a oufnal of - 


. Experimental Psychology ` 


Davin A. Grant, Editor 
University of Wisconsin—Madison 
LYLE E. Bourne, Jr., Associate Editor 
« University of Colorado 


Lro POR: Associate Editor 
University’ of California, Berkeley 
Detos D. Wickens, Associate Editor 
Ohio State University 


Haroto W. Haxe, Associate Editor 


3 University of Illinois, Urbana * 


Consulting Editors ^ 


Nancy S, Anverson, University of Maryland 
Norman H. ANDERSON, 

* University of California, San Diego 
Mosne AxisrkELD, Yeshiva University 
WiLLIAM F. Barrio, University of Colorado 
Ina McD. BiLoprAU, Tulane University 
Isanet M. Brknnaum, 

University of California, Irvine 

Grorce E. Bricos, New Mexico State University 
E. J. Cavaunt, Purdue University 
Vincit R. CARLSON, 


National Institute of Mental Health ` 


Hetnert H. Crank, Stanford University 
Ferous I. M. Craix, Erindale College 
Anprew Crier, Williams College 
Hexen B. DALY, 
State, University.of New York College at Oswego 
Rocer L, DOMINOWSKI, + >» 

D University of Illinois at Chicago Circle 
SngLDoN M. EBEN HOLTZ, 

University of Wisconsin—Madison 
Howanp Ecetn, Johns Hopkins University 
Bruce R. ExsTRAND, University of Colorado 
Henry C. Ertis, University of New Mexico 
pene R. Ericxson, Ohio State University 
OBERT Fox, Vanderbilt University 


Q* Murray GLANZER, New York University 

1 Jopera GocGiIN, University of Texas at El Paso 
FRANCES K, GRAHAM, 
University of Wisconsin—Madison 
^«  WiLLIAM W. Grinas, 


4^5 e S MALE 


` 


L4 


, 


. Vol. 102, No. 1 


ij 


«C 


University of Southern California 
Doucias HiNTZMAN, University of Oregon 
ERR HocnzERG, Columbia University 
erry J. House, University of Connecticut 
WitLiAM C. HowkLL, Rice University 
Srewarr H. Hursk, Johns Hopkins University 
GEOFFREY KEPPEL, . 
University of California, Berkeley 
Grecory A. KiwnLE, University of Colorado 
Patrick R. Laucuiin, University of Illinois 
Herscuer W. Lernowitz, 
Pennsylvania State University 


Gregory R. Locxueap, Duke University 
Frank A. Locan, University of New Mexico 
Steven F Maier, University of Colorado 
IgviNG MALTZMAN, 
University of California, Los Angeles 
RicuaxpD B. MintLwanp, Brown University à 
Bennet B. Mukbock, Jr., University of Toront 
RavwoNp S. NICKERSON, 
Bolt, Beranek, and Newman, Inc. 
Cambridge, Massachusetts 
State U niversity 


vei 
Davip C. Raskin, University of Utah 
L. SrAnLING Rib, University of Virginia 
Donacp A. RILEY, 

University of California, Berkeley 
Hexnert RUBENSTEIN, Lehigh University 
Wittarp N, Ruyguist, University of Alberta 
Peter H. SCHILLER, 

Massachusetts Institute of Technology 

Runotru ScnuLz, University of Iowa 
MARIAN SCHWARTZ, 

University of Wisconsin—Madison 
Ronert W, SEkvLEn, Northwestern University 
Bryan E. Surer, Brown Universit 
Harvey G. Suutman, Ohio State University 
J. Ricwarp Sron, University of Iowa 
PauL SLovic, Oregon Research Institute 
Epwanp E, Smiru, Stanford University 
Kirk H. Satu, Bowling Green State University 


Jaxer T. Spence, University of Texas at Austin | 


Herpert H. Stenson, _ 

University of Illinois at Chicago Circle 
Davip R. Tuomas, University of Colorado 
Micwaen T, Turvey, University of Connecticut 
Wittiam P, Wattace, University of Nevada 
Tuomas S. WALLSTEN, 

University of North Carolina 

Wayne A. WickELcREN, University of Oregon 
Gorpon Woop, Michigan State University 


ARTICLES 


Effects of Masking Tasks on Differential Eyelid Conditioning: A Distinction between 
Knowledge of Stimulus Contingencies and Attentional or Cognitive Act 
volving Them: MicRAEL N. Netson AND Leonarp E. Ross .. x 


Seeing the Same Pattern Twice: J. Ross, J. H. Hocsen, AND V. Dt loo 


h CONTENTS 


(Continued on inside back cover) 


^. -—- 
American 


t A 
Ne ^ 


COO OW GM aue 


ogical Association 
tra 


"ur 


January 1974 


"a 


The Journal of Experimental Psychology publishes original experimental investigations which 
are judged by the Editors to contribute substantially and significantly toward the development 
of psychology as an experimental science. Studies with normal human subjects are favored ovec 
studies involving abnormal or animal subjects, except when the latter are specifically oriented 
toward the extension of general psychological theory. Experimental psychometric studies and 
studies in applied experimental psychology or engineering psychology may be accepted if they 
have broad implications for experimental and theoretical psychology. 


Manuscripts. Submit manuscripts in duplicate to the Editor: David A. Grant, Depart- 
ment of Psychology, University of Wisconsin, Psychology Building, Charter and West Johnson, 
Madison, Wisconsin 53706. Manuscripts must be accompanied by an abstract of 100 to 150 word 
typed on a separate sheet of paper. The abstract should conform to the style of Psychologica: 
Abstracts. Detailed instructions for preparation of the abstracts appear in the APA Publication 
Manual, pp. 11-12. " 


The Journal offers its authors the option of a “blind” review of their submitted manuscripts 
under the conditions described in the Special Announcement (1972, Vol. 94, No. 3, p. 325). If 
the author wishes to have his identity hidden from the reviewer, he should request a blind review 
and take sole responsibility for preparing his manuscript in such a way as to hide his identity. 


Manuscripts must adhere to the conventions concerning reference citations, preparation of 
tables and figures, manuscript format, etc., as described in the 1967 revision of the Publi- 
cation Manual of the American Psychological Association, When in doubt about practices 
of the Journal, authors should examine a recent issue. 


Clear, concise exposition is demanded ; verbose manuscripts may be returned for revision t 
fore being reviewed. Piecemeal experiment-by-experiment reporting is discouraged, and in 
grated series experiments executed simultaneously (e.g., most doctoral dissertations) must 
reported in a single paper. Regular papers cannot exceed 20 printed pages. Simple expe 
ments, replications, and supplementary experiments may be published as Short Reports of n 
more than three printed pages, Reports of negative results and failures to replicate publish 
work are also acceptable if they stand on their own merits as contributions to knowledge. 


Normally papers are published in order of their receipt, and the current publication lag i 
abdat ten months. All of the cost of an author's alterations in galley proof is charged to th 
author. 


Most of the editorial decisions on manuscripts submitted to the Journal are made by the 
Editor or one of the Associate Editors. In addition, although listed on the Masthead as Con- 
sulting Editors, Norman H. Anderson, Helen Bohmer Daly, Murray Glanzer, and Judith 

_ Goggin serve as Special Action Editors. As such, they make the editorial decisions on a limited 
. number of manuscripts rather than serve as reviewers for the editors. 


Subscriptions: Subscriptions are available on a DAMES year basis only (January through December). 
, er fal hae ae $60 domestic, $61 foreign, $6 single issue. ABA member rate: $25. Write t 


E: -. Reprints: Authors may order reprints of their articles from the printer when they receive galley proofs. 


t pos Back Pi" 5e Back Volumes: For information regarding back issues or back volumes write to Order 
* 


Microform Editions: For information regarding microform editions write t. y of the followi 
Associates, Inc, 175 Fifth Avenue, New York, New Y EUntversity Mierotime, Ann Arbor Michigus 
Ortes e Smet ens | e pa ied PUER University Microfilms, Ann Arbor, Michigan 


Change of Address: Send change of ‘address notice and a recent maili i | 
Che 4 X ailing label to the att 

uon A te ied px mt a E change of address. U pides roris Le Fein 
|. post office in writing that they will e a a rade d pere, beu a 


 Hanorp P. Van Corr Anita DeVivo 


H pP i 1 Louise RosENBLA | 
Managing Editor . Executive Editor pipisi Technical Cer Sura | 
i "n > E anager itor Technical Editors — . 
. Forrest T. s 
F. M e ANNE REDMAN 
Manager, Publications Business ' e Subscription Manager | 
Services P. 


D.C. 20036. Primed in the USA, "Second dum ponage gad at N SE ead Saat 


E " ak ^ 
Journal of | 


Experimental Psychology 


+ Copyright © 1974 by the American Psychological Association, Inc, 


Vor. 102, No. 1 


January 1974 - 


= = = 
EFFECTS OF MASKING TASKS ON DIFFERENTIAL 
4 EYELID CONDITIONING: 
SA DISTINCTION BETWEEN KNOWLEDGE OF STIMULUS 
E CONTINGENCIES AND ATTENTIONAL OR COGNITIVE 2 


ACTIVITIES INVOLVING THEM ! 


MICHAEL N. NELSON anp LEONARD E. ROSS? 


University of Wisconisn—Madison 


Two experiments investigated the effects of extraneous masking tasks and the 
viewing of a silent movie on differential eyelid conditioning. In Experiment I, 
involving 144 college students, masking tasks and movie viewing both signif- 
icantly decreased differential conditioning performance. More masking-task 
than no-ma: 


s of a postexperimental 


When ‘Ss in a differential classical con- 
ditioning situation are required to perform 
an extraneous or “masking” task during the 
conditioning session, their differential con- 
ditioning performance is much poorer than 
in the absence of such a task. This effect 
has been obtained in electrodermal response 
(EDR) conditioning studies under a variety 
of masking-task conditions (e.g., Baer & 
Fuhrer, 1970; Dawson, 1970), and in eyelid 
cohditioning experiments (e.g, Mayer & 


1 This research was supported by U.S. Public 
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Training Grant 5-TOI-HDOO-117 from the Na- 
tional Institute of Child Health and Human 
Development. 
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ig-task Ss were classified as unaware of stimulus contingencies 
s questionnaire, and unaware Ss showed 
significantly less differential responding. The masking task 
reduced the differential conditioning performance of aware Ss, however, which 
suggested that factors other than awareness. were important in obtaining the 
Similar masking-task and movie effects were found in Experiment II 
in which Ss were informed of the stimulus contingencies prior to the start of 
the conditioning. Results of the studies were interpreted as indicating that 
attentional or cognitive activities involving cue contingencies during the 
experimental session are important factors which should be differentiated 
from knowledge of the contingencies per se, 
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HUMAN 


and movie also 


Ross, 1969; Ross, Wilcox, & Mayer, 1967) 
which have used a time-estimation masking 
task. 

Studies utilizing masking tasks with the 
EDR have generally done so in the context 
of investigating the relationship. of aware- 
ness to classical conditioning performance, 
with the masking task being used to de- 
crease S's awareness of stimulus contin- 
gencies. Data from studies such as those 
cited above suggest that differen tial classi- 
cal conditioning of the EDR is related to 
awareness as defined by postexperimental 
questionnaire reports, and under some cir- 
cumstances does not occur in the absence 
of such awareness. 

In contrast, differential eyelid condition- 
ing studies have not directly examined the 

L s-differential-conditioning rela- 


ionship when masking tasks are employed, 
though previous eyelid conditioning data 
ggest that awareness factors may be 
involved in the masking task's effect upon 
differential conditioning, In this regard, 
‘Spence (1966) reported that a light-guessing 
"or probability-learning masking task slows 


~ " < n" . 
the extinction of the conditioned eyelid 


response by decreasing S's recognition of 
the acquisition-extinction transition. Simi- 
Jarly, Spence and Platt's (1967) failure to 
find a partial reinforcement decrement in 
acquisition under masking-task conditions 
yas interpreted as reflecting the absence 
— of a cognitive set assumed to be operative 
- "as a function of nonreinforcement in stan- 
‘dard eyelid conditioning situations. 
d Given these single-cue conditioning ef- 
fects, it appears that the reduction in dif- 
Terential eyelid conditioning that has been 
und under masking-task conditions might 
e a result of the masking task's effects on 
S's awareness of the relationships among 
— the positive conditioned stimulus (CS+), 
negative conditioned stimulus (CS—), and 
nconditioned stimulus (UCS). One pur- 
pose of the present study was to investigate 
tis possibility by using a postexperimental 
~ questionnaire to assess S’s awareness of the 
stimulus contingencies. 
—. While the data of eyelid conditioning 
tudies clearly show that a time-estimation 
_ masking task reduces differential condition- 
ing performance, there has been no attempt 
— to determine if specific characteristics of 
ie masking task are important in obtaining 
» effect, In particular, it is possible tha 


| the time-estimation masking-task pro- 
ure might in itself be an important 
or in reducing differential conditioning 
ormance. To examine this possibility, 
. "e present study compared differential 
— conditioning performance under conditions 
of (a) no masking task, (b) the previously 
ised Ume-estimation task, and (c) a modi- 
fime-estimation task which required S 
respond differently to the CS+ and 
Both the original and modified 
| tasks required 2 time-estimation 
responses (button presses) during 
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every intertrial interval (ITI), © it 
modified masking task required Ss 
spatially order their estimation »on 


depending upon whether the CS4 
had been presented on the previ 
ditioning trial. 

A final factor included in thc y 


the presence or absence of a sile: 10010) 
picture for S to view during the ition: 
ing. Since films have been i 

number of previous differential | con: 
ditioning studies that employed | ditor 
stimuli, it was of interest to rmim 
whether or not viewing a film, ar t thai 
does not involve any specific task d: sands; 


might function as a masking tas! 
ing S's awareness of stimulus cont 
and differential conditioning perfi 

Experiment I consisted of a stud 
replication. The replication was added 
when it appeared that too few Ss in the 
original study were unaware of the stimulus 
contingencies to make meaning com- 
parisons of aware and unaware Ss 


encies 
nance 
and its 


EXPERIMENT | 


Method 
Subjects. One hundred and sixty-five students 
enrolled in introductory psychology courses at the 


University of Wisconsin served as Ss to fulfill part 
of a course requirement. The data from 21 ss were 
rejected, 19 because of movie projector or proe 
gramming equipment failure and 2 because of Æ 
error. The remaining 144 Ss, 72 males and 727 
females, were randomly assigned to groups except 
for the restriction that cach group contain an equal | 
number of males and females. The original study - 
and its replication each consisted of 6 groups A 
12 Ss each. 

Apparatus. The experiment was conducted in à 
semisoundproof  3-room laboratory, with pro- 
gramming and recording equipment located in al 
control room serving 2 testing rooms. Each S satini. 
an adjustable ophthalmologist's chair facing a movie 
screen, To make their time-estimation responses: 
Ss in masking-task groups pressed buttons located 
on the ends of the arms of their chairs, Monitor 
lights in the control room that flashed whe thë 
buttons were pressed allowed Æ to check for com 
pliance with instructions. 

Pure-tone stimuli were generated by Hewlett- 
Packard audio oscillators and presented through 3 
Grason-Stadler electronic switch set for fast (7 10" 
usec.) onset and offset ramps. The tones were de) 
livered binaurally to each S through Sharpe H.A-10A 
earphones, Movement of S's right eyelid was re 
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ising standard potentiometer-polygraph 


zn and procedure. The design of each study 
W 3X 2 X 2 X 2 factorial with 3 masking- 
conditions (masking task, modified masking 


d no masking task), movie vs. no movie, and 
t ounterbalanced factors of frequency of CS+ 
ex of S. The pure-tone CSs of 800 and 2,100 


were presented for 900 msec. in a continuous 
ground. The loudness of the back- 
round white noise was matched by 4 judges to 
5-db. (re .0002 dynes cm?) free-field white noise. 
be loudness of the 2 tones was matched by the same 
judges to a 1,000-Hz. tone 85 db. above the mean 
hreshold, The UCS was a .75-psi air puff of 100- 
nsec. duration delivered through a 1-mm. aperture 
to S's right cornea. An 800-msec. interstimulus in- 
rval and 15-, 20-, and 25-sec. ITIs (M = 20 sec.) 
re used, Equal numbers of CS+ and CS— trials 
vere presented in a Gellermann sequence. 
[he original study and its replication were identi- 
cept that masking-task Ss in the original study 
eceived a total of 110 trials with a 1-sentence 
recorded instruction to cease performing their ti 
between Trials 100 and' 101, while no-masking-t: 
Ss were interrupted with a l-sentence statement 
concerning either the silent film or attention to the 
fixation point. Since the data from Trials 101—110 
did not show systematic group differences, they were 
omitted from further consideration, and Ss in the 
replication were given only 100 uninterrupted trials. 
Instructions were given to Ss in 2 parts. The 
first portion was spoken to S as the headset was 
attached and adjusted. Additional recorded instruc- 
tions were delivered through S's earphones after E 
had left the testing room. For the movie group, 
preliminary no-masking-task instructions involved 
telling S that the purpose of the experiment was to 
(measure S's reactions to "certain events" which 
would occur while S watched a silent motion picture. 
Both modified-masking- and masking-task Ss were 
|told that the experiment was concerned with time 
imation during distraction, and that certain 
"distracting events" would occur while they were 
viewing a silent film. No-movie groups were treated 
identically except that all references toa movie were 
omitted. Instead of viewing a silent film, Ss in no- 
movie groups were asked to fixate a 7.6-cm.-diam. 
flat-back disk fastened to a movie screen and illumi- 
nated by a rectangular patch of diffuse white light 
MES by a movie projector running without 
film 


hite-noise 


ale 


additional recorded instructions presented 
after E left the testing room began with standard 
neutral instructions which asked S to neither aid nor 
inhibit his natural responses. Masking-task Ss were 
Iso instructed to judge 2- and 5-sec. time intervals 
‘from tone offset on each trial, i.e., 2 sec. after a tone 
had terminated S was to push the button on the 
right arm of his chair, and 5 sec. after the same tone 
had terminated S was to push the button on the left 
arm of his chair. Modified-masking-task Ss were in- 
structed to vary the order in which they pressed the 


buttons to estimate the 2- and 5-sec. intervals, 


P 


to press the right-hand button 2 sec. after the high- 
frequency tone terminated and to press the left-hand — 
button 5 sec. after the high tone had terminated, —— 

iy, 


Half of the modified-masking-task Ss were instructed. 3 


but to begin with the left-hand button when per- —- 
forming the same two estimations after the low- 
frequency tone. The other half of the Ss were —- 
instructed to report the estimations in the reverse - 
spatial order, i.e., left then right for the estimations 
after the high tone, and right then left after the low 
tone, After questions had been answered, masking- 
and modified-masking-task Ss were given the same 
recorded pretrial instructions given to no-masking- 
task Ss, i.e., that they would now be hearing some 
of the tones that would occur during the experiment. 
Three pretrials were then given: 1 800-Hz, tone, 1” 
2,100-Hz. tone, and 1 unpaired air puff, Modified- 
masking and masking-task Ss who initially failed to 
press their time-estimation buttons in the proper ` 
order were corrected by E between the first few 
conditioning trials. 
Immediately after the last trial each S was given 
a postexperimental questionnaire.’ All Ss were 
asked the same 12 questions (1 question per page) 
which differed only slightly among the groups in 
order to be coi ent with the appropriate task 
and movie condition. The first 2 questions were 
open ended: The first asked S what he thought the 
purpose of the experiment had been, and the second 
asked S what relationships he had noticed between 
the various stimuli and other aspects of the experi- 
ment, Questions 3-8 required S to note, on an 11- 
point scale, his resistance to the experimental pro- 
cedures, his anxiety, annoyance, alertness, attention 
to the movie or the fixation point, and his irritation 
with the air puff. The next 3 questions asked S 
(a) if he noticed a change in: the strength of the air 
puff, (b) if he prepared himself for the arrival of | 
the air puff, and (c) to estimate the total number of 
air puffs and tones he had received during the experi- — 
ment. The last question asked S to report, on an ~ 
11-point scale, the point during the experiment when — 
he had noticed a relationship between the air pull 
and the tones. Masking- and modified-masking- 
task Ss were given 1 extra question, inserted between. 
Questions 7 and 8, which asked them to report how. 
diligently they had performed their time-estimation 
tasks. The Ss were not allowed to modify answers - 
to earlier questions. y 


Results D 


A conditioned response was defined as 
the first pen deflection of at least 2 mm. 


? The questionnaire used was originally developed — 
by G. A. Kimble, and modified by J. W. Moore. 
The present study used most of the items reported — — 
by Moore to correlate with performance, together 
with some additional questions developed concern- 
ing masking-task conditions. 
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Mean percentage responses to the positive conditioned stimulus 
(CS+) and the negative conditioned stimulus (CS—) asa function of masking 
task and movie-viewing conditions collapsed across both replications in Experi 


FIGURE 1. 
ment I. 


(1 mm. at the eye) in the interval begin- 
ning 200 msec. after CS onset and ending 
at the UCS onset. 

Figure 1 presents mean differential re- 
sponding to CS+ and CS— as a function 
of task condition and trial block for the 
movie and no-movie groups of both stud ies, 
with the data collapsed across the counter- 
balanced factors of frequency of CS+ and 
sex of S. Inspection of the figure reveals 
that differential responding was greatest 
in the no-masking-task condition, and 
poorest in the masking-task condition 
under both movie and no-movie conditions, 
The movie decreased differential responding 
| under all task conditions. 

__ A 6-factor analysis of variance was per- 
formed on difference scores for Trials 
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1-100, with between-Ss factors of repi 
cations, task condition, movie conditio 
frequency of CS+, and sex of S, and 
within-Ss factor of 10-trial blocks. Signt! 
cant main effects of task condition, F (2,9 
= 10.81, p < .001, movie condition, F Vd 
96) = 10.23, p < .005, and trial bloc 1 
F (9, 864) = 13.61, p < .001, were 
tained. The replication factor did 
reach the .10 level, and the only signifi 
interaction, which proved to be unim 
pretable, was that of Replication X T 
X Sex X Frequency, F (2, 96) = &U 
p «.025. 

Subsequent comparisons of the mask 
task conditions revealed that the modi 
masking-task group was intermediate | 
performance, i.e., significantly superior v 
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the masking-task group, F (1, 96) = 4.37, 
^ ,05, but inferior to the no-masking- 
sk group, F (1, 96) = 5.39, p « .025. 
he performance of no-masking- and mask- 
ing-task groups differed significantly, F 

1, 96) = 19.48, p < .001. 

Only 18 of the 144 Ss were classified as 
voluntary-form responders by the Hartman 
and Ross (1961) derivative criterion, which 
was modified to prevent nonresponders 
from being classified as voluntary-form 
responders. Since the data remained 
virtually unchanged when the voluntary- 
form-responder data were omitted, separate 
analyses were not performed. 

The postexperimental questionnaire data 
for the original study and replication were 
closely examined to determine whether the 
interruption between Trials 100 and 101 in 
the original study might have had an effect 
on S's questionnaire responses. No effect 
was apparent since Ss’ mean response 
scores on the questions were quite similar 
for corresponding groups in the 2 studies. 
Consequently, Ss from both studies were 
lassified as aware or unaware of the stim- 
ulus contingencies as defined by a correct 
statement on the first 2 open-ended ques- 
tions to the effect that the high (or low) 
‘tone was followed by a puff of air. Although 
it proved possible to make the classifica- 
tions on a quite objective basis, all classi- 
fication of Ss as aware or unaware was 
performed without knowledge of the group 
assignment or conditioning performance 
of S. 

Table 1 presents both the frequencies 
of Ss classified as aware and unaware and 
mean differential responding over all trials 
for the 6 major group conditions, combined 
lor the study and its replication. It was 
found that unaware Ss were distributed 
»nequally across the 6 groups shown in 
Table 1; 33 (2) = 802, p «02. The 
table shows that (a) the majority of un- 
aware Ss were in the 2 original-masking- 
task groups, (b) the modified-masking- and 
no-masking-task groups did not differ in 
number of unaware Ss, and (c) the movie 
had little effect on the distribution of un- 
aware Ss, 


A 3x2X2X10 expected cell fre- 


TABLE 1 


MEAN PERCENTAGE DISCRIMINATION AS A FUNC- 
TION OF MaskiNG-Task AND MOVIE CONDITIONS 
FOR Ss CLASSIFIED "AwWARE'-OR “UNAWARE” 
ON THE Basis OF POSTEXPERIMENTAL 
QuESTIONNAIRE RESPONSES 


Task condition 


Movie condition 
and reported 


| 
ie 
awareness | a | Modified No Aine Nee 
um masking task task 
= | | x 
Movie | 
Aware 12.4 (9) 11.6(18) 21.6(17) 
Unaware | 13(15) 6.7 (6) 12.5(7) 
No movie | 
Aware | 16.5(12) 26.1(17) 40.2(18) 
Unaware | -9(12) 19.2(7) 31.3(6) 


Note. Number of ssi in E group is given in parentheses. 


quencies analysis of variance involving 
the factors of task, movie-no movie, 
awareness, and trial blocks was performed 
on the CS+/CS— difference score data 
shown in Table 1. A major finding of the 
analysis was that Ss classified as aware of 
the stimulus contingencies showed signif- 
icantly greater differential responding than 
those classified as unaware, F (1, 132) 
= 6.02, p < .025. As in previous analyses, 
significant main effects of task, F (2, 132) 
= 7.56, p < .001, and movie condition, 
F (1, 132) 2 9.72, p< .005, were ob- 
tained. The trial blocks effect was signif- 
icant, F (9, 1188) = 9.89, p < .001, but 
none of the interactions involving the 
trials variable reached the .10 level. 
Although task and movie factors did not 
interact with the awareness variable (both 
Fs <1), planned individual comparisons 
were carried out on masking-task and 
movie-no-movie differences for aware Ss 
only. The no-masking-task—masking-task 
difference was significant, F (1, 132) 
= 15.94, p < .001, as was the no-masking- 
task-modified-masking-task difference, 7 
(1, 132) = 8.84, p < .005. The masking- 
task-modified-masking-task difference was 
not signific ant, F (t, 132) 2 1:112; 5 5:207 
‘The movie-no-movie per fiance of aware 
Ss also differed significantly, F (1, 132) 

= 13.66, p < .001. A similar analysis 
red out without the trials factor on 
terminal differential performance (Trials 
61-100) found the same effects significant, 
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both in the original analysis and individual 
comparisons. 

The data from each group in Table 1 
were also examined to determine if signifi- 
cantly more responses were made to CS+ 
than to CS—. Dependent ¢ tests of the 
total number of responses to the 2 stimuli 
revealed greater responding to the CS+ 
in each aware group (p < .05) while no 
unaware group showed significant differ- 
ential responding. This was also true for 
terminal responding (Trials 61-100), except 
that the difference in responding to CS+ 
and CS— for aware Ss tested under movie 
and original masking conditions only 
reached the .10 level of significance. 

The remaining questions on the post- 
experimental questionnaire dealing with S 
anxiety, number of tones and air puffs 
received, annoyance, alertness, attention 
to task or movie, adaptation to USC, etc., 
showed nonsignificant low correlations with 
differential responding ; no change in results 
was noted when the questionnaire data for 
aware and unaware Ss were considered 
separately. 

The questionnaire results did not change 
appreciably when a less specific definition 
of awareness was used, ie, when any 
Statement to the effect that the tones 
were followed by air puffs (CS+ and CS— 
relationships not necessarily specified) was 
used as the criterion of awareness. 


Discussion 


The data of Experiment I clearly demon- 
strate that performing a time-estimation task 
or viewing a silent film during the conditioning 
Session results in much poorer differential con- 
ditioning performance than would otherwise 
be the case. Further, the degree to which dif- 
ferential conditioning is reduced can depend 
upon specific features of the masking task since 
differential conditioning performance with the 
original masking task was found to be signifi- 
cantly poorer than that found with the modi- 
fied-masking task in the analy 
all Ss. 

The major difference between the masking 
tasks was the requirement of a common post- 
_ trial response as compared to different, cue- 
‘contingent, posttrial responses. While the 
Mata might be interpreted as indicating that 


sis involving 


this response difference was responsible for the 
performance difference, it cannot be deter 
mined from the present study whether it was 
response mediation, a difference in attention 
to the cues, or some other factor that was the 
basic mechanism involved. In any case, t 
data do clearly demonstrate that a m 


sl 

task can significantly impair differentia! con 
ditioning performance even when S is required 
to attend to the CS+ and CS— in order to 
make different responses contingent upon 
which stimulus is present. 

It also should be noted that the masking 
tasks differed in complexity and und. dy 


imposed different task demands, i.e. i- 
tive loads, on Ss. However, comparing | 
masking-task groups, poorer differentia 
sponding was found with the original masking 
task, which involved considerably less in the 
way of task demands. Since poorer differential 
conditioning performance was also found with 
movie viewing, which required no response on 
the part of Ss, the data from this study do not 
support the position that cognitive load is 
inversely related to differential conditioning 
performance. Obviously studies which system 
atically vary load factors over a wide range 
of values will be necessary to provide definitiy 
evidence regarding a cognitive-load-differe: 
tial-conditioning relationship. 

A primary purpose of the study was to 
determine whether the masking-task effe: 
could be accounted for by Ss’ awareness of 
stimulus contingencies. Since there were more 
unaware Ss in the masking-task groups, and 
unaware Ss performed significantly more poorly 
across all conditions, it would appear that the 
masking task may have prevented some Ss 
from becoming aware of the stimulus contin- 
gencies and thus affected their differential con- 
ditioning performance. Such an effect would 
be consistent with interpretations of EDR 
masking-task data (e.g., Dawson, 1970). How- 
ever, in view of other features of the data, 
this factor is not sufficient in itself to account 
for the detrimental effects of the masking task. 
Most important in this respect is the fact that 
Ss who clearly were aware also showed mask- 
ing-task effects. Similarly, although the movie 
apparently did not reduce awareness (since 
the numbers of aware Ss were very similar 
under movie and no-movie conditions), aware 
Ss who did not view the movie did much better 
than aware Ss who did view the movie. Thus, 
some factor other than knowledge of the rela- 
tionship among the stimuli, as defined by the 
postexperimental reports of Ss, would appear 
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to operate to reduce differential responding 
under masking-task and movie conditions. 

It might be argued that the masking tasks 
and the movie simply delayed awareness until 
later in the conditioning session, and that the 
poorer differential performance of 7 
masking-task and movie Ss reflected that del 
This interpretation rece apparent 
support from an examination of aware Ss' re- 


yes some 


sponses on the questionnaire item which asked 
S to rate the point at which a relationship 
between the air puff and the tone had been 
noticed. A 3 X 2 analysis of variance of the 
point of awareness data for the masking-task 
and movie groups did reveal that the no- 
masking, modified-masking-task, and masking- 
task group means of 89.4, 84.4, and 67.6 
differed significantly, F (2, 84) = 6.42, 
p < 005. The pattern of individual point of 
awareness differences did not parallel the pat- 
tern of differential conditioning performance, 
however, since subsequent comparisons showed 
a significant difference between the 2 masking 
tasks, F (1, 84) = 14.26, p < .001, but no 
significant difference between no masking task 
and the modified masking task, F (1, 84) 
= 1.82, p> .10: the reverse was true, i.e. 
significant and nonsignificant ‘differences, re- 
spectively, in the case of the conditioning per 
formance comparisons. Further, the point of 
awareness means for movie and no-movie aware 
Ss were quite similar, 81.8 and 83.0, respec- 
ly (F < 1), although a large and significant 
difference was found in their differential con 
ditioning performance. 

Although these data lead to the conclusion 
that the point of awareness data do not mirror 
conditioning differences in a manner that 
would suggest a causal relationship, it was 
decided to investigate further the relationship 
between awareness and differential condition- 
ing in an experiment in which Ss were informed 
about the stimulus contingencies prior to the 
start of the conditioning session. Presumably, 
if the movie and masking-task effects found 
with aware Ss were due to reduced or delayed 
awareness of stimulus contingencies, the movie 
and masking task should have little effect on 
performance under conditions where S 
given this information prior to the conditioning 
session. On the other hand, decrements similar 
to those found in Experiment I would support 
the conclusion that knowledge of stimulus 
contingencies is not the sole factor responsible 
for the masking-task and movie-viewing effects 
on differential conditioning. 


was 


EXPERIMENT l 
Method 


Subjects, Fifty-three Üniversity of Wisconsin 
undergraduates enrolled in introductory. psychology 
courses served as Ss to fulfill part of the course 
requirement. The data from 5 Ss were rejected 
due to movie projector or programming: equipment 
failure. The Ss were randomly assigned to groups 
with the restriction that an equal number of males 
and females were included in each group. 

Apparatus. The apparatus remained unchanged 
from that used in Experiment I, except for the sub- 
stitution of solid-state timing equipment for Tek- 
tronix waveform and pulse generators, 

Design and procedure. The design of the experi- 
ment was a 2 X 2 X 2 X 2 factorial with 2 task 
conditions (masking task and no mas ing task), 
movie vs. no movie, and counterbalanced factors of 
sex and frequency of CS+ (800 or 2,100 Hz.), The 
original masking task was used in this study 

The procedure replicated that of Experiment 1 
ept that each S was given the following additional 
instruction prior to the start of conditioning trial 
"During the experiment you will be he; ig two 
tones varying in pitch, or frequenc: The high 
[low] tone will always be followed by a puff of air, 
while the /ow [high] tone will never be followed bya 
puff of air." 

The questionnaire used in Experiment T was also 
iven to Ss in this experiment, 


Results 


Mean differential responding data as a 
function of masking-task and movie con- 
ditions are presented in Figure 2 with the 
data collapsed across the counterbalanced 
factors of sex and stimulus frequenc 
Inspection of the figure reveals that the 
time-estimation masking task and the 
movie produced fairly large decreases in 
differential responding relative to the no- 
movie-no-masking-task group. 

A 5-factor analysis of variance was per- 
formed on the data of Experiment Il, 
including between-Ss factors of m ing 
task, movie, sex, and stimulus frequency, 
and the within-Ss factor of 10-trial blocks. 
Performance under the no-m sking-task 
condition was found to be significantly 
superior to masking-task performance, ^ 
(1, 32) = 8.06, p < .01, and differential 
responding increased significantly across 
trial blocks, F (9, 288) = 7.89, p « .001. 
A significant Movie X Trial Blocks X Fre- 
quency interaction was obtained, F (9, 288) 
= 245, p < .05. It was not possible to 
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(CS-F) and the negative conditioned 
task and movie-viewing conditions in 


interpret this interaction in a meaningful 
way, and a similar interaction was not 
present in either replication of Experiment 
l. 

Since it had been decided prior to the 
study to compare the movie-no-movie 
effect separately for masking and no- 
masking Ss, the analyses were carried out 
despite the absence of a significant Masking 
"Task X Movie interaction. Under no- 
masking-task conditions, the movie effect 
was significant, F (1, 32) = 4.59, p< .05, 
while no effect was found under masking- 
task-conditions. Inspection of Figure 2 
suggests that the masking task so reduced 
differential responding that movie effects 
were not possible. 

Only 5 Ss in Experiment II were cl 
fied as voluntary-form responders, 
removal had little effect on the data. Since 
the questionnaire items were phrased in 
terms of relationships noticed during the 
session, Ss’ answers would be expected to 
refer to their experiences during the con- 
ditioning session rather than to the in- 
formation given them by E prior to the 
start of the session. The Ss’ comments 
during and after the questionnaire sug- 


assi- 
and their 


Mean percentage responses to the positive conditioned stimulus 


mulus (CS—) as a function of masking- 


?xperiment IT, 


gested that this was the case, but confusion 
was expressed regarding the apparent con- 
tradiction of being informed of the con- 
tingencies and then being asked to report 
when they were noticed. In view of the 
difficulty in interpreting the questionnaire 
responses of informed Ss of this study, no 
further analyses of the questionnaire data 
are reported. 


GENERAL DISCUSSION 


The results of Experiments I and II clearly 
indicate that the effect of an extraneous task, 
at least under the experimental conditions of. 
these studies, is not due solely to a concealing 
or masking of the contingencies from Sin the 
sense that S is not aware or does not have 
knowledge of them. It may be that the mask- 
ing task prevented some Ss from becoming 
aware of the stimulus contingencies so that 
they could not be reported, and for that 
matter even without the masking task some 
Ss were not able to report the relationships. 
It is extremely difficult to determine if Ss are 
unaware in any absolute sense since it is not 
possible to be certain that Ss classified as 
unaware by one measure would be similarly 
classified with a different measure. In any 
case, Ss classified as aware in the present 
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studies performed better than those classified 
as unaware, and this finding, together with 
the fact that more masking-task Ss were 
classified as unaware, suggests that part of 
the effect of the masking task was to prevent 
some Ss from noticing or becoming aware of 
the cue contingencies so that they could later 
report them. 

In addition, however, aware Ss from Ex- 
periment I, as well as Ss from Experiment I 
who were told the contingencies prior to the 
start of the study, showed reduced differential 
responding under masking-task and movie 
conditions. These data strongly suggest that 
a fundamental factor in differential condition- 
ing is S's attention to the stimulus contin- 
gencies, or the result of cognitive activities 
involving the contingencies (e.g. mediating 
effects of verbal expectancy), during the con- 
ditioning session. 

If we assume that some attentional or cog- 
nitive activity involving stimulus contingen- 
cies enhances differential conditioning, the 
results of Experiments I and II follow. First, 
those Ss who were unaware throughout the 
study, due to the demands of the masking 
task or for other reasons, would be less likely 
to attend to or engage in cognitiv ity 
with respect to cue contingencies and there- 
fore would be expected to perform more poorly. 
Masking-task or movie 5s who became aware, 
or who had knowledge of the contingencies 
prior to the experiment, would also be less 
likely to attend to or become cognitively in- 
volved with the cue contingencies, but in 
this case because of the demands of the mask- 
ing task or the distraction of viewing the 
movie, Thus, unaware Ss and aware Ss 
under masking-task demands or movie-viewing 
conditions should both show poorer differ- 
ential conditioning performance. 

Assuming the above to be correct, it appears 
necessary to distinguish between knowledge 
of stimulus contingencies, eg. as measured 
by postexperimental questionnaires, and Ss’ 
attentional or cognitive activity involving 
the relationships among the CS+, CS—, and 
UCS. Itis clear that Ss may have information 
about the contingencies, but whether this 
affects differential conditioning performance 
undoubtedly depends upon a number of vari- 
ables such as the particular demands and dis- 
tracting features of the experimental situation. 

Finally, it should be noted that although 


performance levels varied somewhat across 
conditions from study to study, the overall 
picture is one of the masking tasks and movie 
resulting in a high level of response to the 
CS—, with good acquisition curves to both 
CS+ and CS— in almost all groups. Ross 
and Ross (1972) have discussed the fact that 
a number of variables affect differential but 
not single-cue conditioning, and have sug- 
gested that single-cue conditioning involves 
a detection process, while differential con- 
ditioning involves detection plus recognition 
process Within this framework the major 
effects of the masking tasks and movie appear 
to be on the recognition processes necessary 
for good differential conditioning rather than 
upon detection of the CS+ and CS— per se. 
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SEEING THE SAME PATTERN TWICE? 
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Two dot patterns, each either brief (3 r 


on an oscilloscope one after the other, 


640 msec. The second pattern wa: 


AND V. Dr LOLLO 


'edlands, Western Australia, Aust. 


msec.) or long (250 msec.), were di 
separated by an interval between 5 and 


shown in the same field as the first, o 1 
field which was shifted a small or a large amount to the right. Three Ss juc 


whether the second pattern was the same as the first, or different 


radically alters performance for brief p 
criminative visual cues. 
by shift, since Ss rely on memory, no 
ditions, "The results were contrasted 


Pollack (1972) carried out a large study 
on the detection of changes in spatial posi- 
tion. He made 2 observations about his 
findings. The first is that the "most 
striking result from the extended program 
is related to the sharp maximum in detec- 
tion performance for interflash intervals 
(IFTs) of about 64 msec. for brief displays 
[1972, p. 24]" The second is that "'per- 
formance is nearly at chance levels for 
IFIs of 250 msec., which has been taken 
as an estimate of the duration of the visual 
image [p. 24." 

At least 3 distinct time ranges are span- 
ned within Pollack’s (1972) study: (a) a 
range within which successive patterns ap- 

© Pear to be simultaneous (Eriksen & Col- 
lins, 1967; Fraisse, 1966; Lichtenstein, 
1961; McFarland, 1965); (b) a range within 
Which apparent movement is observed be- 
{ween successive patterns or pattern com- 
ponents (Kahneman, 1967); and (c) a range 
beyond apparent movement, within which 
the first pattern is no longer visible when 
the second pattern appears. The purpose 
of the studies reported here is to examine 
the bases for judgments of pattern iden- 
tity for dot patterns presented in the same 
field of view, as in Pollack's study, and 
for patterns presented in different fields. 
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Performance with long patterns 


patterns by modifying the validity 
altered mu 
t on visual effects, under almost 1 
to conclusions reached by Pollac! 


If the bases for judgme: 
discriminative cues that : 
time ranges, and if the 
of patterns alters the 

available in each range, 
of patterns should gov: the effect of 
temporal parameters such as stimulus dura- 
tion and interstimulus interval 


identity are 
in different — 
lative location 
idity of cues 
lative location 


For example, apparent movement is a 
valid cue for patterns presented in the 
same field. Where the second pattern is 


identical to the first, there will be no ap- 
parent movement, since the second set of 
dots is in registration with the first. 
However, where the second pattern is dif- 
ferent, some dots appear in new positions, 
and apparent movement will be seen inj 
the appropriate time range. An O can 
conclude validly that the second pattern 
was different if he sees apparent movement. 
The same is not true if the second pattern 
is displaced. Then none of the dots ap- | 
pear in old positions, and apparent move- 
ment will be seen both for identical and 
for different pairs. The movement cue 
loses its validity if patterns are displaced. 
Numerosity is another cue that is valid 
in one case but not in another. Two 
identical patterns each of 8 dots seen as 
simultaneous will be seen as one pattern 
of 8 dots if the second pattern is not dis- 
placed. Two differentzpatterns, on the 
other hand, will be seen as one pattern 
of more than 8 dots, since some of the 
dots in the second pattern occupy new 
Positions. When patterns are displaced, 
numerosity is no longer a valid cue, since - 
2 patterns will be seen as one containing 
10 
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16 dots, whether they are the same or 
different. 

The appearance of 2 different patterns 
seen as simultaneous is illustrated, for the 
case of no displacement, in Figure 1. Not 
only are more dots seen, but there is an 
obvious "doubling" of some dots, those 
shifted to new positions to make the second 
pattern different from the first. This 
doubling is a prominent and valid cue for 
patterns that are not displaced. In an 
effort to destroy its validity, patterns were 
constructed to look like a pair of different 
patterns seen as simultaneous (see Figure 
1), and were used in Experiment II. 


METHOD 
Experiment I 


Apparatus. The visual display, consisting of 2 
fields of random dots, was presented on a gridless 
Tektronix 602 point plotter equipped with fast 
P15 phosphor. 

The S sat inside a lightproof cubicle and viewed 
the display through a Tektronix (Model 016- 
0154-00) viewing hood. The scope was positioned 
just outside the cubicle and the viewing hood was 
inserted through a light-sealed opening in the wall. 
The S viewed the display binocularly with his face 
resting against the viewing hood. In that position 
the display surface was approximately 28 cm. from 
S's pupils. Low-level illumination of the display 
surface highlighted a rectangular outline at the 
periphery of the display area (10 X 8 cm.) and 
aided S's focusing and convergence. 

The dot fields were generated by a PDP-8/S 
computer, which also performed all the timing and 
scoring functions. The duration of all timing loops 
as determined with the aid of a Marconi (Model 
TF 2414) timer/counter under software control. 

Task definition. In all conditions of the experi- 
ment the basic display consisted of the presenta- 
tion of 2 fields each containing a pattern of 8 dots 
and separated by a variable time interval. On a 
random half of the trials the configuration of the 
dots in the second field was the same as that in 
the first field. On the remaining half of the trials 
a random 4 of the 8 dots in the second field were 
displaced to the S's right by 2.5 mm. from the 
positions they held in the first field. The posi- 
tioning of the dots within a field was random, 
subject to the constraint that no 2 dots should 
appear within 2.5 mm. of each other in either 
field. The coordinates of all dots in both fields 
were computed and saved in a table before each 
trial. 

A trial was initiated by the depression of a push 
button hand-held by the S. The S's task was to 
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decide whether the configuration of dots in the 2 
fields was or was not the same. He then tapped 
a telegraph key once or twice to indicate a same 
or different response, respectively. The taps were 
sensed and scored by the computer, which then 
proceeded to set up the conditions for the next 
trial. The setting-up procedure took less than 
1 sec. 

Subjects and procedure. The 3 authors, all ex- 
perienced Os at this type of task, acted as Ss. 
It might be noted that although the Os were not 
naive, they used a forced-choice design to overcome 
the operation of any theoretical and response biases 
they might have had. Each tried his best to make 
correct judgments. 

Three variables were manipulated in this ex- 
periment: duration of the display field, location 
of the second field with respect to the first on the 
display surface, and interstimulus interval (ISI). 
"These are considered in turn. 

The duration for which a field was displayed 
was either brief (3 msec.) or long (250 msec.). 
On brief displays each of the 8 dots was plotted 
once only. On long displays the 8 dots were plotted 
sequentially and the total display was continuously 
refreshed for 250 msec. Brief and long field dura- 
tions were never mixed within a trial: On any 
given trial both fields were always of the same 
duration. 

The dots in the first field were chosen randomly 
on each trial from a 341 X 341 sample space matrix 
and were plotted on S's left half of the display 
area within an imaginary square 27 mm. on a side 
(about 5° 30’ of visual angle). The first field was 
shown in the same location of the display area 
under all experimental conditions. The second 
field was shown in 1 of 3 locations, depending on 
the experimental condition. In the no-shift con- 
dition the second imaginary square was in the same 
location as the first (see Figure 1), in the small-shift 
condition it was displaced 5 mm. (see Figure 2), 
and in the large-shift condition it was displaced 
40 mm. to the right of the first (see Figure 3), 

The time interval between the 2 dot fields 
presented on each trial was one of the following: 
5, 10, 20, 40, 80, 160, 320, or 640 msec. 

The design of the experiment was thus a 2 (Field 
Durations) X 3 (Field Shifts) X 8 (ISI) factoria 
with 3 replications in each cell. 

Each S was tested in 30 experimental sessions of 
approximately 10-min. duration. In any one 
sion the levels of field duration and field shift were 
fixed, and 20 consecutive trials were done at each 
level of ISI. The sequence in which the levels of 
ISI were presented varied randomly for each ses- 
sion, Within a given session, each of the 8 subsets 
of 20 trials began with a warning signal (2 dots 
displayed vertically on the midline of the display 
area) followed by 4 practice trials not scored for 
correctness. Twenty experimental trials then fol- 
lowed at the same ISI as the practice trials. Within 
each block of 20 trials, same and different pairs 
occurred with equal probability. The Ss were 
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aware of this fact, which helped to minimize re- 
sponse bias.* 

Five sessions were run at each of the 6 combina- 
tions of field duration and shift. Each S worked 
through the 30 sessions in a sequence of his own 
choice, making 100 judgments in each of the 48 
combinations of the 3 factors, 


*An analysis of errors for Experiment I1 shows 
that errors were predominantly (almost 5 to 1) 
made by calling different cases same, rather than 
Same cases different. This may be interpreted to 
mean either that differences tended to be missed, 
-9r that Ss adopted the strategy of responding same 
More frequently, when in doubt. No detailed 
lysis of errors is available for Experiment 1. 
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Judgment of identity and difference with patterns of brief dura- 
(Filled circles show performance when all 
1 8 dots, Experiment I. Open circles show performance when 
ontain 12 dots, Experiment II, and are constructed to resemble 
the appearance of 2 different. 8-dot patterns that are seen as simultaneous. 
imultaneity, apparent movement, and succession without move- 
arance of pairs of patterns within the range of 


The 5 received no immediate feedback on the 
correctness of his responses. However, scores were 
available at the end of a session, and judgmental 
strategies and cues were liberally discussed among 
Ss between sessions. 


Experiment II 

Experiment 11 differed from Experiment I in 2 
respects. First, all displays were presented in the 
no-shift condition: that is, the second field of dots 
was always presented within the same imaginary 
square as the first field. Second, each of the 8 


subsets of 20 trials in a given session contained — 
3 types of displays presented randomly as follows? — 
(a) on 25% of the trials the display consisted of | 
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Judgment of identity and difference with brief patterns (3 msec.) 


and small shift (5 mm.) of field. (All patterns contain 8 dots, Experiment I. 
Ranges for simultaneity, apparent movement, and succession without move- 
ment are indicated, and the appearance of pairs of patterns within the range 


of simultaneity is illustrated.) 


8 random dots, which, after the appropriate ISI, 
reappeared in the same positions on the screen; 
(b) on 50% of the trials a random 4 of the 8 dots 
were displaced 2.5 mm. to the right of their original 
positions; (c) on the remaining 25% of the trials 
the display consisted of 8 random dots plus another 
4 dots, each of which was plotted 2.5 mm. to the 
right of one of the other 8 dots. The display 
therefore consisted of 4 pairs of dots and 4 single 
dots (see Figure 1). All 12 dots reappeared in 
the same locations after the appropriate ISI. 

As in Experiment I, the .5's task was to decide 
whether the configuration of dots in the 2 fields 
was or was not the same. 

As can be seen, Conditions a and b replicated 
the same conditions in Experiment I. Condition c 
was an attempt to simulate the typical appearance 
assumed by the total display at short ISIs in 
Condition 5. 


In all other respects, Experiment I was the 
same as Experiment I. It should be noted that, 
since Experiment Il was run only in the no-shift 
condition, each S was tested in 10 sessions: 5 with 
displays of brief duration and 5 with displays of 
long duration. 


RESULTS 
Brief Patterns 


Figure 1 shows the percentage of cor- 
rect judgments averaged over 3 Os, for 
patterns of brief duration and no shift of 
field, which is the shift condition used in 
Pollack's (1972) study. The curve for 
Experiment I, in which only 8-dot pat- 
terns were used, shows high performance 
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at most ISI values and performance well 
above chance at an ISI of 640 msec. 
A sharp maximum in the range 80-160 
msec., like that reported by Pollack, can 
be seen in the curve for Experiment II 
(Figure 1) in which some patterns con- 
tained 12 dots in an arrangement designed 
to cause confusion with 2 different 8-dot 
patterns seen as simultaneous. 

Figure 2 shows results in the small-shift 
condition for brief patterns. Performance 
is at à minimum and almost chance level 
when ISI is 80 msec., and reaches a maxi- 
mum at the 2 longest ISI values, 320 and 
640 msec. Performance is at an inter- 
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mediate level for ISI values in the range 
5-40 msec. 

Figure 3 shows results in the large-shift 
condition for brief patterns. Performance 
is at a low level for all ISIs except the 
longest, 640 msec., and, as in small shift, 
falls to a minimum, almost chance, level 
at an ISI of 80 msec. 


Long Patterns 


Results for all 3 shift conditions with 
stimuli of long duration are shown in 
Figure 4. Whereas the amount of shift 
makes a very marked difference in the 
shape of the performance curves for brief 
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stimuli, the effect is much less for long 
stimuli. With increasing shift there is a 
progressive decline in level of performance, 
and some change in the shape of perfor- 
mance curves. For both small and large 
shift, performance is best when ISI is 320 
msec., but still high at 640 msec. 

A striking feature of the results, illus- 
trated in Figure 5, is that any shift causes 
performance for brief stimuli separated by 
an ISI of 80 msec. to drop from near 
perfect to a chance level. When there is 
no shift, performance is virtually 100%, 
but when there is shift, performance is 
just above 50%. On the other hand, the 
effect of shift is much less and much the 
same for brief stimuli separated by the 
longest ISI, 640 msec., and for long stimuli 
separated by an ISI of either 80 or 640 
msec, 


Discussion 


Brief Patterns 


Consider brief patterns first. In the no- 
shift case, at ISI values below 40 msec., 
2 identical patterns look like 1 pattern of 


8 dots, and 2 different patterns like 1 pat- 
tern of 12 dots. An O can tell the difference 
on the basis of numerosity or, even more 
easily, by the doubling illustrated in Figure 1, 
If, as in Experiment II, the validity of both 
cues is lowered, the task is harder in this ISI 
range. There is still a subtle apparent bright- 
ness cue, since some dots are plotted only 
once when patterns are different, but not all 
Os can use it effectively. 

When ISI is within the range 40-160 msec., 
a sequence of different brief patterns with no 
shift produces apparent movement (Kahne- 
man, 1967) and can readily be discriminated 
from identical patterns on this ba At longer 
ISI values, apparent movement gives way to 
an apparent succession of patterns both for 
identical and different patterns, the second 
nowk!being examined in terms of what is 
remembered of the fir 

In the shift conditions, brief patterns, 
whether different or not, appear to be simul- 
taneous at ISI. values of 40 msec. and below, 
The small shift allows identical and different 
patterns to be distinguished on the basis of 
the equality of distances between pairs of 
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points (see Figure 2). As the shift is in- slight rise in performance betwee nd 
creased to the large value, differential dif- 160-msec. ISI values may reflect ime 


ference becomes more difficult to judge, and 
performance drops to a low level. 

When ISI is between 40 and 160 msec., 
pronounced apparent movement is observed 
in both shift conditions for identical as well 
as different pairs of patterns. What was a 
discriminative cue in the no-shift condition 
not only fails to provide a basis for discrimina- 
tion in both shift conditions, but removes the 
differential cues available in the lower ISI 
range. 

At long ISI values patterns appear to come 
in sequence whether there is a shift or not, 
and performance in all conditions is based on 
memory, not on discriminative visual effects. 

The main features of Pollack's (1972) study 
are captured in the performance curve for 
Experiment II (Figure 1) and the top curve 
of Figure 4. The first shows a sharp maximum 
in the apparent movement range; the second 
shows a progressive decline at long ISI values. 
A comparison of the Pollack-type curve for 
brief stimuli (Experiment II, Figure 1) with 
the other curves for brief stimuli (Figures 1, 2, 
and 3) shows that the Pollack curve is not at 
all typical. Its shape is explained entirely 
by the visual effects that are valid as dis- 
criminative cues in each of the 3 ranges of 
ISI values. The different shapes of the other 


3 curves can also be explained in exactly the 
same terms,4 


Long Palterns 


Long patterns are seen as successive at all 
ISI values, except in the no-shift condition, 
Where, at low ISI values, the points that are 
not displaced are seen as continuously present, 
and displaced points as intruding within an 
established context. Except at the low ISI 
range in the no-shift condition, there is no 
obvious visual cue and all comparisons of long 
patterns can be said to be based on memory, 
Indeed, the comparison may be based on an 
encoded version of the first pattern, and the 


‘Tt should be noted that at short ISI values for 
brief stimuli, Pollack's (1972) Ss would have had 
available the doubling cue illustrated in Figure 1. 
We assume that their performance remained low 
in this range because the experimental design made 
it difficult for them to discover the cue. Pollack's 
design is not reported in sufficient detail for us to 
be certain on this point. 


required for encoding. 


Conclusion 


The results of this study do not 
contradict Pollack's (1972) observ 
show that they hold true only und: 
circumstances. Performance in com 
patterns, or detecting pattern change, 


mal at ISIs of about 64 msec. and mal 
above 250 msec., as Pollack observ: nly 
if patterns are presented in the s: ield 
of view, and if the experimental dt de- 
stroys the validity of numerosity as a for 
brief patterns shown at short in als. 
Otherwise, as we have shown, ther: no 
sharp maximum in the apparent mov- vent 
range. When shift of field is introduce jer- 
formance may be minimal at an inter of 
80 msec. and maximal at intervals a ng 
as 640 msec. 

We believe that Pollack's (1972) results, 
and ours, fit comfortably with the vic: iat 
judgments of pattern identity and chang ly 
on visual cues, when they are available and 


valid. When they are not, the task of com- 
parison is either impossible or becomes a task 
of memory, comparing features of a paitern 
seen with what is remembered of a paitern 
that has been seen but is no longer visible 
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KAESS? 


The development of shape constancy as a function of the distance, size, and 
angle of orientation of the standard rectangle was studied, Shape constancy 
performance improved between 4 and 19 yr., and remained constant to 75 yr. 
However, a large Age X Distance interaction effect revealed that the age trend 
occurred with the standard rectangle at 457.20 cm., but not at 91.44 cm., 


regardless of the size of the object. 
improves from 


The perception of the physical shape of 
an object regardless of its orientation from 
the observer's fronto-parallel plane is com- 
monly referred to as shape constancy per- 
formance. Klimpfinger (1933) found that 
performance on the shape constancy task 
improves with increasing age from 3 yr. 
until early adolescence, but then decreases 
with adults aged 31-70 yr. This appar- 
ent inverted-U development trend was 
replicated by Brault (1962) with data com- 
bined from different experiments. Sub- 
sequently, however, Meneghini and Leibo- 
witz (1967) have reported a contradictory, 
linearly decreasing developmental trend 
with Ss 4.5-19 yr. of age. 

Because Klimpfinger (1933) presented 
the standard object at a 5-m. observational 
distance, Meneghini and Leibowitz (1967) 
supposed that the increasing portion of her 
obtained age trend might be an artifact 
of the development of perception at a 
"far" distance. The "distance hypothesis" 
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nursery school subjects available; Patrick Mooney, 
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Kaess at every stage of this project is gratefully 
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* Requests for reprints should be sent to Dale W, 
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17 


Apparently shape constancy performance 
"near" to "far" space as development progresses, 


suggests that the child first becomes fa- 
miliar with objects within his reach and 
only gradually is able to perceive objects 
ata distance (Piaget, 1967; Vernon, 1962), 
Wohlwill (1963) has reviewed studies 
which show that size constancy, or the 
perception of the size of an object regard- 
less of its distance, is reduced with in- 
creased observational distance, and the 
effect of distance is apparently greater 
with younger Ss than with adults. How- 
ever, there are few data available concern- 
ing the parameter of observational distance 
in shape constancy development. 

The only study of the development of 
shape constancy in which observational 
distance was systematically varied was 
reported by Meneghini and Leibowitz 
(1967). Children 4.5-19 yr. old were shown 
a rotated 3-cm.-diam. disk at either 91.44- 
or 457.20-cm. distances, and required to 
compare it with the comparison series 
objects presented at a 91.44-cm. distance. 
Shape constancy performance apparently 
decreased with age with the test object 
at a 91.44-cm. distance, but was poor at 
all ages with the test object at the 457.20- 
cm. distance. The data were interpreted 
to demonstrate the validity of the distance 
hypothesis. However, it is possible that 
the poor performance by all age groups at 
the 457.20-cm. distance represents chance 
performance resulting from the use of too 
small an object at that distance. The 3-cm. 
disk shown at 457.20 cm. subtended a 
visual angle of less than 1?. Thus, the 
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task was probably difficult for the Ss to 
perform (Gibson, 1969, p. 365). 

Furthermore, since the visual angle sub- 
tended by the standard object varied 
directly with the observational distance, 
projective size and distance were com- 
pletely confounded in the Meneghini and 
Leibowitz (1967) study. That is, the 
projective size of the standard object was 
always 5 times larger at the 91.44-cm. 
distance than it was at the 457.20-cm. 
distance. Consequently, their results could 
be interpreted in terms of either observa- 
tional distance and the distance hypothesis 
or in terms of the projective size of the test 
object as determined by the visual angle 
subtended by it. This latter interpretation 
in terms of the size of the test object in- 
volves the assumption that acuity for the 
cues necessary to shape constancy perfor- 
mance varies with the projective size of 
the object being judged. 

Consequently, the present study is de- 
signed to investigate the influence of ob- 
servational distance upon the development 
of shape constancy performance. Standard 
objects of 2 sizes have been presented, so 
that the visual angle subtended by the 
objects at the 2 distances was systematic- 
ally varied. Furthermore, elderly Ss were 
also used in an attempt to replicate the 
decreasing trend with older adults, as ob- 
tained by Klimpfinger (1933). 


METHOD 


Subjects. There were 96 Ss comprising 4 age 
groups averaging 4.3, 11.4, 19.5, and 75.6 yr. Each 
of the 24 Ss of each age group was randomly assigned 
to one of 2 treatment combinations denoted by the 
distance from which the standard object was viewed, 
so that there were 6 males and 6 females in each 
group. All Ss reported having normal binocular 
vision, at least with corrective lenses, 

A pparatu. . The standard objects were presented 
15 cm, inside a 122 X 91 X 30 cm. flat-black viewing 
box with a 30-cm.-sq. aperture in the front. The 
background, 15 cm. behind the object, was also flat 
black in color. This apparatus was placed on the 
front edge of a table at S's eye level so that the 
standard object was either 457.20 or 91.44 cm. from 
S. Comparison objects were presented individually 
on the background of a similar 122 X 91 X 15 cm. 
box placed at eye level 91.44 cm. to S" right or left. 
The Ss viewed the objects binocularly. The tops of 
the viewing boxes were open, so that illumination 
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was provided by overhead room lights re 
no visible shadows present on either stir 

Stimuli. Two rectangular standard o! ere 
used. They were cut from white text: ster 
board and mounted upon 2-mm. wires their 
longer, vertical dimensions. The stand jects 
were 7.62 X 8.89 cm. and 38.10 X 44.45 thus 
preserving the same horizontal-to-verti ratio. 
The sizes were chosen so that the larger o! vould 
present the same projective size at 457.2 that 
the smaller object presented at the 91.4 dis- 
tance. Each of these standard objects w tated 


about its vertical axis to either 25? or 75 n the 
fronto-parallel plane for presentation. 


The comparison objects were 2 separat of 17 
rectangles each with the same height as orre- 
sponding standard object (8.89 or 44.45 « and 
varying in steps of .63 em. from 5.04 cm. ower 
to 5.04 cm. wider than the corresponding dard 
object. These comparison objects wer cut 
from white textured poster board. 

Procedure. On an individual trial, onc re 2 
standard objects was presented at cither ra 
75? orientation with either a 91.44- or a cm. 
observational distance. The rectangles of 1 ome 
parison series were shown individually in lom 
order 91.44 cm. to S's right or left. The | or 
small comparison series forms were always presented 
with the rectangle of the same height. A diticrent 
random order used with each trial for cach Sy 
and order of presentation of trials was counter- 
balanced across Ss in each Age X Distance group. 

Instructions, All Ss were given instructions 
similar to those used by Kaess (1970). lt was ex- 
plained that 5 was participating in a test of how 
well he could tell the "real" shape of an object even 


when it is turned to the side. The S was shown (by 
measuring the physical shape of a rotated rectangle) 
that the real shape remains the same regardless of the 
orientation. However, no mention was made of the 
apparently narrower projective shape, or that 5 
should compensate for the effects of angle. The 
youngest Ss were also told that they were “playing 
a game," and that they should attempt to be correct 
every time. No feedback was given during testing; 
however. 


RESULTS 


The basic data were the median widths 
of those comparison series rectangles judged 
to be equal to each standard rectangle in 
each series of trials by each S. ‘These 
values were converted to constant error 
scores by subtracting the width of the 
standard rectangle from that of the median 
comparison rectangle judged equal to it, 
so that the data for the 7.62- and 38.10-cm. 
rectangles would be comparable (Table 1). 
The constant errors were subjected to a 4 
(age) X 2 (sex) X 2 (distance) X 2 (size) 
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TABLE 1 
Cons CRRORS FROM POINTS OF OBJECTIVE 
E (PorNT OF SUBJECTIVE EQUALITY 


MINUS STANDARD) 


| 7.62-cm, form 38.10-cm. form 


2592 |e rss 359 | 75° 
anism anne a (| ule 

91.44-em. distance 

j 
4 | .05 25e .06 | B 
1 8 -04 | —.12 .04 | —.03 
10 E15 —.27 415| -=:02 
33 fo 05]| —.07 | —.13 | —.17 

| mE [S SO RE EN 

157.20-cm. distance 
44.| .3 | st | —3 p —27 | =.78 
ss 9 —.29 —.54 —.05 —.30 
19:2 .02 —.28 28 =.12 
75,9 | OL -03 | —.09 | —.47 


X 2 (angle) analysis of variance, with age, 
sex, and distance as between-Ss effects. 


The main effect of age, F (3, 80) = 3.48, 
p < .025, was significantly large. Com- 
parisons of individual means (Neuman- 
Keuls technique) indicate that the 4-year- 


olds' mean constant error was significantly 
than that of the 11-year-olds, which 
was in turu larger than the means of both 
the 19- and 78-year-old. Apparently, 
shape constancy performance improves 
between 4 and 19 yr. of age, and remains 
relatively constant with older Ss. 

The effects of distance, F (1, 80) = 13.46, 
p < .001, and of angle, F (1, 80) = 4.33, 
p < .05, were also significantly large. The 
S’s judgments of the widths of the rotated 
forms were more accurate when the forms 
were viewed from a near, 91.44-cm., dis- 
tance or when the forms were viewed at 
25°, rather than 75°, rotation from their 
fronto-parallel plane. The Sex X Distance 
interaction, F (1, 80) = 4.37, p < .05, was 
also significantly large, however. Com- 


larger 


"n of individual means reveals that 
while the effect of distance obtained with 
Ss of both sex, the effect was more pro- 
nounced with the male Ss. 

As can be seen in Figure 1, the Age X 


Distance interaction, F (3, 80) 4.21, 


P < .01, was large. Comparisons of indi- 
vidual means reveal that there were no 
significant differences between age groups 
with the standard object at the 91.44-cm. 
distance. But with the standard object 
at the 457.20-cm. distance, the 4-year-olds' 
mean constant error was significantly larger 
than that of the 11-year-olds, which was in 
turn larger than all others. The mean 
constant errors of the 19- or 75-year-olds 
with the 2 observational distances did not 
differ significantly. Thus, the influence of 
the distance of the standard object is re- 
duced as age increases, and does not pro- 
duce a significant difference in the shape 
constancy performance of 19- and 75- 
year-olds. 

Finally, the Age X Size interaction, 
F (3, 80) = 2.83, p < .05, was also signifi- 
cantly large. This effect was apparently 
produced by a curvilinear developmental 
trend with the 38.10-cm. test object, but a 
linear trend with the 7.62-cm. object. In 
particular, the 19-year-olds overestimated 
the width of the 38.10-cm.-wide form, but 
all other group means for the Age X Size 
factor represented underestimates. Thus, 
this interaction apparently resulted from 
an anomalously high mean by the 19-year- 
olds with the large standard object. 

Neither the main effect of size nor any 
additional interactions involving size or 
Age X Distance were significantly large in 


Constant Error (ind 


4 n" 19 75 
Age tyn 


FIGURE 1. Mean constant errors (point of sub- 
jective equality minus standard) of judgments of 
the physical widths by Ss who viewed the forms 
from 91,44-cm. (solid line) and 457.20-cm. (dashed 
line) distances. (Mean constant errors are averaged 
over sizes and angles of rotation.) 
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the analysis of these data. Apparently, 
the effect of presenting 7.62- or 38.10-cm.- 
wide standard objects at the 2 distances 
so that the same projective size is main- 
tained does not decrease the effect of dis- 
tance upon the development of shape con- 
stancy performance. 


Discussion 


The results of the present study show that 
shape constancy performance improves with 
age, at least between 4 and 19 yr. of age and 
with objects at a 457.20-cm. distance. The 
present results thus contradict those of Mene- 
ghini and Leibowitz (1967). As suggested by 
Gibson (1969, p. 365), it seems probable that 
the decreasing age trend found by Meneghini 
and Leibowitz resulted from the type of in- 
structions used in their study. Their instruc- 
tions can probably be termed “apparent” 
(Landauer, 1969), Such instructions have 
been shown to result in lower degrees of con- 
stancy performance and decreasing develop- 
mental trends (Kaess, 1970). When care is 
taken to insure Ss’ understanding of the task, 
shape constancy performance improyes with 
age (Kaess, 1970, 1971). Klimpfinger's (1933) 
data also reveal an increasing developmental 
trend with children. However, the older 
groups of her study showed an apparent de- 
crease of shape constancy performance, 
while the older groups of the present study 
do not, This discrepancy may also have re- 
sulted from differences in instructions, since 
Klimpfinger required projective shape judg- 
ments while the present study involved 
physical shape judgments. Apparently, when 
the perceptual task involves judgments of the 
physical shape of the object, shape constancy 
performance improves with age until adult- 
hood and does not decrease with older Ss. 

Superimposed upon this developmental trend 
are the effects of other factors which affect the 
level of shape constancy performance. Shape 
constancy performance is more accurate when 
the object is nearly facing S, and when the 
object is at a 91.44-cm. distance. The reduc- 
tion of shape constancy performance as a result 
of increased distance does not seem to be a 
product of the reduced visual angle subtended 
by the distant object, as expected, since there 
was no significant Age X Distance X Size 
interaction. Apparently, the "distance effect" 
is obtained with both large and small rec- 
tangles and is not entirely produced by differ- 
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ences in the visual angles of the si and 
comparison forms. 
It is possible that the distance ef sults 


from the particular comparison 1 jue of 
the method of limits paradigm used The 
present study involved variation dis- 
tance of only the standard object com- 
parison series was always presented e near 
distance. Thus, the Ss were require: natch 
the standard stimulus to the compa series 
forms when the distances of the f were 
equal (near distance) and when t! ances 
were disparate (far distance). A í r test 
of the distance hypothesis in whi > dis- 
tance of the comparison series is var: imul- 
taneously with that of the standard object 
would be of interest. In this manner :t might 
be possible to determine whether th: erva- 
tional distance effect results from ju; <inents 
of the standard form per se, or from ch- uges in 
the ability to judge 2 forms, each oí ich is 


at a different distance. 


Regardless of the locus of the effect, however, 
the significant Age X Distance interac! ion ob- 
tained in the present study suggests that the 
effect of observational distance depends upon 
the age of the Ss. The results of the present 


study thus support the hypothesis of "non 
(1962) and Piaget (1967) that perceptual 
development proceeds from near to far space 
as age increases. At least with these experi- 
mental techniques, the effect of increased dis- 
tance of the standard form is apparently 
greater with younger than with older Ss. 
Consequently, developmental trends of shape 
constancy performance are similar to those for 
size constancy performance. Each task in- 
volves judgments of an "'object-at-a-distance'" 
(Gibson, 1969, p. 366), and developmental 
trends of performance on each task depend 
upon the observational distance involved. 
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ACQUISITION AND RETENTION OF MNEMONIC 
INFORMATION IN LONG-TERM MEMORY ! 


PAUL M. WORTM/ 


N 


Northwestern University 


Paivio's (1969) 2-process theory of associative learning postulating differential 
effects on recall for word abstractness-concreteness, and imaginal and verbal 
mediators, was evaluated within the context of mnemonic encoding 


mnemonic learning was self-paced, in: 
and delayed test of retention after 1 
of a trade-off in differential acquisi 


formance in situations where presentation rate was not strictly con 
A large amount of forgetting was found, independent of word or mediator type, 
thus supporting the notion that acquisition and retention are separate mecha- 


nisms. Finally, a highly significant 
the relaxation in presentation rate, 
theory was discussed. 


The recent revival of interest in mne- 
monic techniques has led to a new theory 
of paired-associate (PA) learning (Bower, 
1972; Paivio, 1969). According to this 
formulation, PA learning consists of 2 
processes or mediational sets involving the 
formation of either verbal or imaginal asso- 
ciations. These processes interact differen- 
tially with 2 types of to-be-learned informa- 
tion—abstract and concrete words. Thus, 
image processes are supposed to be more 
predominant in learning concrete words 
than abstract words, since concrete nouns 
have readily available imaginal components 
of meaning, whereas abstract nouns do not, 
Verbal mediators, on the other hand, are 
assumed to be unaffected by changes in 
word type. While this predicted inter- 
action has proved somewhat difficult to 
obtain, it has received empirical confirma- 
tion (Paivio & Foth, 1970). 

The rather 


"elusive" nature of this 
phenomenon, however, raises a number of 
questions. On methodological grounds, 


how dependent is the interaction of word or 
item type and mediational set on the rather 
long study intervals used by E 


aivio and 
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volving multiple lists with an im 
wk. The results supported a pr 
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effect for word type was found 
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and its implication for modif 
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Foth (1970)? And, in particular, 
happens when these intervals are 
even longer? On the theoretical side, what 
is the relative importance of these inter 
acting variables? And do they affect th 
retention of information in long-term 
memory (LTM)? The results reported by 
Paivio and Foth indicate a consistent 
superiority in learning of concrete over ab 
stract PAs that is independent of encodini 
process, while the differential effects of thesé 
associative strategies seem highly depend 
ent on temporal and instructional variables. 

The present study seeks to assess thi 
2-process theory within the context 
mnemonic learning. Mnemonic systems 
provide a convenient bridge between thi 
theory and the study of LTM since they 
form the basis for the former and are also 
purported to enhance retention in the latter: 
(Yates, 1966). Moreover, this interactive: 
effect has not yet been observed with mne- 
monic strategies (cf. Paivio, 1968). In 
addition, this study examines mnemonic 
learning under circumstances typically em- 
ployed with such techniques and ones 
which have not yet been subjected to em- 
pirical scrutiny. For example, nonlabo- 
ratory Ss most often determine the rate at 
Which items are learned, usually employ the 
mnemonic more than once, and expect to 
recall the information at some later time. 
Thus, the following experiment involved 
S-paced mnemonic learning of multiple lists 
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ywith both immediate and delayed tests of 
retention. 

Under these conditions the 2 mediational 
processes should result in either differential 
time to master the to-be-learned informa- 
tion or differential retention of these items 
on an immediate test of retention. But, if 
Paivio (1969) is correct in assuming that 
image mediators differ in their availability, 
then both of these effects should not co- 
occur since differences in learning time 
should compensate for differences in im- 
mediate retention. Furthermore, if Olton 
(1969) is correct in that acquisition and re- 
tention (or forgetting) of information in 
LTM are separate processes, then these 
mediational sets should have no effect on 
delayed retention. 

While mnemonic systems encompass 
class of techniques, they are essentially 
similar in principle; one of these, in par- 
ticular, has been employed in a number of 
experimental studies. This is the mne- 
monic rhyme or "peg-word" system, ONE 
BUN, TWO-SHOK .TEN-HEN (Bugel 
1968; Keppel & Zavortink, 1969; Paivio, 
1968). This technique allows items to be 
learned by forming ordered pairs. For 
example, the first word would be memorized 
by associating it with the peg word BUN 
via a mediating image. The above studies 
have demonstrated that this mnemonic 
increases the rate of item acquisition and 
that imagery is essential for this to occur. 
Paivio developed a comparable rhyme com- 
posed of abstract words (ie. ONE-FUN, 
etc.) but failed in his attempt to show its 
inferiority to the ONE-BUN mnemonic which 
composed of high-imagery, concrete 
words. The following experiment seeks to 
evaluate the effects of these mnemonics on 
the acquisition and retention of information 


in L'TM. 


is 


METHOD 


Subjects and design. The Ss wi 36 Duke Uni- 
versity undergraduates whose participation ful- 
filled a course requirement. ch S was randomly 
igned to one of 6 mnemonic acquisition condi- 
tions, so that there were 6 Ss per group ina 2 X 3 
X 3 X 2 factorial design. 

These various conditions represented 2 crossed, 
between-Ss variables, which included the abstract- 
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ness-concreteness of the mnemonic rhyme pairs and 
mediational sets of bizarre images, common images, 
and verbal connections. While the 2-process theory 
of PA learning refers to the images usually or com- 
monly available to and associated. with concrete 
words, a number of studies have investigated the 
effects of bizarre images (Delin, 1968; Wood, 1967), 
Moreover, it is possible, especially under conditions 
where presentation time is not limited to a few 
seconds, that the formation of bizarre, as opposed 
to common, images reduces interference effects and 
thereby enhances reeall. This would occur if such 
es were more elaborate and contained idio- 
tic meaning or if, as is likely, they were more 
entiated and distinct than a common or ordi- 
nary image, For this reason, both of these visual 
imagery techniques were included along with the 
il mediation condition, 

The last 2 factors in the design were composed of 
within-Ss variables. The first repeated measure in- 
cluded 3 lists of to-be-learned responses, and the 
temporal retention variable consisting of an 
and delayed recall test. 

Lists. Six 10-item lists of words were constructed 
using Paivio's tables (Paivio, Yuille, & Madigan, 
1968). Three lists contained a total of 30 concrete 
words having a mean imagery (7) value (on a scale 
of 1-7) of 6.54 and a mean concreteness (C) value of 
6.90, The 3 remaining lists were composed of 30 
abstract words having a mean J value of 3.23 and à 
mean C value of 2.04, The J and C values for the 
2 10-item peg-word mnemonics were approximately 
the same as those for the concrete and abstract word 
lists. All lists were counterbalanced for sequence of 
presentation within each 6-5 group and for word 
frequency within each list, 

Procedure. The procedure was patterned after 
Paivio and Foth's (1970) Experiment 1. All Ss were 
run individually by the same Æ Each S read a 
sheet containing a general outline of the experiment 
as well as providing the necessary cover story to 
avoid suspicion that they were returning a week 
later for a delayed retention test, The Ss then 
learned a number rhyme—the peg-word mnemonic. 
Half them (18 Ss) were given the abstract 
rhyme—ONE-FUN, TWO-TRUE, THREE-FREE, FOUR 
BORE, FIVE LIVE SIX-TRICKS, N-LEAVEN, 
EIGHT-FATE, NIN D, TEN-ME the others 
learned the concrete rhyme--oNE-BUN, TWO-SHOK, 
THREE-TRER, FOUR-DOOR, FIVE-HIVE, SIX-STICK 
SEVEN-HEAVEN, EIGHT-GATE, NINE- WINE, and TEN 
HEN, After hearing the mnemonic twice, all Ss were 
able to say it correctly aloud once and write it down 
twice. Each S was then given heet with appro- 
priate illustrative examples explaining the use of the 
rhyme in learning word associations (by forming 
appropriate mediators), and containing 2 practice 
pairs to insure that he understood the instructions. 
This sequence was employed to avoid confounding 
learning the rhyme with using it to learn other words. 

Each S then learned 30 words (3 10-word lists) 
employing one of 3 mediational methods to in- 
corporate the list word and corresponding rhyme 


second 


of 
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word into either (a) a bizarre (strange or unusual) 
image, (b) a common (ordinary or usual) image, 
or (c) a sentence or phrase (verbal connection). For 
example, if the first word were GARDEN, the 3 
mediators might be "flowers in the garden growing 
buns,” “eating a bun in the garden," or “I saw a 
BUNny in the garden," respectively. Regardless of 
the mediational set, each S went through the rhyme 
list 3 times in learning the 30 words (i.e., one study 
trial for each list), The Ss were cautioned to make 
each image or verbal connection distinct, thus 
hopefully preventing a linking together of 2 or 3 
list words with the same corresponding rhyme word 
into one largeimage. To insure that S followed these 
instructions, he was required to write down (on 
another index card) either a description of the image 
mediator or the actual verbal connection. Paivio 
and Foth (1970) have reported that this method 
forces S to follow the particular mediational 
instructions. 

All Ss using the abstract rhyme learned abstract 
nouns and those using the concrete rhyme learned 
concrete nouns. The rationale for this was to avoid 
confounding the item attribute of the PAs with the 
mediational set and also to follow Paivio and Foth 
(1970), but with a between-Ss mediation design. 
These authors noted that the interaction they found 
between these 2 variables may have resulted, in part, 
from their within-Ss design. 

Each S paced himself in learning the 30 words. 
He was given up to 1 min., if necessary, to form an 
appropriate image or phrase. This limit was seen 
as nonrestrictive since prior research (Bugelski, 
Kidd, & Segmen, 1968) has shown that 8 sec. is an 
optimal time to form an image, The E presented 
the words, one at a time, on index cards with one list 
following another. 

Upon finishing the 3 lists, S was given a total 
recall test in which the numbers 1-10 were presented 
in random order, and he attempted to remember 
the 3 list words corresponding to the rhyme word 
for that number. Before ending the session, each S 
was then given 10 items from the Remote Associates 
Test (Mednick, 1962) to distract his attention and 
thus reduce rehearsal. One week later, the Ss were 
given another total recall test during which they 
also wrote down the rhyme and all the mediators 


they had used to associate the peg words with the 
responses, 


E 


RESULTS AND DISCUSSION 


The results for the initial retention trial 
and their relation to the total time needed 
to form the associations to the rhyme words 
are depicted in Figure 1. Two Ss—one in 
the concrete word, common image condi- 
tion, and the other in the abstract word, 
bizarre image condition—failed to return 
for the delayed retention test, and their 


data were not included in the analysis, 
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The mean numbers of words ini 
called with both common anc 
imagery and verbal mediators, res 
were 23.8, 23.7, and 24.7 for concre 
and 15.2, 16.8, and 16.0 for abstra 
A univariate analysis of variance « 
the usual highly significant main « 
the item attribute of the mnemo 

pair (i.e, abstract or concrete), 

= 25.33, p < .001. As in Wood 
ments (1967), no differential effc 
call was found for mediational se: 
interacting with the mnemonic rh 


re-4 
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conditions (Fs < 1). However, pre- 
dicted, the Ss did take different unts 
of time to acquire the same nun er of 
items, F (2, 28) = 5.85, p < .01 rhe 
bizarre images, not unexpected! took 


significantly longer to generate thao «ther 
the common, F (1, 28) — 5.18, p 5, or 
the verbal mediators, F (1, 28) 1.287 
b < .01, since they were not as readily 
available. 

These findings appear to preclude 
ential overlearning resulting from self- 
pacing as a possible explanation. Where 
there were differences in pacing as in the 
mediation conditions, there was no differ- 
ence in performance. But, this is precisely 
the situation in which such differences due 
to overlearning (ie., having more study 
time) would be predicted. On the other 
hand, there was no difference in total 
learning time between abstract and con- 
crete conditions (F < 1.5), but there was 
a large difference in retention. Similarly, 
the ability of all Ss to learn the mnemonic 
rhyme perfectly on the first trial and to 
recall it correctly 1 wk. later seems to 
preclude differential overlearning of the 
thyme and subsequent transfer effects as 
an explanation for these findings. 

These results, moreover, clearly support 
the 2-process theory of verbal learning. 
There was a significant interaction in total 
learning or acquisition time between item 
attribute and mediational set for common 
images and verbal associations, F (1, 28) 
= 5.33, p < .05. The contrasts yielded the 
expected pattern with common images tak- 
ing significantly longer to generate than 
verbal mediators for abstract nouns, F (1; 


ffer- 
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28) = 5.85, p < .05. On the other hand, 
the difference was not significant for con- 
crete nouns (F < 1). Similarly, common 
images took less time to generate (i.e., they 
appear more available) for concrete than 
abstract words, F (1, 28) = 4.15, p « .05, 
while verbal connections were equally avail- 
able for both (F < 1.5). The interaction, 
contrasts, and simple effects confirm both 
2-process theory predictions of a trade-off 
between acquisition time and recall per- 
formance and the differential availability of 
imaginal mediators for concrete and ab- 
stract nouns. 

On the delayed retention test 1 wk. later, 
the mean numbers of words recalled with 
both common and bizarre imagery and ver- 
bal mediators, respectively, were 16.2, 15.2, 
and 15.7 for concrete nouns, and 7.3, 8.0, 
and 8.2 for abstract nouns. A highly signifi- 
cant main effect was found for the retention 
variable, F (1, 28) = 118.73, p < .0001, 
indicating a large amount of forgetting dur- 
ing the week separating the 2 recall trials. 
In fact, this loss in retention was approxi- 
mately constant for all conditions (about 
8.2 items), showing no effects for either 
mediational set or item attribute (Fs < 1). 
Given the rather large initial differences in 
retention between abstract and concrete 
noun pairs, this constant forgetting resulted 
in a differential rate of forgetting (50% and 
35%, respectively). However, this did not 
produce a significant interaction over the 
2 retention trials (F < 1). 

"This supports Olton's results (1969) in- 
dicating that acquisition and retention in 
LTM are separate processes with mne- 
monic encoding affecting the former, but 
not the latter, process. Moreover, this 
constant forgetting contradicts Bower's 
(1972) notion that the decay rate in LTM 
is faster for verbal mediators than for 
imaginal ones. Instead, it necessitates a 
clear distinction between availability and 
decay. The former affects acquisition and 
the latter retention; and as these data in- 
dicate, it is mediator availability in mne- 
monic learning that reflects the interaction 
of associative techniques and item type. 
Moreover, the similarity and consistency 
of the between-Ss findings for item attribute 
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FIGURE 1. 
time as a function of item concreteness 
and mediational technique. 


and mediational set over the 2 retention 
tests indicate the absence of differential 
effects on the second trial resulting from the 
first retention probe, 

There is, in addition, no differential or 
learning-to-learn effect for the 3 lists 
(F < 1.5). It is possible, however, that 
the amount of forgetting from LTM was 
overestimated due to a procedural error. 
Since the first recall test followed ini- 
mediately after the study trial for the third 
list, with no interpolated activity, not all 
of the items recalled may have been in 
LTM. The mean amounts of forgetting 
from the 3 lists were 2.2, 2.5, and 3.6 items, 
respectively. This produced a significant 
List X Retention Trial interaction, F (2, 
27) = 9.17, p < .01, with forgetting from 
List 3 significantly greater than that from 
the other 2 lists, both Fs (1, 28) > 6, 
p «.05. Thus retention was underesti- 
mated by approximately one item—a rela- 
tively small effect compared with the large 
number forgotten. Finally, while mediator 
availability has been found to be an im- 
portant factor in the acquisition or encoding 
of an item, it is also possible that mediator 
decodability affects performance. An anal- 
ysis of the missed items on the delayed 
recall trial revealed that it was more diffi- 
cult to decode the responses from abstract 
mediators (a mean of 3.0 items not de- 
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coded) than concrete mediators (a mean 
of 1.2 undecoded items), F (1, 28) = 7.53, 
D 05- 

CONCLUSIONS 


The results of this study indicate that when 
(mnemonic) learning is S paced instead of £ 
paced, there is a trade-off in the performance 
for acquisition or learning time. That is, S- 
paced learning results in differential rates of 
acquisition for various mediating strategies but 
not in differential performance, while the re- 
verse is true in E-paced studies (where the 
presentation rate is typically less than 15 sec.). 
The availability hypothesis that common 
images are more readily evoked by concrete 
than abstract nouns, with no such difference 
occurring with verbal mediators, was also 
supported. While these findings are in agree- 
ment with Paivio's (1969) 2-process theory of 
verbal learning, the relatively large effects on 
recall due to abstractness-concreteness, des- 
pite the S-paced differences in acquisition rate, 
indicate that a modification of theory is neces- 
sary. The factor of item attribute absolutely 
affects performance independent of presenta- 
tion rate, and as such is prepotent; while 
mediational set primarily affects the informa- 
tion-processing rate (i.e, item acquisition in 
LTM) and only incidentally (ie, at fast 
presentation rates) the performance of the 
learning system. 
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The 5s judged the likelihood that a sample had come from 1 of 2 binary urns 


of specified proportions of red and white beads. 


The compositions of the 


sample and of the 2 urns were varied in a 3-way design. A ratio model from 


integration theo: 
of 2 competing response tendencies 


fit the data quite well. The response is treated as a resultant 
that reflect the felt likelihoods that the 


sample comes from either urn. A novel application of analysis of variance for 


nonlinear models was employed in the test of fit. Relations to Bay 
, as well as previous work on serial integration 
allsten, were discussed. 


choice theory, representativene 
by J. C. Shanteau and T, S. W 


This report considers the 2-urn task that 
has been used in much of the experimental 
work on the Bayesian approach to decision 
making. A single sample was drawn from 
1 of 2 specified binary urns, of 80/20 and 
40/60 composition, for instance, and S 
judged the probability that the sample had 
come from the 80/20 urn. However, in- 
stead of applying the Bayesian statistical 
model (Edwards, 1968; Slovic & Lichten- 
stein, 1971), the present report follows the 
lead of Shanteau (1970, 1972) in seeking a 
descriptive, psychological model for subjec- 
tive inference. 

In preliminary work, conversation with 
5s suggested that their judgments were 
based on a comparison between the 2 
probabilities that the sample came from 
either urn. On that basis, the overt 
response is considered as a compromise 
between 2 covert response tendencies. 
Upon seeing the sample, S evaluates the 
subjective probabilities that it comes from 
either urn. These 2 response tendencies 
then determine the actual response. The 
specific question at issue is the integration 


! This research was supported by National Science 
Foundation Grant GB-21028 and facilitated by 
National Institute of Mental Health grants to the 
Center for Human Information Processing, Uni- 
versity of California, San Diego. We wish to thank 
E. F. Alf, M. H. Birnbaum, G. C. Oden, J. C. 
Shanteau, and T. S. Wallsten for their helpful 
comments on an earlier draft. 

* Requests for reprints should be sent to Manuel 
Leon, Department of Psychology, University of 
California, La Jolla, California 92037. 


27 


esian theory, 


rule that governs how the 2 
tendencies combined. 

Two particular integration rules sug- 
gested themselves. One was a subtracting 
model from conflict. theory (Anderson, 
1962b). The 2 competing response tenden- 
cles would be conceptualized as approach 
and avoidance tendencies; the effective 
response would equal the difference in their 
strengths. 

The other integration rule derives from 
averaging theory (Anderson, 1974). Each 
covert response tendency would be assumed 
to have a scale value and a weight, eg. 51 
and for the more likely urn, sy and ws 
for the less likely urn. ‘The response is 
then the weighted average: 


response 


R = (wisi + wess)/ (wi + ws). [1] 


In the present task, the scale values may 
be specified as sı = 1, są = 0, since these 
represent the end points of the decision 


scale. Equation 1 then reduces to 
R = wy/ (wi + w). [2] 
Equation 2 is a ratio rule: The overt 


response is the ratio of the strength of 1 
response divided by the sum of the strengths 
of the 2 responses. This ratio rule can be 
generalized directly to the case of several 
competing responses, corresponding to 
several possible parent urns. In this 
situation, therefore, integration theory 
leads to a ratio rule similar to that used by 
Luce (1959) and others. Unlike Luce’s 
choice theory, however, integration theory 


allows deterministic numerical response 
measures. 

In testing such algebraic models, func- 
tional measurement methodology typically 
employs factorial designs. An attempt was 
made in the present experiment to avoid 
any assumptions about relations among 
samples of different composition. Accord- 
ingly, single samples were used, and the 
factors of the design were the compositions 
of the more likely urn and of the less likely 
urn, a device also used by Wallsten (1972). 
The subtracting model then predicts a 
zero between-urns interaction in the analy- 
sis of variance (Anderson, 1962b). The 
ratio model is nonlinear and not testable 
so simply. However, an important new 
test procedure was developed that may be 
generally useful in the analysis of nonlinear 
models. 


METHOD 


On each trial, S saw a single sample of red and 
white beads and 2 binary parent urns. His task 
was to choose the more probable urn and then make 
a graphic rating "to indicate on the scale how likely 
it is that you are correct,” 


Design 


The main experimental design was a 3 X 4 X 8 
factorial. "There were 3 levels of the less probable 
urn in which the red/white ratios were 10/90, 20/80, 
and 40/60, "There were 4 levels of the more probable 
urn in which the red/white ratios were 60/40, 70/30, 
80/20, and 90/10, Finally, there were 8 samples: 
4 homogeneous, with 1, 2, 3, or 4 red beads, and 4 
mixed, with 3/1, 4/1, 3/2, and 6/2 red/white 
ratios. "This design was balanced over red and white 
to yield 192 judgments. In addition, 16 extreme 
anchor sets were included. 

An auxiliary design was also. employed in which 
the parent urns were 70/30 and 30/70 for all 
samples, There were 100 samples, which included 
All possible red/white ratios in which the number. 
of red and white beads each varied from 1 to 10. 
Four additional extreme anchor sets were also used 
in this condition. : 


Procedure 


Each urn was represented by a transparent plastic 
bag with 100 beads in the Seed beu 
together with a numerical label, such as 60/40, to 
indicate its contents. On each trial, an index card 
was presented on which the 2 relevant urns were 
listed (e.g., 60/40 and 20/80), and on which the 
sample was represented by rows of large red and 
white dots, one color in each row. 
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Responses were made by placing a pin i cm. 


scale, read from the rear by E. The rigt left 
halves of the scale were used to indicate 100d 
judgments of the mostly red and the mc hite 
urns, respectively, with the center of cale 
corresponding to equally likely. 

The judgments were collected in the g 4 
consecutive stages: (a) preliminary tions 
followed by a practice phase which incl e 16 


anchor sets plus 64 sets from the main (b) 
the 16 anchors plus the 192 sets fron 


nam 
design, (c) the 104 judgments of the auxil ign, 
and (d) direct judgments of the likeliho« each 
of the 8 samples coming from each 1 of irns 
used in the main design. These last da not 


seem very useful and are not reported her ept 


for practice, the deck of index cards w: filed 
separately for each 5 for each stage. Stimi ards 
were presented at approximately one eve: ! sec. 

The Ss were 24 introductory psychology lents 
who served as part of a course requireme: Each 
served in a single session of approximately min, 
with a 5-min, rest after the main design, 

RESULTS 

Empirical Results 

Main design. Each curve in Figure 1 
shows the mean response to 1 of ihe 8 
samples, plotted as a function of the more 


probable urn. The 2 main trends a 
expected : the likelihood judgment inc: 
(a) as the composition of the more probable 
urn becomes more extreme, and (b) as the 
difference between red and white beads in 
the sample becomes more extreme. The 
main effects of these 2 variables were 
highly significant, Fs (3, 69) — 72.55 and 
36.92, as was also the effect of the less 
probable urn, F (2, 46) — 58.11. 

Neither the difference between red and 
white, nor the ratio of red to white, is the 
sole determinant of judged likelihood. 
First, to each homogeneous sample in the 
left panel there is a corresponding mixed 
sample in the right panel that has the 
same difference between red and white. 
Since the 2 sets of curves have different 
overall shape and elevation, the response 
is determined by more than the difference 
between red and white. Second, the curve 
for the 6/2 sample is substantially higher 
than for the 3/1 sample. Thus, the re- 
sponse is determined by more than the 


sample ratio, Supporting evidence is given. 
in the next subsection. 
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Frcuns 1, Mean likelihood judgment as a func- 
tion of the composition of the more likely urn, and 
X sample composition as listed by each curve, 
(Data averaged over less likely urn.) 


Special theoretical significance attaches 
io the case in which the 2 urns have 
symmetrical red/white proportions, From 
the Bayesian model, the statistically 
optimal response then depends only on the 
difference between red and white. Descrip- 
tive models might also embody such an 
assumption on the ground that each 
red-white pair of beads is uninformative 
and would be ignored. Because of the 
theoretical significance of these data, they 
ire presented separately in Figure 2. 

ri levant comparison in Figure 2 is 
between the left and right panels. They 
should be identical if the response is 
determined by the difference between red 
and white. But in fact the 2 panels differ 
considerably. The response to homoge- 
neous samples is more extreme in either 
direction, and covers a greater vertical 
range. Also, the mixed samples show no 
effect of set size for the 60/40 urn pair. 
"That the 2 panels are reliably different is 
shown by the Sample Type X Urn Com- 
position interaction, F (2, 46) = 6.65. 

Auxiliary design. In this design, the 2 
urns were complementary, of 70/30 and 
30/70 composition. All 10 X 10 possible 
samples, with 1—10 beads of each color, 
were used. Illustrative data are shown in 
Figure 3. 

In the left panel of Figure 3, the near 
identity of the 3 curves shows that the 
response was completely governed by the 
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difference between the number of red and 
white beads in the sample (plotted on the 
horizontal). At any point, the curves 
represent samples that have the same 
difference between red and white, though 
differing in sample size. The curve labeled 
9 + 10 is averaged over the samples in 
which there were 9 or 10 beads of the 
dominant color. The curves labeled 7 + 8 
and 5 + 6 were obtained similarly. It was 
not feasible to plot all the data points 
because they all lay on top of one another. 
As these curves show, however, sample size 
per se had no effect in this phase of the 
experiment. 

The same conclusion is illustrated in 
different form in the right panel of Figure 
3. The solid curve represents the 5 
samples with a 1/2ratio. The dashed curve 
represents the 3 samples with a 2/3 ratio. 
The upward trend shows that something 
more than the sample ratio controls the 
response. The near equality of the 2 
curves illustrates again that the sample 
difference is the controlling factor. 

The data from this auxiliary design 
should be interpreted cautiously. The very 
simplicity in the pattern of results suggests 


that Ss may have adopted a cancellation 
strategy. Canceling red against white 
would lead to the obtained results, Since 


the 2 urns were complementary and fixed 


over all trials, Ss could devote their 
attention to the characteristics of the 
sample itself. 

osor 

90/0 

Il 90/10 
"m 
2 80/20 80/20 
© oso is 
a 
g 
ERE 
s 
Ei 
2 
Son 
E m 
Z 
n 60/40 
= 
= 

oso} 

ta 

AILLhL L " LLL 
Re a ERE URGET SAS YET T ETE 
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Ficure 2. Mean likelihood judgment as a func- 
tion of sample composition and urn composition, 
(Data for symmetrical urns only.) 
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FIGURE 3. 


SAMPLE COMPOSITION 


Mean likelihood judgment from the auxiliary design 


with 70/30 and 30/70 urns. 


In the main design, such a cancellation 
strategy would seem less appropriate. In 
fact, the difference between the left and 
right panels of Figure 1 shows that some- 
thing more than the sample difference 
controlled the response. That was true 
even for the cases in which the urns were 
symmetric as already noted. Deviations 
from the difference rule were moderately 
small, however, so it may be a good first 
approximation. 


Model Analysis 


Model predictions. Vigure 4 plots mean 
likelihood response as a function of the 
more and less likely urns. The left and 
right panels are averages over the 4 
homogeneous samples and the 4 mixed 
samples, respectively. 

The curves in Figure 4 are theoretical, 
obtained by fitting the ratio model of 
Equation 2 as described below. The fit 
is quite close which supports the ratio 
model. 

An additional aspect of Figure 4 is the 
nonparallelism of the data points in each 
panel. A model that assumed 2 response 
tendencies that combined by a subtracting 
rule would predict parallelism. That the 
- Observed nonparallelism is reliable is shown 
_ by the significant More Likely Urn X Less 


Likely Urn interaction, F (6, 138) 
Wallsten's (1972) data, converte 
odds to probabilities, show the same -^ape; 
with an even more marked inter:«tion, 
On the face of it, therefore, a subtiocting 
model does not account for the data 

Figure 4, incidentally, illustrates an 
optical illusion that affects apparent paral- 
lelism, as the data seem more parallel than 
they are in fact. The vertical spread of 
data points for the 60/40 urn is greater 
than for the 90/10 urn by about 75% for 
the homogeneous samples and 30% for the 
mixed samples. In making graphic assess- 
ments of parallelism, it is often helpful to 
turn the graph sidewise and measure the 
physical distances. 

Weight estimates. Figure 5 plots mean 
weight estimates from the ratio model as à 
function of urn and sample composition. 
The trends can be understood most easily 
by reference to the theory, in which the 
values of w; and w represent the response 
strengths of the 2 alternatives. 

Theoretically, therefore, z should in- 
crease as the composition of the more 
likely urn becomes more extreme, and also 
as sample composition becomes more 
extreme. Both trends are quite marked. 

The trends for wə, in contrast, are much 
less marked. A mild effect of the composi- 
tion of the less likely urn can be seen, but 


1.99, 
rom 
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the effect of the sample was so small that 
the points could not be graphed separately. 
To a first approximation, therefore, the 
less likely urn appears to have a nearly 
constant response strength. 

Estimation and goodness of fit. The model 
was tested by a novel and potentially 
important application of functional mea- 
surement methodology. Since Equation 2 
is nonlinear, the parameter estimates are 
not statistically independent, and the 
number of degrees of freedom used in the 
estimation is unknown. If only relatively 
few parameters were estimated, the rule of 
one degree of freedom per parameter would 
presumably be an adequate approximation. 
Even so, there would be a difficult problem 
of incorporating the variabilities of both 
the observed and the predicted data points 
in the test. In general, exact analysis of 
nonlinear models is not easy. 

The method developed here may bypass 
all these difficulties in a simple way. 
Chandler's (1969) STEPIT routine was 
applied to estimate the weight parameters 
of Equation 2 for each of the 8 samples for 
each S. "The means of these estimates were 
used in Figure 5. Predicted responses were 
also generated for each of the 8 samples for 
each S by using these weight estimates in 
Equation 2. The means of these predicted 
responses were used in Figure 4. 

To test goodness of fit, the predicted 
responses were subtracted from the ob- 
served to yield deviation scores. The data 
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FIGURE 5. 


for statistical analysis are thus the 8 3 X 4 
matrices of deviation scores for each S. 
"These deviation scores were analyzed in 
an ordinary repeated-measurements analy- 
sis of variance. The homogeneous and 
mixed samples were treated separately so 
that there were 2 3-way analyses of vari- 
ance. If the model is correct, then these 
deviations should be no more than chance. 
The F ratios in Table 1 should cluster 
around 1 and none should be significant. 
In fact, 1 of the 16 is significant, but that is 


TABLE 1 


SUMMARY ANALYSES OF VARIANCE OF DEVIATIONS 
FROM RATIO MODEL 


Mixed 


] 
| Homogencous | 


samples samples 

Source dr dc iip c 
F | Emo | £F | Error 
| arl 1.36 | 2568 | 1.22 | 1169 
| 3| Aas[| A204 | 178 | 1069 
2 | 160 | 7015 | 1:13 | 4058 

More likely urn 
(MLU) a | .82| .7956 | 1.00 | .5465 
S XLLU | 6 | rz | 4683 | 1.63 | 357 
S X MLU |» | 102 | 3685 | 95 | ‘3741 
LLU X MLU | 6| 35 | 282653 | ‘ot | 211473 
S XLLU X MLU | 18 | 145 | 1s nis | zors | 175208 


Note. Error mean squares have 23 times as many degrees of 
freedom as corresponding systematic source, Error mean 
squares multiplied by 10,000. 

*p«.05. 
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about the chance expectation with 16 
tests. Thus, this goodness of fit test 
supports the model. 

This test may be disturbing because a 
good fit is guaranteed by using 7 param- 
eters to fit 12 data points. However, the 
error term is itself composed of the small 
deviations from that good fit. Thus, the 
test should be sensitive even to small dis- 
crepancies if they are systematic across Ss. 

The main technical problem in this test 
is that the predicted scores, and hence also 
the deviation scores, are not independent 
but are correlated through their joint 
dependence on the estimated parameters. 
However, the analysis of variance for 
repeated measurements allows for corre- 
lated scores (Winer, 1971, p. 279). 

Two comments may help relate this test 
to the ordinary applications of analysis 
of variance. First, any linear contrast on 
the deviation scores would have an exact ¢ 
test, obtained by comparing its mean to 
its variability across Ss. Applied to main 
effects or interactions on one degree of 
freedom, this / test would be equivalent to 
the above F test. Second, if the present 
method were applied to an additive rather 
than a nonlinear model, it would be 
equivalent to the ordinary analysis of 
variance except that all main effects would 
necessarily be zero. 

"Tests based on group data are sensitive 
to discrepancies that are systematic across 
Ss. However, real discrepancies at the 
individual level may get overlooked if they 

are idiosyncratic. Table 1 contains a 
warning in this regard since the error mean 
squares for the last 2 interactions are much 
larger than for the others. Since the error 
term for each listed source is its interaction 
with Ss, the size of these errors suggests 
that there may be real discrepancies though 
of different form for different individuals. 

=i he present method could be applied 
directly to single Ss by replicating the 
experiment several times. These repli- 
cations would be treated exactly as Ss in 

the present analysis. The deviation scores 
within each replication would be correlated, 


but such correlation is allowed in the given 
analysis, 


This method for testing goodn f fit 
is presented tentatively. If it is: »rted 
in further work, it could be mely 
important because it provides 'xact 
analysis for nonlinear models. peci- 


is to 
id be 
non- 
njoint 


ally interesting possible applicat 
response scaling. Thus, the test 
used to provide an error theory 
metric approaches, such as used in 


measurement (Krantz, Luce, ^ s, A 
Tversky, 1971), or Kruskal's (19: nono- 
tone analysis of variance, for h an 


error theory is presently lacking Weiss 


& Anderson, 1972). 


In such apj tions, 
the test would be made on the d ences 
between the monotonically tran: ormed 
data points and the best-fit litive 


representation. 


DISCUSSION 
Ratio Model 
The basic assumption of the present 
proach is that the overt response i 


promise between 2 competing respons: 
cies. One specific integration rule « 


ap- 
com- 
tenden- 
idered 


here was the ratio rule from averaging theory, 
ie. that the overt response was the average 
of the 2 competing response tendencie This 


ratio rule gave a good account of the data and 
led to meaningful parameter estimates. The 
strength parameter for the less likely urn was 
approximately constant over conditions, 8$ 
though it exerted much the same effect 
regardless of its composition or the composition, 
of the sample. In contrast, the strength 
parameter of the more likely urn varie! 
markedly across conditions, suggesting that 
this urn was given the main attention. 

An alternative, the subtracting model, was 
also considered in which the overt response 1S 
just the difference between the 2 response 
strengths. Interestingly enough, the ratio ani 
subtracting models cannot be distinguished in 
the present experiment as they are mono- 
tonically equivalent. If R obeys a ratio 
model, then log(R-! — 1) obeys a subtracting 
model. Since functional measurement method- 
ology allows for monotone response transfor- 
mation (Anderson, 1962b), the present data 
do not allow the conclusion that the ratio 
model is superiór to the subtracting model. 
Either might be correct. 

Some preference may be expressed for the 
ratio model because it fits the raw data 
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whereas response transformation would be 
required for the subtracting model. Previous 
work on functional measurement has shown 
that raw ratings are often near-interval scales 
and that argues for the tentative acceptance 
of the ratio model. On the other hand, 
parameter estimates for the 2 urns would be 
more comparable under the subtracting model. 

The ratio model and the subtracting model 
are not monotonically equivalent in general. 
For 3 or more urns, the ratio model is 
inherently nonlinear in the sense that no 
monotone transformation will make w/ 
(2 + we + ws) linear in all 3 variables at 
once. The subtracting model, wi — we — ws, 
remains linear, at least in form. 

A limitation on the applicability of at least 
the ratio model is suggested by the marked 
difference between the 2 weight parameters 
obtained here. Whereas the weight parameter 
for the more likely urn showed a strong 
dependence on the stimulus variables, the 
weight parameter for the less likely urn was 
almost constant. This suggests that the 
parameter valuation process may be configural, 
influenced by attentional factors. To ensure 
constant parameters, it would thus seem 
advisable to keep the same urn predominant 
in all cases. 


Choice Theory 


Although the ratio model of integration 
theory has the same form as Luce's (1959) 
choice model, there are several basic differ- 
ences between them. The most important 
of these center on the difference between 
choice probabilities and numerical response 
measures. Luce’s model applies, in effect, 
only to imperfect discrimination, or proba- 
bilistic threshold choices. In contrast, inte- 
gration theory deals with numerical-valued 
response strengths, and applies even when 
there is no probabilistic element in the choic! 

This difference is well illustrated in the 
present task, in which the appropriate response 
is never in doubt. Unless the sample is split 
evenly, the appropriate choice of urn is 
specified by the majority proportion in the 
sample. Since the choice probabilities are 0 
or 1, Luce's formulation does not apply. 

Whether the present averaging model might 
be applicable to threshold choices is uncertain. 
Substantively, if not formally, that would 
seem to require that the actual choice process 
can be conceptualized as an averaging process 
even in the case of imperfect discrimination. 


Some evidence that supports this idea is given 
by Friedman, Carterette, and Anderson (1968), 
Friedman, Rollins, and Padilla (1968), Himmel- 
farb (1970), Robbins and Medin (1971), and 
Levin, Dulberg, Dooley, and Hinrichs (1972). 


Bayesian Theory 


The Bayesian approach is similar to inte- 
gration theory in its empha on problems 
of combining information to reach a judgment. 
The wide use of the 2-urn task reflects the 
pioneering efforts of the Bayesian workers in 
studying how information is integrated in 
decision making. 

In a strict sense, comparison of these 2 
approaches is not appropriate. Integration 
theory is a descriptive psychological theory, 
whereas the normative Bayesian model derives 
from mathematical statistics and prescribes 
the optimal response in an actuarial sense. In 
various practical applications, knowledge of the 


optimal, normative response can be most 
important. 
Almost inevitably, however, normative 


models are forced into a descriptive mode for 
which they are ill-suited. Thus, a dominant 
concern in the Bayesian work has been with 
explanations of “conservatism,” the general 
tendency for the response to be less extreme 
than the optimal response, Conservatism was 
obtained in the present experiment also, but 
no attempt has been made to explain it 
because there is nothing to explain. Conserya- 
tism is not a phenomenon, only a name. It 
exists only by reference to a statistical model 
that has no psychological relevance, “Explana- 
tions" of conservatism in terms of mispercep- 
tion, misaggregation or response bias are 
inappropriate because there is no psychological 
phenomenon to be explained. 

To be fair to the Bayesians, it should be 
repeated that conservatism can be important 
in practical applications in which there is a 
cash value on correct responding. Pursuing 
such problems in the experimental laboratory. 
can be quite useful. Nevertheless, commitment 
to the study of optimal behavior is not innocu- 
ous. It has diverted attention from the actual 
behavior, as is implied by the papers of 
Shanteau (1970, 1972), Slovic and Lichtenstein 
(1971), Wallsten (1972), and Anderson (1971, 
p. 195). In particular, the data reported in the 
Bayesian studies are ordinarily derived sta- 
tistics, calculated from the Bayesian model 
itself, so that identical behavior in different 
conditions will produce different derived 
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statistics. Such data analysis makes it hard to 
understand what Ss actually did. 


Representativeness 


Kahneman and Tversky (1972) have argued 
from a heuristic of sample-population "repre- 
sentativeness" that the response to single 
samples in the Bayesian task should be 
controlled almost entirely by the sample 
proportion. In particular, they claim that the 
response should be virtually independent of 
the sample size, and independent of the urn 
proportions (pp. 446, 450). The data of their 
Table 1 support their claim since the responses 
are very nearly equal to the actual sample 
proportions. 

The present results differ radically from 
those of Kahneman and Tversky (1972). 
Figures 1, 2, and 4 show a very strong effect 
of the urn proportions, in agreement with the 
results of Shanteau (1972) and Wallsten (1972). 
Further, the right panels of Figures 1 and 3 
show a strong effect of sample size with sample 
proportion held constant. In both respects, 
therefore, the present data disagree with those 
given by Kahneman and Tversky. 

"The simplest explanation of this difference 
is that Kahneman and Tyersky (1972) failed 
to make their task clear. Their Ss were run 
in large classroom groups in 1 or 2 min. of a 
regular class session, and each S judged only 
one problem. That the task was not clear is 
suggested by the fact that about 10% of the 
Ss had to be eliminated (pp. 432, 447). It 
hardly seems surprising, therefore, that their 
5s might have failed to understand the task 
and simply interpreted their job as one of 
judging the sample proportion. 

A warning on this problem was sounded 
by Shanteau (1970) who found no difference 
between estimation and inference instructions 
with serial presentation. Logically, these 2 
instructions should produce quite different 
behavior, with the inference group tending 
toward 0 or 1, and the estimation group toward 
the prevailing proportion. The lack of differ- 
ence was surprising, therefore, yet it was 
replicated in a follow-up experiment in which 
Shanteau used special care in explaining the 
judgment task to Ss. 
à Shanteau's (1970) results indicate that Ss 
in the Bayesian task are not making inference 
judgments, as has been assumed in tests of 
the Bayesian model. Instead, they appear to 
be doing something qualitatively different, 
E uo This illustrates an 
: point that appears 
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to be especially relevant to judgim: cory. 
The meaning of a response is not gi mply 
by the words that the investigator but - 


rather by the network of empirica! 
that surround the behavior. 


tions 


Simultaneous Integration 


The present work is most closely 
that of Shanteau (1970, 1972) and of 
(1972). In all this work, primary 
is on the algebraic rule or psycho! 
that governs the integration of the in! 


ed to 
illsten 
»hasis 
i] law 
ation. 


Wallsten attempted to develop a iptive 
analogue of the Bayesian model,  hereas 
Shanteau employed a model deri from 
information-integration theory. Shai 1 was 
mainly concerned with serial integrati: which 
is discussed below. Simultaneous prese ation, 


as used here, seems to involve diffc 
cesses and requires a few preliminary c 
A basic question for simultaneous 


pro- 
rents. 


esen- 


tation is whether the effect of th: t ol 
stimulus information can be represents) as à 
simple function of the elements of ihe set. 
Considerable evidence for this view has been 
obtained in person perception and various 
other tasks (e.g. Anderson, 19622, 1967, 
1974). 


In the 2-urn task, a similar view has been 
taken by Shanteau (1970, Equation 3) and 
by Wallsten (1972). In particular, Wallsten’s 
multiplying assumption implies that the differ- 
ence between the number of red and white 
beads is the only relevant property of the 
sample for the special case of symmetric urns. 
This follows from Wallsten's Equation 3a, 
together with the added assumption that à 
sample split evenly between red and white 
has no effect when the urns are symmetric. 

Unfortunately for the 2-urn task, the present 
data indicate that the effect of the sample is 
not a simple function of its elements. The 
qualitative test of Figure 2 shows that the 
difference between red and white is not the 
sole determiner of the response, Sample 
composition appears to have a gestalt elfect. 
The data from the auxiliary design (Figure 3) 
did obey a cancellation property, but that 
may reflect a more deliberate strategy potenti- 
ated by special task conditions. Shanteau's 
(1970, Figure 4) data also imply that the 
effect. of a simultaneous sample is not a simple 
function of its elements. This problem, it 
should be noted, did not arise in Shanteau's 


results with serial presentation, but he used only 
one-bead samples. 
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Serial Integration 


Two basic reports by Shanteau (1970, 1972) 
have studied behavior in the Bayesian task 
using sequences of one-bead samples. A serial 
integration model gave a very good account 
of the data whereas the Bayesian model did 
poorly. A surprising result, noted above, was 
the lack of difference between estimation and 
inference instructions. This result caused a 
serious conceptual problem for the Bayesian 
formulation because the evidence suggested 
that Ss were doing something akin to esti- 
mation regardless of the instructions. 

However, Shanteau's (1970, 1972) success 
with the observed behavior obscured a theo- 
retical difficulty with his model as applied to 
inference rather than estimation. No problem 
es if the same sample is presented at each 
serial position; unléss it was split evenly, the 
response would always asymptote at 0 or 1. 
However, that would not necessarily happen if 
sample composition varied across serial posi- 
tion. In particular, a sequence of random 
samples of single beads from any urn would 
asymptote at the urn proportion. Despite its 
success, therefore, the serial integration model 
used by Shanteau would not seem able to 
handle general inference judgments without 
further elaboration. 

Wallsten (1972) has also attempted to 
develop a model for serial integration. How- 
ever, it has been worked out explicitly only 
for the case of a sequence of identical samples, 
and in that form it is effectively equivalent to 
the serial integration model used by Shanteau. 
Wallsten's approach seems more general since 
it explicitly allows for monotone response 
transformation, but the functional measure- 
ment methodology used by Shanteau has the 
same capability. Since Shanteau's analysis 
showed that the untransformed response was 
already an interyal scale, response transfor- 
mation was not needed. Wallsten's nonmetric, 
ordinal analyses did not allow a satisfactory 
test of the basic model. 

The present formulation deals explicitly 
only with a single sample, and it is uncertain 
how it should be extended to handle serial 
integration of successive samples. Three 
possibilities deserve mention. The first is that 
separate cumulative counts of the red and 
white beads are maintained, and that the 2 
competing response tendencies are recomputed 
from the cumulative counts after each succes- 
sive sample. In effect, this reduces the serial 
case to the simultaneous case. However, it 


would allow serial position weights for each 
separate count so that serial position effects 
could be allowed for. 

The second possibility is similar to the first, 
but allows only one counter which cumulates 
the difference between red and white. Weight- 
ing as a function of serial position would be 
allowable, but a cancellation strategy would 
be required within any single sample. Cancel- 
lation is not entirely consistent with present 
results, though it might provide a reasonable 
first approximation 

The third possibility is that each later 
sample sets up 2 competing response tendencies 
in exactly the same way as with the first 
sample, These are then averaged in with the 
single resultant response produced at the 
previous serial position, The resultant would 
have a scale value of 0 or 1, and a weight that 
would reflect the total amount of information. 


Concluding Comments 


Three theoretical problems are basic in the 
analysis of the 2-urn task. The first problem 
concerns the integration model for a single 
sample. Present data give some slight prefer- 
ence for the ratio model over the subtracting 
model, but a critical test between them 
remains to be done. 

The second problem is how the response to 
any sample relates to its component elements, 
Unfortunately, as noted above, this problem 
may not have a simple solution like that 
found for verbal stimuli. Present data 
indicate some departure from the simple 
cancellation strategy even with symmetrical 
urns. 

The third problem is that an adequate 
general theory must handle serial presentation 
as well as simultaneous presentation. Serial 
presentation involves additional complications, 
however, and it may be desirable to place 
greater attention on the first 2 problems which 
have been relatively neglected in previous 
work. 
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ROLE OF CENTRAL MONITORING OF EFFERENCE 
IN SHORT-TERM MEMORY FOR MOVEMENTS 
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University of Queensland, St, Lucia, Brisbane, Queensland, Australia 


A necessary and sufficient condition for retention of movements over a short 
time interval may be central monitoring of efferent signals to the muscles 
(CME) rather than proprioceptive feedback (PFB). It is shown that volun- 
tary movements may be rehearsed over a 15-sec. unfilled retention interval 
even when location cues are removed, whereas passive movements and con- 
strained movements—movements made to a stop set by E, which B. Jones 
has shown may not be centrally monitored—are subject to decay. Voluntary 
movements are affected only by an attention-demanding task during the 
retention interval. Since augmented PFB does not increase accuracy of volun- 
tary movement duplication, CME may be a necessary and sufficient condition 


for rehearsal in motor short-term memory. 


if we are to describe a pattern of move- 
ments as skilled, the component move- 
ments must at the least be carried out in 
the correct temporal sequence. Such timing 
could conceivably depend upon peripheral 
feedback or may result from central moni- 
toring of efferent commands to the muscles 
(CME). The present article is concerned 
with the role of CME compared with pro- 
prioceptive feedback from joints and/or 
muscles (PFB) in short-term retention of 
single movements (MSTM). Since even 
the simplest voluntary act requires the 
tion of at least 2 antagonistic muscle 


reguli st 2 
groups coupled at a joint, timing is a 
feature of MSTM. 

Posner (1967) showed there is little 


forgetting of visually guided movements 
over an unfilled retention interval of 20 
sec. although an attention-demanding ta 
during the interval did interfere with per- 
formance. When Ss were blindfolded, on 
the other hand, considerable forgetting 
occurred after 20 sec. and the interpolated 
task had no further effect. Posner (1967) 
argued that central processing capacity is 
available to the visual channel which func- 
tions as a "rehearsal" mechanism and that 
absence of such capacity makes little 
difference to the retention of kinesthetic 
information, even if the interpolated task 


1 Requests for reprints should be sent to Bill 
Jones, who is now at the Department of Psychology, 
University of Waterloo, Waterloo, Ontario, Canada. 
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requires movements as in writing (Posner 
& Konick, 1966). 

However, some movements, including 
replications of the standard movement 
during the retention interval (Stelmach & 
Bassin, 1971), do interfere with MSTM 
for blind movements. Keele (1968), there- 
fore, suggested that duplication of some 
movements may be based upon PFB from 
the muscles, which is not rehearsed, whereas 
input from the joints has access to central 
storage and hence may be subject to inter- 
ference. He did not consider whether 
CME may be involved, and his PFB 
argument rests in any case on 2 invalid 
assumptions. 

First, Keele (1968) assumes that only 
afferents from joint receptors project to 
the sensory cortex. This is not the case in 
the monkey and the cat and there is no 
reason to think that it is the case in man 
(see Paillard & Brouchon, 1968). The 
second implied assumption is that the joint 
receptors signal direction and extent of 
voluntary movement. However, while we 
know that joint receptor impulses mediate 
knowledge of passive movements and that 
the muscle spindles are not involved here 
(see Rose & Mountcastle, 1959), there is 
clear evidence that voluntary movements 
may be accurately duplicated when the 
joint capsule is anesthetized, thus elimi- 
nating joint feedback (Browne, Lee, & 
Ring, 1954). We can argue, therefore, 
either that the duplication of voluntary 
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movement is based upon CME or that 
PFB from the muscle spindles is tied in 
with voluntary but not with passive move- 
ment (Browne et al, 1954; Paillard & 
Brouchon, 1968). 

Jones (1972) attempted to test the alter- 
native hypothesis by comparing the ability 
of blindfolded Ss to produce a voluntary 
movement based upon one of 3 initial move- 
ments. Either (a) S made a voluntary 
movement, (b) S moved to a stop set by E 
("constrained movement"), or (c) S's arm 
was passively moved to the stop ("passive 
movement"). There was little difference 
in voluntary duplication of constrained 
and passive movements, which were dupli- 
cated with significantly less accuracy than 
were voluntary movements. Jones argued 
that only in the voluntary condition could 
S predict the end point of the initial move- 
ment before he began to move, so that only 
here was retention mediated by CME. 
Since both voluntary and constrained condi- 
tions required maintenance of muscle tone 
during the initial movement, there is no 
reason to think that the behavior of 
muscle spindles in the 2 movements would 
have differed; and yet without the oppor- 
tunity to make a preset movement, repro- 
duction accuracy decreased to the level of 
passive movement. It may be that both 
passive movement and constrained move- 
ment are based upon joint inflow, or that 
duplication of constrained movement re- 
quires such a complex integration of CME 
over sections of the movement that the 
initial movement did not lay down an 
adequate trace. In any case, there is 
evidence against the hypothesis of spindle 

involvement in voluntary movement and 
in favor of the CME position. 

The hypothesis tested in Experiment I 
is that the initiation of a voluntary move- 
ment results in a central record of efferent 
discharge, or “efferent copy” (von Holst, 
1954), which is in itself a motor memory 
Storage system without the requirement 
of peripheral feedback. Since the efferent 
copy is central it may be subject to inter- 
ference, unlike PFB which has no access 
to central mechanisms. In other words, 


BILL JONES 


CME is a necessary condition for re -:rsal 
in MSTM. 
EXPERIMENT I 

Most previous studies of motor tion 
have had Ss duplicate a constrain ove- 
ment and so may have pro ittle 
opportunity for retention via CM The 
first experiment compared re: of 
voluntary, constrained, and passis 10ve- 
ments over both filled and unfilled ntion 
intervals. The main measure of m nent 
reproduction accuracy was variable error 
(VE)—mean within-Ss standard de ation 
about the algebraic mean. Mar ther 
studies of movement duplication (i iding 
my own) have had absolute error i) as 
the dependent variable. Howe, AE 
confounds VE with algebraic or «stant 
error (CE) (Laabs, 1973; Pepper Her- 


man, 1970). 


Method 
Subjects. The Ss were 24 undergraduates in 
psychology who took part in the experiment as à 


course requirement. 


Apparatus. A freely moving slide on which 5 
placed his index finger was mounted on a straight 
track. The slide could be moved along the track 
by either S or E since a small handle was fixed t0 


the slide on the opposite side to where S sat and could 
be gripped by E and used to move the slide with 5'5 
arm relaxed. A pointer fixed to the slide moved 
along a scale so that linear displacements of the 
slide could be measured. Two freely moving stops 
were mounted on the track so that position an 
extent of displacement could be controlled by E 
A stopwatch was used to time retention intervals. 
Procedure. The S sat facing E, and the different 
conditions were explained to him. ‘The S was then 
blindfolded. Three movement conditions-volun- 
tary, constrained, and passive—were performed 
under conditions of 0-sec. delay, 15-sec. rest interval: 


and 15-sec. task interval. For voluntary movement | 


S placed his index finger on the slide, moved it 
along the track to whatever position he chose, an 
immediately returned to the initial starting position. 
The Ss were instructed to vary the extent of the 
initial displacement over trials between approxi 
mately 10 and 60 cm. Following the retention in- 


terval S attempted to voluntarily duplicate the | 


initial movement. 


d In the constrained movement 
conditions, 


S moved the slide to a stop set by 


and again returned the slide to the initial starting | 
point. During the retention interval E removed the | 


stop wl 


hich indicated terminal position. In the 


Aan 
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TABLE 1 
VARI/ARLE, ABSOLUTE, AND CONSTANT ERROR (IN CENTIMETERS) OF MOVEMENT DUPLICATION 


IN EXPERIME? 


Movement condition 


Retention ir al Voluntary | Constrained Passive 
VE AE | CE | VE | AE | CE VE | AE CE 
) [seras "spo g 2387. (03:30: | ats 12 120} 1.00 ll 13:39 7 seas 
rest 29 1.65 | 0.00 | 498 | 5.20 | 2.00 | 4.74 | 5.90 —1.35 
sek | 393 | 4.10 | 3.10 | 4:53 | 5,39. | 2.61 | 5.27 | 5:55 —46 
z Note, V variable error, AE absolute error, CE = constant error. is 


passive condition, S placed his index finger on the 
slide and relaxed his arm and E then removed the 
slide to the initial starting point. The S always 
used his preferred arm and initial displacement was 
always an extensor movement. In all conditions 
Ss were instructed to make both the initial displ: 
ment and the attempted duplication as quickly as 
possible. 

Attempted duplication was made either immedi- 
ately upon returning to the starting point (0-sec. 
delay) or after 15 sec. The 5 either simply sat with 
his index linger on the slide for the retention interval 
until given the instruction "Move" (15-sec. rest), 
or he was required to count backwards from a 3-digit 
number spoken by Æ as soon as S had returned to the 


starting point (15-sec. task 
Both movement and interval conditions were 
within«Ss. variables counterbalanced in ABCCBA 


fashion. That is, the order of blocks of movement 
conditions was voluntary, constrained, passive, 
passive, constrained, voluntary, and within each 


block order of retention intervals was O-sec. delay, 
rest, task, task, rest, 0-sec. delay, yielding 36 cells 
each of 6 trials (216 trials overall). “To insure 
approximately equal distributions of initial displace- 
ments across conditions, settings in the constrained 
and passive conditions followed initial settings for 
each S of the settings he had made in the correspond- 
ing cell of the first block of voluntary tr For 
example, initial displacements made during the 
second presentation of 6 trials in the ^, rest 
condition in the first block of voluntary trials were 
used as standards for duplication in the second 
presentation of 15-sec. rest in both blocks of con- 
strained and passive movement trials. 


Resulls und Discussion 


There were no significant sequence effects 
and mean VE, AE, and CE for each condi- 
tion are given in Table 1. A 3 X 3 analysis 
of variance (ANOVA) of VE with both 
main effects as within-Ss variables showed 
highly significant effects for movement con- 


ditions, / (2, 46) = 157.56, p < .000001, 
and interval conditions, F (2, 46) = 80.32, 
p < .000001, and a highly significant inter- 
action term, / (4, 92) = 14.52, p < .000001. 
Duncan's multiple range test indicated a 
significant difference (p < .01) between 
the voluntary condition and the other 2 
movement conditions with 0-sec. delay. 
At this interval passive and constrained 
movement did not differ, thus confirming 
previous findings (Jones, 1972) in terms of 
VE. There was no significant difference 
in either the constrained or the passive 
condition between rest and task intervals. 
In the voluntary condition, on the other 
hand, only the interpolated task produced 
significant forgetting (p < .01), so that 
the pattern of results for voluntary move- 
ment compares with Posner's (1967) find- 
ings for visually guided movements. 

Figure 1 is a plot of VE against initial 
displacement. The .S's initial displace- 
ments were grouped to the nearest 5 mm. 
and averaged. Inspection of the figure 
shows no evidence of any tendency for 
error to vary with extent of movement, 
Since, however, there was an unequal 
number of observations for each point on 
the graph, no statistical check has been 
attempted. 

Following Posner's (1967) criteria it is 
clear that retention and duplication of ' 
voluntary movement may depend upon 
central processing, since there was no 
forgetting of voluntary movements over 
the rest interval while the attention-de- 
manding task interfered with retention. 
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FIGURE 1. 


In the constrained and passive conditions, 
on the other hand, forgetting occurred 
over time and there was no interference 
effect. Since only the voluntary condition 
allowed S to preset the initial movement, 
the present experiment provides evidence 
that CME rather than joint PFB is a 
necessary condition for motor rehearsal. 
Keele and Ells (in press) have recently 
shown that passive movements may show 
little forgetting in the absence of distrac- 
tion, indicating that efferent consistency 
may not be required for motor rehearsal. 
However, they carried out passive move- 
ment at a constant and steady rate, which 
may have been highly susceptible to a 
counting strategy (Laabs, 1973, has shown 
that counting is a very efficient way of re- 
ducing VE in movement duplication), un- 
like the fast rate of movement used in the 
present experiment. Consequently, per- 
formance in the Keele and Ells study may 
not have depended upon joint PFB. 
Against the present position it might be 
argued that when Ss are asked to make a 
voluntary movement they decide on a 
stopping position and later remember that 
position rather than efference from the 
movement. The second experiment was 
designed to test such a possibility. 


EXPERIMENT II 
Method. 


Subjects, The Ss were 15 students who had not 
taken part in Experiment I. 
Apparatus and procedure, 


t The apparatus was 
that used in Experiment I. 


The Ss carried out 


kauf. pem: rami er - Gos NEUE EUR UE 
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Plot of variable error against initial displacement (Experiment I). 
voluntary movement duplication under itionse 
The first (constant start) followed « |y the 
voluntary movement procedure in the xperi- 
ment. The second (variable start) was ime às 
the first except that the starting posi of the 
standard movement and of the attempt: uplicar 
tion were randomly varied on each tr ver 6 
randomly chosen positions. The secc third, 
fourth, fifth, and sixth positions were, respe: tively 
extensions of 9, 13, 25, 32, and 45 cm. fr« he first 
position, the midline of S’s body. Consequently, S. 
had to duplicate the extent of the initia! displace 


* a Y$secs Toss 
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ment, and without the pos 

Both movement cond 
tions were within-Ss varia x 
Experiment I. Each S received 240 trials 
blocks of 10 each. 


Results and Discussion 


Table 2 gives mean VE and mean CE for 
the various conditions. Analysis of vari- 
ance of VEs showed that only the intervals 
factor was significant, F (2, 28) — 362.34, 
b « .000001. Significance was not ob- 
tained for either the movements main effect, 
F (1, 14) = .01, or the Movements X In- 
tervals interaction, F (2-28) <=. 140] 
Duncan's multiple range test showed that 
differences between the 0-sec. and 15-sec. 
tasks were significant (p « .01) for both 
variable and constant start conditions) 
although for neither was there a significant 
difference between 0-sec. and 15-sec. rest. 
Experiment II consequently provides evi- 
dence that MSTM for blind voluntary 
movements follows the pattern of retention 
of visually guided movements (Posner, 


1967) even when location cues are 
eliminated. 


CENTRAL MONITORING OF EFFERENCE IN STM FOR MOVEMENTS 


TABLE 2 


VARIABLE ERROR (IN CENTIMETERS) AND CONSTANT 


ERROR (IN CENTIMETERS) OF MOVEMENT DUPLI- 
CATION IN EXPERIMENT II 


Movement condition 


Rétention Constant start 2 Variable start 

VE | CE E | » 

0 sec. aus ursspu nas es —44 

15-sec. rest 136 | —47 | 155 | —.60 

Hec task | 436 | 249 | 4i | 3:73 
"u) 


Note. VE = variable error, CE. = constant error. 


EXPERIMENT III 


In general the results of Experiments I 
and Il are good evidence that CME is a 
necessary condition for rehearsal in MSTM 
processes. So far the evidence does 
not show whether CME is both necessary 
and sufficient. In bilaterally deafferented 
monkeys CME certainly seems to be 
sufficient for motor learning (Taub & 
Berman, 1968), but in intact human Ss 
it may be that PFB "stamps in" the 
efferent copy. The present experiment 
was designed to test the hypothesis that 
PFB facilitates voluntary movement dupli- 
cation since, if this is the case, CME would 
be a necessary but not a necessary and 
sufficient condition for retention of a 
voluntary movement over time. By having 
S move the slide against varying degrees 
of tension, PFB was manipulated. Such 
a recourse has frequently been held to 
increase PFB (e.g., Bahrick, 1957) and, 
since Pepper and Herman (1970) have 
reported that MSTM for force is analogous 


TABLE 
(ERS) AND CONSTANT ERROR (IN CENTIMETERS) OF 


VARIABLE ERROR (IN C 


AL 


to MSTM for extent and position, it is 
reasonable to expect that increasing force 
information could lower VE of voluntary 
movement duplication, given that PFB in- 
teracts with CME to stabilize memory for 
extent of movement. 


Method. 


Subjects. The Ss were 15 undergraduates who had 
not taken part in either Experiment I or Experi- 
ment II. 

Apparatus. The apparatus was the slide and track 
system used previously, with the addition that a 
weight could be attached to the slide by a cord and 
pulley 10 cm, in diameter mounted at one end of 
the track. 

Procedure. The general procedure followed that 
already described for the variable start condition in 
Experiment II, with differential weight on the slide 
of 0, .5, 1.0, and 1.5 kg. Weight was a within-Ss 
variable counterbalanced across trials as were re- 
tention intervals. Each 5 received 240 trials in all, 


Results and Discussion 


Table 3 gives mean VE and mean CE for 
each condition. A 4 X 3 ANOVA of VEs 
with weights and intervals as within-Ss 
factors showed, as for Experiment II, that 
only the main effect of intervals was signifi- 
cant, F (2, 28) = 329.27, p < .000001. 
Neither the effect of differential weighting, 

(3, 42) — 1.77, nor the Weights X In- 
tervals interaction, / (6, 84) = .42, reached 
significance. 

Breakdown of the analysis of variance by 
Duncan's test did not indicate significant 
differences between movement conditions 
at any of the interval levels. Thus the 
present findings for voluntary movements 
parallel those of Stelmach (1968) for con- 


Move XPERIMENT III 
Weight 
Retention interval 0 kg. | 5 kg. | 1 kg. 1.5 kg. 
ve | ce | we CE | Vi cE VE CE 
| 157 | 000 146 | —74 | 135 —.55 1.37 —.50 
i; | 1.54 AT 1.49 —.36 1.45 —.68 1.37 -—.M 
15-sec. task | 4.21 | 2.68 4.33 | 3.47 | 3.99 2.74 4.17 2.58 


Note. VE. = variable error, CE = constant error, 


42 = 


strained movements in that augmenting 
PFB without also giving visual or verbal 
knowledge of results has little effect on 
accuracy of movement duplication. Con- 
sequently, there is evidence for the hy- 
pothesis that CME is both a necessary and 
sufficient condition for rehearsal in MSTM. 
Limiting factors on the present conclusion 
are that force and extent of movement may 
be subserved by different neural mecha- 
nisms despite Pepper and Herman's (1970) 
findings, and also that VE in the 0-kg. 
condition may represent the limits of ac- 
curacy at present levels of practice so that 
augmenting PFB could not facilitate motor 
retention. Further experimentation could 
study the effects of various tension levels 
of more long-term practice on blindfolded 
voluntary movement duplication, since it 
may be that augmented PFB results in 
rather faster lowering of VE over a longer 
time span. 


GENERAL. DISCUSSION 


"The present results are good evidence that 
CME is a necessary condition for rehearsal in 
MSTM, and they also provide a suggestion 
that CME is necessary and sufficient. Most 
if not all previous studies of MSTM have 
failed to consider the CME hypothesis since 
they have compared the duplication of con- 
strained movements under various conditions. 
Peripheral feedback theories of MSTM have 
therefore been given a generality on the basis 
of insufficient evidence. This is not to say 
that memory storage capacity for propriocep- 
tive inputs does not exist. It is simply that 
MSTM for proprioceptive inputs need not 
involve central mechanisms since storage may 
be via "preperceptual" kinesthetic afteri mages 
comparable to the sensory afterimages found 
in vision and hearing, although probably with 
a relatively longer decay function. Itis largely 
m ningless to speak of interference here; any 

new inputs to the preperceptual store would 
erase existing afterimages rather than inter- 
acting in classical interference fashion. It is 
conceivable that volunta y movement also 
results in an afterimage which may or may not 
be read in. The absence of any proprioceptive 
facilitation effect on duplication of voluntary 
movement would suggest that, if it occurs, 
__-Preperceptual storage of data from proprio- 
— ceptors resulting from voluntary movement 
— has little functional utility. The adaptation 
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theory developed by Pepper and man 
(1970) and extended by Laabs (197 ems 
to give a good fit to the data on sl term 
retention of constrained movement How- 
ever, such a model may have little ext n to 
the wider field of skilled performan: )rdi- 
narily skill means a pattern of correct; imed - 
voluntary movements, and the pre data 
show that voluntary and constrain: ove- 
ments differ in important ways. [v killed 
performance, generally, the initial rning 
stages may depend upon visual (© other 
exteroceptive) feedback matched agaist the 
motor memory or efference copy : e ify 


error of movement. With increasin, 
Ss may come to depend solely on C -, con- 
trary to the widely held theory (e.g., i 
1957) that PFB control replaces visu: 
of movements. 

The neural basis of CME possibly pends 
upon cerebello-cerebral relationships. It is 
known that coordinated movements 
rupted by cerebellar damage even tho 
motor cortex remains intact. Ruch 
speculates that preprogramming of mov: 
would be possible if the motor cortex i 
discharges into a cireular pathway involving 
the cerebellum, so that impulses would be 
stored to be discharged after a fixed delay. 
Such a "reverberatory circuit" could con- 
ceivably mediate memory for voluntary move. 
ments over the 15-sec. rest interval in the 
present experiment. 


ming 


e dis- 
h the 
1951) 
nents: 
itially 
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INFORMATION AMPLIFIED: 
MEMORY FOR COUNTERFACTUAL CONDITIONALS ! 


SAMUEL FILLENBAUM 1 
University of North Carolina at Chapel Hill | 


People remember not only information explicitly specifed but also information 
that can be inferred from that provided directly, It was found, in a recognition 
memory task that compared to control items Ss "recognized" the negated 
antecedent and consequent propositions of previously encountered counter- 
factual conditionals significantly more often, the latter effect being distinctly 
stronger. A similar result was obtained for causals related to previously en- 
countered counterfactual conditionals and counterfactual conditionals related 
to previously encountered causals, the latter being the stronger effect. "These 
results were evaluated in the context of the observation that a counterfactual 
conditional presupposes (a) the negation of its antecedent propositiomy strongly 
suggests (b) the negation of its consequent proposition, and (c) is intimately 
related to a causal in which a and 4 are conjoined, with a taken as the grounds 
for b. 


Characteristically, people remember nei- These examples represent relativel» sim- 
ther the particular surface syntactic form ple cases of information compound!:: of 
of the sentences they have encountered the drawing of simple 1-step infereices. 
nor even their deep structure, but rather There is very little knowledge of «hat 
remember semantic information or propo- may happen when more subtle types of 
sitional content that includes not only inferences are involved, even for the case 
information explicitly specified but also of single sentences, as regards the recall 
information that can be inferred from that 
provided directly (for a survey of relevant 


of what is either presupposed or pos- 
sibly entailed by a sentence, as com- 
evidence see Filenbaum, 1973). Thus, pared to recall for what is explicitly as- 
Tieman (1971), for example, has shown serted in it. The research presented herein 
that if a sentence pair such as John is is addressed to these questions, Recently, 
lall/ John is shorter than Bill is presented, linguists have become very interested in 
Ss may later often falsely recognize Bill presuppositional analysis (see, e.g., Fill- 
is fall, which can be inferred from the more & Langendoen, 1971). Psychologists 
factually presented sentences, Bransford, have begun to consider how to differen- 
Barclay, and Franks (1972) provide related — tiate between the knowledge involved in 
evidence in studies concerned with both what is explicitly asserted and what is 
memory for inferences that may be drawn presupposed in an expression and to ex- 
from sets of semantically related sentences amine the role of knowledge involved in 
and recognition memory for information such presuppositions (see, e.g, Miller, 
inferable from single sentences, given the — 1969; Osgood, 1971). If nt processing 
proper understanding of lexical items such information one is responsive to not only 


às a crucial preposition. what is explicitly and directly provided 
NT — wa E a Ai Sat ua but also to what can or must be assumed 
E HER Saver Chee M Pu 2, if the directly provided information is to 
tional Institute of Mental Health. The author i; Ve Coherent, then a concern with the 


indebted to R. Hall for his aid in gathering and functions of th iti «nowledge 
oair tha data. gathering à € presuppositional know ledg 


BERG foe tepsia should ba sedE T6 Si becomes inescapable. Particularly intrigu- 
Fillenbaum, L. L. Thurstone Psychometric Labo- Ing questions arise with regard to possi- 
Fatory, University of North Carolina, Chapel Hill, bilities of information gain or amplification 


e ag in memory, as opposed to the more usual 


emphasis on abstraction and information 
loss. 

The conceptual status and proper analy- 
sis of counterfactual conditionals have long 


concerned and perplexed philosophers; 
neyerthelsv counterfactual conditionals 
may i us with test material that is 
partici useful and relevant for the 
stud enory for information implicit 
in sentences. A counterfactual conditional 
presupposes the negation of the antecedent 
proposition and, while there is some dispute 


as to whether or not it also entails nega- 
tion of the consequent proposition, it would 
seem that it (at the least) strongly sug- 
gests such negation and "invites" this as 
an inference (see Karttunen, 1971), Thus, 
given Lf he had caught the plane he would 
have arrived on time, it is presupposed that 
He did not catch the plane, and in most 
contexts it is at least strongly suggested 
that He di! not arrive on time, A counter- 
factual conditional may also be seen as 
related to a causal in which the negation 
of the antecedent is given as the reason 
for the negation of the consequent, in our 


example, Becouse he did not catch the plane 
he did e on time or, more stylistic- 
ally & on, He did not arrive on time 


becouse he did not catch the plane, 

Giver presentation of a counter- 
factual conditional, one may ask if S will 
later falsely recognize (a) the negation of 
the antecedent proposition, which is pre- 
supposed or (b) the negation of the con- 
sequent proposition, which is strongly 

suggested if not entailed; and whether 
false recognition is more frequent for a 
or b. Further, given a counterfactual 
conditional, one may ask if S will later 
falsely recognize its causal relative, which 
consists of the causal conjunction of a and 
b, with a given as the reason for 6. This 
relation between counterfactual conditional 
and causal may be viewed as going in the 
other direction as well (ie, from causal 
to counterfactual), since there would ap- 
pear to be few, if any, occasions appropriate 
for the causal (He did not arrive on time 
because he did not catch the plane) which 
would not also assume the counterfactual 

If he had caught the plane he would have 


interested in | comparing the n eq 
recognition errors in the 2 dh 

Two experiments in recognition m 
are presented herein, the first of wh 
concerned with memory for the si 
offspring of counterfactual condition 
and the second with memory for re 
complex sentences where the original 
tences are either counterfactuals or ca 
and the recognition sentences include t 
causal and counterfactual relatives, 


METHOD 
The basic format of the 2 experimenta was the 
same, The Sy were told that they would hen 


à series of sentences on tape and were Fea 
to listen carefully, since they would later be 
questions about the sentences,” After the Rene 
series wan finished, they were told that they would 
hear another series of sentences and would then 
have to decide whether each sentence was old 
(one they had actually beard before) or new (one. 
that had not appeared in the first series), The — 
sentences were presented at a S«sec. rate both in 
the initial series and in the later recognition series, 
The Ss were run in groups ranging in size from $ 
or 5 to 20 or so. There were 110 Se in Ex, 
ment | and 92 Se in Experiment II, all under. - 
graduates fulfilling course requirements. 

Design Experiment 1. Both the original and LA 
test series consisted of a mixture of coun 
conditionals and simple propositions, Whether 
the original or in the test series, all counterfacti 
were positive in both of their clauses with the 
if clause always coming first, as in f he had caught — 
the plane he would have arrived on time, and all. 
simple propositions were we such as in He 
did not catch the plane. Some physical or social 
nexus always obtained between the clauses of the 
counterfactuals, as in If he had elem the fish he 
would have become sick or rir the eA 
the boss he would have got a raise, Matters were 
arranged so that there were (a) conditionals ap 
pearing both in the original and the test 
(6 items), (b) simple propositions appen 

he original and the test series (8 items), 
ditionals appearing only in the test series (12 
(d) simple propositions appearing only in the test 
series (8 items), (e) conditionals appearing in tl 
original series whose negated antecedent appeare 
in the test series (8 items), (/) conditionals - 
pearing in the original series whose negated con- 
sequent appeared in the test series (8 items), and 
(g) filler conditionals and i do he al 
pearing both at the start and the 
series and at the start of the test series, as well 
as scattered through the original series, so as to 
make the proportions of conditionals and simple 
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propositions comparable in the original and test 
series, Each item appeared twice in the original 
series; after every item had occurred once, the 
items appeared again in a scrambled order. 
Design Experiment II. The original series con- 
sisted of a mixture of counterfactual conditionals, 
simple conditionals, and causals involving negation 
in both of their propositions, Simple conditionals 
were included as a control so as to be able to de- 
termine whether any systematic recognition effects 
obtained were simply topic effects, i.e, were due 
to the prior occurrence of the same or similar 
content. The same type of sentence content used 
in Experiment I appeared in this experiment. 
In the causals, the because clause always came 
second, as in He didn't pass the test because he 
didn't study, The simple conditionals involved the 
same kinds of connections between their clauses 
as did the counterfactuals, eg., Zf he is treated 
immediately he will survive or If the watches the 
- show he will be bored. Matters were arranged so 
that there were (a) counterfactual conditionals, 
simple conditionals, and causals appearing both in 
the original and test series (6 items of cach kind), 
- (b) counterfactual conditionals, simple conditionals, 
and causals appearing only in the test series (6 items 
- of each kind), (c) counterfactual conditionals ap- 
- pearing in the original series whose related causals 
- Appeared in the test series (6 items), (d) causals 
Appearing in the original series whose related 
counterfactual conditionals appeared in the test 
series (6 items), (e) simple conditionals appearing 
- in the original series whose corresponding causals 
- Appeared in the test series (6 items), and ( f) some 
- filler items, Again, each item appeared twice in 
the original series, and after every item had oc- 
curred once the items appeared again in a scram- 


Eu 
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- For each S, one may determine the number or 
“proportion of old ses for each item class and 
then make te comparisons across item 
t in addition, one may determine the mean 
of old responses taken over Ss for each 


gne These figures will constitute our 
> 
13 ResuLTs 
Consider first the results for Experi- 
ment l ne 


Old counterfactuals C88) and 
simple Propositions (.53) were recognized 
. As old significantly more frequently. than 
new counterfactuals (12) and new simple 
. Propositions (.14)*; old counterfactuals 
- (88) were recognized as old significantly 
- more often than were old simple proposi- 
tions (53); and there was no difference 


, Unless explicitly noted otherwise, all differences 
paer as significant represent p « 01 on a 
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in recognition between new counterfa: als 
(12) and mew simple propositions 1). 
Except for the finding that old cou er- 
factuals are recognized better tho. old 
simple propositions, these results ar« no 
particular interest except to show th. ihe 
situation is one in which Ss can distin ish 
between old and new items, and thi one 
where the results for offspring iter) will 
be relevant. It is indeed the casc hat 
offspring sentences, whether negated ote- 
cedent (.25) or negated consequent . 14), 


were judged old significantly less frequ atly 
than old simple propositions (.53), but 
more interestingly, it is also the cas ‘hat 


these offspring sentences were jud; to 
g [ j 


be old sentences significantly more | ‘ten 
than the control new simple propos | ons 
(14). Finally, if we compare offs, ing 
sentences where the consequent is ne. ted 
with those where the antecedent is ne; ted, 
we find that the former (.44) are ju ged 
old significantly more often than the 
latter (.25). 

In Experiment Il, old counterfa tual 


conditionals (.79), old simple conditionals 
(.84), and old causals (.73) are recognized 
as old more than new counterfactual con- 
ditionals (.05), new simple conditionals 
(04), and new causals (.08), with cach 
of these differences highly significant. 
While the differences between old proposi- 
tions are significant, with simple condi- 
tionals recognized significantly more often 
than counterfactual conditionals or causals 
and counterfactual conditionals recognized 
more often than causals (p < .05, sign 
test), these differences are quite small com- 
pared to those between old and new 
propositions. The differences between 
various sorts of new propositions are slight 
and nonsignificant. Related sentences are 
judged old significantly less often than 
Sentences that actually had appeared be- 
fore, but more interestingly, related sen- 
Xences are judged old significantly more 
often than their mew controls. Further, 
looking in a little more detail, we find 
that counterfactual conditionals related to 
Previously presented causals are "recog- 
nized” significantly more often (.47) than 
causals related to previously presented 
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counterfactuals (34), and that the latter 
are "recognized" significantly more often 
than causals corresponding to previously 
presented simple conditionals (.25), As 
noted above, okl counterfactual condi- 
tionals are recognized more often than old 
causals (79 vs. .73), so perhaps the dif- 
ference between counterfactuals related to 
previously presented causals (47) and 
causala related to previously presented 
counterfactuals (.34) represents only a bias 
toward recognition of  counterfactuals, 
Thus, one needs to correct. recognition 
rates for related sentences by correspond- 
ing rates for old sentences in order to 
determine whether the difference between 
AT and 23 is reliably greater than that 
between .79 and .73, Comparing for each 
5 the difference in recognition rate be- 
tween old counterfactuals and old causals 
with that between related counterfactuals 
and related causals, we find that the latter 
difference is greater for a significant ma- 
jority of Ss (p < 08), Thus, even with 
this correction taken into account, counter- 
factual conditionals related to previously 
Presented causals are recognized signifi- 
cantly more often than causals related to 
previously presented counterfactuals. 


Disct ston 


The basic trends of the results are quite 
clear in each experiment and consistent over 
the 2 experiments, While offepring or related 
sentences are recognized less frequently than 
eld sentences, they are recognized more fre- 
quently than mew control sentences. Listeners 
appear to operate on the information direetly 
provided to them, amplifying and elaborating 
that information. In considering the results 
more chwely, one important caution needs to 
be voiced with regard to their interpretation. 
There is no statistical warrant for generalizing 
these results beyond the specific samples of 
linguistic material actually employed. There 
are obviously very severe problems in defining 
A population of counterfactual conditionals 
and in sampling at random from such a popu- 


. lation; for a discussion of the "language-a«- 


fixed-effect fallacy” and some suggestions for 
corrective measures, see Clark (in press). 

In might perhaps be argued that the posi- 
tive findings of Experiment | merely represent 


some sort of a topie effect, with negated 
tecedent and consequent propositions f 
recognized because S remembers having « 
countered some similar content in the 
nal series (where these contents were act 
embedded in various counterfactual con 
tionals), This argument would not appe: 
be too convincing given the findings of E; 
periment I, where there was significant 
more false recognitions of causals related 
prior counterfactual conditionals than of ca 
als corresponding to prior simple conditional, — 
If all that we have is a topic effect, t 
there should have been no difference in false 
recognition of causals, whether the antecedent | 
conditionals were ple or counterfactual — 
Nor would the topic hypothesis account in 
any obvious way for the fact that in Experis 
ment I, false recognition of negative conse. — 
quents was significantly more frequent than — 
that of negative antecedents. Why should — 
this be the case if all that is involved is the 
recognition of propositions that had occurred — 
earlier in some form? Finally, note that — — 
recognition involved the negation of the ex- — 
plicit prior contents, and we know that gist 
memory is generally good, so that on recall. 
* can discriminate rather well between an 
expression and its negation, Nevertheless, 
à built-in control against the topic hypothesis 
may be desirable in future work, and one 
suggests itself readily, In the recognition 
series, in addition to sentences which negate 
the antecedent and consequent propositions — 
of earlier counterfactual conditionals, which —— 
we might call “legitimate” offspring, one might 
abo include sentences which asert the antes 
cedent and consequent propositions Thee 
might be called "illegitimate" offspring. On — 
the hasis of a topic hypothesis, no difference 
should be expected in false recognition of the 
legitimate and illegitimate offspring. or perhaps. F 
we might expect even greater recognition. of — 
the latter than the former (since the f 
legitimate offspring do not involve. negation - 
of previously encountered materials), In terms — 
of the position argued here, however, one 
would expect more false recognition of legiti- 
mate- than of illegitimate-offspring sen c 
With regard to the first experiment, we 
can offer no plausible reason as to why old 
counterfactual conditional should have been —— 
recognized substantially more often than old — 
simple propositions (88 vs. $3) except for 
noting that there were substantially fewer of 
the former (18) than of the latter (32) in 
the recognition series and this may have 
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resulted in less confusion for the former. 
A problem more directly related to the con- 
cern of the present research is posed by the 
finding that sentences which negate the con- 
sequent of a counterfactual conditional are 
significantly more frequently recognized than 
those which negate the antecedent of the 
counterfactual conditional (.44 vs. .25).! 
Recall that the negation of the antecedent 
is strictly presupposed by the counterfactual 
and that one cannot coherently conceive of a 
counterfactual without assuming that its an- 
tecedent proposition is negated, while it has 
been argued that the negation of the con- 
sequent is strongly suggested in most circum- 
stances but does not appear to be required. 
Thus, a sentence such as Jf he had caught the 
plane, which he did, he would have arrived. on 
lime seems inconsistent, while there is in 
principle nothing contradictory in saying Jf he 
had caught the plane he would have arrived. on 
lime, which he did anyway because he look a 
fast train. Hence it would appear that, if 
anything, the negated antecedent is more 
strongly implied by the counterfactual con- 
ditional than is the negated consequent, and 
yet our results indicate substantially more 
false recognition of the latter than of the 
former. In order to make sense of this 
finding, consider the circumstances under 
which one might be most likely to employ 
a counterfactual conditional. For example, 
one might most commonly say If he had 
caught the plane he would have arrived on time 
as an explanation for the fact that he was 
late, a fact that may be simply expressed by 
negating the consequent proposition, i.e., 
saying He did not arrive on time. If the fact 
that is being explained is somehow central 
or focal to the discourse, then it may be 
reasonable that a proposition that embodies 
this fact, i.e., the denied consequent, should 
be more likely to be inferred and falsely 
remembered than one that only purports to 
give a reason for the fact, viz., the subordinate 


* Actually, recognition memory for offspring sen- 
tences was examined both for the case where just 
l offspring of a particular counterfactual condi- 
tional was presented in the recognition series (with 
a negated antecedent provided on half of the 
occasions and a negated consequent on the other 
half) and for the case where both the negated 
antecedent and the negated consequent offspring 
of a counterfactual were presented. Since it made 
essentially no difference whether one or both off- 


Spring were tested, only the pooled results are 
„presented. 
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denied antecedent. All this, whil 


haps 
sensible enough, is of course somewha hoc, 
since one could argue that the sı nate 
causal, just because it gives an ex; ition, 


is the more central proposition t is 


needed is some manipulation that sys- 
tematically change the focus betw ie 2 
propositions, and a determination : ther 
such a shift in focus has any iatic 
effect on recognition errors in a t the 
sort used in the present study. 

With regard to the second experi: it is 
again hard to know why there sho: dif- 
ferences in recognition of the vari cinds 
of old sentences, but here the diffe: are 
certainly not of any great magnit with 
figures of .79, .84, and .73 for coun ctual 
conditionals, simple conditionals, an« isals, 
respectively, The finding that cau cor- 
responding to simple conditionals a ecog- 
nized more often than new causals gests 
that in some measure there may ir i ob- 
tain a sort of topic effect, such that a rence 
involving propositional material enc ered 
earlier may be falsely recognized. To ever, 
the finding that causals related to pre. ously 
encountered counterfactual condition are 
significantly more frequently falsely recog- 


nized than causals corresponding to previously 
encountered simple conditionals indicates that 
such a topic effect cannot totally explain 
present results. Perhaps the most interesting 
result is the finding that both causals related 
to counterfactual conditionals and counter- 
factual conditionals related to causals are 
recognized significantly more often than their 
controls, with the latter effect the stronger one. 
But why should the counterfactual relative of 
a causal be more frequently recognized than 
the causal relative of a counterfactual? Con- 
sider what is assumed or what can be assumed 
given, respectively, a causal and its related 
counterfactual conditional, e.g., the sentences 
(a) He did not arrive on time because he did 
not catch the plane and (b) If he had caught 
the plane he would have arrived on time. One 
cannot very easily assert a without assuming 
or taking for granted the correctness or truth 
of b, but it is not clear that the converse 
necessarily holds. For, as was pointed out 
earlier, b could be true and yet there is no 
absolute requirement that the person in ques- 
tion must have been late, for (c) If he had 
ESTER BP IY 

* Another possibility, pointed out by a reviewer, 
simply that semantic material from the main 


clause is better remembered than material from 
the adverbial if clause, 


is 


caught the blane he would have arrived on time, 
which he did anyway because he caught a fast 
train, is certainly a possible elaboration. The 
point is simply that a counterfactual condi- 
tional does not appear to entail the denial of 
the consequent proposition and thus, insofar 


as this is the case, does not require its related 
causal. But this analysis too may appear 
ad hoc. At least it would be supportive of 
these suggestions if, given causals and their 
counterfactual relatives, Ss were more willing 


to accept the latter as inferences on the basis 
of the former than the converse. 

Even though our interpretation of some of 
the details of the results must be quite ten- 
tative with regard to within-conditions dif- 
ferences, viz. (a) the difference between 
offspring sentences which negate the ante- 
cedent and those which negate the consequent 
and (b) the difference between causals related 
to counterfactual conditionals and counter- 
factual conditionals related to causals, it seems 
clear that there are substantial and consistent 
between-conditions differences, such that off- 
spring sentences and related sentences are 
consistently recognized more often than their 
new controls. This occurs with regard to 
both simple offspring propositions (Experi- 
ment I) and complex related propositions 
involving the causal conjunction of the simple 
offspring (Experiment II). 

Wittingly or unwittingly, Ss seem to go 
beyond the givens, using information ex- 
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plicitly provided as a clue to what is implicit 
or can be assumed given that information. 
If this is the case, then it is incumbent on 
us to investigate the processes of inference 
or natural logic involved in such elaboration 
of information. 
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MULTIPLE-EXPOSURE EFFECTS ON RESOLUTIONS 
OF FOUR BASIC CONFLICT TYPES * 


BARRY D. SMITH? LEONARD GEHL 


University of Maryland New Jersey College of Medici 
"The present study examined the 4 Lewinian conflicts in the context of a nega- 
tive-start paradigm. The Ss were given 3 trials of each of the 4 conflicts, inter- 
spersed among 60 nonconflict trials, Double approach-avoidance (Type IV) 
conflict produced less adequate resolutions than avoidance-avoidance (Type 
111), but the 2 conflicts were similar in response modes employed, slow speed 
of resolution, and high error rate, In addition, considerably blocking was 
found to Type III and IV conflicts than in work performed by Hovland and 
Sears in 1938, Repeated exposure to conflicts produced increases in speed. 
Resolution lability took the form of increased use of double and blocking re- 
sponses and decreased use of single responses across trials and was affected by 
conflict type and response adequacy. Approach-avoidance conflict showed 
the greatest overall lability, while interclass shifts were greatest for double 


approach-avoidance, 


Two recent studies of human cognitive- 
motor conflict (Epstein & Smith, 1967; 
Smith & Epstein, 1967) analyzed responses 
to approach-approach (Type 1), approach 
avoidance (Type II), and avoidance-avoid- 
ance (Type lI) conflict. These studies 
modified an experimental paradigm used 
earlier by other investigators (Hovland & 
Sears, 1938; Sears & Hovland, 1941) to 
include a negative-start condition in which 
Ss were instructionally forced to leave the 
starting point in the conflict field imme- 
diately at the onset of any conflict or 
nonconflict trial. Results of the Epstein— 
Smith and Hovland-Sears studies were 
quite discrepant on some points. ln par- 
ticular, Hovland and Sears found that 
blocking (remaining at the starting point 
after trial onset) was a common response 
to Type III conflict, while Epstein and 
Smith, using the negative-start paradigm, 
found blocking to be quite infrequent. 

Although the Epstein-Smith (Epstein & 
Smith, 1967 ; Smith & Epstein, 1967) studies 
did introduce the negative-start paradigm, 
they limited the examination of conflict 
responses to the first 3 conflict types, 


1 The authors are indebted to Frank Rosillo for 
his help in running Ss in the study. Computer 
time was provided by the Maryland University 
Computer Science Center. 

? Requests for reprints should be sent to Barry 
D. Smith, Department of Psychology, University 
of Maryland, College Park, Maryland 20742. 


eliminating the double approach dance 
(Type IV) conflict examined b; »vland 
and Sears (1938), and to a single senta- 
tion of each conflict, with the onflict 


trials. 
lution 
s were 


trials interspersed among nonconi!! 
Hovland and Sears did study the : 
of Type IV conflict, but their rest 


undoubtedly affected by their fa lure to 
use a negative starting point. In addition, 
comparisons between Type IV and the 


other 3 types in the Hovland and Sears 
study are difficult, since they used only 
20 preliminary trials with each of the first 
3 conflict types and 80 with Type IV. 
The present study was carried out, in part, 
to evaluate Type IV conflict effects in - 
comparison with the other conflict types 

Hovland and Sears (1938) also studied 
the effects of repeated exposure to con- 
flict, but only with regard to Type I 
conflict. A major purpose of the present 
study was thus to examine, in the context 
of the negative-start paradigm, response 
lability (the tendency to change or shift 
from one response mode to another with 
repeated exposure to conflict) for all con- 
flict types. 


METHOD 

Subjects 
A total of 52 Introductory Psychology students 
(24 male, 28 female) were employed as Ss. 


ditional Ss (7) were run or partially run but had 
to be dropped due to failure to understand basic 
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instructions (2), overt refusal to follow. instruc- 
tions (1), or equipment failure (4). 


paratus 


e conflict apparatus consisted of a board on 
ich were mounted 4 lights (one red and one 
ute light at each of 2 goals), a control panel that 
rmitted Æ to switch on any light or any 2 or 
re lights simultaneously, and a Standard electric 
ier wired to start simultaneously with light onset 
d to be stopped with the termination of S's 
ponse. The response field was a simple maze, 
nsisting of a diamond-shaped section followed by 
1 V-shaped section, with paths terminating at 2 
goals. The effect of the maze was such that the 
first section directed S's vision toward one goal, 
the second toward the opposite goal, and the third 
section again toward the first goal. Crossways 
between the sides of the maze permitted .S to 
detour from one side to the other during his re- 
The actual maze was one presented in an 
earlier study by Epstein and Smith (1967). The 
only major difference in the present study was 
that the outside pathw; of the maze were nar- 
2 mm. to increase error frequency and 


sponse. 


rowed to 2 
thus make error a meaningful measure, 


Procedure 


Each S was instructed to approach white lights, 
which he was told represented a million dolla’ 
and to avoid red lights, representing an atomic 
bomb, by moving a pencil along the paths of the 
maze. The starting point was made negative by 
indicating that it would represent a bomb detona- 
tion point whenever any light indicating trial onset 
came on, 

The experimental session consisted of a total of 
72 trials, including 3 trials for each conflict type, 
interspersed among 60 nonconflict (simple ap- 
proach or avoidance) trials. More specifically, 
after 14 nonconflict trials, 3 different random se- 
quences of 5 trials each were interspersed among 
nonconflict trials. Each sequence contained all 4 
conflicts and a randomly selected control (non- 
conflict) trial, All Ss received the same resulting 
random schedule of conflict and nonconflict presen- 
tations. Type I conflict involved the presentation 
of the 2 white lights simultaneously, Type II con- 
isted of the red and white lights at the same goal 
appearing simultaneously, Type II was 2 red 
lights, and Type IV w: 1 4 lights. A new copy 
of the response maze used for each succes- 
sive trial. 


RESULTS 


The results of the study were analyzed 
in terms of response speed, errors made in 
responding, mode and adequacy of conflict 
resolution, and response lability as a func- 
tion of conflict repetition. All analyses 


involved all 52 Ss except where specifically 
noted. 


Response Speed and Error 


Effects of conflict type and practice were 
examined through analyses of the rate at 
which Ss traversed the maze in response 
to a given conflict presentation and the 
relative frequency of errors made. Anal- 
yses of rate involved a speed score, cal- 
culated as the ratio of total response time 
on a given trial to distance traversed. 
The speed score was calculated for each S$ 
on each of the 3 presentations of each 
conflict type and on each of the 3 cor- 
responding random control (nonconflict) 
trials. 

A repeated-measures analysis of variance 
applied to speed scores on the conflict and 
control trials, with conflicts and trials as 
sources of main effect variance, yielded a 
significant trials main effect, Æ (2, 56) 
= 13.71, p < .001. Speed increased across 
trials, with mean values of 4.66, 5.18, and 
5.43 mm/sec for combined Conflict and 
Control Trials 1, 2, and 3, respectively. 
Conflict type was also significant, F (4, 112) 
= 30.46, p < .001 (Figure 1). Speed de- 
creased with increasing conflict difficulty 
except for a slight increase on Type IV. 
A Scheffé (1959) test indicated that the 
difference between Type III and Type IV 
conflicts was not significant, F (4, 112) 
= 1.85, p > .05. The difference between 
Type | conflict and control trials was also 
clearly nonsignificant. 
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A test was planned of the expectation, 
drawn from the work of Worell (1967), 
that response speed decreases with re- 
peated exposure to Type 1V conflict. The 
necessary planned comparison was carried 
out as a Scheffé (1959) test. Means for 
Trials 1 and 3 on Type IV conflict were 
3.83 and 4.71, respectively, representing 
a significant increase in speed rather than 
the hypothesized decrease, F (8, 224) 
= 1351, p < 10. A probability of .10 
was considered significant due to the highly 
conservative nature of the Scheffé test. 

For the error analyses, an error was de- 
fined as any movement of S's response line 
that crossed the boundary defining a path- 
way of the response maze. The measure 
utilized was error rate (errors/speed). The 
Significant effect of conflict type, F (4, 112) 
= 5.52, p < .001, resulted from an in- 
crease in error rate across the 4 major 
conflict types (Figure 2). 

, 
Response Mode and Adequacy 


Response mode and response adequacy 
were scored, using the criteria of earlier 
studies (Epstein & Smith, 1967; Smith & 
Epstein, 1967). The modes scored were: 
(a) single response, in which the goal is 
reached directly, without detour; (b) double 
response, in which both goals are reached 
either by going from one goal to the other 
or by returning to the starting point and 
proceeding to the second goal; (c) com- 
promise, in which the response terminates 
at a point intermediate between the start- 
ing point and the 2 goals; (d) detour, in 
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FiGURE 2. Errorrateasa function of conflict type. 


which the response includes ceding 
through one or more cross f the 
maze; (e) blocking, which is s a fail- 
ure to leave the starting po Ilowing 
stimulus onset; and (f) dis ed re- 
sponses, in which no attempt ide to 
remain within the confines maze. 
Adequacy was scored in relati specific 
conflict types. Detour, bloch nd dis: 
organized responses were alw ored as 
inadequate. A compromise 1 se was 
adequate only for Type III pe IV 
conflict, and the double ri was 
adequate only for Type I cont Single 
responses were adequate in 1 | con 
flict, in Type II conflict (to oncon- 
flicted goal), and on control t; (to the 
correct goal). 

Conflict and corresponding c jl trials 
were scored for mode and adi v, and 
appropriate percentages were í mined. 
Results of the present study regard 
to the first 3 conflict types wi n close 
agreement with the earlier worl Epstein 
and Smith (1967). Dominani response 
modes for each conflict type w similar 
to those in the earlier work, and adequacy 
decreased with increasing conflict difficulty 
from Type I through Type IIL conflict, 


Type 1V conflict. Since it was not fully 
investigated in earlier work, the l'ype IV 
conflict is of particular interest here. Re 
sponse mode percentages for Type lV 
conflict in the present study were quite 
similar to those for Type III, in most 
instances. Single (38.5% for Type ul 
and 40.4% for Type 1V) and compromise 
(33.3% and 28.805) modes predominated 
for both conflict types, while detouh 
double, and disorganized responses welt 
infrequent. Blocking was seen in a numbef 
of instances in response to both Type Hu 
(17.9%) and Type IV (16.6%) conflict 
Hovland and Sears' (1938) percentages 
were quite different, with 72.5% blocking 
responses and 12.5% double responses t0 
Type IV conflict, 

; Though resolution modes were similaf 
in the present study, adequacy differen 
tated Type III and Type IV conflict) 
A chi-square analysis of adequacy involv! 

only those 25 Ss who differed in adequacy, 
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from one conflict to the other. The result- 
ing chi-square demonstrated that the Type 
IV conflict did result in a greater frequency 
of inadequate responses than did Type III, 
X? (1) = 9.7, 5 < .01. 


Response Lability 


Mode lability analyses examined shifts 
in response mode from Trial 1 to Trial 2 
and from Trial 2 to Trial 3 (Table 1). 
It was found that Type II conflict resulted 
in the greatest degree of lability, while 
Type I conflict and control trials showed 
the least total mode shifts, X* (4) = 26.1, 
p < .001. 

Lability data were further examined to 
determine the extent to which Ss tended 
to shift toward or away from each major 
response mode with increased conflict ex- 
perience. Summed over conflict types, 
there were significant differential tenden- 
cies to shift away from the single response 
mode over trials, X? (1) = 4.25, p < .05, 
and toward the adequate double response 
mode, X? (1) = 7.11, p < .01, and the in- 
adequate blocking response, X* (1) — 7.12, 
p< .01. 

A further analysis of shift involved 
grouping the various responses to form 
the 2 classes originally defined by Hovland 
and Sears (1938). Class 1 contains re- 
sponses that generally terminate at a goal, 
including double, single, and disorganized 
responses; while Class 2 contains only the 
blocking and compromise modes, which do 
not terminate at goals. The number of 
shifts from Class 1 to Class 2 modes (24) 
was considerably greater than the number 
of shifts (8) in the reverse direction, 
X? (1) = 8.0, p < .01; and the number of 
interclass shifts (23) from Trial 1 to Trial 2 
was significantly greater than the number 
of shifts (9) occurring from Trial 2 to 
Trial 3, X? (1) = 61,  <.02. In addi- 
tion, frequency of interclass shifts in- 
creased with increasing conflict difficulty, 
with the greatest number occurring on 
Type IV conflict, x?(4) = 19.6, p < .001. 

Two additional factors influencing re- 
sponse lability were suggested by the work 
of Lewin (1935) and Hovland and Sears 
(1938). Lewin hypothesized that a single 


TABLE 1 


RESPONSE LABILITY (% Suiers) AS A FUNCTION 
or Conrrict TYPE 


Response mode shifts 
Conflict 


Trial 1 to Trial 2 Trial 2 to Trial 3 


15.4 9.6 
48.1 38.5 
32.7 25.0 
38.5 25.0 
Nonconflict 11.5 17.3 


Note. All percentag e based on 52 possible shifts (one per 
S) for each conflict type between each 2 trials. 


response to a given goal in a Type I con- 
flict situation weakens the valence of the 
selected goal, resulting in a shift to the 
opposite goal. This hypothesis was not 
confirmed in the present study, since only 
6.5% of the Ss shifted goals on either the 
second or third trial of the approach- 
approach conflict. An alternative hypoth- 
esis is that response lability is more a func- 
tion of the reaction type or mode itself 
than of the field situation or conflict type 
(Hoyland & Sears, 1938). This conten- 
tion was not supported by present results, 
since the major response modes with which 
Hovland and Sears were concerned— single, 
double, and compromise—were associated 
with the very similar shifting values of 
30.3%, 37.3%, and 28.1%, respectively. 

A final consideration was the extent to 
which response lability is a function of 
the adequacy of the initial response. 
Response lability between Trials 1 and 3 
was greater for inappropriate than for ap- 
propriate responses. Specifically, the mean 
percentage shift across the 6 inappro- 
priate response modes (inappropriate de- 
tour, double, compromise, single, disorgan- 
ized, and blocking responses) was 60.2%, 
while the mean for appropriate modes (ap- 
propriate double, single, and compromise 
responses) was 16.8%. An exception was 
the inappropriate blocking response, which 
was among the most stable responses, with 
only 20% shifts. 


Discussion 


Two basic questions studied in the present 
investigation concerning the Type IV conflict 
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related to its resolution under conditions of 
negative start and its relationship to other 
conflicts, particularly Type III. First, it was 
apparent that under the negative-start con- 
ditions of the present study, Type IV conflict, 
like Type III, elicited markedly less blocking 
than occurred in the work of Hovland and 
Sears (1938). Type III and Type IV con- 
flicts were also similar to each other in rela- 
tion to further response modes, speed, and 
error measures, while differing from the other 
conflicts. Only adequacy differentiated the 
2 conflicts, with Type IV producing less 
adequate responding. 

Both the Type III and Type IV conflicts, 
as well as the remaining 2 conflict types, 
showed some effects of repeated exposure. 
The overall increase in speed with practice 
was accompanied by differential mode lability, 
which was a function of conflict type and 
initial response adequacy. The hypotheses of 
both Lewin (1935), who stated that approach— 
approach conflict is subject to frequent shift- 
ing from goal to goal, and Hovland and Sears 
(1938), who said that lability is largely a func- 
tion of initial response mode, had to be 
rejected on the basis of present results, The 
failure of the Lewin hypothesis may have 
been due to the use of left and right goals 
rather than, for example, forward and reverse 
goal points, since 93.9% of all single responses 
on Type I conflict were to the right-hand goal. 
Thus, right-lateral dominance, coupled with 
high preexperimental response strength for 
responses involving movement toward the 
right, may have affected results, Somewhat 
more massed practice may also be a neces- 
sary condition for the confirmation of the 
Lewin hypothesis. The negative-start condi- 
tion of the present study, combined with the 
use of a maze rather than an open-field 
situation, and the fact that Hovland and 
Sears examined lability of only Type II con- 
flict, may have contributed to the discrepant 
findings in the 2 studies concerning their 
hypothesis, 

Further lability results showed that re- 
Sponses tend to shift away from inadequate 
early attempts and to shift from goal (Class 1) 
to nongoal (Class 2) modes, while interclass 
shifting becomes more Írequent, the more 
difficult the conflict. These findings are re- 
lated in that it is principally the Type III 
and Type IV conflicts that elicit inadequate 
responses, produce the most compromise and 
blocking (Class 2) responses, and have the 
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highest interclass lability. In ei 
inappropriate responses to Type 1i 
conflicts often result in shifts to 
equate compromise or inadequate ! ng re- 
sponses, depending on relevant 
difference variables. 

The fact that approach-avoidan« 
ras the most labile conflict whe 
class was not considered can be e 
relation to the Lewin (1935) and M 


pe II) 
sponse 
ned in 
(1944) 


models. Both of these theories cluded 
that the response to Type II sh reach 
some compromise point between t arting 
point and the conflicted goal,  ! in hy- 
pothesized, in addition, that Ss « ienced — 
in responding to approach-avoida: onflict 
situations may leave the field, a | thesis 
consistent with the position tha onflict 
produces a state of increased drive Brown 
& Farber, 1951). In the latter casc caving 
the field would be seen as one way of roducing 
the drive resulting from the conflict ation. 


Present results for Type II show single j 
responses to the nonconflicted (51.9°%) and 
conflicted (25%) goals to be most frequent, 
with compromise (3.2%) and detour (13.5%) 
modes relatively rare. The instructions and i 
obvious boundaries of the paper maze in the]! 


present study made it difficult for S to | 
leave the field. It was, however, c pos: 
sible to leave (detour or compromise) or avoid 
(single response to nonconflicted goal) they 
path leading directly to the conflicted goal | 
and hence reduce the drive. Given these | 
response alternatives, the unique ambiguity | 
of the Type II conflict, resulting from the || 
absence of any information (lights) concern- | 
ing the status of the nonconflicted goal, may — 
well have been responsible for the observed 
lability of this conflict. Moreover, the pre- | | 
dominance of single responses is consistent | 
| 


rally 


with the fact that the single mode is the 


most practiced, due to its use on noncon- | 
flict trials. 


In conclusion, given a conflict situation in | 
which the starting point is negative and field | 
departure difficult, the present study provided | 
mixed support for several hypotheses drawn | 
from earlier conflict theories and demonstrated 
that only response adequacy clearly differen- | 
tates Type IV conflict from Type Ill | 
Lability was shown to be determined primarily 
by initial response adequacy and conflict type 
Further Work systematically varying starting- 
point status, repetition of exposure, difficulty 
of field departure, situational ambiguity, and 
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relative goal location should make it possible 

to suggest some specific modifications of 

earlier conflict theories. 
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MODES OF EXTRACTING INFORMATION IN CONCEPT 


ATTAINMENT AS A FUNCTION OF SELECTION 
VERSUS RECEPTION PARADIGMS 


NEAL S. SMALLEY * 


State University of New York College at Plattsburgh 


Measures of 2 modes of inference were developed: the V measure counts in- 
ferences about attributes which are varied from a focus instance, and the C 
measure counts inferences about constant attributes. In 2 experiments 

a selection paradigm were compared with a yoked group in a reception para 
digm. Both experiments supported the prediction that the rı ption group 
would give more C inferences, but found no difference in V inferences. The 
results are explained in terms of the selective attention to S's current hy pothesis 
which is produced by the task of selecting, rather than receiv ing, information. 
The use of the V and C scores to measure focusing and scanning strategies i 


discussed. 


The purpose of this investigation was to 
examine 2 methods of extracting informa- 
tion from instances in a concept attainment 
(CA) task. In particular, it was concerned 
with whether the method differs when S can 
choose the instances he wishes to examine 
(selection paradigm), as opposed to a situa- 
tion where the choice of instances is not in 
his control (reception paradigm). 

Attempts to experimentally compare the 
selection and reception paradigms (Hunt, 
1965; Huttenlocher, 1962 ; Laughlin, 1972; 
Lowenkron & Johnson, 1968; Murray & 
Gregg, 1969; Schwartz, 1966) have based 
the comparison on effectiveness of per- 
formance as measured by number of trials 
(instances chosen) to criterion or solution 
time. No consistent pattern of results, 
favoring either selection or reception con- 
ditions, has emerged. 

One alternative approach is to focus 
research attention on the more funda- 
mental processes at each step of the CA 
task rather than on effectiveness of overall 
performance. The present investigation 
focused on the methods employed by Ss to 
extract information from each instance en- 
countered, employing measures of 2 such 
methods which were identified in a pre- 
vious, unpublished study using the selec- 
tion paradigm. For each instance which S 


* Requests for reprints should be sent to Neal S, 
Smalley, Department of Psychology, State Uni- 
versity College, Plattsburgh, New York 12901, 
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chose he was asked for a reason for 
choice and, after he was told whether 
card was positive (an exemplar) or negativi 
(a nonexemplar), he was asked for am 
inference he had drawn on the basis of t 
instance. The statements which Ss ma 
were classified into 3 types—V, C, and 
—which are illustrated in Table 1. 


utes) which are varied from the positive 
example which S is given at the beginni Ig 
of the trial. The C inference is a statement 
about one or more attributes (or values 
which are constant, or the same as the 
attributes of the given instance. The VE 
inference is a statement about both vai 
and constant attributes. Table 1 giv 
examples of the 3 inference modes for eat 
of the 4 main types of situations that com 
monly occur in conjunctive CA tasks. T d 
task given as an example has stimuli with 
4 attributes (represented as letters) and 2 
values for each attribute (represented 
numbers immediately following each letter, 
The concept shown is a 2-element co 
junctive concept, and the inferences given 
are the logically correct ones for each 0 
the 3 inference types. In addition to minor 
differences in wording, the inferences actus 
ally given by Ss showed an occasional 
logical error and were often less comple! 
in the cases where multiple inferences até 
possible (e.g., Situation I-C). 
Employing a measure of mode of inf 
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mation extraction has the additional ad- 
vantage of overcoming a problem inherent 
in the design of experiments comparing 
reception and selection paradigms. In 
order to equate the groups on stimuli re- 
ceived, previous studies have employed the 
yoked group design in which each recep- 
tion ,S received the sequence of instances 
actually chosen by an S in the selection 
group. However, Lowenkron and Johnson 
(1968), following Church (1963), have 
argued that Ss differ in their ability to solve 
CA problems and that selection Ss will 
choose sequences of instances whose diffi- 
culty is appropriate to theirabilities. Since 
reception Ss are paired with selection Ss 
at random, the correlation between ability 
level and sequence difficulty will not be 
present in the reception group, and this 
may result in a negative bias against the 
reception group. 

A study which uses as its dependent 
variable a measure of information extrac- 
tion can avoid this bias because differences 
between Ss in their abilities to extract in- 
formation should be minimal. With a 2-ele- 
ment conjunctive concept, the task of draw- 
ing an inference from each instance is 
relatively easy and almost all Ss shoüld 
produce a logically correct inference for 
each instance, as was the case in the 
exploratory study. Thus Ss should not 
differ substantially in their ability to draw 
inferences but rather in the manner in 
which they draw them. 

The present study compared selection 
and reception groups, hypothesizing that 
Ss in the selection condition would make 
more V inferences and that Ss in the re- 
ception condition would make more C 
inferences. Consistent with previous sug- 
gestions (Dienes & Jeeves, 1965; Hunt, 
1965), the selection condition, which pro- 
vides the opportunity for manipulation of 
the type and order of information to be 
examined, is assumed to thereby permit 
Ss to take a more active role, systematically 
varying the attributes of the stimulus 
materials and thus focusing their attention 
on V attributes. On the other hand, the 
reception condition, since it does not pro- 
vide this opportunity, is assumed to en- 


TABLE 1 


THREE INFERENCE TYPES FOR THE Four Most 
FREQUENT SITUATIONS WITH A Two-ELEMENT 
CONJUNCTIVE CONCEPT 


Situ- ate. | Infer- 
ation] Instance | Cäte-| ence Inference 
type gory | mode 
ALBLCLDI| + (The given instance.) 
1 F | v. | Atisnot part of the con- 


cept. 
| C (Bi and C1), (B1 and 
D1), and (C1 and D1) 
are possible concepts. 
VC | (Aland B1), (Al and 
C1), and (Aland D1) 
z are not the concept, 
IL | A2,B1,C1,D1 | — Vv A1 is part of the concept. 
Cc (B1 and C1), (B1 and 
D1), and (C1 and D1) 
are not the concept. 
(A1 and B1), (A1 and 
C1), and (A1 and D1) 
| 4 are possible concepts. 
IH | A2,B2,C1,D1 | + | V | Neither A1 nor B1 is 
| part of the concept, 
> (C1 and D1) is the con- 


cept. 
VC | (At and B1), (A1 and 
| CI), (A1 and DI), (B1 
and C1), and (BT and 
| D1) are not the con- 


| vc 


cept. 
IV | AZ,B2,CT,Df x V Either A1 or Bl or both 
x are part of the concept, 
| € (C1 and D1) is not the 


concept. 

VC | (Al and B1), (Al and 

| C1), (Al and D1), (BL 
and C1), and (B1 and 

D1) are possible con- 

| cepts, 

i I 


Nole, The V inference is a statement about one or more a 
tributes (or values of tho ttributes) which are varied from 
the positive example which < iven at the beginning of a trial, 
The C inference is a statement about one or more attributes (or 
ues) which are constant, or the same as the attributes of the 
given instance. The VC inference is a statement about both 
varied and constant attributes. 


gender a more passive role in which § 
attends to patterns of relationships common 
to the instances observed and thus focuses 
on C attributes. 


EXPERIMENT | 
Method. 


Subjects and design. The Ss were 28 college fresh- 
men who volunteered during a summer orientation 
program prior to their first year of college. Two 
groups were employed, a reception (REC) and a 
selection (SEL) group. In order to control the 
stimuli received by the 2 groups, a yoked group 
design was used in which Ss in the REC and SEL 
groups were randomly paired and each REC S re- 
ceived the instances chosen by his paired SEL S. 
Thus Ss were randomly assigned to groups with the 
necessary restriction that a particular S not be 
assigned to the REC group unless an unpaired SEL. 
S who had already completed the experiment was 
available. 

Materials. The instances were hand-lettered on 
5.1 X 7.6 cm. white cards and arranged in an ordered 
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array. Each instance contained 4 capital letters—A, 
H, Q, and R—each of which had either the number 
1 or the number 2 printed as a subscript below it. 
The letters were the attributes and the subscripts 
were the values of those attributes; therefore this 
was a 4-attribute, 2-value task, The array con- 
tained all possible instances ordered so that 
card was adjoined on all sides only by cards which 
1 from it in a single attribute, 

ce the purpose of the experiment was to study 
types of inferences, it was desirable to control possi- 
ble sources of erro confusion for the Ss. Ex- 
ploratory studies seemed to indicate that unitary 
stimuli such as the usual geometric figures used in 
CA experiments occasionally produced such con- 
fusion, presumably due to failures to separate the 
attributes combined in the single geometric figures. 
The letter-and-subscript form of the stimuli in the 
present experiment was an attempt to reduce this 
confusion by physically separating the attribute 
value pairs. 

Procedure. The directions read to S explained the 
construction of the cards, stated that for cach prob- 
lem the concept would involve only 2 attributes 
conjunctively combined, and gave an example of 
one such concept. The directions for the 2 groups 
differed only in whether the Se were instructed to 
select cards from the array (the $ group) or were 
told that Æ would give them the cards (REC group). 

All Ss received 4 problems, or different concepts, 
each of which began with the card A;H,Q;R;, which 
was always positive. For each successive card, 4 
steps were followed. First, a SEL S chose his card; 
a REC 5 was given the card chosen by his paired 
SEL S. Second, Sp in both groups were asked: 
"Dy you have any hypotheses about what the con- 
cept might involve?" Third, the category of the 
oird (positive or negative) was announced, and the 
«ard was placed in either of 2 columns headed with a 
plus or minus sigan, Fourth, the Ss were asked: 
"Does that card lead you to any particular con- 
clusions?" In the SEL group a problem was com- 
pleted when 5 announced the correct concept— 
Woless that announcement was a guess, that is, 
wnless SN had not chosen enough cards to be logically 
certain. When such guesses occurred, S was en- 
couraged to pick additional cards to “test” his guess, 
Ta the REC group a problem was not completed 
tantil S had received all of the cards which his paired 
SEL S had chosen for that problem, even if the 
REC S attained the concept earlier than the SEL S. 
M à REC S had not attained the concept when the 
SEL S's cards ran out, he was encouraged to go back 
over the cards and attempt to figure out the concept. 
He was not given any additional cards. These pro- 
cedures for the 2 groups insured that each pair of 
Se would have the opportunity to examine exactly 
the same cards from which to draw inferences, 

The semions were tape-recorded using a cassette 
recorder which was in full view of S. In addition, 
Æ wrote down each card that S chose or was given 
And attempted to note every card to which 5 pointed 
his verbal explanations. 
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value pairs mentioned were the same) 
or constant with, the given card, Thus 
inferences were always about attribi 
which had a subscript of “1” on the @ 
in question. Finally, an inference | 
classed as VC if the attributes or attrib 
value pairs mentioned in the inference tl 
cluded at least one which was V type dl 
one which was C type, as just defi 
Examples of each type of inference in W 
situations most common for this task 
given in Table 1. In cases such as the 
statement for Situation |, 
possible concepts which S actually 
was counted as a separate C inference: 
There were, then, 3 response measure 
the total number of V, of C, and of 
responses over the 4 problems. The 
and REC groups were compared on S 
of these 3 measures by means of a matchi 


groups / test, The mean number of C 
inferences in the REC group was 24.6, and 
in the SEL group it was 14.9. The differ- 
coce was in the predicted direction and was 
tatistically significant, ¢ (13) = 3.15, p < 
obo The V means10.8 (SEL) and 8,9 
KEC)—were in the predicted direction 
ut were not significant, / (13) = ,68. No 
prediction was made about the VC meas- 
ire, nor was it found to significantly dis- 
'inguish the 2 groups, / (13) = 1.40, 

"Thus the results were consistent with the 
hypothesis that the reception paradigm 
tends to produce a more passive, pattern- 
oriented approach to extracting informa- 
tion from instances than does the selection 
approach, However, there was no support 
for the hypothesis that the selection 
paradigm produces a more active, sys- 
tematic approach to information extraction, 

For comparison with previous studies 
employing measures of effectiveness of per- 
formance, the number of trials (instances) 
required to attain each of the 4 concepts 
wis . A concept was said to te 
attained on the first trial in whieh (a) 5 
announced the correct concept without 
montioning any alternative concepts, and 
(b) it was logically possible to definitively 
conclude what the concept was Using 
this criterion, the mean number of trials 
required 10 solve all 4 problems was 167 
in the REC group and 17.4 in the SEL 
group, a difference which was statistically 
significant, £ (13) = 2.86, p< .02. The 
superior performance of the REC group 
obviously contradicts the possibility of a 
negation lias against the reception group 
resulting from the yoked control design. 
Additional evidence against this bias was 
also furnished by the error rates, which 
failed to support a difference in difficulty 
letween the stimulus sequences for the 2 
groups. There were no differences in either 
the total percentage of logically erroneous 
inferences (5.2°% for SEL and 4.8% for 
REC) or in errors in particular types of 
inferences—for V inferences, 6.0% (SEL) 
and 4.0% (REC), and for C inferences, 
3.4% (SEL) and 4.6% (REC) 

One possible objection to Experiment | 
centers on the part of the procedure that 

M 
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asked S to give any hypotheses he held 
about the concept. In answering this ques: 
tion, which was asked just after he ha 
made his choice of an instance, the SEL 
S may have given the hypothesis he was 
attempting to test with that choice, rather 
than giving all the possible hypotheses: 
which might be tested with that instance, — 
The REC Ss, unrestricted by a prior hypo- 
thesis, may have been able to give more 
complete lists of possible hypotheses, Since 
the hypotheses were combined with the | 
inferences given in response to the question 
asking for S's conclusions, it is possible that 
the observed differences were due pri- 
marily to the difference between the num- 
ber of hypotheses given, However, a 
repetition of the / tests based only on the 
inferences given to the "conclusions" ques- 
tion produced findings nearly identical to 
those obtained initially : for the C measure, 
L3) = 390, p < 001; for V, £ (13) = 92, 
m; and for VC, £ (13) = LAS, ns. 

This revision in the C measure docs not, 
however, entirely meet the objection. The 
“hypotheses” question itself may have 
directed the REC Ss to look for possible 
hypotheses which they would not other 
wise have produced, Therefore, asking for 
hypotheses may have facilitated the per- 
formance of REC Ss while giving no cor- 
responding help to SEL Ss, who were 
presumably set by the hypothesis they had 
already produced in the process of selecting — 
an instance. To eliminate this interpreta: 
tion of the results, a second twas 
run which replicated the first one except 
that the request for hypotheses was omitted 
from the procedures. 
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the concept might involve?” was omitted from the 
procedures for both groups. Thus the only oppor- 
tunity to make verbal inferences was in response to 
the question asking for conclusions from each card. 


Results 


The 3 measures were derived in exactly 
the same way as in Experiment l. Again, 
there was a considerable difference between 
the mean number of C inferences in the 
REC group (24.6) and the SEL group 
(17.8), a difference which was statistically 
significant, ¢ (15) = 3.51, p < .01. For 
the V measure, neither the direction nor 
the magnitude was as predicted—mean 
SEL = 5.6, mean REC = 6.8, ¢ (15) = 
.62. Again, the VC measure did not pro- 
duce a significant difference, £ (15) = .62. 
In this experiment, there was no significant 
difference in trials to criterion—means of 
19.6 (SEL) and 19.9 (REC), ¢ (15) = 
providing less strong but still consistent 
evidence contrary to the hypothesis of 
negative bias against the REC group. As 
in Experiment I, no significant differences 
in error rates were found. For the SEL 
and REC groups respectively, the percent- 
ages of erroneous inferences were 4.3% and 
7.2% for all inferences combined, 3.3% and 
3.7% for V inferences, and 3.9% and 8.4% 
for C inferences. 

The C score may be seen not only as a 
measure of a mode of drawing inferences 
but also as a measure of the completeness 
with which these inferences are drawn. 
Recall that in Situations I and II, 1, 2, or 
all 3 possible concepts may be given. Each 
concept given was counted into the total 
C score. Thus, the more such concepts S 
gave for each instance, the more complete 
was the inference process and the greater 
was the information extracted from the 
instance. However, we may ask whether 
the difference in C scores simply reflects 
greater completeness in the REC condition 
than in the SEL condition, or whether there 
is an actual preference in the REC condi- 
tion for inferences of the C type. To get 
some indication of this from the present 
data; the C score was revised by counting 
only one CZinference per instance in cases 
“where 2 or 3 such inferences were given; 
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the mean C scores for the REC and SEL 
groups then became 17.5 and 15.2, re- 


spectively. The difference is in the pre- 
dicted direction, although not significant, 
t (15) = 2.01. 


DISCUSSION 


The major difference between the selection 
and reception conditions appears to be in the 
completeness of the C inferences drawn by 5s 
in the selection condition. Thus, instead of 
the hypothesis of active and passive roles 
initially suggested, a better formulation can 
now be offered. The task of selecting instances 
may set S to extract only information relevant 
to his current hypothesis and may therefore 
result in his ignoring additional information 
which the instances contain. When S receives 
rather than selects instances, he may be less 
set by any hypotheses he has and therefore 
able to extract more information from each 
instance. 

More generally, the modes of information 
extraction identified in thi appear to 
underlie the CA strategies des by Bruner, 
Goodnow, and Austin (1956). In focusing 
strategies, S selects an instance which varies 
one or more attributes from the given instance. 
His goal is to gain information about those 
attributes, and so his inferences should be 
about varied (V) attributes. In scanning stra- 
tegies, S tests one or more possible concepts 
which are directly represented in the test in- 
stance. Since these possible concepts will also 
be embodied on the given card, the inferences 
drawn will be about constant (C) attributes. 
However, the V and C scores are not perfect 
indexes of the strategies employed because Ss’ 
verbal statements include additional deduc- 
tions about attributes and concepts not directly 
tested by a particular card. The V and C 
scoring system compares favorably with the 
measure of focusing developed by Laughlin 
(1965) in that it requires fewer, simpler de- 
cisions by the scorer and is applicable to both 
reception and selection paradigms. 

Perhaps because the V and C measures 
reflect modes of extracting information from 
instances rather than either patterns of in- 
stance selection or effectiveness of perfor- 
mance, they may prove applicable to a variety 
of experimental situations and problems. 


REFERENCES 


Bruner, J. S., Goopnow, J. J., & Austin, G. A. 
A sae) of thinking: New York: Wiley, 1956. 


SELECTION VERSUS RECEPTION PARADIGM 61 


Cuuncu, R. M. 
on behavior. 
369-402. 

Dienes, Z. P., & Jeeves, M. A. Thinking in struc- 
tures. London: Hutchinson Educational, 1965. 

Hunt, E. Selection and reception conditions in 
grammar and concept learning. Journal of Verbal 
Learning and. Verbal Behavior, 1965, 4, 211-215. 

HUuTTENLOCHER, J. Effects of manipulation of 
attributes on efficiency of concept formation. 
Psychological Reports, 1962, 10, 503—509. 

LAUGHLIN, P. R. Selection strategies in concept 
attainment as a function of number of persons 
and stimulus display. Journal of Experimental 
Psychology, 1965, 70, 323-327. 

LAUGHLIN, P, R, 


The varied effects of punishment 
Psychological Review, 1963, 70, 


Selection versus reception con- 


cept-attainment paradigms for individuals and 


cooperative pairs. Journal of Educational Psychol- 
ogy, 1972, 63, 116-122. 
LOWENKRON, B., & Jounson, P. J. Yoked group 


comparisons of selection and reception paradigms 
of concept attainment: A biased procedure, 
PONE: Reports, 1968, 23, 1143-1149. 
Mur ., & GREGG, R. E. Reception versus 
A on ix edures in concept learning. Journal 
of Experimental Psychology, 1969, 82, 571-572. 
Trial-by-trial analysis of processes. 
unctive concept-attainment 
xperimental Psychology, 1966, 


in ES and dis 
tasks. Journal of E 
56-465. 


(Received March 22, 1973) 


Journai of Experimental DXCEUDEY 
1974, Vol. 102, No. 1, 


EFFECT OF NOISE ON THE STROOP TEST 


L. 
Medical Research. Council, 


Noninterfe! 
used to test 
were performed during the fi 
and quiet. 


ence between performance on interference and noninterference tes 


interference in noise. In E 
ed independently. 


R. HARTLEY? anb R. G. 


xperiment II, exposure duration and practice were 
The brief exposure to noise was beneficia 


ADAMS 


Applied Psychology Unit, Cambridge, England 


ence tests and 2 versions of the Stroop color interference test were 
s in loud noise and quiet. 


Interference and noninterference tests 


st and last 10 min. of a 30-min. exposure to noise 
Interference in noise and quiet was measured by taking the gitter- 


The . 


s. 
in Experiment I who were tested at both exposure durations showed increase d 


and de- 


creased interference, and the long exposure increased interference, suggesting a 


cumulative adverse effect of noise. 


Exposure to loud noise undoubtedly 
produces changes in human performance 
efficiency (Broadbent, 1953). "These per- 
formance changes have been attributed to 
the amount of stimulation being received : 
excessive stimulation from intense noise 
is supposed to overarouse S and cause 
impairment (Broadbent, 1963), but a mod- 
erate amount of stimulation can cause 
some improvement (Davies & Hockey, 
1966; McGrath, 1963). These performance 
changes have occurred toward the end of 
a lengthy (30-min.) test in continuous 
loud noise (Broadbent & Gregory, 1963). 
The changes occurring with continued ex- 
posure to loud noise include increases in 
errors and pauses in a serial reaction task 
. (Wilkinson, 1963) and increased attention 
| to high-probability sources of information 

(Hockey, 1970). Broadbent (1971) sug- 
gests that one interpretation of these re- 
sults is that the increased arousal accom- 
panying exposure to intense noise leads to 
an impairment of the mechanism that 
filters information from the environment 
for further processing. Some of the evi- 
dence for this view derives from a series 
of studies involving the visual identifica- 
tion of common and uncommon words in 
noise. It was found that the uncommon 
word was more difficult to see in noise, 
but only when both classes of words were 
present. Noise did not alter the ratio of 


1 Requests for reprints should be sent to L. R. 

— Hartley, Medical Research Council, Applied Psy- 

4 thology Unit, 15 Chaucer Road, Cambridge, 
. England. 
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misperceptions of common and uncon: 
words. Noise clearly affected the 
tion of the stimulus rather than cha: 
the bias in responding to one or the : 
class of words. 

In the Stroop test (Jensen & Rol: 
1966; Stroop, 1935) color names written i s 
inks of different hues, excluding ink of t 1a « 
hue the name indicates, are presented to Ss | 
who are required to name the hue of t 13. 
ink and not the color name. The corr, 
parison or control task may involve nar» 
ing monochromatic color names or nam: a 
the hue of the ink in which nonverl>-a 
symbols are written. Responding to tl» < 
hue of color names takes longer thza + 
either of the comparison tasks becau Em 
both name and hue are appropriate to th <- 
same set of responses. Hence, the irre~y - 
evant feature of the stimulus (color nam «~ " 
must be excluded from the analysis. T 14 < 
color name dimension must be filtered fro y E 
the hue dimension (Dyer, 1971). z 

It would be expected that if filtering 


. " Is 
affected by noise-induced arousal, t tia 
Stroop test should show impairment in 
noise which would increase with length oF 


exposure to noise, as other tests hax. 
shown it does. In the following expea-i — 
ments, versions of the test not requiri, y 
an overt verbal response were used. 


EXPERIMENT I 


Method 
Subjects, Eighteen enlisted men served as S 
and each § was tested individually. No S Rag 
< 
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done the test before. All Ss had normal hearing ; 
less than 35-db. loss in one ear or 30-db. loss in 
both ears at any of 7 test frequencies covering 
the audible range. 

Materials, Two packs of 100 cards, one experi- 
mental (E) and the other control (C), were used. 
Cards in the E pack displayed 5 color names 
red, blue, green, violet, and black—written in 

B uppe! lettering in inks of the 5 hues. No color 
name ever written in the hue the name in- 
dicated. There were 5 cards of each hue/name 
combination in the pack. ch card in the C pack 
displayed 5 crosses drawn in any one of the 5 hues 
of the E pack. There were 20 cards of each hue in 
the pack. The Ss sorted each of the 2 packs as 
quickly and accurately as possible into 5 piles 
corresponding to each of the hues. 

Procedure. All 18 Ss were tested in conditions 
of noise (N) and quiet (Q). These 2 tests were 
given on separate days at between 8:00 and 9:00 A.M. 
Each S sorted both the E and C pa in the first 
and last 10 min. of a 30-min. exposure to N and Q. 
The Ss read magazines in the interval between 
the tests at the beginning and end of the exposure. 

Broad-band noise was used in all conditions, 
having constant energy per cycle between 50 and 
4,000 Hz. In N, this was presented at 100 db., 
and in Q, at 70 db., measured on the C scale, having 
à substantially flat frequency response. 

Half of the Ss had the N condition on their 
first test and half had the Q condition. Presenta- 
tion order of the E and C packs to each group of 
9 Ss was counterbalanced within each S. Prior 
to the first test, practiced sorting both packs 
of cards in Q. ch S was told his scores after 
sorting each pack, and the scores were also dis- 
played on a group chart near .S. 


Results 


The times taken to sort each pack were 
analyzed using a repeated measures analy- 
sis of variance with logarithmic transfor- 
mation of the scores to achieve normality 
of distribution. Interactions of each factor 
with Ss were used as error terms. 

Sorting on the second test, following the 
20 min. of prior exposure, was faster than 
sorting on the first test in every condition 
(p « .008). Pooling tests in the N and Q 

- conditions, the type of pack sorted inter- 
acted with the position of the test in the 
i-hr. exposure, F (1, 17) = 6.32, p <.025. 
The difference in time taken to sort names 
and crosses was less at the end of the 
exposure than at the beginning. Com- 
paring tests in the N and Q conditions 
independently, N increased the time taken 
to sort the E pack by about 3% and 


TABLE 1 


MEAN TIME (IN MIN.) To Sort Each Carp Pack 
UNDER EACH CONDITION 


Time of sorting task 


| Card-pack stimuli 
| 
| 


Color names Crosses 

NI "ac RO 
Quiet condition 

First 10 min. 2.3167 

Last 10 min. 2.0722 
Noise condition 

First 10 min. | 2.6990 | 2.2593 

Last 10 min. 2.3880 2.0620 


Note. Time of sorting task refers to its occurrence within the 
30-min, exposure to either quiet or noise, 


decreased the time taken to sort the C 
pack by about 1% (Table 1). 

The difference in time taken to sort E 
and C packs was nearly twice as large in N 
(22.97 sec.) as in Q (14.69 sec.), F (1,511) 
= 455, p < .05. Although noise caused an 
increase in the difference in sorting time 
of the 2 packs, there was no evidence that 
it interacted with the position of the test 
in the 3-hr. exposure (p = .25). The in- 
creased difference in sorting time of the 
2 packs can therefore be regarded as 
similar at the beginning and at the end 
of 5 hr. of noise. 


EXPERIMENT II 


Experiment I examined whether noise 
increased the interference of color name 
with the selection by hue and whether 
this interference increased with the dura- 
tion of noise exposure. The fact that 
there was no increased interference with 
a long as compared to a short exposure 
to noise could be due to a number of 
factors; in particular, the fact that duration 
of exposure to noise was confounded with 
practice at test, and the latter reduced 
the interference. In Experiment II, these 
2 variables are assessed independently and 
a modified version of the test, involving 
a much smaller motor component, was 
used. 
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Method 


Subjects. "Two groups of 16 Ss each were tested. 
All were enlisted men, none of whom had taken 
part in Experiment I. The Ss were tested in 
groups of up to 6 at a time. All Ss had normal 
hearing; less than 35-db. loss in one or 30-db. loss 
in both ears at any of 7 test frequencies covering 
the audible range. 

Materials. In the modified version of the Stroop 
test, the material was presented in a written form, 
and .S ticked the appropriate item, rather than 
sorting it from a set of cards. The material was 
presented in a series of sheets, each containing 
32 lines of material. On E sheets, each line was 
made up of the color names, red, green, brown, 
black, and blue, written in inks of these hues. 
No color name written in the hue that the 
name indicated. Each line on the sheets contained 
1 stimulus color name on the left and the 5 pos- 
sible response color names on the right. Each .$ 
was instructed to select from the response set the 
color name appropriate to the hue of the stimulus 
name. As a control, a noninterference (C) test 
was also used. The format of these C sheets was 
identical to that of the E sheets, and they con- 
tained the same set and number of color names 
the E she however, color names were all printed 
in black ink. Each S was instructed to select 
from the response set the same color name as the 
stimulus color name? In both C and E tests, 
S crossed out the response word in as many lines 
of material as he could in 5 min. 

Procedure. All Ss were tested twice, once in 
both the N and Q conditions. A practice test 
was also given. These 3 tests took place on 3 con- 
secutive days between 8:00 and 9:15 A.M. 

Broad-band noise having equal energy per octave 
was used in all conditions. This noise was pre- 
sented at 95 db. in the N condition and at 70 db. 
in the Q condition, measured on the C scale. The 

- differences in sound-pressure level and spectrum 
of the noise between the 2 experiments reported 
here were necessitated by administrative reasons 
outside E's control. 

One group of 16 Ss (Group 1) was exposed to 
only 10 min. of Q or N, performing both C and E 
tests immediately following entry into the experi- 
mental sound room: a second group of 16 Ss 
(Group 2) performed both C and E tests in the 
last 10 min. of a 30-min. exposure to N and Q. 
The Ss in the latter group read magazines prior 
to the tests during the first 20 min. of both the N 
and Q conditions. In each condition, S did both 
E and C tests for the 5-min. period allowed. Half 
of the Ss in each group were tested in N on their 


2 This monochromatic control was selected in 
preference to material containing stimuli of dif- 
ferent colors for ease of duplication and prepara- 
tion, with a view to possible large-scale use of the 
test in the future. This experiment was part of 
à program of evaluation of the test for use in 
field studies of environmental stress. 
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first test day On cach test day 
half of the 
the C test first. Both tests were scored in terms 
of number of lines responded to correctly, All 

were instructed before each test to work as 
and accurately as possible, completing 
lines of the test as they could. The 5’s score: 
displayed in a group chart on the wall near him 


nd half in Q. 


juickly 


Results 


The number of lines completed 
analyzed using analysis of variance, 
Experiment I, and Wilcoxon's test, he 
Ss worked faster on their second test than 
on their first on E sheets (p < .001) nd 
on C sheets (p < .02), Wilcoxon's test. 
Every S worked more slowly on the 
than the C test in every condition. 8 
Table 2 shows, there was a tendency for 
Ss to work slower on the C sheets avd 
faster on E sheets in the 10-min. expost: e 
to N, as compared to Q. On the otier 
hand, the group with 30-min. exposure 
tended to work slower on E sheets aid 
faster on C sheets in N as compared to Q. 
None of these differences were reliable 
however. The changes in speed were 
about 7% on E sheets and about 1% on 
C sheets. 

The differences in number of items cor- 
rectly completed on E and C tests in each 
condition were used to assess the inter- 
ference caused by N as compared to Q. 
The Ss made very few errors on either 
test. The average number of errors on 
each C and E test was less than 1. Taking 
the difference, or interference score, be- 
tween C and E tests, there was no overall 
effect of N as compared to Q. There was 
however, a strong interaction between In- 
terference and Duration of Exposure to N 
and Q, F (1, 30) = 15.81, p < .001. This 
interaction occurred because the inter- 
ference, or difference between C and E 
tests, was less in the N condition (45.13) 
than in the Q condition (55.50) in the 
first 10-min. exposure, F (1, 30) = 7.46, 
p «.025. Only 2 out of 16 Ss did not 
show less interference in N. However, 
considering the other group of Ss, there 
was more interference with the 30-min. 
exposure to N (57.82) than in Q (40.00), 
F (1, 30) = 8.38, p < .01. Again, only 


s had the E test first and half had 


A. 
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2 out of 16 Ss did not show greater inter- 
ference in the 30-min. exposure to N than 
to Q. These 2 effects of noise did not 
relate systematically to either the order 
in which N and Q conditions were received 
or the order in which the 2 tests were 
received. Neither could there be any 
effect of instructional bias upon the group 
since not all members of a group had C 
and E tests presented in the same order. 


DISCUSSION 


The second version of the Stroop test is 
clearly more sensitive than the card-sorting 
version. In both experiments, nearly all the 
change in interference was contributed by 
performance on the experimental rather than 
the control material. The Ss in Experiment II 
showed a much more reliable change in inter- 
ference than those in Experiment I, notwith- 
standing the higher sound-pressure level in the 
first experiment. This is probably due in 
part to the much smaller motor and larger 
perceptual component in the test in Experi- 
ment II. 

In addition, in the first experiment, the 
response of sorting by color was unrelated to 
the semantic description of the hue. In Ex- 
periment II, although S did not verbalize, he 
selected the appropriate semantic response in 
relation to the hue. Pritchatt (1968) con- 
sidered 2 conditions similar to these experi- 
ments and also obtained considerably more 
interference when S was required to press 
a key labeled by color name rather than hue. 
The difference in interference between the 2 
versions is generally consistent with the view 
of Hock and Egeth (1970) that much of the 
interfering effect is due to disruption of the 
encoding of the name of the hue by the 


verbal dimension of the stimulus or its 
aftereffect. 
The effect of noise on this. interference 


depends on duration of exposure; interference 
was reduced by the short exposure, confirm- 
ing the results of O'Malley and Poplawsky 
(1971). The increased impairment during the 
long exposure is consistent with the change 
in performance in other tests during a long 
exposure to noise, where impairment has 
increased with exposure. Hartley (1973) has 
found that a 40-min. exposure to noise causes 
impairment in a serial reaction test performed 
only in the last 20 min. of the exposure. 
This impairment was worse than when 20 min. 


TABLE 2 


MEAN NUMBER or Lines COMPLETED 
UNDER EACH CINDITION 


Stimulus sheet. 
Grup — | 
Control Experimental 
Quiet condition 
a s — — Et XE uai 

1 170.13 114.63 

2 172.44 132.44 
Noise condition 

1 | 169.38 124.25 

2 176.88 125.06 


Note. Group 1 Ss were exposed to only 10 min. of either quiet. 
or noise; Group 2 Ss were tested after 20 min. of a 30-min. ex- 
posure to quiet or noise. 


of noise was presented in the last half of 
40 min. of performance. These results indi- 
cated that continuous noise led to a pro- 
gressive increase in impairment, independent 
of task duration. In Experiment II, the dif- 
ference in performance between the brief and 
the long exposure may also be construed as 
further evidence that the effect of noise on 
performance changes with exposure. Impair- 
ment appeared following the long exposure 
without any prior performance. 

There is some evidence to connect the 
difference in interference between short and 
long exposure to noise with changes in arousal. 
Agnew and Agnew (1963) and Tecce and 
Happ (1964), using threat of electric shock, 
and Callaway (1959), using an amphetamine, 
obtained results indicating reduced inter- 
ference under these conditions. On the other 
hand, Callaway (1959), using amylobarbital, 
and Ostfeld and Aruguete’ (1962), using 
hyoscine, obtained results indicating increased 
interference under these conditions. It would 
be reasonable to assume, although without 
substantiating evidence, that stimulants and 
shock raised arousal, while the sedatives de- 
pressed arousal. The beneficial and detri- 
mental effects of short and long exposure to 
noise may likewise be connected with changes 
in arousal level. 

A significant feature of Experiment II is 
the absence of any auditory cues in perfor- 
mance of the test. Some auditory cues, either 
generated by S or by the apparatus, have 
been present in nearly all tests that have 
shown impairmentin noise. Although the fact 
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that noise has shown its effect at the end of 
the test suggests that masking of these audi- 
tory cues did not contribute to the noise 
impairment observed, this factor could not 
be ruled out (Kryter, 1970). For example an 
interaction between time spent on the task and 
loss of auditory information could account for 
impairment in noise appearing at the end of 
the test. The results of Experiment II, a task 
with no auditory component, clearly indicate 
that the adverse effect of noise need not be 
due to the loss of these cues, particularly in 
view of the initial beneficial effect of noise 
also obtained. Nevertheless, these results do 
not exclude the possibility that loss of task- 
independent auditory information is connected 
with impairment when the loss is prolonged 
for 20 min, 
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VARIABILITY AND CONTROL IN DICHOTIC MEMORY! 


STANLEY R, PARKINSON ? 


Arizona State University 


Experiments are reported in which Ss were instructed on each trial to attend 
to one of 2 stimultaneously presented messages, Following a procedure de- 
veloped by M. P. Bryden, order.of report was manipulated by requiring Ss 
to either recall the attended message first and unattended mess: 
(AU report), or the unattended message first and attended me 
(UA report). While the group data of Experiments I and 11 provided a clear 
replication of Bryden's results, extensive individual differences in unattended- 
ear recall functions were obtained. Conventional single-channel digit spans 
were a: ed for Ss in Experiment II] and Ss were grouped according to their. 
digit span. Correlations between digit span and unattended-ear performance 
yielded coefficients of .88 and .84 in AU and UA reports, respectively. The 
theoretical implications posed by the relation between digit span and dichotic 
memory and the increase in experimental precision which is gained by the 
knowledge of digit span warrant the further use of digit span as a concomitant 
variable in dichotic memory research, 


Dichotic memory refers to a set of ex- 
perimental operations in which Ss are given 
2 different lists of items simultaneously, 
one to each ear, with instructions to re- 
produce as many of the items as they can 
following list presentation (Broadbent, 
1971). 

In the special case of the dichotic ex- 
periment where stimulus presentation is 
rapid (14-2 pair/sec) and where both 
lists consist of a single stimulus type (e.g., 
digits) recorded in the same voice, Ss tend 
to report items from one ear and then 
from the other ear (ear-by-ear report) 
rather than reporting items by pair (pair- 
by-pair report) according to their order 
of arrival (Broadbent, 1954; Bryden, 1962). 
When forced to recall pair by pair under 
these stimulus conditions, accuracy levels 
are decreased. 

The reliability of these findings moti- 
vated Broadbent (1971) to formulate a 
model for dichotic memory performance. 
Broadbent proposed the existence of 2 sys- 
tems: an "S" or storage system capable 
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of parallel processing and thus of holding 
information received by 2 ears simulta- 
neously, followed by a “P” or perceptual 
system limited to sequential information 
processing. According to the model, audi- 
tory-verbal information enters the orga- 
nism via the S system. A selective filter, 
operating in time between the S and P 
systems, selects the message from one ear 
for entry into the P system and attenuates 
processing of the message received by the 
other ear. The P system processes (cate- 
gorizes) the selected ear message and that 
message is reported first during recall. 
"The initially unselected message is held in 
the S system until processing of the first 
message has been completed, at which 
time it gains access to the P system. 
Another way of stating the operation of 
the model is that an individual attends to 
and rehearses the message received by one 
ear (P system) while the message received 
by the other ear is held in a very-short- 
term buffer or echoic store (S system). 
Following recall of the attended message, 
S shifts his attention to and recalls the 
second message. 

As Broadbent (1971) has noted, the S 
and P system nomenclature is somewhat 
misleading as the S system is viewed as 
a parallel processing system while the P 
system is assumed to be sequential in 
operation. Due to this confusion, and to 
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more closely align the terminology with 
other memory models, the term “echoic 
Store" (Neisser, 1967) will be used when 
referring to a very-short-term auditory 
buffer in which information is relatively 
unanalyzed (S system), and “active store" 
(Kroll, Parkinson, & Parks, 1972; Stern- 
berg, 1969) will be used when referring to 
a system consisting of attention and re- 
hearsal processes (P system). 

A finding of major importance for the 
present purpose is that recall accuracy for 
the ear reported second in dichotic memory 
tasks is generally lower than that for the 
ear reported first (e.g., Bryden, 1964). 
In terms of the model under consideration 
this finding implies that the echoic store 
is (a) limited in duration, and/or (b) lim- 
ited in capacity, and/or (c) vulnerable to 
either input or output interference (or 
both) resulting from first-ear processing. 

Broadbent (1957) conducted an experi- 
ment in which the 2 messages of dichotic 
pairs differed in length. The message to 
the ear to be reported first consisted of 
6 digits, while 2 digits were presented to 
the ear to be reported second. In order 
to test a decay or limited duration hy- 
pothesis, the 2-digit message was presented 
simultaneously with either the first 2, 
middle 2, or final 2 digits of the 6-digit 
message. Presentation rate was 2 pair/sec, 
making the difference in time between the 
extreme conditions on the order of 2 sec. 
Broadbent found more accurate recall 
when the 2-digit message coincided in 
presentation with the final 2 digits of the 
6-digit message, a result which is con- 
sistent with the hypothesis that echoic 
storage is subject to rapid decay (1-2 sec.). 

A similar time estimate was offered by 
Murray and Hitchcock (1969). They in- 
Structed Ss to either silently code (without 
observable mouth movement) or mouth 
code (articulate silently) one of a pair of 
S-digit messages presented dichotically. 
In a third group, labeled the “not code 
group," Ss were instructed to say the as 
each pair of stimuli arrived. At the end 
of each stimulus list, Ss were probed for 
their retention of a single digit on either 
the "coded" or “uncoded” message. Mur- 
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ray and Hitchcock found that, even with 
a probe recall procedure which minimi 
response interference, only one item was 
recalled above chance levels in the un- 
coded message of the silently-code and 
mouth-code groups and in both me 
of the not-code group. Given the < 
lus presentation rates employed (1 and 
pair/sec), they estimated that the d 
tion of echoic storage was between 1 aud 
2 sec. 

In another series of experiments directed 
towards determining the temporal paranx 
ters of echoic storage, however, Bryd 
(1971) obtained results which suggest th 
the time course of echoic storage is coi 
siderably longer (several seconds) tha 
previously indicated. Using a dichotic 
memory procedure, Bryden instructed his 
Ss to "attend" to one ear (either left < 
right) during list presentation. Order « 
report was manipulated by instructing the 
Ss to either recall attended ear first and 
unattended ear second  (attended-una:- 
tended or AU order), or to recall unat 
tended ear first and attended ear second 
(UA order) Bryden argued that if un- 
attended ear items were retained in an 
echoic store of very brief duration (12 
sec.), recall should be impaired when they 
were reported second (AU) relative to a 
condition in which they were repo: cd 
first (UA). 

The results which Bryden (1971) ob- 
tained were not in accord with his hy- 
pothesis; i.e., the accuracy of unattended- 
ear recall did not vary as a function of 
order of report. Positively increasing serial 
position functions were found with unat- 
tended-ear recall regardless of order of 
report. Serial position functions on the 
attended ear were relatively flat with both 
report orders, but unlike the unattended 
ear, attended-ear recall was impaired when 
reported second. This latter result would 
be expected if it were assumed that at- 
tended-ear items were retained in a re- 
hearsal-dependent active store, as report- 
ing of unattended items in UA reports 


would prevent rehearsal of attended-ear 
items. 


i 
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Bryden (1971) attempted to explain his 
failure to detect a difference between un- 
attended-ear recall as a function of order 
of report (and to retain the notion of an 
echoic store) by making 3 assumptions: 
(a) The S system or echoic memory lasted 
longer than the delay of reporting U items 
in AU reports; thus, no impairment of 
unattended items occurred. (b) Echoic 
storage is less vulnerable to output inter- 
ference than is active storage. This fol- 
lows from his finding that the difference 
between attended reports (A; — As) was 
greater than the difference between un- 
attended reports (Ui; — Us). (c) The 
echoic store is of limited capacity (1-2 
items). The latter interpretation was 
derived from the positively increasing 
al position functions obtained with 
unattended-ear recall. 

The present series of studies represents 
an attempt to further study the nature of 
recall processes in dichotic memory. The 
first experiment reported here was a rep- 
lication of the Bryden (1971) procedure, 
in which attention and recall instructions 
were manipulated in a within-Ss design. 
In the second experiment objective mea- 
sures of attention instructions were ob- 
tained by requiring Ss to either write down 
or verbalize the attended-ear message as 
it was presented. The third experiment 
was concerned primarily with providing a 
means for accounting for some of the 
variability in dichotic memory performance 
by grouping Ss according to conventional 
digit spans. 


EXPERIMENT I 
Method 


Subjects. Twelve students enrolled in an intro- 
ductory experimental psychology course at Arizona 
State University served as Ss in the present ex- 
periment. The Ss were all right-handed and 
varied in age from 19 to 25 yr. 

Materials and apparatus, Subjects were tested 
individually in a sound reduction chamber. Sub- 
jects received 42 dichotic lists h consisting of 
2 different lists of 4 digits presented simultaneously, 
one to each ear. Both channels (left and right) 
were recorded by the same male speaker at the 
rate of 2 pair/sec, Stimulus lists were recorded 
on a TEAC model TCA 42 4-channel tape deck 
and they were played to S$ through stereo head- 
phones (Superex Pro-BV). The stimulus lists were 


constructed from the set of numbers 0-9 and no 
number appeared twice on a given trial, 

Procedure. Each S was instructed to attend 
exclusively to one ear and to ignore the other ear 
during stimulus presentation, even though messages 
to both ears were to be recalled. One half of the 
Ss were told to attend to the right ear and one 
half were told to attend to the left ear, The ear 
of attention remained constant for each S during 
the course of the experiment. 

Subjects received 2 blocks of 21 trials, each 
block consisting of 5 practice trials followed by 16 
test trials. During the first block of trials, 6 Ss 
(3 attending to their right ear and 3 to their left 
ear) were instructed to recall the digits in the 
attended ear first and then to recall the digits in 
the unattended ear (AU order), while the remain- 
ing 6 Ss (again 3 right ear and 3 left ear) were 
given instructions to recall first the digits in the 
unattended ear followed by the digits in the at- 
tended ear (UA order), All Ss were reversed in 
the second block of 21 trials: those reporting AU 
in the first block reported UA in the second while 
those reporting UA initially were told to report 
AU in the second block, 

Serial recall instructions were given in all condi- 
tions; i in AU reports Ss were instructed to 
recall the attended-ear digits in the order in which 
they were received and then to recall the un- 
attended-ear digits again in the order received, 
While 5s were instructed to recall as many of the 
digits as they could remember on each trial, no 
attempt was made to force Ss to recall all 8 digits, 
If an S remembered only 2 unattended-ear digits, 
he was encouraged to report only those 2 in the 
order in which they were presented. Consider an 
example in which the unattended message was 6, 8, 
4, 2. If the S remembered only 4 and 2, he was 
instructed to report only 4 and 2, It was thought 
that if Ss were encouraged to guess, and thereby 
to report more items, the additional response inter- 
ference might ase them to forget the items 
which they did remember, For the same reason, 
in the previous example, Ss were not instructed 
to say blank, blank, 4, 2, even though that type 
of report would have given a more accurate in- 
dication of serial position. 

No attempt was made in this experiment to 
objectively determine if Ss were in fact following 
the attention instructions (i.e., if Ss were attending 
to only one ear during stimulus presentation), 
Order of report instructions, however, was closely 
monitored. If Ss reported in reverse order (e.g, 
AU in a UA condition), the tape was stopped and 
Ss were reminded that order of report was im- 
portant. This procedure proved to be successful, 
as reversals of this type occurred in less than 1% 
of the total observations. 


Results 


While Ss were instructed to recall as 
many of the digits as they could in the 
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attention (at- 
order of report (attended— 
; unattended-attended, UA) and 


order in which they were received (serial 
recall), incomplete reports occurred in the 
majority of trials. As there is no adequate 
Way to deal with omissions in serial recall, 
à free-recall scoring procedure was used. 
Items were scored as correct in the serial 
position in which they were presented if 
they were recalled in the correct ear. 
Consider the following examples in which 
the attended message was 4, 5, 9, 1, and 
the unattended message was 2, 6, 3, 8. 
If, given an AU report instruction, S$ re- 
Ported 4, 5, 9, 1, 2, 6, 3, 8, he received 
à perfect score on both his attended and 
unattended ears. If an S reported 4, 5, 
91,73, 8, he received a perfect attended- 
ear score and 3 and 8 were scored as cor- 
rect in the third and fourth serial positions, 
respectively, on the unattended ear. If an 
S reported 4, 9, 5; 1, 2, 8; he still received 
a perfect attended-ear Score, even though 
9 and 5 were transposed, and 2 and 8 were 
scored as correct responses in the first 
and fourth serial positions, respectively, 
on the unattended ear. ^ 

Raw scores were converted to percentage 
correct recall and a 2 Between X 3 Within 
analysis of variance Was computed. In 
this analysis, the ear of attention (left or 
right) and the Sequence of report instruc- 
tions (AU-UA or UA-AU) 


Were between 


factors while report instructions (AU and 


UA), 


attention (A and U), and serial posi- 
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tion (1, 2, 3, 4) were within-groups facto: 
The results obtained in this experime 
(see Figure 1) were remarkably similar + 
those obtained by Bryden (1971, Figure : 
Neither of the between factors 
tention or sequence of report instructions 
produced reliable differences; however, e 
of attention (right ear superiority) ap 
proached significance, F (1, 8) — 2.52 
b = .149, MSprror = 1,283.86. 
Performance on the attended 
superior to performance on the una 
ear, F (1, 8) = 27.47, p < .001, M 
— 1,282.78, and as in the Bryden (19 
study there was a significant interaction |) 
tween attention and serial position, F (3, 24 
= 1343, p < .001, M Seni = 322.41. The 
serial position functions for theattendedear, 
whether reported first or second, were 
relatively flat, while positively increasin: 
serial position functions were obtained fc 
the unattended ear. The Order of Repo: 
X Attention interaction was significan 


ear of : 


Te (1553): 25:26:65, pre 001, Sx, 
= 230.91, indicating a greater differe: 
between attended-ear scores (Ay — Aj) 
than between unattended-ear scores 
(Ui — U2). 


The group data are fully consistent with 
the findings of Bryden (1971). The failure 
to detect a difference in unattended-ear 
recall as a function of order of report is 
compatible with the hypothesis that echoic 
Storage lasts for several seconds. The 
positively increasing serial position func- 
tions obtained with unattended reports 
Suggests that the echoic store is limited in 
capacity. 

However, while the group data were 
supportive of the Bryden (1971) findings 
and interpretations, there -was extensive 
between-Ss variability in the present ex- 
periment. The individual functions of all 
12 Ss are shown in Figures 2 and 3. With 
regard to the unattended ear, it can be 
seen that the 6 Ss in Figure 2 have indi- 
vidual functions which reflect the group 
performance shown in Figure 1. These 
6 Ss all show unattended-ear performance 
to be a positively increasing function of 
serial position, The 6 Ss shown in Fig- 
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Individual data, Experiment I: percentage correct recall as a function of attention (attended 


FIGURE 2, 


unattended ear), order of report (attended-unattended, AU; unattended-attended, UA), and se 


position. 


ure 3, however, show individual unat- 
tended-ear functions which are widely 
divergent from the group average. 

In an attempt to determine the source 
of variability between Ss, a more detailed 
analysis of the response patterns was made. 
In the AU order of report, all 12 Ss re- 
sponded with 4 digits from the attended 
ear before switching to the unattended ear. 
This response pattern was obtained in 190 
of the 192 trials, 

In contrast to the consistency found 
between Ss in AU reports, there was con- 
siderable variability in UA reports. Five 
Ss responded with an average of 2 or fewer 
U items in UA reports, while the remain- 
ing 7 Ss responded with an average of 4 U 
responses, All of the 5 Ss responding with 
2 or fewer U responses showed positively 
increasing serial position functions on the 
unattended ear, while 6 of the 7 Ss re- 


al 


sponding with 4 U items showed a pro- 
nounced absence of that effect. Whether 
or not recall accuracy on the unattended 
ear is a positively increasing function of 
serial position is dependent on the number 
of items reported from that ear. 

The variability in unattended-ear per- 
formance in this experiment precludes any 
definitive theoretical statement about the 
properties of echoic storage. What ex- 
perimental operations or designs might be 
employed in order to reduce or to account 
for this variability? 

EXPERIMENT II 

In the second experiment Ss were again 
instructed to attend to only one ear 
(balanced left or right over Ss) during 
stimulus presentation. In order to insure 
more objectively that Ss were following 
the instructions, Ss were told to either 
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Ficure 3. Individual data, Experiment I: percentage correct recall as a function of attention (attended 
unattended ear), order of report (attended-unattended, AU; unattended-attended, UA), and serial 


position. 


write down or verbalize the message re- 
ceived by the attended ear as it was 
presented. 

If failure to follow the attention instruc- 
tions contributes to the variability found 
in this type of task, then in Experiment II 
à disparity in recall functions should exist 
between conditions in which Ss follow in- 
structions (attended message perfectly 
monitored during stimulus presentation) 
and when they do not (imperfect monitor 
of attended-ear message). 

Another reason for conducting the second 
experiment concerns attended—unattended 
performance as a function of type of 
monitor. Both the overt acts of verbal- 
izing and writing a message during pre- 
sentation constitute types of rehearsal. 
Given that attended-ear performance is 
mediated by an active storage system, 
comprised of attention and rehearsal pro- 
cesses, both monitor types should result in 


uniformly high attended-ear performance. 
If unattended-ear performance is mediated 
by an echoic store in which information is 
relatively unanalyzed, subsequent auditory 
stimulation might be expected to interfere 
with the maintenance of and/or recall from 
this system. A monitor which produces 
further auditory stimulation by way of 
auditory-verbal feedback (verbal monitor) 
might, therefore, result in more interference 
than a monitor which does not contribute 
auditory stimulation (written). 


Method 


Subjects and apparatus. Twenty-four students 
enrolled in an introductory experimental psychology 
class served as Ss in this experiment. The mate- 
rials and apparatus employed were identical to 
those of the first experiment. 

Procedure, Subjects were randomly assigned to 
one of 2 groups: verbal monitor and written moni- 
tor. Each S was told to attend to only one ear 
during stimulus presentation. One half of the Ss 
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TABLE 1 


PERCENTAGE CORRECT RECALL, EXPERIMENT Il, 
FOR WRITTEN AND VERBAL GROUPS, WITH 
AND WITHOUT PERFECT MONITORING 


Attended Unattended 
Group = | Se 
Perfest | errors Hn pus 
Written | 90.81 | 88.74 | 6728 | 65.66 
Verbal 93.62 91.40 | 0 | 39.30 


in each group were instructed to attend to the 
right ear and one half were told to attend to the 
left ear. As in the first experiment the ear of 
attention for each S remained constant during the 
course of the experiment. 

Subjects received the same 2 blocks of 21 trials 
as those used in the first experiment. The sequence 
of instructions for recall, AL and UA-AU, 
was balanced for both groups. 

Written monitor, Subjects were instructed to 
write down the attended-ear message as it was 
presented. During this task S inserted his right 
arm through an aperture in a wooden stand. 
A piece of black opaque pla around the aperture 
prevented S from seeing his written response. 
While .S was writing his attended-ear response, 
E observed both the S and the VU meter in the 
preamplifier to insure that the written monitor was 
synchronized with the presentation of the messa 
After a few practice trials Ss started in the 2 blocks 
of 21 experimental trials. 

Verbal monilor. Subjects were told to verbally 
monitor the attended-ear message. The E wrote 
down the verbal monitor as well as the verbal 
recall response for the attended and unattended 
ears, Again E observed the VU meter in an at- 
tempt to insure synchrony of verbal monitor and 
stimulus presentation. 


Results 


The first question posed with regard to 
this experiment asked if failure to follow 
instructions (shown by an imperfect moni- 
tor) were a factor contributing to per- 
formance variability. If it were, a disparity 
in recall functions should be observed when 
Ss followed instructions (perfect monitor) 
and when they did not (imperfect monitor). 
'These scores (collapsed over serial posi- 
tion) were obtained for each individual, 
and the group means are shown in Table 1. 
Correlated ¢ tests showed all of these dif- 
ferences to be unreliable, all ps > .05. All 
trials with perfect and imperfect: monitors 
were included in further analyses. 


As in the first experiment, a free-recall 
scoring analysis was used. Raw scores 
were converted to percentage correct recall 
and a 3 Between X 3 Within analysis of 
variance was conducted. This analysis was 
the same as that in the first experiment, 
with the addition of type of monitor as 
a third between-Ss variable. 

The results shown in Figures 4 and 5, 
depicting performance for the written and 
verbal monitor groups, respectively, indi- 
cate that as in the first experiment (a) 
attended-ear recall was superior to unat- 
tended-ear recall, F (1, 16) = 112.60, p 
< .001, MSg;o = 1,131.48; (b) the At- 
tended-Unattended Ear X Serial Position 
interaction was significant, F (3, 48) 
= 76.94, p < .001, MStrror = 130.66, with 
attended-ear serial position functions flat 
and unattended-ear functions positively 
increasing; and (c) the Order of Report 
(A-U vs. U-A) X Attention (A vs. U) 


interaction was reliable, showing a greater 
difference between attended-ear reports 
than between  unattended-ear reports, 
F (1, 16) = 20.95, p «001, MStror 
= 141.94. 


Of more interest in the present results 
are the effects of type of monitor on 
various aspects of recall patterns. While 
the written monitor group was superior to 
the verbal monitor group, F (1, 16) 
= 18.70, p < .001, MStrror = 553.29, the 
difference was due solely to the unattended 
ear. The interaction (shown in Figure 6) 
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Ficure 4. Written monitor group, Experiment 
II: percentage correct recall as a function of at- 
tention (attended-unattended ear), order of report 
(attended-unattended, AU; unattended-attended, 
UA), and serial position. 
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FIGURE 5. Verbal monitor group, Experiment 
Il: percentage correct recall as a function of at- 
tention (attended-unattended ear), order of report 


(attended-unattended, AU; unattended-attended, 
UA), and serial position. 


between type of monitor and ear of report 
was reliable, F (1, 16) = 17.49, p < .001, 
MStrror = 1,131.48, as was the triple order 
interaction between type of monitor, ear 
of report, and serial position, F (3, 48) 
E41. 5.008, MSi = 130.66, 
These results, showing both differential 
recall functions for attended and unat- 
tended ear and selective impairment of 
unattended-ear recall with a verbal moni- 
tor, are compatible with a dual storage 
model in which attended-ear performance 
is mediated by an active store and un- 
attended-ear performance is mediated by 
an echoic store. While interesting, how- 
ever, the results of this experiment did not 
contribute to understanding of the mecha- 
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FIGURE 6. Percentage correct recall as a fi unction 
of type of monitor and attention, 
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nisms underlying variability in dicho: 
memory performance. 

On reexamination of the individual d: 
shown in Figures 2 and 3, it was obvi 
that some Ss were recalling more ite 
than were others. Perhaps this disparit 
in number of items recalled was a functi 
of capacity, with variability in dicho: 
memory performance being determined 1 
individual differences in “immediate men 
ory span” (Miller, 1956). 

In order to test the capacity hypothesi 
digit spans were assessed for each .S 
Experiment IlI and Ss were grouped . 
cording to digit span. 


ExrERIMENT III 
Method 


Tests of digit span were administered to 
potential Ss entering the laboratory. The stin 
for measuring spans consisted of 5 sets of 
each set being comprised of 2 lists of each len 
ranging from 3 to 15 digits, ch S was te 
individually and received all 5 list sets. Stim: 
Presentation was binaural and presentation r 
at 2 digit/sec. During each span a s 
received an ascending series of lists with 


Was se 


ment, 
instructions to reproduce each list in the cor 
order immediately following its termination. This 
procedure continued until an S erred on both lists 
of a given length, at which time his digit span 


wa ned as one less than that number (e.g, 
if an S missed both li with 8 digits, his digit 
span for that list set was 7). Each § was assigned 


an average digit span following 5 such ; 


ssments. 
In order to be included as an S in this experiment, 
certain criteria had to be met. The 5 digit spans 
for an S had to exhibit a clear mode; 3 of the 5 
span estimates had to be identical. In addition, 
no single span estimate could vary by more than 
one unit from the mode. "Thus, while scores of 6, 
5, 6, 7, 6 resulted in a digit span of 6, a set of 
Scores such as 5, 7, 6, 7, 7 resulted in exclusion 
from the experiment, as 5 was 2 units less than 
the mode of 7. This criterion is rather strict and 
it resulted in the exclusion of many individuals, 
but in a first approximation towards determining 
the effects of digit span in dichotic memory per- 
formance, a strict criterion was thought to be 
advisable. 

It seemed essential to cover a wide range of 
spans, so the design was set for groups with span 
estimates of 5, 6, 7, 8, 9, and 10 digi The at- 
tempt to get 8 Ss per group was successful for 
spans of 6, 7, 8, and 9; however, only 4 Ss were 
obtained for the 5- and 10-digit groups. Therefore, 
à total of 40 Ss was included in this experiment. 

Subjects within all digit-span groups were bal- 
anced with regard to both ear of attention (left 
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or right) and order of report sequence (AU-UA, 
UA-AU). 

The instructions and experimental procedure 
were identical to those of the first experiment 
with the following exceptions: (a) each S received 
10 practice trials prior to a test block, and (b) 
each test sequence included 30.trials. Therefore, 
in this experiment each S received 20 practice 
ists and 60 test lists. 


Results 


The results for all Ss are shown (a) col- 
lapsed over digit-span groups for both AU 
and UA reports in Figure 7, and (b) as 
a function of digit-span group for AU 
and UA reports in Figures 8 and 9, re- 
spectively. While all of the digit-span 
groups are included in the figures, in order 
to retain equal group size only the groups 
comprised of 8 Ss each (ie., Digit-Span 
Groups 6, 7, 8, and 9) were included in 
the analysis of variance. It is obvious 
from Figures 8 and 9 that any difference 
detected between digit-span groups would 
be accentuated by the addition of Groups 
5 and 10. 

Examination of Figure 7 reveals results 
comparable to those obtained in Experi- 
ments I and II. Indeed, as in the other 
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Ficure 7. Group data, Experiment lll: per- 
centage correct recall as a function of attention 
(attended-unattended ear), order of report (at- 
tended-unattended, AU;  unattended-attended, 
UA), and serial position, 


experiments the following results were 
reliable: (a) the AU order produced higher 
recall than the UA order, F (1, 16) — 24.52, 
p «.001, MStrror = 245.01; (b) attended 
ear was superior to unattended ear, 
F (1, 16) = 258.90, p< .001,. MS 
— 434.48; (c) terminal serial positions 
were more accurately recalled than initial 
serial positions, F (3, 48) = 55.43, p < 
001, MSprror = 196.66, a result which is 
due primarily to the positively increasing 
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FiGURE 8. Percentage correct recall with the attended-unattended report 
order as a function of digit span (DS), attention (attended-unattended ear), 


and serial position, 
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FIGURE 9, 
order 
and serial position. 


serial position functions in the unattended 
ear, F (3, 48) = 58.18, p < .001, MStrror 
= 193.51; and (d) there was a greater 
disparity between attended-ear scores when 
. reported first and second than between 
unattended-ear reports as a function of 
report order, F (1, 16) = 7.45, p < .01, 
MSterror = 215.60. 
"The effect of digit-span groups as a be- 
ween-Ss variable is highly reliable, 
F (3, 16) = 2324, p < 001, MSg,, = 
742.74; and while there is a small effect 
of digit span on the attended ear when 
reported second, the most substantial ef- 
fect of digit span is on unattended-ear 
= performance. This was confirmed in sta- 
tistical analysis by a Digit Span X At- 


TABLE 2 


COEFFICIENTS OF CORRELATION AND DETERMINA- 
TION BETWEEN ATTENDED-EAR AND UN- 
ATTENDED-EAR RECALL ACCURACY 
AND DIGIT SPAN 


Unattended 


ear ear 


| Attended 


Reports 


r n | r n 
Attended-unattended | 474 | .225 | .881 | .776 
E Unattended-attended | .625 | .391 | 844 | .712 


UNATTENDED EAR 


POSITION 


Percentage correct recall with the unattended-attended report 
a function of digit span (DS), attention (attended-unattended ear), 


tended-Unattended Ear interaction 
F (3, 16) = 23.88, p < .001, MS... 
434.48, a. Digit Span X Serial Position i: 
teraction, F (9, 48) = 5.76, p< .001, 
MSerror = 196.66, and a Digit Span > 
Attention X Serial Position interaction, 
F (9, 48) = 7.21, p < 001, MS. 
= 193,51. 

While reliable, the interactions do not 
provide us with any indication of the 
strength of the relationship between digit 
span and dichotic memory performance. 
The total number of items correct in the 
dichotic task was calculated for each S 
(N — 40), and the correlation between 
digit span and dichotic performance was 
determined. This analysis yielded a very 
high correlation coefficient, r = .891, The 
coefficient of determination (Hays, 1963), 
r, gives an indication of the strength of 
the linear relationship between 2 measures. 
The coefficient in the present experiment 
indicates that 79.39% of the variance in 
dichotic memory can be accounted for by 
à linear rule and digit span. In Table 2 
separate coefficients of correlation and de- 
termination are given for each measure of 
dichotic performance, ie., for attended- 
and unattended-ear recall in both AU and 
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UA reports. These figures indicate that 
most of the variance accounted for by 
digit span is in unattended-ear recall. 

In the first experiment, the nature of 
the unattended-ear serial position func- 
tions was shown to be due in part to dif- 
ferential report strategies. Positively in- 
creasing serial position functions were 
obtained when the mean number of un- 
attended items reported was 2 (Figure 2), 
while. more variable functions were ob- 
tained when an average of 4 unattended 
items was reported (Figure 3). The Ss 
in Figure 2 were therefore making errors 
of omission, while the Ss in Figure 3 were 
making primarily errors of commission, 
either in the form of interlist or extralist 
intrusions. An error analysis for the Ss 
in the present experiment is provided in 
Table 3. In this table the number of 
errors is shown as a function of digit-span 
group and attention (attended-unattended 
ear). Errors were divided into 3 types: 
omissions—items presented on either the 
attended or unattended ear which were 
not reported during recall; interlist in- 
trusions—items presented on the at- 
tended ear which were recalled as un- 
attended items and items presented on the 
unattended ear which were reported as 
attended ear items; extralist intrusions— 
items reported during recall which were 
not members of the stimulus set. 

As shown in Table 3, while low-span Ss 
made more attended-ear errors than did 
high-span Ss, no apparent differences were 
evident in the pattern of errors made by 
Ss in the various digit-span groups. How- 
ever, on the unattended ear there was an 
orderly trend of type of error as a func- 
tion of digit span. Approximately 90% 
of the unattended errors made by Ss with 
digit spans of 5, 6, and 7 were errors of 
omission, i.e., items were simply not re- 
ported during recall. This tendency was 
still prominent for Ss with a digit span 
of 8, as 7496 of the errors were omissions. 
Between Spans 8 and 10, however, there 
was a shift in the type of errors, with a 
reduction in errors of omission and an 
increase in errors of commission, so that 
by Span 10 the majority of errors were 


interlist intrusions. 


TABLE 3 
Tyre OF ERROR (OMISSIONS, INTERLIST INTRUSIONS, 
'T INTRUSIONS) AS A FUNCTION 
ON AND DIGIT SPAN, 
PERIMENT III 


Extralist 


Interlist 
i intrusions 


Omissions | intru 


Digit span 


No. YS 


No. | % No. 


Attended ear 


134 | 44,37 | 156 | 51.66 


12 | 3.97 
| 213 | 193. 46.06 | 13 | 3.10 
167 170 | 48.57 | 13 | 3.71 
116 135 | 50.00 | 19 | 7.04 
9 (n 73 86 | 50.00 | 13 | 7.56 
10 (n =4)| 12 235610] 6 | 14.63 
Unattended ear 
576 | 89.44 | 59| 9.16) 9 | 14 
| 932 | 91.46 | 78| 7.66| 10 | .98 
904 | 89.33 | 93| 9.19) 15 | 148 
482 | 74.04 | 150 | 23.04 | 19 | 2.92 
219 | 54.89 | 154 | 38.59 | 26 | 6.52 
31| 40.79 | 38] 50.00} 7 | 921 


This analysis raises the interesting prob- 
lem that response strategy (which is cor- 
related with digit span) and not digit-span 
ability per se might be the factor which 
produced the results of these experiments. 
Due to the large number of incomplete 
reports (errors of omission), I was forced 
to use a free-recall scoring procedure; and 
as high-span Ss responded with more un- 
attended-ear items than low-span Ss, their 
recall functions might have been inflated 
simply due to guessing. Two questions 
then arise: (a) What would the recall 
patterns of high-span Ss look like if a 
scoring procedure (serial recall scoring) 
less sensitive to guessing were employed? 
and (b) What would happen if low-span 
Ss were coerced to recall 8 digits per trial? 
With regard to the second question, would 
the recall patterns of low-span Ss become 
more like those shown typically by high- 
span Ss (which would mean that the 
results obtained were primarily a function 
of response strategy and not digit-span 
ability per se), or would they remain much 
the same as when these Ss recalled in a 
manner of their own choice (which would 
be more indicative of digit-span ability)? 
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FIGURE 10. 


Percentage correct serial recall in Experiment IV as a function 


of digit span (10 or 6), attention (attended-unattended ear), order of report 
(attended-unattanded, AU; unattended-attended, UA), and serial position. 


In the next experiment 4 Ss with low 
digit spans were forced to respond with 4 
attended-ear and 4 unattended-ear digits 
in both AU and UA reports. This pro- 
cedure permits a serial recall scoring anal- 
ysis, the results of which can be compared 
with data from the high-span Ss in the 
last experiment. 


EXPERIMENT IV 
Method 


Subjects and apparatus. Digit spans were as- 
sessed as in Experiment III and 4 Ss with digit 
spans of 6 were selected for participation as Ss in 
the present experiment. The apparatus was the 
same as that used in the previous experiments, 

Procedure. Each S participated in 2 test ses- 
sions. In both sessions Ss received the test tape 
used in Experiment II, which consisted of 2 prac- 
tice blocks (10 trials each) and 2 test blocks 
(30 trials each) Headphones were reversed in the 
second test session for all Ss, 

In the first test session 2 Ss were instructed to 
attend to their right ear and 2 to their left ear, 
One S attending to each ear was given the report 
sequence AU-UA while the other S was given 
UA-AU. The instructions were the same as those 
previously administered, Le. to report as many 
items as they could remember from both ears. 
As in the previous experiments Ss were instructed 
to report the items in the order in which they 

- Were presented. This test session was used to 
establish that the performance of these Ss was 


comparable to that of the 
Experiment III, 

In the second test session, Ss received the same 
test tape but with headphones reversed so that 
attended and unattended messages were different 
from those in the first session. Each S attended 


low-span Ss in 


to the same ear (right or left) and red the 
same report sequence (AU-UA or UA-/ is he 
did in the first session. The only change in the 
second session was in the report instructions. 


Each S was told that he was to report 4 attended- 
ear digits and 4 unattended-ear digits in both AU 
and UA reports, Subjects were told to guess if 
they did not remember the numbers. In addition 
they were instructed to make sure to report the 
numbers that they did remember in the correct 
serial position. 


Results 


The purpose of the first test session was 
simply to establish that the performance 
of Ss in the present experiment, when they 
were allowed to report in the manner of 
their choice, was comparable to that of the 
low-span Ss in Experiment Ill. The Ss 
in the present experiment responded with 
only 1 or 2 unattended-ear items in both 
AU and UA reports, Collapsed across Ss 
and order of report (AU and UA), their 
mean percentage correct recall on the un- 
attended ear was 11.67%, 12.91%, 35%, 
and 83.33%, respectively, for Serial Posi- 
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tions 1-4. Comparison of these percent- 
ages with those shown in Figures 8 and 9 
indicates that Ss in the present experi- 
ment were comparable to the low-span Ss 
in the previous experiment. 

A serial recall scoring procedure was 
used for data in the second session, in 
which Ss were required to report 4 at- 
tended-ear and 4 unattended-ear items on 
each trial. In this analysis a number was 
scored correct only if it was recalled in 
the correct serial position. Given an at- 
tended-ear message, 5, 9, 1, 3, and an 
unattended message, 4, 2, 6, 8, with an 
AU report instruction, if S recalled 5, 1, 
9, 3, 4, 6, 2, 8, he was given credit only 
for the first and fourth serial position 
digits in both ears. In the free-recall 
scoring procedure used previously, Ss 
would have received perfect credit for 
this report. 

Raw scores from the second session were 
converted to percentage correct recall, and 
they are plotted in Figure 10. Also shown 
in Figure 10 is a serial recall analysis of 
the data for the 4 Ss with a 10-digit span 
who participated in Experiment HI. As 
these 4 Ss all responded with 4 attended- 
ear and 4 unattended-ear digits on each 
trial, their data were simply rescored. 
Comparison of the data in Figure 10 with 
those shown in Figures 8 and 9 reveals 
2 interesting findings. First, the per- 
formance of high-span Ss remained un- 
changed with a serial recall scoring anal- 
ysis, indicating that these Ss were not 
simply guessing more items than low-span 
Ss. Second, low-span Ss were able to 
recall only 1 or 2 unattended ear digits 
in the correct position, even though they 
responded with 4 digits on each trial. 
Together these findings suggest that it is 
digit-span ability, and not a response 
strategy correlated with digit span, which 
is the important variable underlying dif- 
ferential performance. 


DISCUSSION 


Several group findings were revealed reli- 
ably by analyses of variance across Experi- 
ments I, II, and III: (a) attended-ear recall 
was superior to unattended-ear recall; (b) at- 


TABLE 4 


Mean NUMBER or UNATTENDED tems RECALLED 
CORRECTLY PER List (COLLAPSED OVER 
ORDER or REPORT) AS A FUNCTION 
or DiGiT SPAN 


M no, of unattended 
digits recalled 


Digit span 


1.14 


5 

6 1.78 
7 | 1.78 
8 2.80 
9 3.46 
10 3.80 


tended-ear serial position functions were rela- 
tively flat, while positively increasing serial 
position curves characterized unattended-ear 
performance; (c) attended-ear recall was im- 
paired when reported second relative to 
when reported first; (d) unattended-ear recall 
accuracy did not vary as à function of order 
of report; and (e) AU reports were superior 
to UA reports. 

However, of these experimental findings, 
only 3 were found to be reliable across Ss of 
all digit-span groups: (a) impairment of at- 
tended-ear recall when reported second, (5) in- 
variance of unattended-ear recall accuracy às 
a function of report order, and (c) superiority 
of AU reports. There was a pronounced 
absence of Serial Position X Attention inter- 
actions in high-span groups, and the tendency 
for attended-ear superiority was diminished, 

What are the mechanisms or processes 
underlying digit span which result in more 
accurate unattended-ear performance when 
digit span is high (8, 9, and 10) than when 
it is low (5, 6, and 7)? There are at least 
2 alternatives. First, we have defined active 
storage in terms of attention and rehearsal 
processes and echoic storage as an auditory 
buffer in which information is relatively un- 
analyzed. We have also identified active 
storage with attended ear and echoic storage 
with unattended ear. In terms of this model, 
the more accurate unattended-ear performance 
of Ss with high digit span reflects echoic 
stores of larger capacity. The mean number 
of unattended items reported correctly per 
list (collapsed over report order) was cal- 
culated for Ss of all digit-span groups. These 
figures, shown in Table 4, indicate values of 
echoic storage capacity ranging from 1.14 
digits (Span 5) to 3.80 digits (Span 10). 
Bryden (1971) hypothesized that echoic 
storage was limited to 1-2 items. While this 
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estimate adequately describes performance for 
Ss with digit spans of 5, 6, and 7, it is too 
low for Ss with spans of 8, 9, and 10. 

The identification of active storage with 
attended ear and echoic storage with unat- 
tended ear implies that no attention is directed 
towards the unattended ear. If, as a second 
alternative, we modify our working model to 
include the possibility that some attention is 
directed towards the unattended ear, a dif- 
ferent interpretation of the more accurate 
unattended-ear performance of high-span Ss 
arises. Consider models in which "attention" 
is used synonymously with “central processing 
capacity,” “energy,” and/or “mental effort” 
(e.g., Kahneman, 1973; Moray, 1967; Posner 
& Boies, 1971). In models of this type a finite 
central processing capacity or amount of en- 
ergy is available for information processing. 
Any mental operations such as "encoding," 
“retrieval,” “rehearsal,” and the like, which 
are thought to require central processing 
capacity, must share from a common source, 
Consequently, if at any particular moment 
there is a large investment of central processing 
capacity to one mental operation (e.g. re- 
- trieval the amount of capacity remaining 
= for other operations (e.g., encoding and re- 
hearsal) is reduced. If the capacity available 
for additional mental operations is not suffi- 
cient, impairment of performance will result. 

Consider the implications which this type 
of conceptual analysis has for dichotic memory. 
All Ss in the 
Structed to "attend" (invest central processing 
capacity) to a 4-digit message presented to 
One ear. If we assume that the amount of 
central processing capacity required by this 
task is a function of the relation between 
length of message and digit span, Ss with a 
low digit span (S or 6) would, in order to 
maintain accurate performance on the task, 
be required to invest more processing capacity 
than would Ss with a high digit span (9 or 10). 
More spare capacity would thus be available 
to high-span Ss. Given instructions to attend 
to one ear, but to recall messages from both 
ears, an S might be expected to direct his 
Spare capacity to his unattended ear. Thus 

in terms of the original model, unattended-ear 
performance would be mediated by both ac- 
tive and echoic storage systems. Viewed in 
this way, the difference in unattended-ear 
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performance detected between Ss with 
and low digit spans reflects not a diffe: 
in echoic storage capacities, but rathe: 
greater contribution of active 
with high digit span. 

It is not possible to decide at present 1 
of these theoretical alternatives i« mo: 
sible. While the knowledge that 79' 
of the variability in dichotic memo: 
accounted for by digit span does no 
the theoretical issues, it does increas: 
mental precision and it points out d 
for future research. 


storage i 
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AROUSAL AND THE RANGE OF CUE UTILIZATION! 


STEPHEN J. BACON? 


Indiana University 


The present experiment explores the effects of arousal on the range of stimulus 
responsiveness in terms of sensitivity and response criterion changes, the 
amount of attention initially focused on the stimulus, and the locus of effect 
of arousal: The Ss simultaneously performed a pursuit-rotor tracking task 
and an auditory signal detection task under different levels of arousal, temporal 
locus of effect conditions, and degrees of attention being focused on the auditory 
cue. Results indicate that arousal narrows the range of cues processed by 
systematically reducing responsiveness to those aspects of the situation which 
initially attract a lesser degree of attentional focus. This stimulus loss under 
arousal represents, independently of any response criterion changes, an actual 
diminution in the Ss' sensitivity. In addition, it seems that arousal mediates 
its effect not so much by impeding the initial sensory impression as by affecting 
the capacity limitations and attentional control processes operating within 


short-term memory. 


Discussions of the effects of emotional 
arousal on the breadth of attention are 
assuming an increasingly prominent role 
in accounts of the experiential and be- 
havioral consequences of arousal (Easter- 
brook, 1959; Korchin, 1964; Wachtel, 
1967; Wine, 1971). The basic theoretical 
formulation is provided by Easterbrook, 
who suggests that the effect of emotional 
arousal on attention is to narrow and focus 
the attentional field by systematically re- 
ducing the range of cue utilization. More 
specifically, it is proposed that responsivity 
to peripheral or less relevant stimuli is 
diminished, while responsivity to central, 
immediately relevant or dominant cues is 
maintained, if not in fact augmented. 
Empirical support for the Easterbrook 
hypothesis may be found in the experi- 
ments of Bahrick, Fitts, and Rankin 
(1952), Bursill (1958), and Wachtel (1968), 
which have shown that, while simultane- 
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ously performing a central pursuit-meter 
tracking task and a peripheral visual de- 
tection task, aroused Ss evidence consid- 
erable decrement in responding to the 
visual stimuli. 

Perhaps the primary question that can 
be put to the arousal-reduced cue-utiliza- 
tion formulation concerns the nature of 
the stimulus loss. The implicit assump- 
tion being made is that this involves a 
reduction in the range of actual sensitivity 
tostimuli. The possibility exists, however, 
that emotionally aroused Ss do not really 
experience attenuation of sensitivity to 
peripheral or less relevant cues, but rather 
are more cautious in reporting the occur- 
rence of these stimuli. The present ex- 
periment employs a signal detection 
methodology (Green & Swets, 1966) to. 
determine whether reduced responsivity 
under arousal to peripheral stimuli reflects 
an actual attenuation of sensitivity to the 
signal, or merely an increase in the sub- 
jective decision criterion, so that it be- 
comes less likely that the signal will meet 
the stricter criterion. 

It should be noted that the main thrust 
of the literature has been directed towards 
investigating the effects of arousal in re- 
ducing responsivity to peripheral cues. 
Easterbrook (1959), however, had postu- 
lated that the narrowing and focusing of 
the attentional field under arousal involves 
not only diminished utilization of periph- 
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eral cues, but also the maintained, if not 
improved, use of central, immediately rele- 
vant cues. This suggests 2 related hy- 
potheses to be tested: (~) There is a dif- 
ferential shift in responsivity under arousal 
depending upon the initial amount of at- 
tention being focused on a cue; i.e., cues 
which initially attract less attention should 
show further diminished attention under 
arousal, while cues which occupy the 
primary focus of attention should perhaps 
show enhanced attention under arousal. 
(b) There is a heightened sensitivity under 
arousal to those cues on which a high 
degree of attention was initially focused. 

In general, the evidence that arousal 
increases the performance of a central task 
is highly inconsistent. Moreover, the type 
of task that has been used to measure the 
proposition that arousal might augment 
awareness of central, primary stimuli (e.g., 
pursuit meter) typically involves complex 
perceptual-motor coordination and higher 
order cognitive organization, and is far 
from testing simple stimulus responsivity. 
Another purpose, then, of the present in- 
vestigation is to study, under arousal, dif- 
ferential shifts in stimulus responsiv y to 
task cues which diverge along the dimen- 
Sion of amount of attention (high vs. low) 
being focused on them. 

Finally, the research reported here is 
also directed towards the question of the 
locus of the effect of arousal on the range 
of stimuli that are processed. The Easter- 
brook (1959) hypothesis essentially asserts 
that the effect of arousal involves capacity 
limitations in information processing and 
the activation of attentional control mech- 
anisms. One of the fundamental issues in 
information processing concerns whether 
such capacity limitations and attentional 
effects operate during the initial perceptual 
encoding stages of processing, or whether 
they operate exclusively in short-term 
memory store following perceptual pro- 
cessing (Shiffrin & Geisler, 1973). The 
present study, by manipulating whether 
the response to target stimuli is given 
immediately or is given after a delay, 

explores whether the effects of arousal on 
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stimulus responsivity occur at the 
perceptual or memory processes. 


locus of 


METHOD 
Subjects 


The Ss were 20 students at the | niversity of 
California, Los Angeles, who were paid for their 
participation in the experiment. They ranged in 
age from 19 to 25 yr. and had an aver ige age of 
21.6 yr. (SD = 1.82), 


Apparatus and Instrumentation 


The experimental task consisted of the simul- 
taneous performance of pu rotor (PR) tracking 
and auditory signal detection, The fluorescent light 
target of a Lafayette pursuit rotor, set at 40 rpm, 
was tracked by means of a stylus containing a 
photoelectric cell, A La ayette timer clock mea- 
sured time on target (TOT) to the nearest .01 sec, 

The auditory signal consisted of a 1,000-cps tone 
generated by a Hewlett-Packard audio oscillator 
and monitored through a He lett-Packard variable 
attenuator, The signal was activated by a 60-msec. 


pulse former triggered by a Davis timer. This 
timer was in turn programmed by a trial stepping 
device which controlled hether or not the signal 
was to be presented. On trials in which the signal 
Was presented, the tone was then fed through a 


40-db. fixed attenuator into a Grason-Stadler noise 
generator and mixed there with white ne vise. The 
output from the mixer, con isting then of either 
a signal embedded in the noi "ackground or just 
the noise background, was then fed through a 
30-db. fixed attenuator into à relay system. and 
then into a set of Sharpe stereo headphones, The 
purpose of the 2 fixed attenuators was to shut out 
switching clicks from the mechanical program 
equipment. The programming equipment con- 
sisted of a series of interconnected. Davis timers 
which automatically set trial intervals (7 sec), 
time of signal presentation, and intertrial intervals 
(S sec.). A switch selected whether the trial timer 
pulsed the noise or the PR. It should be noted 
that when the timer triggered the relay to allow 
the noise (or signal plus noise) to be on for a 7-sec. 
trial, the PR ran continuously, In this case, the 
TOT counter was connected to the relay so that 
it marked PR tracking time only when the noise 
was on (i.e, only during the 7-sec. trial interval). 
On the other hand, in those conditions in which 
the trial timer pulsed the PR for 7 sec., the noise 
Was continually fed through to the headphones. 
In this case, TOT could still only be scored for 
7 sec, since with the PR light turned olf, the 
photoelectric cell in the stylus was not activated. 
The stimulus delay timer was controlled by a 
device which selected the particular delay interval 
for the stimulus Presentation. In this experiment, 
delays of .5, 1,0, 2.0, 6.0, and 6.5 sec. were possible. 

A public address system permitted 2-way com- 
munication between the S's room and the room 
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containing the program equipment in which Æ was 
located. A Beede electrical stimulator generated 
electric. shocks, which were delivered via penny 
electrodes taped to the fingers of the S’s non- 
preferred hand. After each experimental condition, 
the Ss filled out a 20-item, modified version of 
Spielberger's State Anxiety Inventory (SAI; Spiel- 
berger, Gorsuch, & Lushene, 1970). 


Procedure 


Determination of signal intensity. At the be- 
ginning of the session, the auditory threshold for 
each S for a 1,000-cps, 60-msec. tone, embedded 
after a .5-sec. delay in a 1.0-sec. noise burst, was 
determined by the method of limits, Ten alter- 
nating descending and ascending runs were given. 
Only the last 6 runs were used in the final calcula- 
tion of the threshold. The signal level was then 
increased 3 db. above the established threshold. 

Signal detection and PR tasks. In each condi- 
tion, each S was given 100 trials lasting 7 sec. 
each, with an intertrial interval of 5 sec. During 
each 7-sec. trial, the S was instructed to perform 
the PR tracking task while simultaneously listen- 
ing through a set of headphones for the occurrence 
of an auditory signal embedded within a white- 
noise background. The S was told that on each 
trial the signal might or might not occur, but 
that if it did occur, it could do so anywhere within 
the trial, He was informed that this was all totally 
randomized, was advised of the “gambler’s fallacy,” 
and was told to make an independent judgment 
on each trial. When the trial was over, the S 
rated his subjective confidence that a signal did 
or did not occur on a 4-point rating scale (signal 
positively there, probably there, probably not there, 
positively not there). The S was also asked to 
rate from 0% to 100% how hard he was con- 
centrating on doing the PR tracking. The program 
for the signal presentation was as follows: ran- 
domly, but equally, distributed over 2 blocks of 
50 trials each were 13 signals presented after a 
delay of 1 sec. into the trial, 12 signals after 2 sec., 
12 signals after 6 sec., and 13 signals after 6.5 sec. 
Randomly interspersed within this program were 
50 noise-only trials. Half of these were randomly 
designated as the noise trials to be used with the 
early signal presentation, and half were randomly 
designated as the noise trials to be used with the 
late signal presentations. At the beginning of each 
ssion, each S received 62 practice trials. Signal 
previews at each of the temporal delays were 
presented well above threshold before each block 
of trials, Rest periods of 5 min. each were given 
after the practice trials and after each block of 
50 experimental trials, and a 15-min. rest period 
separated the nonarousal and arousal conditions 
for each S. 

Level of arousal conditions. Each S performed 
the PR-signal detection task under counter- 
balanced no-shock and shock conditions. The non- 


arousal condition consisted of a no-shock PR- 


signal detection task. In the arousal condition, 
the S was informed that E was interested in studying 
the effects of stress, and that shock applied to the 
fingers of the nonpreferred hand would be used to 
induce stress. The level of shock that was per- 
ceived by the S as the most painful he was willing 
to endure was determined. The S was told that 
shocks would be randomly delivered while he was 
performing the task, so that the longer the interval 
between successive shocks, the more intense the 
shock would be. In each block of 50 trials, 20 shocks 
were delivered according to a fixed random schedule, 
Shocks were thus delivered during both intertrial 
interval and experimental trials, but never at the 
time of signal occurrence. Actual delivered inten- 
sities varied around the predetermined level as a 
function of intershock interval. The Ss were 
asked to complete the 20-item SAI after both 
no-shock and shock conditions. 

Focus-of-attention conditions. The Ss were ran- 
domly assigned to one of 2 groups which differed 
in the amount of attention that was being focused 
on the auditory task. Degree of focus of attention 
on the auditory cue was manipulated by physical 
and instructional means designed to pull attention 
either towards or away from the auditory signals. 
When the auditory cue was to occupy the primary 
focus of attention, the PR was run continuously, 
and the noise coming over the headphones was 
pulsed for 7 sec, The S was told that when the 
noise goes on he w: in PR tracking and 
listening for the auditory signal, and as soon as 
the burst ended he was to stop tracking and make 
his signal detection and concentration ratings. 
Thus, the auditory channel gained further im- 
portance in that it set the stage for task perfor- 
mance, The S was also told that, while he was 
required to perform both tasks simultaneously, 
the more important task was the auditory signal 
detection task, Furthermore, if he performed 
above a certain criterion level on the auditory 
task, he could receive a bonus of $1.00. The only 
qualification to this was that he had to perform 
the PR task to a criterion, although he was assured. 
that the criterion was fairly low. All Ss were, 
in fact, rewarded. Following the practice trials, 
S was informed that his PR performance was 
adequate, and he was encouraged to maintain this 
level. After experimental trials 25, 50, and 75, 
S was again assured that his PR performance was 
adequate, was again reminded of the importance 
of the auditory task, and was encouraged to con- 
centrate on listening for the signal. When the 
auditory cue was to attract less attention, the 
manipulations favored the PR task. Thus, the 
auditory noise background was on continuously, 
and the 7-sec. trial interval was determined by the 
pulsing of the PR, i.e., the target was illuminated 
and rotated for 7 sec. and then stopped and turned 
off. Pursuit-rotor tracking was emphasized, and 
a $1.00 bonus could be received for performing 
PR tracking above a criterion level, provided that 
minimal criterion performance be maintained on 
the auditory detection task. Following the practice 
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TABLE 1 


SENSITIVITY AND MEAN SLOPE SCORES FOR 
No-SHOCK AND SHOCK CONDITIONS OVER HIGH 
AND Low DEGREE or Focus OF ATTENTION ON 
THE AUDITORY C (HF, LF), AND IMMEDI- 
ATE AND DELAYED Response (RI, RD) 
CONDITIONS 


Sensitivity scores Slope scores 


Conditions - 


Shock 


No shock Shock | No shock 
HF-RI 3.02 2.63 208 .80 
HF-RD 3.32 2.61 44 | .61 
LF-RI 3.13 2.30 -65 -68 
LF-RD 3.61 1.60 42 | -98 


trials and at the given points during the experi- 
mental sessions, S5 was advised that his auditory 
detection performance was adequate, was encour- 
aged to maintain this level, and was reminded of 
the primary importance of the PR task. 

Temporal locus of effect condition. "Temporal 
factors were manipulated by varying the time 
between auditory signal presentation and S’s report, 
so that the response was either immediate or 
delayed. The trials containing the signals presented 
after 6- and 6.5-sec, delay, along with their noise- 
only trials, were collapsed together to make up 
the response-immediate condition. The signal 
presentations occurring 1 and 2 sec. into the trial, 
requiring S to delay his response 5 or 6 sec., taken 
together with their nois only trials, comprised the 
response-delayed condition. 


RESULTS 


The interpretation of the effects of 
arousal on the range of cue utilization 
depends upon the effectiveness of the no- 
shock vs. shock manipulation in producing 
differential levels of arousal, Raw scores 
for each S on the SAI were tabulated and 
then translated into the appropriate per- 
centile rank scores given from the norms 
on college undergraduates (Spielberger et 
al., 1970). An analysis of variance per- 
formed on the arousal level scores revealed 
the mean difference between no-shock and 
shock conditions (Percentile Rank 58 
vs. 91) to be hightly significant, F (1, 16) 
= 37.08, p < .001. No other differences 
between treatment conditions were found. 

The actual sensitivity of the various 
treatment groups to the auditory signal 
is expressed in terms of a linear function, 
Am, representing the measure of under- 
lying sensory capabilities, independent of 


response bias. Green and Swets (1966) 
have proposed this index of sensitivity to 
be used instead of d’ when the assumption 
of equal variance in the signal and noise 
distributions may not be met. Values of 
Am were obtained for each S based on the 
rating data for each of the conditions in 
which S participated. Cumulative condi- 
tional probabilities of hits and false alarms 
at each of the decision criterion points 
were calculated, transformed to their cor- 
responding Z score equivalents, plotted 
against each other, and, the method 
of least squares, they were with a linear 
function. The Z score values of the X 
intercepts of the linear functions served as 
the estimates of Am (Green & Swets, 1966). 
The analysis of variance conducted on these 
data showed that arousal clearly diminishes 
sensitivity to the auditory cue, (1, 16) 
= 11.32, p< .005. Table 1 shows the 
mean sensitivity scores for the focus-of- 
attention, temporal factors, and arousal- 
level treatment combinations. It can be 
seen that arousal produces greater sen- 
sitivity loss when the auditory cue attracts 
less attention, and particularly when the 
response to it is delayed. The planned 
comparisons performed to test whether 
arousal increases sensitivity to the audi- 
tory cue when it occupies th primary 
focus of attention and response is imme 
diate or delayed, and whether arousal 
reduces sensitivity to the auditory cue in 
the low-attention condition for immediate 
and delayed responding, revealed that the 
primary effect of shock was to reduce 
sensitivity in the low-attention — response- 
delayed condition, F (1, 16) = 9.61, $ 
< .01. The other comparisons fell well 
Short of significance: F (1, 16) d 
for the primary-attention — response-imme- 
diate condition; F (1, 16) = 1.17; pie 
-25, for the primary-attention-response-de- 
layed condition; and F (1, 16) = 11659 
35 > p> .10, for the low-attention — re- 
sponse-immediate condition, 

Another independent sensory parameter 
is the slope value of the linear function. 
The traditional interpretation of the slope 
in signal detection theory is that it equals 
the ratio of the variance of the noise dis- 
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tribution to that of the signal distribution. 
in analysis of variance was carried out on 
> slope values. A significant main effect 
is found for arousal level, F (1, 16) 
4.91, p < .05, the mean value of the 
ope increasing from .58 in no-shock con- 
ditions to .77 with shock. A significant 
interaction for arousal level and order 
effects, F (1, = 6.60, p < .025, sug- 
gests that the main effect of increased 
slope for aroused Ss was due primarily to 
those Ss in the no-shock-first, shock-second 
order condition. A significant 3-way inter- 
action of focus of attention, temporal 
factors, and arousal level was also ob- 
tained, F (1, 16) = 5.29, p < .05, and the 
mean slope values for each of these treat- 
ment combinations are also presented in 
Table 1. The planned comparisons indi- 
cated that shock significantly increases the 
slope value only in the low-attention — re- 
sponse-delayed condition, F (1, 16) = 7.66, 
p < .025. 

Typically, changes in the ratio of the 
variances of the noise to the signal dis- 
tribution have been taken to indicate 
changes in the way S is processing in- 
formation about signal parameters. How- 
ever, while it is of interest to know in the 
present experiment that not only sens 
tivity changes but also underlying infor- 
mation-processing strategy changes occur 
under arousal, no additionally meaningful 
psychological information can be ascer- 
tained from the slope statistic concerning 
the locus of arousal's effect. There will, 
therefore, be no further discussion of the 
slope data. 

Another measure which further describes 
S's behavior on the auditory detection 
task is the hit rate N), where 
y = yes, signal was present, and SN = 
signal was added to noise background, 
which provides fairly direct estimates of 
shifts in response criterion when sens y 
values do not differ. In addition, it is the 
hit rate scores in this study which are most 
closely analogous to the perceptual per- 
formance measures of previous central- 
tracking — peripheral-light studies. Table 2 
shows the mean hit rate Z scores at each 
of the cumulative subjective criterion 
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TABLE 2 


Mean Hir Rare Z Scores AT EACH CRITERION 
Pornt FoR No-SHOCK AND SHOCK CONDITIONS 
OVER HIGH AND Low DEGREE oF Focus or AT- 

TENTION ON THE AUDITORY Cue (HF, LF) 
AND IMMEDIATE AND DELAYED 
Response (RI, RD) Conpitions 


No shock Shock 
Conti aes — 
tions 
strict [Mod | Lax | strict | Mod. | pax 
HE-RI | —.531 |.512 | 1.296 | —.260 | .592 | 1.320 
HE-RD | —348 | {690 =1179 |.756 | 1487 
—305 |573 —491 |371| "848 
—229 |546| 925 | —.689 |.620 | ‘962 


points as a function of focus of attention, 
temporal factors, and arousal level. ‘The 
analysis of variance and the planned com- 
parisons performed on the overall mean 
hit rate Z score, Z(y|SN) revealed no 
significant differences between treatment 
conditions. This finding is somewhat sur- 
prising in that one might expect that, 
given the observed reduction in sensitivity 
under shock, an overall lowering of the 
average number of hits should follow. 
An explanation of this finding, which seems 
to be supported by reference to Table 2, 
is that treatment manipulations had op- 
posing effects at different criterion points, 
thus leaving the overall average number 
of hits unchanged. 

"articularly noteworthy in Table 2 are 
the hit rate Z score data associated with 
the most stringent criterion point, in that 
the previous central-tracking — peripheral- 
light experiments, which required .S to 
switch off the peripheral light when he 
saw it was on and recorded the amount 
of time the light remained on, were prob- 
ably dealing with a criterion range covered 
by the most stringent criterion point in 
the present investigation. Table 2 shows 
that, at the most stringent criterion level, 
under arousal there is a moderate increase 
in hit rate when the auditory cue occupies 
the primary focus of attention, but a 
marked decrease in the hit rate when the 
auditory cue attracts less attention, most 
dramatically when response is delayed. 

Time on target per trial performance on 
the PR was also evaluated by an analysis 
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of variance. The results that merit con- 
sideration here are the significant Tem- 
poral Factor X Arousal Level interaction, 
F (1, 16).— 4.99, p < .05, which indicates 
that only when the response to the auditory 
cue is delayed did shock produce decre- 
ment in PR performance. The planned 
comparisons revealed that shock produced 
significantly lower PR scores only in the 
low attention to the auditory cue (hence 
centered attention on the PR)- response- 
delayed condition, F (1, 16) = 8.42, 
p < 025. 


DISCUSSION 


The results of the present experiment sup- 
port Easterbrook’s (1959) hypothesis that the 
effect of arousal is to reduce the range of cue 
utilization, and the results are also consistent 
with the conclusions of Bahrick et al. (1952), 
Bursill (1958), and Wachtel (1968) that emo- 
tional arousal decreases responsiveness to pe- 
ripheral stimuli. More important, the data 
indicate that the stimulus loss under arousal 
involves a diminution in S's sensitivity, in- 
dependently of any response criterion changes. 
Moreover, arousal seems to have differential 
effects depending upon the degree of atten- 
tion the stimuli attract, sensitivity loss sys- 
tematically occurring to those cues which 
initially attract less attention. On the other 
hand, the present study offers no support for 
the suggestion that arousal might augment 
Sensitivity to cues which occupy the primary 
focus of attention. One must note, however, 
that the PR is such a demanding and com- 
pelling task that, despite the manipulations 
designed to focus attention on the auditory 
task, it is quite likely that the auditory cue 
never achieved attentional centrality. Thus 
the present study may not have provided 
the optimal conditions for a central enhance- 
ment effect to emerge. 


Some additional evidence emerges from this 
study which may be viewed as an extension 
of the findings that arousal reduces the range 
of cue utilization, A significant decrement 
in both PR performance and Sensitivity to 
the auditory cue was obtained under arousal 
in the low-attention — response-delayed con- 
dition. This seems to suggest that when Sis 
required to perform the PR task while also 
retaining the memory trace of the auditory 
Signal, arousal reduces S's ability to attend 
to both tasks simultaneously. This provides 


support for the notion that arousal reduce 
the number of cues S can handle at 
time, the capacity, in Korchin's (1964) term: 
for dual activity. In addition, these resu 
indicate that arousal introduces further 
pacity limitations within the 
memory system. 

The response-immediate and response-d 
layed manipulations in the present study y 
directed at determining the locus of ef 
arousal in reducing the range of stimu 
are processed. The response-immediate c 
dition was designed to test the elects 
arousal on perception. [t is 
response delays of .5 and 1.0 sec. were 1- 
volved here and thus 
may be implicated. Nonetheless, if aros! 
affects perception of external stimuli by 
tenuating signals arriving over sensory cl 
nels, this would surely have resulted in a 
sensitivity loss in the response-imiiediate 
condition. While a trend towards dimin 
sensitivity was found in the low-attentic 


short-te 


realized that 


memory mechanis s 


response-immediate — shock condition, it cid 
not attain statistical significance. Thus, there 
is little evidence from the present investiga 
tion that arousal acts effectively to impede 
perception of stimulus input. On the other 
hand, the response-delayed condition, designed 
to evaluate the effects of arousal on memory 
processes, did show reduced sensitivity to the 
auditory cue under low-attention — shock con- 
ditions. This suggests that arousal narrows 
the range of stimuli that are processed by 
impairing the memory traces of those signals 
which initially attract less attention. Arousal 
may exert its effect in this condition by both 
overloading the system and thereby exceeding 
the capacity limitations of short-term storage, 
and by interfering with the attentional control 
Processes, such as rehearsal, that aid in 
maintaining information in short-term mem- 
ory. Within this context, it is interesting 
to note that the differential effects of the 
focus-of-attention manipulations in the present 
experiment become evident only within mem- 
ory and under the capacity limitations imposed 
by arousal. The results and conclusions 
presented here are consistent with those ad- 
vanced by Shiffrin and Geisler (1973), that 
attentional effects and capacity limitations 
operate solely within short-term memory 
storage following perceptual processing. 
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When Ss are required to utter rapidly a sequence of visually pres 


ented 


which are closely similar in spelling, but not in pronunciation, such as m 
DOWN or HOME-SOME, hesitations and confusions may result. These 
were found to be less apparent when the same items are so interleaved 
members of “theterophone”’ pairs such as the above are no longer adjac: 
each other, It was shown that this heterophone effect was unlikely to de 
on the Ss scanning ahead or perceiving later words in peripheral vision 


phenomenon was studied further in terms of the times for initiating and 


pleting the articulation of items either separately or in pairs. By varying ‘lie | 


contingencies of presentation and utterance of paired items (heterop! 


a 


and neutrals), it was found that a pronunciation set could be establish: 


cause subsequent interference. 


grapheme-phoneme recoding. 


A phenomenon in English orthography is 
the existence of large groups of words 
such as MOWN-DOWN, QUART—PART, HORSE-— 
WORSE, HOME-SOME, and TOW-COW. Such 
heterophone pairs share largely the same 
spelling, but with some considerable pro- 
nunciation differences. When we encounter 
any one such word in isolation, there is 
usually little difficulty in pronunciation. 
When a number are juxtaposed, the speaker 
may become confused. 

A number of experiments are reported 
where this phenomenon is investigated. 
The findings are later considered within 
the context of phonemic recoding of visually 
presented stimuli ("indirect access" theo- 
ries, Smith, 1971), Morton's logogen theory 
(Morton, 1969), sensory and motor “buffer” 


stores in reading, and the role of peripheral 
vision in reading. 


EXPERIMEN’ | 


The aim of this experiment was to ex- 
amine the possibility that Ss may have 


1 This research was Supported by an award from 
the Australian Research Grants Committee. We 
wish to acknowledge the assistance of Kevin 
Donahoo and Robert Wood in the design and con- 
struction of apparatus. 

* Requests for reprints should be sent to John L. 
Bradshaw, Department of Psychology, Monash 
University, Clayton, Victoria 3168, Australia, 


88 


However, overt articulation was nece 
for this, not merely covert reading or memorizing, and the effect was 

fested in the time taken to initiate articulation, not its duration. The fin 
were discussed within the context of word recognition, pronunciation set 


more difficulty in correctly pronouncing & | 
sequence of words where the members of 
heterophone pairs such as ROOT-FOOT, 
RAID-SAID, BOWL-FOWL, etc., are adjacent 
to each other, than where they are sepa- 
rated from each other by single members 
of other heterophone pairs. 


Method 


Material. Dictionaries were searched to obtait 
48 pairs of heterophone items. The words varie 
from 3 to 8 letters in length, and all except 9 pair 
were single syllables, the remainder being bisyllabie, 
The items were arranged in 4 vertically organize 
lists. In the first list the 2 members of a pair were 
always one immediately above the other, i.e; there 
were no intervening items (List 0). In the next 
the same words were arranged in an interleaved 
form, so that there was always one item between MH 
2 members of a heterophone pair—these mem? 
themselves serving to separate other heterophont 
pairs—thus: HOME-NORTH-SOME- WORTH-SOUGH 
WEAR-DROUGHT-FEAR, etc. There was there 
one intervening item (List 1). In the third list, 
same words were arranged so that there were 3 Es 3 
vening items (List 3), thus: HOME-NORTH-SOUGH 
WEAR-SOME-WORTH-DROUGHT-FEAR, etc. ee 
in the last list (List 7), the words were so interlea 
that there were always 7 intervening items: ‘ea 
HOME-NORTH-SOUGHT-WEAR-ROLL—WEIGHT ROO" | 
THOUGH-SOME-WORTH-DROUGHT, etc. in 24 

Procedure. These 4 lists can be arranged vhich 
possible orders. Twelve of these were selected wht 
permitted 3 different complementary designs 9 rH 
Latin square type. The 12 Ss were all requ nce | 
read aloud each of the 4 lists twice, i.e., in accorda 
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the particular specified sequence and also in the 
of that sequence, thus: 3, 0, 1, 7, 7, 1, 0, 3. 

were timed with a stopwatch while reading, 
were encouraged to read as fast as possible, 
ing mistakes of pronunciation and hesitations 
minimum. They were instructed to continue 
ing if they did make a mistake, and not to go 
and correct it. The Ss, 4 males and 8 females, 
wee staff or postgraduate student members of the 
de; artment of psychology of Monash University. 


esulls and Discussion 


The means of the reading times for the 4 
ists, across the 12 Ss, were: List 0, 41.5 
ec.; List 1, 38.3 sec.; List 3, 36.7 sec.; 
nd List 7, 37.0 sec. A 2-way analysis of 

variance (within Ss) showed a significant 
difference between the lists, F (3, 33) = 
34.4, p < .001. In fact List 0 was the 
longest for all 12 Ss, without exception, and 
with only one exception, List 1 took a 
longer time to read than List 3. 

It seems therefore that when Ss are 

,uired to read lists of such heterophone 
items, as fast as possible consistent with a 
minimum of mistakes, there is considerable 

interference both when the items are ad- 

jacent, and even, although to a lesser ex- 
tent, when they are interleaved so that 
they are each separated by a "neutral" 
item. 

A number of possible explanations will 
be discussed later. One possibility, other- 
wise of little theoretical interest, is simply 
that Ss were either scanning ahead or, 
what comes to the same thing, able to see 
the next member of the pair in peripheral 
vision, This might have caused interfer- 
ence in the case of adjacent heterophones. 


EXPERIMENT II 


There is perhaps a fairly simple test to 
see whether a word beyond the one cur- 
rently being processed can interfere with 
the current one. In English there are a 
number of heteronyms, such as LEAD 
(rhyming with either BEAD or DEAD, de- 
pending upon meaning), ROW, SOW, and 
BOW (rhyming with either HOW or LOW), 
TEAR (rhyming with either PEAR or FEAR), 
WIND (TINDER or BINDER), and WOUND 

ÐUNDED Or BOUNDED). If S encounters 
.EAD followed by BEAD (or DEAD) in a list, 
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is he more likely to pronounce it in rhyming | 
fashion? Lists were constructed, contain- 

ing filler and critical items such as the 

above, to see whether scanning ahead (or 

peripheral vision) tends to influence the 

pronunciation of ambiguous heteronyms. 


Method 


Material. Two vertical lists of single words were 
constructed. Both contained identical filler items— 
rhyming pairs such as SINK-DRINK, STONE-BONE, 
etc., interspersed with heterophone pairs such as 
OUR-FOUR, NEVER-LEVER, etc. In this way it was 
hoped to avoid the establishment of a set or expect- 
ancy for complete or partial rhyming. At certain 
positions 6 critical items were located. In one list 
these were BOW-COW, TEAR-FEAR, LEAD-BEAD, ROW-— 
LOW, WIND-BINDER, WOUND-BOUNDED. In corre- 
sponding positions in the other list were found BOW- 
TOW, TEAR-PEAR, LEAD-DEAD, ROW-HOW, WIND- 
TINDER, WOUND-WOUNDED, 

Procedure. Each of the 9 Ss, 3 males and 6 
females, was required to read the 2 lists aloud, as 
though he was being timed. In fact it was noted 
whether or not the Ss pronounced the critical, 
ambiguous heteronym item to rhyme with the 
following word. The Ss were given the 2 lists one 
before and one after another experiment lasting. 
approximately 1 hr. Half of the Ss were given one 
of the 2 lists first, and the others were given the other 
list first. 


Results 


The critical items were scored as rhyming 
or not rhyming. If S pronounced either 
the first or second word of a pair incor- 
rectly, this was scored as an error. The 
total number of rhymes, nonrhymes, and 
errors are shown for each list in Table 1, 
as are the totals over the 2 lists. Table 2 
shows the number of times Ss pronounced 
the given test word the same or a different 
way in each list. Table 3 shows how each 
test word was pronounced, if pronounced 
correctly, on a total of 18 presentations. 


TABLE 1 


Pronunciation ae First list T Second list Total 
Rhymed | 2i 22 51 
Not rhymed |a: 31 52 
Errors | 


1 | 5 
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TABLE 2 


TOTAL NUMBER OF TIMES SUBJECTS PRONOUNCED A 
WORD THE SAME WAY IN Born Lists 


Test word Same Different Errors 
BOW 6 2 1 
TEAR 8 1 0 
LEAD 6 2 1 
ROW 6 2 1 
WIND 8 1 0 
WOUND 6 1 2 

Discussion 


The results shown in Table 1 suggest that 
scanning ahead did not play an important part 
in the reading of these lists. Overall, Ss were 
equally likely to rhyme or not rhyme the 
critical items. However, this may merely be 
due to a strong overriding tendency for Ss to 
pronounce the ambiguous test word in one 
particular way. Table 2 shows that this was 
the case—the test words were pronounced the 
same way in both lists on 40 occasions and 
differently on 9. This does not mean that Ss 
agreed on their preferred pronunciation. Table 
3 shows that only 2 words—TEAR and winp— 
were pronounced in the same way by a ma- 
jority of the Ss. 

If scanning ahead does occur it does not 
appear to have sufficient influence to override 


TABLE 3 
PRONUNCIATION AND ERRORS FOR EACH Test WORD 
FOR THE TWO Lisrs COMBINED, OVER NINE 


Supjects 
Test word | Pronounced as in: | Noof | errors 

BOW ae 2 1 
Cow 9 

ian Pear 15 0 
Fear 3 

LEAD Deag E 1 
Bead 

x How 8 j 
Low 9 
Ti 

SIN deg [5 ^ 
Binder 
Wounded 9 

WOUND 2 
Bounded ' 7 

| 


Ss' tendency to pronounce a given test word in 
their own preferred way. It therefore seem-^ 
unlikely that the heterophone effect in Experi 
ment I was due to Ss' scanning ahead, or per 
ceiving a later word in peripheral vision. 

This experiment could be repeated with th: 
ambiguous words (BOW, TEAR, LEAD, etc.) 
occurring after instead of before the word 
with which they could be rhymed (cow—Bow 
TOW-BOW, etc.). This would show whether 
the first word encountered influences the sec- 
ond. However, there is evidence for this in the 
present data, because all 5 errors were due to 
Ss' choosing a particular pronunciation of the 
first word and then making a nonword of the 
second word by rhyming it with the first, e.g 
pronouncing the pair Bow and Tow as if to 
rhyme with cow. Thus the heterophone effect 
in Experiment I may be the influence of an 
earlier word on a later one, rather than the 
influence of one seen below in peripheral vision 
on the one currently being processed. 


EXPERIMENT III 


In the first experiment, it was found that 
it took longer to read material correctly 
when the members of heterophone pairs 
were adjacent than when they were sepa 
rated by intervening items. If only one 
member of a pair is articulated, the first 
being merely observed or subvocally identi- 
fied, does the heterophone effect persist? 
If it does, it would imply that overt articu- 
lation may not be necessary to cause inter- 
ference. Successive pairs of words were 
therefore presented on a computer-con- 
trolled oscilloscope, Ss being required to 
observe the first and name the second, 
vocal initiation times being obtained. 


Method 


Material. Because of equipment and other con- 
straints, words of only 4 letters were employed, A 
total of 25 heterophone pairs were obtained and a 
PDP-8/S computer was programmed to present 
them on a Tektronix RM 503 oscilloscope screen. 
Each item consisted of a pair of words, either a 
heterophone or a neutral pair. The latter consisted 
of one from each of 2 otherwise unrelated hetero- 
phone pairs. Thus the 2 heterophone pairs WART- 
PART and HIVE-GIVE generated the 2 neutral pairs 
WART-GIVE and HIVE-PART. Heterophone and 
neutral pairs were pseudorandomly intermingled, 
with certain constraints to avoid uneven sequencing 
effects. Every word appeared both at the beginning 
and end of a heterophone and a neutral pair, i.e., it 
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occurred somewhere in each of the 4 possible posi- 
tions. Thus the 25 heterophone pairs generated 50 
heterophone items, taking sequencing of members 
into account. There were also, correspondingly, 50 
neutral items, 

Procedure, The Ss sat in front of the oscilloscope 
screen, at a distance of 40 cm., viewing it through 
a tachistoscope-type viewing hood. On each trial 
the E said "Ready" and S pressed a hand-held 
button. The first words then appeared in a faintly 


‘luminous target area for a total of 237 msec. In 


fact it was "written up" 3 times, each left-to-right 
scan taking 79 msec. After a blank interstimulus 
interval of 193 msec., the second word appeared in 
the same location, also for 237 msec. The Ss were 
required to note the first word but to make no verbal 
response. They shouted out the second word as 
soon as they could, closing a voice-operated switch 
which stopped a timing routine begun at the offset 
of the second word. The initiation time thus ob- 
tained was printed out on the teletype. There were 
8 new Ss, 5 females and 3 males, Any stutters or 
articulatory blocks were excluded from analysis. 
However, over all Ss, only 5 neutral and 7 hetero- 
phone items were so excluded. 


Results and Discussion 


The average time to initiate a verbal re- 
sponse (naming), after the offset of the 
second word (duration of 237 msec.) was 
286 msec. for heterophone pairs and 288 
msec. for neutrals. Thus there was a 
minimal and nonsignificant difference in a 
direction opposite to the one predicted, only 
3 of the 8 Ss producing longer vocal initia- 
tion times for heterophone pairs, £ (ny es. 
P > .8. It would seem that under the con- 
ditions of this experiment, the second item 
of a heterophone pair can be initiated as 
readily as the second item of a neutral pair. 
Any slight facilitation for heterophone pairs 
may perhaps stem from easier decoding, re- 
sulting from the shared letters appearing 
spatially superimposed. 


EXPERIMENT IV 

When S covertly identified but did not 
articulate the first member of a pair of 
words, the expected heterophone effect 
failed to materialize in the initiation times 
for articulating the second word. Can it 
be made to reappear, under the conditions 
of separate paired presentations when S 
also articulates the first word and thus 
triggers the presentation of the second? In 
this experiment not only initiation times 


but also duration times were obtained for 
the articulation of the second word, 


Method 


Material and subjects. The same material and 
sequence were employed as in Experiment IIT, 
However, a totally newgroup of 8 volunteer Ss, 4 
males and 4 females who were entirely naive as to 
the material being used, was obtained. 

Apparatus and procedure, The computer was pro- 
grammed to present the first word of a pair for 237 
msec. when S pressed the hand-held “ready” button, 
Exactly 388 msec. after S started to name the first 
word, the second word appeared, also for 237 msec, 
The computer and auxiliary equipment were pro- 
grammed to time both initiation (after offset of the 
second word) and duration of S’s vocalization of the 
second word. The long interval (388 msec.) was 
chosen to insure that vocalization of the first word 
was completed before the second word went off 
and to insure that there was a gap between the 2 
articulations, This permitted the timing of the 
parameters relating to the second word alone, 
Measurement of the duration of the second word was 
achieved by the construction of a special device, 
Since there are silent periods within articulated 
words, a voice-operated switch is not kept closed 
until the offset of the last voiced period, but tends to 
oscillate between its open and closed positions as 
it attempts to follow the intensity changes in the 
articulation, The voice-operated switch was there- 
fore used to start a chronoscope and a specially con- 
structed timer, which was reset each time it received 
à pulse from the voice-operated relay, If it did not 
receive a pulse this timer timed out after a preset 
interval (100 msec, in this experiment) and stopped 
the chronoscope, which thus gave the duration of 
the word plus 100 msec. The computer was pro- 
grammed to reset the chronoscope and the timer at 
the commencement of each trial, and to hold them in 
this mode until the second word had been displayed. 
The initiation time was determined as before by a 
timing loop in the computer and printed out on the 
teletype. The Æ monitored § and the equipment 
and noted any trials on which a preliminary stutter 
was timed instead of the whole word. Over all Ss, 
13 neutral items and 14 heterophones were excluded 
from the analysis of initiation times, and 22 neutral 
items and 19 heterophones from the anal of 
duration times. Some initial technical problems 
led to the exclusion of the extra duration times. 


Results and Discussion 


This time, all 8 Ss showed faster initia- 
tion times to the second word with hetero- 
phone pairs than with neutral pairs, £ (7) = 
4.9, p < .002, again in the direction con- 
trary to that predicted. The mean for 
heterophones was 273 msec. (after stimulus 
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offset), and for neutrals 289 msec. For 
some reason the trend observed in Experi- 
ment IlI became highly significant, al- 
though it should be noted that 5s now had 
to articulate the first word. No difference 
in duration times was observed between 
heterophones (243 msec.) and neutrals 
(245 msec.), 4 Ss showing very small dif- 
ferences in one direction, and 4 in the other. 
Obviously any effects operate, upon the 
setting up of articulatory programs, not 
upon their operation. 

The continued absence of the expected 
heterophone effect could possibly be due 
to the fact that the 2 words, although 
spatially superimposed, were temporally 
separated, as were the 2 verbal responses. 
It is important to see whether the strong 
effect obtained in serial list reading can be 
reproduced in the discrete situation of 
paired presentations. 


EXPERIMENT V 


In this experiment both words were pre- 
sented in immediate succession, one above 
the other. The Ss were required to articu- 
late both together, and as before both initia- 
tion and duration times were measured. It 
was hoped that this way the predicted 
heterophone effect would reappear. 


Method 


Procedure. Using the same stimuli as before, and 
a new group of 8 naive Ss, the 2 words were pre- 
sented in immediate succession. The first word was 
written up twice and thus appeared for 158 msec. 
It was immediately followed by the second word, 
again for 158 msec., but this time directly below the 
first one, to avoid masking and facilitative effects 
which could result from superposition. The Ss were 
required to name both words as fast as possible, 
both initiation and duration times being measured 
as before. In order to insure that the chronoscope 
measuring duration times was not stopped by a 
short pause between the 2 words, the timer was set 
to time out after 400 msec. The chronoscope thus 
gave the duration time for the 2 words plus 400 
msec., a constant subsequently subtracted from the 
data. 


Results and Discussion 


| [In this experiment, with the initiation 
- times the expected heterophone effect re- 
appeared, the heterophones being on aver- 
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age 26 msec. longer than the neutrals, 
t (7) = 2.02, p < .05, one-tailed. Only one 
of the 8 Ss gave a contrary result. The 
mean for heterophones was 276 msec., and 
for neutrals 250 msec. With the duration 
times, there was also an 11-msec. decre- 
ment with heterophones (701 msec.) as 
compared with neutrals (690 msec.), though 
this failed to reach significance, / (7) = 1.7, 
p > .1, being found only with 5 of the 8 Ss. 
Thus the expected heterophone effect can 
be made to reappear with discrete presenta- 
tions of paired stimuli. However, it is much 
weaker than that originally found with the 
continuous list-reading material. |n Ex- 
periment I there was a difference of over 
4.5 sec. between lists at the 2 extremes, each 
being composed of 48 pairs of heterophones 
and 48 neutrals. Thus there was in effect 
a difference of over 90 msec. per ir 
If the initiation and duration differences 
from Experiment V are added, there is a 
difference of little more than 38 msec. per 
pair. Consequently, it seems that much 
of the original heterophone effect stemmed 
from the continuous forced pace of articula- 
tion, not present with discrete presenta- 
tions. 


ore 
I 


Experiment V established some of the 
conditions necessary to demonstrate the 
heterophone effect with single word pairs. 
Whether actual articulation of both words 
is necessary for interference to occur is 
still uncertain, although this was the 
starting point for Experiment III, Experi- 
ment V was therefore repeated without 
articulation of the first word. 


Method 


Procedure. Exactly the same Ss and procedure 
were used as in Experiment V except that instea 
of articulating both words, Ss were required to note 
the first word, articulate the second as quickly as 
possible, and then, after the teletype had finished 
printing out the response time, tell E what the first 
word had been. There was an interval of approxi- 
mately 3 sec. between presentation of the first word 
and S's being free to report it. This depended on 
how long S took to respond with the second word. 
As before, both initiation and duration times were 
measured for the second word presented, i.e., the 
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first word called out by S. The timer controlling 
the chronoscope was set to 100 msec., and this value 
was subtracted from the chronoscope readings. 


Results and Discussion 


While the heterophones (508 msec.) still 
gave rise to longer initiation times than the 
neutrals (493 msec.), the difference (15 
msec.) was smaller and 3 Ss gave results 
in the opposite direction, making the effect 
nonsignificant. The duration times were 
nonsignificantly longer for the neutrals 
(349 msec.) than for the heterophones (342 
msec), ££ 17.5 4." The heterophone 
effect seems therefore to require S to articu- 
late both members of the pair. 


GENERAL DISCUSSION 


When people are required to read aloud a 
continuous list of material consisting of paired 
heterophones such as WART—PART, etc. (Experi- 
ment I), they find it more difficult than when 
the heterophone members are separated by 
“neutral” items (in reality, other heterophone 
members similarly separated). Some consider- 
able residual difficulty persists even when there 
is one intervening item between successive 
members of a heterophone pair. The effect 
can also be found with discretely presented 
pairs of stimuli (Experiment V), when it is 
seen to depend largely upon the time taken to 
initiate the appropriate verbal response rather 
than the duration of that response. However, 
the cumulative, interactive effect of load with 
a continuous list appears important, the effect 
being very much weaker in the discrete mode of 
presentation (Experiment V), and nonexistent 
when the second word is presented only after S 
articulates the first (Experiment IV). 

The effect demands the articulation of each 
item, and is not apparent in the initiation time 
for the second item when S merely notes the 
first item (Experiment III) or temporarily 
stores it for recall after pronouncing the second 
(Experiment VI). Moreover, the greater ef- 
fect in the continuous list-reading task cannot 
be due solely to scanning ahead or to the em- 
ployment of peripheral vision, since the pro- 
nunciation of ambiguous heteronyms such as 
LEAD, WIND, etc., is apparently not influenced 
by subsequent words such as BEAD-DEAD, 
TINDER-BINDER (Experiment II). Kolers and 
Lewis (1972) review the role of peripheral 
vision and eye movements in vision. They 
conclude that peripheral vision may indeed be 


more than simply a ranging instrument guiding 
the fovea. Some content analysis seems possi- 
ble from input presented through several 
degrees of visual angle. However, this is 
successful only when all the information, foveal 
and peripheral, is part of a single word, and 
not 2 or more words distributed between the 
2 areas. Kolers and Lewis conclude that 
"simultaneous processing of different linguistic 
units from center and periphery is not a 
strategy that the rapid reader could profitably 
engage in [p. 123]." 

Failure to produce a heterophone effect in 
either of the 2 experiments where .$ merely 
"noted" or "stored" the first item (Experi- 
ments ITI and VI) seems to go against Morton's 
(1969) logogen theory. This states that any 
act of storage or identification of a word calls 
up a motor (articulatory) program, which 
would interfere with any other such item being 
processed. However, it is possible that in both 
these experiments the material persisted at a 
satisfactory and noninterfering level in some 
form of sensory "buffer" store. 

It is debatable whether phonological recod- 
ing inevitably precedes or accompanies further 
processing—the indirect access theory (criti- 
cized by Smith, 1971). The latter probably 
owes its origins to work on short-term memory 
and acoustic confusions, (Baddeley, 1966; Con- 
rad, 1963, 1964). However, these effects may 
well be memory and load dependent; certainly 
in such studies ample time was always avail- 
able, and the stimuli were frequently single 
letters. A similar comment may be made with 
respect to the visual buffer store studies of 
Posner, Boies, Eichelman, and Taylor (1969) 
and Sternberg (1969). Rubenstein (Ruben- 
stein, Garfield, & Millikan, 1970; Rubenstein, 
Lewis, & Rubenstein, 1971) found that homo- 
phonic nonsense words (BRANE, SPAID, etc.) 
require longer decision times as to whether 
they are real words than nonhomophonic non- 
sense words (BRATE, SPOID), perhaps because 
the former excite more false matches requiring 
rejection. Other support for the inevitability 
of phonological recoding comes from Dick 
(1971)—naming precedes categorization, from 
Klapp (1971)—implicit speech cannot be in- 
hibited even in a same-different matching task 
where it is not required, and from Dalrymple- 
Alford (1972)—it is difücult to suppress the 
reading responses in a Stroop task. 

On the other hand, proponents of the direct 
access theory (cf. Bower, 1970; Kolers, 1970; 
Smith, 1971) would claim that phonological 
representation is not a necessary part of the 
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recognition of a visually presented word, per- 
haps being superfluous in the case of practiced 
readers, familiar material, and high reading 
speeds. 

From the studies reported here, it does seem 
that actual and overt articulation of both items 
is necessary to produce the interfering effect 
with heterophones. Thus an articulatory set 
may be briefly established, which is briefly 
linked to a particular configuration of letters— 
a "grapheme-phoneme correspondence" (Gib- 
son, Pick, Osser, & Hammond, 1962). 
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The positive and negative componen 
were investigated in a differential cl 
a 10-sec. interstimulus interval (ISI). 
Ana 


respons 


comparisons revealed that Ss e: 


t 


of the n potential response (SPR) 
l conditioning paradigm employing 
"he Ss were 30 undergraduate females. 


s of variance indicated that conditioned effects, measured in terms of 
frequency, were present in both 
occurred in early as well as late portions of the ISI. 
hibited a predominant response polarity and 


SPR components, and that these 
Intrasubject response 


displayed both conditioned effects and orienting behavior in terms of that 


polarity. 


In classical conditioning of electrodermal 
responses, a great deal of attention has been 
devoted to skin resistance while only a few 
studies have been concerned with skin po- 
tential. Additionally, the work dealing 
with the latter response has largely been 


characterized by a lack of conditioned 
effects, possibly resulting from inappro- 
priate control and scoring procedures. 


Loveless and Thetford (1966) reported 

failure to observe skin potential response 
(SPR) conditioning. However, their de- 
sign, incorporating test trials interspersed 
among conditioning trials in a single group, 
may have precluded a meaningful assess- 
ment of conditioned effects. Similarly, 
Yamazaki, Watanabe, and Niimi (1969) 
employed test trials during a series of 
simultaneous conditioning trials and evalu- 
ated conditioning in terms of the relative 
amplitudes of positive and negative SPR 
components on test trials, They con- 
cluded that since the negative responses 
increased across test trials while the posi- 
tive component responses decreased, the 
negative component was more likely to re- 
flect a conditioned effect. While interest- 
ing, this interpretation is difficult to sustain 
since the design employed lacks an ap- 
propriate conditioning control (Rescorla, 
1967). Several studies have adequately 
dealt with the control problem but have 
failed to produce unequivocal results. 
Kotses (1969), in a study employing un- 


! Requests for reprints should be sent to Kathleen 
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Procedural considerations in SPR conditioning were also discussed. 


paired control groups where simultaneous 
recordings of skin potential and skin re- 
sistance were made, found no evidence of 
conditioning in either SPR component, al- 
though the skin resistance response (SRR) 
showed a strong conditioned effect. In this 
case, SPR conditioning may have been ob- 
scured by the independent groups design, 
by the scoring system employed,? or possi- 
bly by the fact that pairing effects were 
evaluated in terms of response amplitude. 
Finally, successful conditioning of the posi- 
tive potential, in terms of response mag- 
nitude, was reported by Shmavonian, 
Miller, and Cohen (1968), who utilized a 
differential conditioning paradigm in a 
study designed to examine age and sex 
differences in autonomic conditioning. Dif- 
ferential responding was demonstrated only 
in the young Ss (ages 17-25), and no evi- 
dence for negative potential conditioning 
was obtained. The lack of more general 
conditioned effects in this study may have 
been due to attenuated overall responsive- 
ness resulting from the presentation of a 
series of adaptation trials prior to the 
acquisition session. 

The predominant use of the SRR and 
subsequent neglect of the SPR in electro- 
dermal conditioning have been due largely 


Negative component amplitudes were scored as 
the sum of all negative potentials following onset of 
the conditioned stimulus. These were measured 
between negative response onset and peak negative 
level. Positive component amplitudes were mea- 
sured between peak negative and peak positive 
levels. Only one positive deflection but as many as 
2 negative deflections were recorded per trial. 
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to problems encountered in the evaluation 
and interpretation of the positive, negative, 
and multiphasic potential wave forms 
(Holmquest & Edelberg, 1964). While the 
SRR does offer certain advantages in terms 
of simplified scoring and interpretation, 
these may be outweighed by distortions 
resulting simply from the passage of an 
electric current through the skin which may 
act in some way to alter skin structures. 
The SPR is inherently free of this source 
of artifact and its complex wave forms may 
offer a further advantage in yielding ad-di 
tional information not available in the uni- 
phasic SRR (Edelberg, 1967). With regard 
to the latter, a number of investigators 
have suggested that the component wave 
forms of the SPR may be differentially 
sensitive to conditioning (Shmavonian et 
al., 1968; Yamazaki et al., 1969) and to 
various perceptual and cognitive factors of 
psychological interest (Burstein, Fenz, 
Bergeron, & Epstein, 1965; Forbes, 1936; 
Forbes & Bolles, 1936; Hupka & Levinger, 
1967; Raskin, Kotses, & Bever, 1969). 
Thus, of the 2 electrodermal responses, the 
SPR is potentially a less ambiguous mea- 
sure than the SRR and one which may yield 
additional information concerning complex 
variables in classical conditioning. 

The purpose of the present study was 
twofold. First, an attempt was made to 
determine which components of the SPR 
are modified in classical conditioning. Thus 
the relative effect of conditioning on the fre- 
quency of positive and negative potentials 
was examined. Second, the differential 
effect of conditioning on long and short 
latency responses was assessed. This aspect 
was of interest in light of inconsistent evi- 
dence from SRR studies dealing with the 
temporal locus of conditioned effects in a 
long interstimulus interval (ISI). While it 
isa general finding that responses appear- 
ing late in a long ISI are reliably affected 
by pairing (Gale & Ax, 1968; Gale & Stern, 
1967; Leonard & Winokur, 1963; Mc- 
Donald & Johnson, 1965; Prokasy & Ebel, 
1967; Stewart, Stern, Winokur, & Fred- 
man, 1961), some studies have failed to 
demonstrate conditioned effects for early 
responses (Leonard & Winokur, 1963; Mc- 


Donald & Johnson, 1965; Stewart et al 
1961). This problem appeared to be < 
some significance with respect to SPRs be 
cause of the possibility that potentials « 
different polarity would be associated wit 
early and late portions of the ISI. Finall: 
an attempt was made in the present stuc 
to avoid certain of the problems obser 
in previous SPR conditioning work. Spx 
ically, a differential conditioning paradi; 
not preceded by an adaptation session v 
employed to provide a precise intrasubj: 
control for sensitization and to elimin 
the uncertain effects of prior habituat 
Additionally, response frequencies, rà: 
than magnitude measures, were obtain 4 
since the former would seemingly be ! 
sensitive to artifact introduced by ini 
actions between subsequent positive ai | 
negative wave forms (Lykken, Miller, & 
Strahan, 1968). 


METHOD 


Subjects. Thirty female undergraduate psychol- 
ogy students, ranging in age from 18 to 23 
se Subjects were volunteers and re- 
ceived extra course credit for participation. 

Apparatus. Skin potentials were picked up b 2 
Beckman Ag-AgCl electrodes having a diameter of 
1.5 cm., amplified by a Grass model 7P1B low-level 
de preamplifier, and displayed on one channel of a 
Grass model 7B polygraph operated at a paper speed 
of 25 mm/sec. The electrodes were attached to 
the skin with Beckman electrode collars, and Beck- 
man electrode paste served as the electrolyte. 
Lights serving as conditioned stimuli (CSs) con- 
sisted of 2 7.5-w. white lights having a duration of 
10sec. These were mounted side by side in 5 X 5 X 
10 em. individual metal compartments which were 
completely enclosed with the exception of the side 
facing S. This side was covered by 2 5X 5 cm. 
metal slides whose centers consisted of an open 
circle 4.5 cm. in diameter. Red and yellow sheets of 
translucent plastic were placed over the circles to 
diffuse the light and provide chromatic dillerences 
between the 2 signals. These stimuli were pre- 
sented at eye level approximately 5 ft, in front of 
the seated S. The unconditioned stimulus, a 500- 
msec. 112-db. white noise, was produced by a 
Grason-Stadler model 455B white noise generator, 
amplified by a locally constructed audio amplifier 
and presented to S via Koss PRO-4 headphones. 
Onset and duration of both visual and auditory 
stimuli were controlled by BRS-Foringer solid-state 
control equipment. During recording, S was seated 
in a comfortable armchair located ina 3 X 5m. room 
with an ambient sound level of 45 db. All sound 
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levels were measured with a General Radio sound- 
level meter, model 1565-A. 

Procedure. A differential delay classical condition- 
ing paradigm with a 10-sec. ISI was employed. 
Twenty CS+ and 20 CS— trials, wherein CS color 
was counterbalanced across Ss, were presented in 
random order with the stipulation that no more than 
3 trials of either type occur in succession. Intertrial 
intervals varied between 40 and 80 sec. around a 
mean of 60 sec. 

Upon arrival at the laboratory, S was escorted 
into the experimental room where Æ applied the 
electrodes, Electrode sites were cleaned with a 
70% solution of ethanol, and a Kimwipe was used 
to abrade the skin under the inactive electrode. 
The active electrode was positioned over the thenar 
eminence of the left hand and the reference was 
attached to the volar surface of the left forearm 
approximately 8 cm. below the elbow. A ground 
electrode was attached to the dorsal surface of the 
right hand. Standard instructions read to S stressed 
the importance of refraining from unnecessary move- 
ment and remaining awake and alert during the 
session. Following this, E positioned the head- 
phones on S and entered the recording room to 
initiate the session, 

Scoring of skin potentials. The ISI was 
into 2 scoring intervals, .4-5.0 sec, and 5. 
after CS onset, and all 1 potentials we 
as either first- or second-interval response: 


iccording 
to their onset latencies, Since large changes in the 
tonic skin potential level were rarely observed dur- 
ing the ISI, the base level at stimulus onset on each 
trial was employed as a reference from which the 


polarity of subsequent phasic responses was deter- 
mined. Thus, uniphasic potentials were scored as 
negative or positive responses, respectively, accord- 
ing to whether pen deflection was in the direction of 
increasing or decreasing negativity relative to the 
reference level. When 2 or more responses occurred 
in succession (diphasic and polyphasic responses 
additional criteria were employed. Specifically, if 
all phasic responses were in the region of increased 
negativity relative to the reference level, successive 
negative-going deflections were scored as negative 
responses while positive-going movements of the pen 
were judged as returns to base level. Similarly, for 
phasic responses in the region of decreased negativ- 
ity, successive  positive-going deflections were 
scored as positive respor and negative-going 
movements of the pen were as returns to base 
level. In cases where the peak-to-peak excursion of 
the pen crossed the reference level, successive peaks 
were scored, as were uniphasic responses, according 
to the polarity of the region in which each occurred. 

Only deflections representing a .5-my. or greater 
change in potential and having a slope of at least 
10° were counted as responses. Artifact due to 
movement or to faulty electrical contact was elimi- 
nated on the basis of extremely high slopes (ap- 
proaching 90°) and irregular wave shape. 

The frequency measures obtained included all 
criterion responses occurring on each trial since, at 


present, no strong theoretical basis exists for re- 
stricting attention to any one response and since the 
total frequencies reflected the relative degree of 
positive and negative skin potential activity. 


RE 


Conditioned effects in positive and nega- 
tive potentials were observed in both por- 
tions of the ISI. Separate 4-way repeated 
measures analyses of variance for the 2 
wave forms, wherein intervals, pairing, and 
trials were considered fixed factors and Ss 
a random factor, supported these observa- 
tions, These analyses vielded reliable pair- 
ing effects for both positive and negative 
potentials, F (1, 29) = 28.05 and 43.34, re- 
spectively, p < .001. Furthermore, as 
shown in Figure 1, conditioning appeared 
to be more pronounced in the second in- 
terval for both positive and negative 
potentials. This observation received only 
partial statistical support, however, since a 
reliable Pairing X Intervals interaction was 
obtained only for the negative response, 
F (1, 29) = 9.84, p < .005. Additionally, 
it should be noted that tests of simple main 
effects nesting pairing within intervals for 
the negative response resulted in reliable 
pairing effects for the first, F (1, 58) 
= 11.05, p < .005, as well as the second, 
F (1, 58) = 51.04, p< .001, interval. 
Finally, reliable Pairing X Trials inter- 
actions were obtained for both positive, 
F (19, 551) = 3.58, p < .001, and negative, 
F (19, 551) = 1.89, p < .01, potentials, 
which appeared to reflect the typical curvi- 
linear nature of autonomic conditioning 
functions. 

The overall frequency of responding was 
higher in the first interval than the second 
for both positive, F (1, 29) = 24.95, 
p < .001, and negative, F (1, 29) = 39.43, 
p < .001, potentials. It was also apparent 
that the negative potential evidenced a 
degree of response decrement across trials 
in the first, F (19, 551) = 2.34, p < .001, 
but not in the second interval, despite a 
reliable overall trials effect, F (19, 551) 
= 3.59, p < .001. In the positive response, 
only a trials effect was obtained, F (19, 551) 
= 6.66, p < .001. 

Examination of intrasubject response fre- 
quencies in both ISI intervals revealed 
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additional details regarding the relative 
effect of conditioning on positive and nega- 
tive potentials. These data indicated that 
in terms of overall frequency, Ss tended to 
show a predominant response polarity. On 
the basis of the within-Ss algebraic differ- 
ence between frequency of responses to 
CS+ and CS—, [f(CS+) — J(CS—)]. 
across the entire ISI, pairing effects were 
observed in 29 of the 30 Ss, but these were 
greater across negative potentials in 20 .Ss 
and across positive potentials in 7 Ss, while 
2 Ss failed to show a distinct pattern. 
Individual-data also showed that in spite 
of overall conditioned response latencies, 
Ss showing greatest conditioning across 
positive potentials exhibited such differ- 
ences primarily during the first half of the 
ISI (5 of 7 Ss), while those conditioning 
in the negative response showed greatest 
differentiation in the second interval (15 of 
20 Ss). A Fisher's exact probability test 
performed on these frequencies revealed 
that they were statistically significant, 
p < .05. 

Since Ss exhibited a strong tendency to 
condition primarily in terms of a single 
polarity, individual total responsiveness 


was examined to determine the degrec to 
which each S responded in a consisient 
fashion to conditional and control sti»iuli. 
In 20 of the 30 Ss, at least two thirds of the 
total responses made were of like polarity. 
Of the 20 Ss showing such a degree of re- 
sponse predominance, 16 were negative 
responders while 4 responded positively. 


DISCUSSION 


The results of the present study indicate 
that both components of the SPR are modified 
in classical conditioning, and further that con- 
ditioned effects occur in early as well as late 
portions of the ISI. Previous related research 
dealing with differential behavior of positive 
and negative potentials in classical conditioning 
has led to some inconsistent conclusions. 
Yamazaki et al. (1969) stated that the effects 
of conditioning are observed primarily in the 
negative potential, while Shmavonian et al. 
(1968) argued that the positive potential is 
most sensitive to pairing. "The current re- 
search yielded no evidence that conditioning 
is manifested exclusively in either component. 
Although there appeared to be a greater pair- 
ing effect in terms of the negative component, 
this simply may have been a function of higher 
negative response frequency. 
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Dividing the ISI into 2 intervals and examin- 
ing responses within these intervals separately 
permitted a determination of the temporal 
characteristics of the conditioned activity and 
provided some information with respect to the 
orienting response (OR). A comparison of 
short and long latency conditioned responses 
showed that conditioned effects for the nega- 
tive potential are more pronounced in the 
latter portion of the ISI. This finding is in 
accord with the results of previous SRR studies 
employing long ISIs in that, in these studies 
as a whole, conditioned effects appeared more 
consistently later in the interval (Gale & Ax, 
1968; Gale & Stern, 1967; Leonard & Winokur, 
1963; McDonald & Johnson, 1965; Prokasy 
& Ebel, 1967; Stewart et al., 1961). It should 
be noted, however, that with the negative 
SPR and probably with the SRR, the atten- 
uated pairing effects observed initially may 
result from OR components present primarily 
in the first portion of the interval. In the 
present study this was supported by the in- 
creased frequency of response to the CS- early 
in the ISI for the negative response. Elevated 
CS— response frequencies were also observed 
in the first interval for the positive potential; 
however, in this case, attenuated early pairing 
effects did not result. 

Related to the observations concerning re- 
sponse frequency to the unpaired stimuli are 
the findings of investigators who have ad- 
dressed themselves directly to the relationship 
between SPR polarity and orienting behavior. 
As with the conditioned response, some have 
associated orienting with the negative com- 
ponent (Raskin et al., 1969; Shmavonian et al., 
1968), while others have assigned this property 
to the positive component (Loveless & Thet- 
ford, 1966; Yamazaki et al., 1969). In the 
present study, elevated frequencies were ob- 
served in the first interval for both potentials, 
suggesting that the OR may be manifested in 
either component. 

Intrasubject response comparisons permitted 
a more precise interpretation of the effects of 
pairing on positive and negative skin potential 
components. Most Ss exhibited a degree of 
differential responding in terms of both com- 
ponents. However, there was a tendency for 
Ss to condition predominantly in terms of one 
component or the other. This propensity may 
reflect a stable characteristic of individual re- 
sponsiveness, but probably can be better ac- 
counted for in terms of the interaction between 
individual and situational response deter- 
minants. The latter possibility is more likely 


in view of studies relating SPR components to 
stimuli of varying levels of threat, apprehen- 
sion, or intensity (Burstein et al., 1965; Forbes 
& Bolles, 1936; Raskin et al, 1969). The 
previous work dealing with such relationships 
has been limited somewhat by its failure to 
consider individual differences in response to 
the stimulus. A model of SPR polarity in- 
corporating individual response differences of 
this nature appears to be required in the inter- 
pretation of the results of the present study, 
since the same stimuli elicited positive and 
negative SPR components to varying degrees 
in different Ss. 

An additional finding apparent in the data 
of individual Ss concerned the principal locus 
of differential responding for individuals ex- 
hibiting a predominant response polarity. 
Subjects showing a conditioned effect primarily 
in terms of the negative component displayed 
greater differential responding in the latter 
half of the ISI, while those showing more 
positive component conditioning did so earlier. 
"This effect, which may in part account for the 
lack of differential pairing effects between the 
first and second intervals in the grouped data 
for the positive response, was probably due to 
the habituation rates of responses to the C 
which appeared to be greater for the positive 
than for the negative potential in the firs 
interval. While the reason for differenti: 
rates of habituation of positive and negative 
components is unclear, the effect has been 
noted by other investigators (Loveless & Thet- 
ford, 1966; Raskin etal., 1969; Yamazaki et al., 
1969) who have, partially on the basis of such 
differences, attempted to relate response po- 
larity to orienting behavior. Such an interpre- 
tation, of course, would not be consistent with 
a model of SPR activity emphasizing indivi- 
dual as well as situational determinants of 
responsiveness. 

In summary, on the basis of the current 
research, several conclusions may be advanced 
tentatively regarding SPR conditioning which 
also relate to procedural considerations. First, 
since Ss exhibit a predominant response 
polarity and have a tendency to display con- 
ditioned effects as well as orienting behavior 
primarily in terms of that polarity, a differ- 
ential paradigm which provides intrasubject 
control for sensitization appears to be required. 
Second, because of the differential habituation 
rates of the positive and negative components, 
an adaptation session prior to conditioning may 
complicate matters and probably should be 
avoided. Finally, comparisons between the 
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present study and previous work suggest that 
frequency measures may provide a better index 
of conditioning since such measures presum- 
ably would be less distorted by rapid reversals 
in response polarity than would amplitude 
measures. Of course, further research is re- 
quired to provide more definitive information 
concerning these issues, 
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LEVELS OF PROCESSING IN WORD RECOGNITION 


AND SUBSEQUENT FREE RECALL?! 


JOHN M. GARDINER * 
Birkbeck College, University of London, London, England 


The present experiment explored a situation in which Ss were unexpectedly 
required to recall the target words from a perceptual decision-making task. 
The targets were defined with respect either to their phonemic or semantic 
attributes, and Ss held these attributes in "working memory" for varying 
time intervals prior to target presentation. Semantically defined targets were 
better recalled subsequently than were phonemically defined targets, although 
the latter gave rise to longer decision latencies in the initial task. Also, sub- 
sequent target recall was not affected by the length of time the target-defining 
attributes had been held in working memory. These results were discussed 
within the context of Craik and Lockhart's “levels-of-processing” approach. 


Craik and Lockhart (1972) described a 
framework for memory research in which 
the memory trace is viewed essentially as 
the by-product of perceptual analyses. 
Central to their argument is the notion 
that the stability of the memory trace is 
a positive function of the type and depth 
of processing involved in the encoding of 
perceptual events. Greater depth of proc- 
essing is defined in terms of the degree 
of cognitive involvement in carrying out 
stimulus analyses. Memory is assumed to 
be tied to a continuum of levels of proc- 
essing which range, for example, from 
sensory analyses to the activation of asso- 
ciative semantic attributes. 

As Craik and Lockhart (1972) sug- 
gested, this formulation implies that re- 
search should be directed toward deter- 
mining the memorial consequences of 
various types of perceptual operations. 
Craik (in press, Experiments IV and V), 
for example, reported 2 studies in which 
S was given an initial perceptual decision- 
making task followed by an unexpected 
memory test. The purpose of the initial 
decision-making task was to lead S to 

1This research was supported by a Science 
Research Council Postgraduate Award. The author 
is grateful to Fergus I. M. Craik and Peter Herriot 
for their helpful comments on an earlier draft of 
this article. 

* Requests for reprints should be sent to John 
M. Gardiner, who is now at the Department of 
Social Science and Humanities, The City Uni- 
versity, St. John Street, London ECIV 4PB, 
England. 


encode different words at different levels 
of analysis. For each of a series of words 
presented briefly in a tachistoscope, S was 
asked a question such as (a) Is there a 
word present? (b) Does the word rhyme 
with ___? (c) Is the word a member of 
the following category? or (d) Does the 
word fit into the following sentence? In 
general, Craik's results showed that 
"deeper" decisions about words gave rise 
both to longer decision latencies in the 
initial task, and to better memory per- 
formance subsequently. 

These results are in good agreement 
with the levels-of-processing view. How- 
ever, it could be argued that the differences 
obtained in memory performance were due 
to the corresponding differences in initial 
processing time, rather than due to in- 
creasing "depth" of analysis. Although 
words processed at "sentence level" were 
better remembered than words processed 
at "rhyme level," for example, S also took 
longer in making sentence level decisions. 

The general aim of the present experi- 
ment was to obtain further evidence in 
support of the levels-of-processing ap- 
proach. Again, the strategy was to 
examine the effects of depth of analysis 
in a perceptual decision-making task upon 
subsequent memory for "target" words. 
The present study was also designed to 
provide more information on the effects 
of initial processing time in such a situa- 
tion. In this context, it is important to 
note that processing time refers, poten- 
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tially, both to the time S takes to process 
the target word as such and to the time 
S spends in processing those attributes 
that serve to define the target. The present 
paradigm permits the effects of these 2 
processing times to be evaluated separately. 

In essence, Ss were presented with a 
series of word sets, each containing a unique 
target word. Prior to the presentation of 
each word set, S was given the "defining 
attributes" of the target word. For half 
of the Ss, the target words were defined 
by their containing either 1 or 2 phonemes. 
For the remaining Ss, the targets were 
defined as members of either a single 
category or a nested category. The target 
word POLAND, for example, might be de- 
fined as containing the phonemes /l/ or 
the phonemes /l/ and /n/, on the one 
hand, or as a country or a European 
country, on the other hand. Thus, one 
group of Ss was required to identify targets 
at a "structural" level of analysis while 
the other group of Ss had to identify 
targets at a deeper, semantic level. The 
S's decision latencies in response to targets 
provided a measure of target-word proc- 
essing time. In order to examine the 
effects of processing time for target-defin- 
ing attributes, the number of nontarget 
words preceding the target in each set 
varied across trials. "Thus, the length of 
time the target-defining attributes were 
held in "working memory" was indexed 
by the position of the target in any given 
word set. Following this perceptual deci- 
sion-making task, Ss were unexpectedly 
asked to write down all the target words 
they could remember. 

The experiment had, therefore, the fol- 
lowing 2 main objectives: (a) to compare 
recall performance following either pho- 
nemic or semantic processing of target 
words, and (b) to examine the relationship 
between initial processing times and sub- 
sequent recall. 


METHOD 


Subjects. The Ss were 32 volunteers, mostly 
undergraduate students, who were either paid for 
their services or were fulfilling a course require- 
ment, The Ss were tested individually. 
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Design. The experiment was conducte ng 
a mixed design. Level of processing (pho: 
semantic) was the betweer variable, cal 
for convenience "attribute type." Sixteen 
assigned at random to each between-: 
Within Ss, the design comprised a factori m- 
bination involving 2 levels of "attribute n: or 


and 4 levels of "attribute lag." Attribute ber 
refers to target-word definition by either 2 
phonemes, on the one hand, or by a or 
nested category name, on the other hand, ^: ute 
lag refers to the position of the target wi ch 
word set. 

The Ss were presented with 38 word 


target-recognition session. Prior to th 

tion of the first word in each set, th: 

that set was defined either as a member o! 
or nested category for Ss in the semanti 
tion, or by the word containing either 1 or 
phonemes for in the phonemic conditio 
target word was always the final word in 
On any given trial, it was preceded by 1 
target words. The critical positions of the 
for scoring purposes were 3, 4, 5, and 6. 1 
2 and 7 were included to reduce target prec 
ity by position; 6 of the 38 trials were of t! 
The remaining 32 target words appeared 


often at each of the 4 critical positions an: were 
defined equally often by either 1 or 2 attr: utes 
in each between-Ss condition, For presentation 
purposes, the total number of word sets was divided 
into 4 blocks which were counterbalanced with 
regard to presentation order and the level of at- 
tribute number. In effect, all Ss received the same 


target words, and no target word was differoa tally 
favored with respect to order of presentation or ex 
perimental condition. 

Approximately 2 min. after the final trial in the 
recognition ions, Ss were unexpectedly asked 
to write down, in any order, all of the target words 
they could remember. 

Materials and procedure. Of the 32 critical target 
words, 27 were selected from the Battig and Mon- 
tague (1969) category norms and the remaining 5 
were chosen by Æ. Those drawn from the pub- 
lished norms had a mean frequency of occurrence 
of 16%. This relatively low value was chosen in 
order to minimize S's chances of guessing the 
target word when given category names as the 
target-defining attributes. The frequency count 
for nontarget words ranged between 5-20/million 
(Thorndike & Lorge, 1944). All words, both 
targets and nontargets, were 2 syllables in length. 

Two sets of presentation lists were constructed 
for each of the 4 levels of attribute lag. The dif- 
ference between the 2 sets lay in the nontarget 
words in each; specifically, where target words 
were defined by 2 attributes, the preceding non- 
target words always included 1-3 words that con- 
tained one or the other of the 2 defining phonemes 
and belonged to the single but not to the nested 
category. This procedure was designed to ensure 
that Ss had to utilize both given attributes when 
targets were so defined, 
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The word lists were typed in uppercase letters 
for presentation via an IBM electric typewriter 
and a closed-circuit television em. A Birkbeck 
Laboratory Timer and Signal Source was used to 
both trigger the presentation of each word and 
start, simultaneously, a Venner millisecond 
stopclock. The Birkbeck Laboratory Timer and 
Signa! Source was operated manually by Z, con- 


also to 


tingen: upon S's having responded to the word cur- 
renti esented. In practice, words within a set 
appe ! at about a 2-sec. rate. 

i s vocal response, amplified from a throat 
micro; one, stopped the Venner timer. Each S$ 
was tructed to respond yes or no to cach word 


as rapidly but as accurately he could. In ad- 
dition were instructed that after each yes response 
the cre to report the target word aloud. The 

on latency measure, however, was of course 
from the onset of the target word to S's yes response. 

Fhe Ss were initially familiarized with the ap- 
parat and procedure, They were also fully 
instructed as to the nature of the stimulus ma- 
Prior to the experimental trials, ea 
received 6 practice trials in which the target word 
appeared once at each possible target position, 
Each trial began with Æ reading aloud the target- 

fin \ttributes for the word set, Consider, for 
[ the case where the target word was 
ronLAND. If the target was defined by one attribute, 
E said either "the target contains the sound /1/," 
or “the target is a country," for Ss in the phonemic 
ind semantie conditions, respectively. Where the 

rf is defined by 2 attributes, Æ said either 
“the et contains the sounds /1/ 
ret is a European country.” 
wget words and their corresponding 1- or 
-a definitions were: LEOPARD: /p/, or an 
animal; /p/ and /d/, or a wild animal; DUBLIN: 
/b/, or a city; /b/ and /n/, or a capital city ; RIC 
/k/, or a disease; /k/ and /t/, or a bone dise: 
TEMPLE: /m/, or a building; /m/ and /p/, or a reli- 
gious building, Following target definition, each 
word in the set was presented, one word at a time, 
Examples of 2 presentation sets were: FATHOM, 
METEOR, ATTIC, POLAND, and FATHOM, JAPAN, 
C, POLAND. ‘The latter example includes one 
distractor which contains 1 of the 2 target-defining 
phonemes and belongs to the main but not to the 
nested category; sets including one or more such 
distractors were presented when the target was 
defined by 2 attributes, Each trial terminated 
after 5 had correctly identified and named the 
target; the next trial began immediately, 

The 5s were told that the point of the experi- 
ment was to investigate the effects of defining 
target words by 1 or 2 "values along a dimension” 
on the speed with which they could identify the 
target. No S reported subsequently that he had 
anticipated having to recall the target words. 

Unknown to Ss, only decision latency in response 
to target words was recorded. Errors made by 5 
in the target-recognition session wére noted and 
“corrected” orally by Æ at the time of their 
occurrence. 


RESULTS 


For both the decision latency data and 
the recall data, Target Positions 3, 4, 5, 
and 6 (attribute lag) were collapsed into 
2 levels of 3 plus 4 (short lag) and 5 plus 6 
(long lag) in order to obtain more reliable 
estimates. The E errors and apparatus 
failures accounted for less than 297 of all 
responses in the target-recognition session; 
S errors accounted for 5% of responses in 
the semantic condition and 6% in the 
phonemic condition. Error data were ex- 
cluded from all analyses. The results are 
shown in Figure 1. 

Consideration is given first to the latency 
data. The basic datum was median re- 
action time for correct responses from the 
8 trials each S received in the 2 X 2 X 2 
combination representing attribute type 
(phonemic or semantic), attribute number 
(1 or 2), and attribute lag (short or long). 
The mean median scores are shown in the 
left-hand panel of the figure; each point 
is based on approximately 120 observa- 
tions. These data were subjected to an 
analysis of variance. The main effects due 
to attribute type, / (1, 30) = 14.9, at- 
tribute number, / (1, 30) = 43.5, and at- 
tribute lag, F (1, 30) = 8.8, were all sig- 
nificant at or beyond p < .01. The only 
significant interactions were the following: 
Attribute Type X Number, £ (t, 30) = 
12.3, p < .01, and Attribute Type X Lag, 
F (1, 30) = 4.4, p < .05. 

No detailed interpretation of these re- 
sults will be attempted. From the point 
of view of the subsequent argument, the 
most relevant feature in these data is that 
Ss took considerably longer to process 
target words at the phonemic level than 
they did when targets were defined seman- 
tically. This result might seem surprising 
in view of Shulman's (1970) finding that, 
in a probe-recognition task, homonyms 
were recognized more rapidly than syn- 
onyms. However, it should be noted that 
whereas homonyms can be identified on a 
“holistic” basis, target words in the present 
paradigm were identified on the basis of 
individual, constituent phonemes. 

The right-hand panel in Figure 1 shows 
the percentage of target words which Ss 
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recalled in the final test, with the lag 
variable again collapsed into short and 
long lag. In order to obtain “workable” 
numbers for the purposes of analysis, the 
recall data supplied by each consecutively 
tested pair of Ss were combined to form 
8 macro-Ss in each between-Ss condition. 
An analysis of variance carried out on the 
macro-S recall scores confirmed that the 
main effect due to attribute type was 
highly significant, F (1, 14) = 73.2, p 
< .001. The main effect due to attribute 
number was also significant, F (1, 14) 
= 7.2, p € .05, but the main effect due to 
attribute lag was not (F < 1). The only 
significant interaction was Attribute Type 
X Number, F (1, 14) = 5.7, p < .05. 

Not surprisingly, these results confirm 
that target words were recalled much 
better following semantic processing, as 
compared with phonemic processing, in the 
initial session. Recall of targets following 
phonemic processing is virtually at floor 
level. The results also indicate that the 
number of phonemes involved in proc- 
essing the target had no effect upon recall 
performance, This finding contrasts with 
the comparable result in the semantic con- 
dition, where recall is better following 

. target definition by a nested category than 
by a single category name. 


Finally, it is apparent that the | zth 
of time the target-defining attributes ere 
held in working memory (short or ong 
attribute lag) did not affect subsequent 
recall of the target word. Neither the 
main effect due to attribute lag nor any 
interaction term involving lag approached 
significance, 


Discussion 


These results have several implications for 
the levels-of-processing approach discussed 
earlier. First, it is clear that processing target 
words at a semantic level led to good sub- 
sequent recall of the targets, whereas targets 
Processed at a phonemic level were very 
poorly recalled. Similar differences in memory 
performance following semantic as opposed to 
structural processing have been reported by 
Craik (1973), Hyde and Jenkins (1969), and 
Schulman (1970), Such findings are in good 
agreement with the levels-of-processing view. 

Although further analysis at a semantic 
level resulted in a corresponding improvement 
in recall (nested vs, single category names), 
additional analysis at a phonemic level did 
pot benefit recall. Thus, as Craik (1973) 
notnted out, the persistence of the memory 
trace should not be conceived to be simply 
a function of the number of analyses per- 
formed on the stimulus, but rather, a function 


of the degree of meaningfulness extracted 
from it, 
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The markedly poor recall of target words 
that had been processed at a phonemic level 
was associated with much longer decision 
latencies for these targets in the orienting 
task. Clearly, the pattern of results obtained 
in the recall test cannot be attributed to cor- 
responding differences in the processing time 
for target words. It should be noted that 
following target-word recognition, Ss were 
required to report the target word aloud. 
This procedure ensured that all target words 
were processed to at least a "nominal" level. 
The poor recall of phonemically defined targets 
must reflect the manner in which the words 
were processed, rather than the possibility 
that the target word as a whole was never 
perceived. 

It was also found that the length of time 
the target-defining attributes were held in 
working memory did not affect subsequent 
recall of the target words. This finding 
relates well to Craik and Lockhart's (1972) 
suggestion that maintaining information at 
a "fixed" level of processing does not neces- 
sarily lead to any improvement in the memory 
trace. More generally, the present data de- 
monstrate that duration at encoding is not 
nearly so crucial for registration in memory 
as are the levels of analysis involved. No 
evidence was found to support the notion 
that processing time, whether for target words 
or for target-defining attributes, was a critical 
factor for subsequent recall performance. 
The results imply that processing time may 
only be important insofar as it permits further 
or deeper levels of analysis to be carried out. 

In summary, the present study explored 
a situation in which .$ was unexpectedly 
required to recall the target words he had 
identified in a perceptual decision-making 
task. The main findings were, first, recall of 
the target words was better following semantic 
as opposed to phonemic analysis of the target. 
Second, whereas further analyses at a semantic 
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level led to a further improvement in recall, 
additional analyses at a phonemic level did 
not benefit recall. Third, neither processing 
time for target words, nor the length of time 
for which the target-defining attributes were 
held in working memory, appeared to be 
critical factors in determining subsequent 
recall performance. These results provide 
further support for the levels-of-processing 
view (Craik & Lockhart, 1972). It was con- 
cluded that the memorial consequences of the 
operations carried out in the perceptual deci- 
sion-making task were attributable solely to 
the level of analysis these operations involved. 
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VISUAL AND AUDITORY CODING IN A MEMORY 
MATCHING TASK? 
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"Two experiments were conducted to investigate modality-specific 
(auditory and visual) and recoding in a same-different reaction time task. 
On each trial 2 letters of the alphabet were presented sequentially in the same 
or in different modalities with an interstimulus interval of 150 msec 


'The coding dimension wa 
trasting sets of letter pairs, 
milar or vice versa. 
stimulus, ind: 
of the second stimulus regardle 


interstimulus 
to the modality 


or 3 sec. 

inferred from differences in reaction t to con- 

which were either aurally similar isually 

milarity interacted with modality oí second 

ating that the stimuli were matched on the modalit nension 

of the modality of the first stimulus or the 

nterval. The results suggest that the first stimulus v "coded 

of the second for matching when the two w presented in 

different modalities and that visual coding was as effective as audit coding 


in the task. 


Although early research seemed to in- 
dicate that coding in short-term memory 
(STM) was auditory or acoustic in nature 
(e.g., Conrad, 1964; Wickelgren, 1965), 
later studies (e.g., Parks, Kroll, Salzberg, 
& Parkinson, 1972; Shulman, 1970; Tver- 
sky, 1969) have demonstrated considerable 
flexibility in coding processes and strate- 
gies. A task that has been particularly 
useful in studying coding processes is one 
in which Ss are presented with 2 stimuli 
simultaneously or sequentially and are in- 
structed to indicate as quickly as possible 
whether the stimuli are the same or dif- 
ferent. Reaction time (RT) is the de- 
pendent measure of performance. For 
example, it has been found by several 
investigators (Beller, 1971; Posner, Boies, 
Eichelman, & "Taylor, 1969; Posner & 
Mitchell, 1967) in a variety of conditions 
that Ss are able to respond faster to pairs 
of letters that are physically the same 
(e.g., AA) than to pairs that only have 
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coding 


the same name (e.g., Aa) Ihe results 
seem to indicate that physical matches 
are made on the basis of visually coded 
information, but that name matches are = 
made on the basis coded in- 
formation. That is, before a name match. 
is made, Ss apparently recode the visual 
stimuli to their auditory equivalents, which 
leads to the longer RTs. More direct 
evidence that name matches are based on 
auditory information has been provided 
by Dainoff (1970) and Dainoff and Haber 
(1970), who found that name matches are 
affected by auditory similarity so that 
RTs were longer for similar pairs than for 
dissimilar pairs. 

Tversky (1969) used a sume-different 
RT task to demonstrate that the same 
information can be coded differently 
(either visually or verbally) depending on 
how S expects to use it. The stimuli con- 
sisted of line drawings of faces, each of 
which was associated with a name. Both 
types of stimuli varied on 3 binary dimen- 
sions, and the names and faces were paired - 
so that similarity inversely covaried across 
the 2 types of stimuli. That is, faces dif- 
fering by only one dimension were given 
names differing by 3 dimensions and vice 
versa, so that similar faces had dissimilar - 
names and dissimilar faces had similar 
names, On each trial 2 stimuli were 
shown sequentially, with an interstimulus 


of aurally 
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interval of 1 sec., and Ss were asked to 
decide as quickly as possible whether or 
not the 2 stimuli had the same name. 
In each condition the first stimulus (Si) 
was always either a name or a face, and 
the second stimulus (S;) was randomly 
varied so that one type was shown on 
79% of the trials and the other type was 
shown on the remaining 21% of the trials. 
Tversky reasoned that if Ss could use 
either visual or verbal coding with equal 
ease and could readily recode stimuli from 
one mode to the other, then they should 
tend to code S, in the mode in which S. 
was expected (most frequently presented) 
regardless of the mode in which S, was 
presented. Such a strategy would result 
in longer RTs for those trials on which 
S. was presented in the unexpected mo- 
dality because one of the stimuli would 
have to be recoded to the mode of the 
other before a match could be made. 
The predicted results were in fact ob- 
tained. Furthermore, analysis of the sim- 
ilarity effects showed that RT varied 
reliably as a function of the similarity of 
the S; mode, which indicated that the 
stimuli were matched on the mode 
independently of the S, mode. 

Using a design similar to that employed 
by Tversky (1969), the present investiga- 
tion was concerned with a comparison of 
visual and auditory coding and recoding 
in STM using letters of the alphabet as 
stimuli. The first experiment was per- 
formed to demonstrate that stimulus sim- 
ilarity manipulations could be used to 
infer the coding dimension, and the second 
experiment was an attempt to compare 
the relative effectiveness of visual and 
auditory coding and recoding as a func- 
tion of the interval between stimuli to be 
matched by S. In order to determine the 
dimension on which stimuli were coded, 
the assumption was made that if RT in 
a matching task differed as a function of 
similarity along a specified dimension, 
then the stimuli were coded and matched 
on that dimension. Since the auditory 
and visual dimensions were the concern of 
the present investigation, letter pairs were 
constructed which were either high in 


auditory similarity and low in visual simi- 
larity (HALV) such as ZC, or low in 
auditory similarity and high in visual 
similarity (LAHV) such as OC, The ex- 
pectation was that when stimuli were 
matched on the auditory dimension, HALV 
pairs would yield longer RTs than con- 
trasting LAHV pairs. However, the op. 
posite relationship would obtain between 
the 2 types of pairs when matches were 
made on the visual dimension. 


EXPERIMENT I 


Since differences in RT as a function of 
similarity were crucial in making inferences 
about stimulus coding, the first experiment 
was designed to demonstrate that the 
similarity manipulations described earlier 
(sets of contrasting HALV and LAHV 
stimulus pairs) would yield reliable dif- 
ferences. First, similarity effects are nec- 
essary in order to establish that coding 
and matching are occurring on the auditory 
or visual dimensions. Second, if the coding 
dimension in any given set of conditions 
is to be inferred from a particular pattern 
of similarity effects, it is necessary to show 
that a different pattern of similarity effects 
will obtain with the same sets of stimulus 
pairs and the same Ss under different 
values of another independent variable, 
As discussed below, the independent vari- 
able chosen for this purpose was presenta- 
tion modality. 

Previous research with matching tasks 
has demonstrated that visual similarity 
affects RT with visual presentation of both 
stimuli of a pair (Clement & Carpenter, 
1970) and auditory similarity affects RT 
with auditory presentation of both stimuli 
(McInish & Tikofsky, 1969). If presenta- 
tion modality is a critical factor in coding 
for a matching task, then presentation of 
the same sets of contrasting HALV and 
LAHV letter pairs should reflect auditory 
coding with auditory presentation and 
visual coding with visual presentation. 
That is, with auditory presentation of both 
stimuli of each pair, HALV pairs should 
produce a longer mean RT than LAHV 
pairs, but with visual presentation, HALV 
pairs should produce a shorter mean RT 
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m LAHV pairs, Furthermore, the simi- 
arity effects should hold for the same Ss 
Serving in both modality conditions. 


Method 


B The Ss were 10 female students from 
OR ocn Psychology courses at the Uni- 
ind rn owa. They were tested individually 
>- min. Sessions on 2 successive days. 
Design. Similarity (HALV and LAHV) 
eee nation modality (auditory and visual) were 
nipulated in a 2 X 2 factorial design with all Ss 
serving in all conditions, 
B zs As previously discussed, the predicted 
peut t research were dependent upon 
kastr Ciierencés in mean RT between con- 
or ENTIS of letters conforming to a HALV 
attempt ee Classification. Therefore an 
n as made to 
Which an 
both simil, 
similarity 


and 


tween 2 uppercase letters, and 
was defined in terms of com- 
Details 
an be found in Wood 
so that the 2 E of letter pairs were constructed 
Second letter EM im each set contained a common 
the appropria LE different first letters making up 
Sets of HALA * similarity classifications. The four 
7 V and LAHV pairs were ZC and OC, 


VA 
T B e VB and RB, and TP and RP. Both 
letter in orde ii Were terminated with a common 
among DX " equate for perceptual differences 
modality, ct AE letters asa function of frequency, 
pairs could a Pa that differences in RT between 
similari tmambiguously be attributed to the 
arity manipulations, 

Tt should be noted tha 
a different fi 
Predict the sı 
after being 


1n order to creat, * T 
for the different € greater uncertainty regarding Ss 


D ; Twelve of those pairs were con- 
ESSI combining each of the 6 different initial 
(Z, V, T, O, Q, R) with 
Been o RET G, B, i xe NES 
restrict; previously. e only 
CHAINES s each of the 4 terminal letters 
Pairs were peek the pairs, The other 12 different 
initial test- y pairing each of the 6 

ith the letters A and S as 


© test pairs 
rminal letters 
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Apparatus. The stimuli were presented visually 
on an Industrial Electronic Engineers Binaview 
display. For auditory presentation, the stimuli 
were tape-recorded and presented to S through 
earphones. The Binaview display was pr ogrammed 
bv means of a punched-paper-tape reader which, 
in turn, was controlled by a BRS Electronics. 
digital logic control system. The output of the 
tape recorder to be used for aural presentation 
was connected to the earphones through a set of 
relay contacts and to the BRS system for con- 
trolling the interstimulus interval (ISI) and in- 
tertrial interval for both auditory and visual 
presentation. A control signal from the paper- 
tape reader turned off the Binaview and closed 
the relay contacts to the earphones for aural 
presentation of a stimulus or turned on the Binaview 
and opened the relay contacts for visual presenta- 
tion. Duration of the visual stimuli was 360 msec., 
the mean duration of the auditory stimuli as mea- 
sured on an oscilloscope. The Ss responded to the 
stimuli by pushing one of 2 telegraph keys marked 
“same” or "different" and spaced 12 apart on 
a horizontal plane. The RTs (measured to the 
nearest millisecond) and the responses were re- 
corded and punched onto paper tape. 

Procedure. Each S received 5 blocks of tria 
each consisting of a random order of the 60 s i 
ulus pairs plus 6 practice pairs at th beginning 
of each block. Thesmembers of each ulus pair 
were separated by 1 sec. measured from onset, of 
the first stimulus (Sı) to onset of the second stim- 
ulus (S2) and the pairs were separated by 3 sec. 
The order of the modality conditions was counter- 
balanced such that 5 of the Ss received visual 
presentation in the first session and 5 received 
auditory presentation first. 

The S rested the index finger of each hand on 
one of the keys and was instructed to make a cor- 
rect response as quickly as possible on each trial, 
indicating whether S; and S; were the same or 
different letters. A 2-min. rest period was given 
between each block of trials. The same response 
was always assigned to the dominant hand since 
there was no interest in directly comparing same 
and different responses, 


Results 


The first block of trials, the 6 practice 
trials in the remaining 4 blocks, and all 
trials on which errors were made were 
excluded from the analysis. Error rates 
ranged from 2%-5% across conditions. 
Since each test pair was presented twice 
in each block, there was a maximum of 
8 correct trials per stimulus pair. A me- 
dian RT was computed for each pair and 
then the 4 medians for each similarity 
condition (HALV or LAHV) were aver- .. 
aged, which resulted in a mean RT for 
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each S for each of the 4 similarity and 
modality conditions. These scores were 
then submitted to a 2-way analysis of 
variance. 

The mean RT, SE, and percentage error 
for the 4 similarity and modality condi- 
tions are shown in Table 1. The hypothe- 
sis that similarity would have a different 
effect depending on the presentation mo- 
dality was confirmed by a significant 
Similarity X Modality interaction, F (1, 9) 
= 57.70, p < .001. Comparisons of simple 
main effects of similarity within each mo- 
dality condition indicated that with audi- 
tory presentation mean RT for HALV was 
significantly larger than that for LAHV 
(b < .001). However, with visual presen- 
tation the opposite relationship was ob- 
tained, so that the mean RT for HALV 
was significantly smaller than that for 


LAHV (p < .01). 


Discussion 


The finding that similarity did have a reli- 
able effect in both presentation conditions is 
evidence that the stimuli were coded and 
matched on modality-specific dimensions. Of 
prime importance is the fact that the RTs 
for the same contrasting HALV and LAHV 
pairs reliably varied as a function of presenta- 
tion modality for the same Ss in both mo- 
dality conditions. Since similarity was the 
only other variable which was systematically 
manipulated, it seems reasonable to conclude 
that the differences in RT found for auditory 
and visual presentation are attributable to 
the similarity manipulations. The results sug- 
gest that the procedure could be expected to 
produce reliable differences in RT among the 
stimuli from which modality-specific coding 
could be inferred in further investigation. 
Hence, the same stimuli were used in a second 
experiment to study some additional questions 
related to auditory and visual coding. 


EXPERIMENT Il 


As discussed previously, Tversky (1969) 
found that the same information was coded 
visually or verbally in a matching task 
depending upon the mode in which S ex- 
pected S; to be presented. Thus, it ap- 
peared that the information from S; was 
recoded from one mode to the other when 
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TABLE 1 
MEAN REACTION Times, STANDARD ERRORS, AND 
ERROR RATES (IN PERCENTAGES) AS A FUNCTION 
OF PRESENTATION MODALITY AND SIMILARITY 


Similarity condition 


Modality —— 


HALV LAHV 
Auditory 

$ 445 376 

20 21 

| 5 2 

| 406 435 

| 22 25 

2 


Note. Abbreviations: HALV = letter pairs high in auditory 
similarity i er pairs 
low in auditory similar; 


necessary and that the mode of S; had 
little effect on the matching task. How- 
ever, Tversky used an ISI of 1 sec., which 
is at the upper limit of the estimated 
duration of the auditory (Massaro, 1970) 
and visual (Sperling, 1960) sensory stores. 
It is possible that at much shorter inter- 
vals, the residual sensory information from 
S; might have an effect on the dimension 
in which the 2 stimuli would be coded for 
matching. Hence, if the 2 s muli were 
presented in different modalities, S might 
recode S» to the dimension in which S; 
was presented in order to match the two. 
To test this hypothesis in the present 
study, conditions were used in which both 
stimuli of a pair were either presented in 
the same or in different modalities (audi- 
tory or visual) at an ISI of approximately 
150 msec. For comparison, the same con- 
ditions were presented at a longer ISI of 
3 sec., at which sensory information should 
have decayed and Sı would have to be 
maintained for matching through some 
type of rehearsal. The similarity manipu- 
lations described in Experiment I were 
used to assess the coding dimensions. 

If sensory information can be utilized 
in matching, then RT should be faster 
at the short ISI than at the long ISI, 
where the 2 stimuli are presented in the 
same modality. However, in the condi- 
tions where the stimuli are presented in 
different modalities (mixed), the need to 
recode one of the stimuli to the modality 
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of the other for matching would lead to 
larger RTs at the short ISI than at the 
long ISI if the recoding takes longer than 
150 msec. Likewise, the need for recoding 
would produce slower matches at the short 
ISI for the mixed-modality conditions than 
for the same-modality conditions. 

An additional consideration regarding 
the ISI manipulation is related to the 
proposition that S, could be maintained 
for 3 sec. only by rehearsal. Such a need 
may cause S to use auditory coding even 
when both stimuli are always presented 
visually. Dainoff (1970) and Posner et al. 
(1969) did find a tendency toward auditory 
coding as ISI increased from 0-2 sec., al- 
though the mixture of "physical" and 
"name" match conditions may have in- 
fluenced Ss toward auditory coding, as 
indicated by Posner et al. (Experiment 
III). In the present study, a direct test 
of the relative effectiveness of auditory 
and visual coding was provided in the 
conditions where both letters of a pair 
were presented in the same modality at 
2 ISI values and Ss were instructed simply 
to indicate whether or not the 2 letters 
of a pair were the same. 


Method 


Subjects. The Ss were 64 female students from 
the introductory psychology courses at the Uni- 
versity of Iowa. They were tested individually 
in 2 50-min. sessions on successive days. 

Design. The design was a 2 X 2 X 2 X 2 fac- 
torial combination of 2 between-Ss variables and 2 
within-Ss variables. The between-Ss variables were 
modality of S, (auditory or visual) and modality 
of S, (auditory or visual). Sixteen Ss served in 
each of the 4 S, and Sy modality combinations. 
The within-Ss variables were similarity (HALV and 
LAHV) and ISI (short and long). The actual ISI 
values were chosen so that the short ISI (approxi- 
mately 150 msec.) would be well within the esti- 
mated duration of the short-term sensory stores 
but the long ISI (3 sec.) would be well beyond it. 

Materials. "The materials were the same as those 
used in Experiment I. 

Apparatus. The apparatus was the same as 
that used in Experiment I. 

Procedure. In each experimental session, S re- 
ceived 5 blocks of trials, each consisting of a dif- 
ferent random order of the 60 stimulus pairs plus 
6 practice pairs at the beginning of each block, 
The 5 blocks of trials were first recorded for the 
long ISI condition with an ISI of 3 sec. and an 


intertrial interval (ITI) of 4 sec. For the short 
condition, a copy of the original tape was cut 
spliced between the stimuli of each pair result 
in ISI values with a mean of 160 msec, anc 
standard deviation of 20 msec. as measured 
an oscilloscope. The 5 blocks of trials wit! 
each session were presented in a different or 
to each S, and the order of the 2 ISI conditi 
was counterbalanced so that half of the Ss recet 
the short ISI condition first and half received 
long ISI condition first. To aid .5 in discriminat 
between the ISI and the ITI, a light on the sti 
ulus panel was turned on during the ITI. 1 
Ss were randomly assigned to one of the 4 
and S» modality conditions according to their 
pearance at the laboratory, with the restrict 
that the total number of Ss in each group n 
differed by more than one. 

The S rested the index finger of each hand 
one of the keys and was instructed to attempt 
make a correct response as quickly as possibk 
each trial, thus indicating whether or not S; and 
were the same letter. The S was also inforn 
of the S, and S; modalities and that the modali 
would remain constant throughout the exp 
mental session. A 2-min. rest period was given ! 
tween each block of trials. The same response w 
always assigned to the dominant hand since the 
was no interest in directly comparing same a: 
different responses. 


Results 


As in Experiment I, the first block of 
trials, the 6 practice trials in the remaining 
4 blocks, and all error trials were excluded 
from the analysis. Error rates ranged 
1%-7% across conditions. For each S 
à median RT was computed for each test 
pair, and then the 4 medians corresponding 
to each similarity condition (HALV or 
LAHV) were used for computing means. 
These data were then submitted toa 4-way 
analysis of variance. 

The hypothesis that the modality of Si 
would affect coding at the short ISI, but 
not at the long ISI, predicts an inter- 
action of all 4 variables. However, only 
the modality of S, affected coding as re- 
flected by a reliable Similarity X S, Mo- 
dality interaction, F (1, 56) — 143.31, 
b < .001. In the conditions where S, was 
auditory, the overall mean RT for HALV 
was 55 msec. larger than that for LAHV 
(p < .001); however, for the conditions 
where S, was visual, mean RT for HALV 
was 20 msec. smaller than that for LAHV 
(P < .01). Thus in general, the effects of 
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TABI 


Mean Reaction Tim 


TANDARD ERRORS, AND ERROR RATES (IN PERCH 


AGES) AS A FUNCTION OF S} 


Mopa , Sa MODALITY, INTERSTIMULUS INTERVAL (ISI), AND SIMILARITY 
Auditory S: Visual S2 
| ——— — 
Si modality Short ISI Long ISI Short ISI Long ISI 
fl 3 = — = . ar 
HALV LAHV HALV LAHV HALV LAHV HALV LAHV 
Auditory 
x 454 411 522 460 369 402 431 438 
SE 25 24 30 29 15 15 17 17 
Error 3 1 6 1 1 1 1 2 
Visual 
x 426 370 457 395 446 465 47 490 
SE | 23 22 16 16 25 24 32 34 
Error 5 1 7 1 1 1 1 1 


Nole, Abbre 
in auditory simi 


IALV = letter pa 
id high in visual sim 


similarity varied with the modality of S» 
regardless of the modality of S; or ISI. 
Mean RTs, SEs, and percentage error for 
the similarity conditions as a function of 
the other 3 independent variables are 
shown in Table 2. 

One exception to the generalization that 
similarity effects varied as a function of S; 
modality occurred in the auditory Sy, 
visual S», long ISI condition, which led to 
a reliable Similarity X ISI X Sı Modality 
interaction, F (1, 28) = 8.33, p < .01, in 
a separate analysis of the visual S, con- 
ditions. As shown in Table 2, the direc- 
tion of the similarity difference was con- 
sistent with the other visual S» conditions 
but was only 7 msec., which failed to reach 
significance. One explanation for this 
result is that there were substantial simi- 
larity effects for each S, but the direction 
varied across Ss. Individual / tests on the 
raw RTs for each S, however, failed to 
lend any support to this hypothesi: In 
general, the individual differences were 
rather small compared to the other visual 
S, conditions. 

Earlier it was hypothesized that mixed- 
modality conditions might lead to slower 
matches than same-modality conditions at 
the short ISI, if the recoding of one stimu- 
lus to the modality of the other required 
more than 150 msec. To the contrary, 
however, a reliable Sı Modality X S» Mo- 
dality interaction, F (1, 56) = 6.03, p < 
.025, indicated that matches were faster 


s high in auditory similarity and low in visual similarity; LAHV = letter pairs low 


in the mixed-modality conditions than 
where S; and were presented in the 
same modality. The Ss responded 50 msec. 
faster in the visual-auditory conditions 
than in the auditory-auditory conditions 
and 58 msec. faster in the auditory-visual 
conditions than in the visual-visual con- 
ditions. Essentially the same pattern of 
results was also obtained from an analysis 
of same pairs. 

Consistent with the hypothesis that sen- 
sory information can be used in matching, 
RTs at the short ISI were 40 msec. faster 
on the average than at the long ISI, 
F (1, 56) = 18.29, p « .001, The result 
was obtained for both the mixed- and 
same-modality conditions. Analysis of the 
same pairs again yielded an identical pat- 
tern of results. 


Discussion 


In using differences in RT to contrasting 
sets of letter pairs as a basis for making 
conclusions about coding, the assumption was 
made that the stimuli would be coded and 
matched on a common modality-specific di- 
mension. The finding of reliable similarity 
effects in 7 of 8 conditions provides con- 
siderable support for that assumption. Since 
the similarity effects varied as a function of 
the S» modality in both the mixed- and same- 
modality conditions, it appears that the mo- 
dality of Sı had very little effect on the 
coding dimension, even at an ISI as short as 
150 msec. Rather, Ss still seemed to recode 
S, to the modality of S» when necessary, in 
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order to match the stimuli on the S; dimension. 
These results agree with those found by 
Tversky (1969), reviewed earlier, who used 
an ISI of 1 sec. 

The agreement between Tversky's (1969) 
results and those of the present study is in- 
teresting in view of the procedural differences 
between the two other than the length of 
the ISI. First, Tversky randomly varied the 
mode of S» within an experimental session, 
whereas in the present study it remained con- 
stant. Second, Tversky used a considerably 
longer S» duration (4 sec.) than that of the 
present study (360 msec.). As reviewed 
earlier, Tversky's Ss apparently matched the 
stimuli in the S; mode even when S» was 
presented in an unexpected (less frequent) 
mode, thus requiring a recoding of S, on 
those trials. It would seem that a recoding 
of the less frequent S» would be a more effi- 
cient strategy, but perhaps Tversky's use of 
the relatively long S duration had an over- 
riding influence toward matching in the Ss 
mode. That is, a 4-sec. duration of S» would 
be more than enough time to recode S; as 
necessary and then sample the incoming sen- 
sory information from Sə to complete the 
match. Thus, S would only be required to 
recode S; rather than to recode S» and re- 
tain Sı. Although a relatively short Ss dura- 
tion was used in the present study, matching 
in the S; dimension may have been due to 
keeping the S, modality constant during an 
experimental session. Thus the same results 
in the 2 studies may have occurred for dif- 
ferent reasons. 

One way in which a constant S» modality 
could have influenced the coding dimension 
in the present study is by providing S with 
à means to minimize the processing of Ss, 
This in turn would aid his following the in- 
structions to respond as quickly as possible. 
Since the task was simply to decide if S» was 
the same stimulus as that previously pre- 
sented, S could use information from Si to 
prime the perceptual mechanisms specific to 
a given modality for that same stimulus. 
In the Same-modality conditions, the result 
of processing S; would merely have to be 
retained and used as a basis for testing the 
output from the same stage of Processing Sz. 
The priming would be possible even when Si 
was presented in a different modality than S; 
but would require the additional processing 
necessary for recoding Sı. For example, an 
auditory Sı would require a search of long- 
term memory for its visual equivalent, which 


could then be transferred to STM and used 
to prime the visual perceptual mechanisms 1 
for that particularletter. Facilitatory priming 
effects of advanced information have been 
found by Beller (1971) in a slightly different 
matching task for both same- and mixed- 
modality conditions. 

Even though the modality of Sı seems to 
have had very little effect on the matching 


dimension, there is some indication that re- 
sidual sensory information from Sı may have 
facilitated matching in the same-modality 


conditions. It is obvious th 
conditions and the mixed-1 
sensory information from 
played much of a role in 
parison process. In the 
sensory information would 
the 3-sec. ISI and/or wou 
a basis for recoding the stimu 
modality dimension. Hoy the finding 
that matches were faster he short ISI 
than at the long ISI for the same-modality 
conditions is consistent with the notion that 
residual sensory information from S, could 
have been used to aid matching. 

One result which ran cou to expecta- 
tion was that matches in the :ed- modality: 
conditions were faster than those in the same- 
modality conditions. The pattern of simi- 
larity effects indicates that Sı was recoded 
to the modality of Sz for matching in the 
mixed-modality conditions. Since the recod- 
ing would seem to require additional pro- 


in the long ISI 
ility conditions 
ould not have 

actual com- 
conditions the 
e decayed over 
ive only been 

to a different 


cessing of S, the mixed-modality conditions 
were expected to be slower the same- - 
modality conditions rather ti an faster. Rew 


cent studies using memory match or memory. 
search tasks (e.g., Burrows, 1972; Swanson, 1 
Johnson, & Briggs, 1972) have found RT in 
mixed-modality conditions to be either slower 
or no different from Same-modality condi- - 
tons. A tentative explanation for matches 
faster in the mixed- than in the same-modal- ` 
ity conditions at the short ISI is that 150 
msec. may not have been enough time for S — 
to complete the necessary processing of Si 
when both stimuli were presented in the same 
modality, causing a delay in processing Sz. 
However, if it is assumed that the 2 modali- 
ües functioned as independent information 
channels that could be processed in parallel, - 
there would be no delay in processing S2 
When it was presented in a different modality 
than Was Si. Evidence from studies using 
dichotic presentation of auditory stimuli 
(Norman, 1969; Peterson & Kroener, 1964) 
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provides some support for preliminary par- 
allel processing of independent channels. 

A final result of interest is related to the 
relative effectiveness of auditory and visual 
coding in STM. As stated earlier, the same- 
modality conditions provide a direct com- 
parison of auditory and visual coding with 
all conditions equated except presentation 
modality. The finding that visual coding was 
used with visual presentation at both the 
long and short ISI seems to indicate that 
visual coding was as effective as auditory 
coding in the present study. These results 
are in agreement with those of other recent 
studies (Parks et al., 1972; Swanson et al., 
1972) showing that Ss can maintain visual 
information in STM beyond the duration of 
a sensory store, when there is a need to do so. 
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EVENT OBSERVATION IN PROBABILITY LEARNING REVISIT: > 1 
LOWELL M. SCHIPPER,! JACK A. THORPE, DAVID P. JONES, 
AND GARY W. HEIMAN 
Bowling Green State University 
A 1968 study by Reber and Millward investigated the possible differential 
effect of observing as compared with predicting sequences in subsequent prob- 
ability learning. The present study extends the range of probabilities and the 
numbers of observing trials. Spe lly, 0, 10, 35, and 60 observing trials 
and zs of .60, .70, .80, and .90 were combined orthogonally. Additionally, 
different random orders were used for each observing and each predicting 
sequence for all Ss. A rather firm bas made for pointing up the lack of 
difference between observing and predicting procedures in some important 
dimensions. 

Reber and Millward (1968) addressed might be required with higher value- £ r 
themselves to the same basic problem for .S to reach near-terminal lev of 
investigated here: What is the effect of performance than with lower value: T, 
observing sequences of events as compared this arrangement makes it possil to 
with predicting sequences of events on examine this differential effect as ; int | 
subsequent predictions in a traditional function of observation trials and z. " 
probability learning (PL) situation? Reber Additionally, our point of view is hat 7 
and Millward provide a more than ade- of considering PL—unfettered by con plex 
quate theoretical rationale for the problem payoff contingencies or by stimuli with 
and little more need be said here to justify affective or with painful complication. (as 
the research. examples)—as a relatively "pure" form of | 

Unfortunately, the Reber and Millward cognitive learning or information process- 4 
(1968) study is something less than defini- ing, different from other types of learning 
tive in providing an answer to the question such as conditioning or motor skills learn- 
as posed, for a number of reasons. Only ing. That is, there is little involved in the 
a single probability for the most frequently input (stimulus) or output (response) of g 


occurring event, Ai, was used, m = .80. 
Only a relatively gross evaluation of the 
effect of observation sequences as compared 
with prediction sequences could be made 
since either 0, 100, 500, or 2,000 observation 
trials (OTs) were provided for comparison. 
Only a very few (4) sequences were used 
in predicting or testing and another 4 in 
observing or training. Additionally, but of 
minor importance, the observation treat- 
ments omitted the signal light which was 
present during test trials. 

The purpose of the present study is to 
enlarge upon the Reber and Millward 
(1968) study using 4 different levels of + 
for Aj, .60, .70, .80, and .90, and 4 different 
amounts of OT, 0, 10, 35, and 60 trials. 
Since we had speculated that fewer OTs 


1 Requests for reprints should be sent to Lowell 
M. Schipper, Department of Psychology, Bowling 
Green State University, Bowling Green, Ohio 43403. 
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this learning situation that is of any 
consequence compared with the internal 
processing of the information conveyed by 
the simple occurrence of events. This 
point of view, obviously, sees PL as à 
rather unique or atypical form of learning 
requiring little or no perceptual learning, 
little or no response training, and little 
or no associating of special responses with 
novel stimuli. As a consequence, simple 
observations of events seem to provide all 
that. is necessary for learning—or, more 
precisely, for changing proportions of A: 
predictions as a function of trials—to take 
place. 


Mrruop 


Apparatus. Each S sat at a table positioned in 
front of a gray display panel on which were mounted 
2 blue event lights (A; and As) 15.23 cm. apart at 
approximately eye level and a single red signal light 
7.61 cm. above and centered between A, and As: 


dr 


i 


| 
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The response box on the table contained a center- 
off, double-throw lever switch, which was also 
centered between A; and A; Predictions of the 
occurrences of A; and As were made by pushing the 
lever either to the left or right. The switch returned 
to center position when released by .S. 

Modified BRS-Foringer equipment controlled all 
timing, occurrences of Ay, As, and the signal light, 
and recording of all responses. Recording circuitry 
accepted as valid only those responses made during 
the particular 2-sec. interval when the signal light 
on but before A; or Ay appeared. 

Design. Four levels of m for A, (.60, .70, .80, 
.90) were combined with 4 levels of OT (0, 10, 35, 60 
trials) ina 4 X 4 factorial arrangement with 12 Ss 
in each treatment. When appropriate, treatments 
will be identified as z/OT to indicate the particular 

»mbination of respective levels of each variable. 

F'hree Es each ran 4 replicates of the experiment 
in random orders. Of the 4 Ss for each E in each 
cell, 2 saw A, on the left and 2 on the right. 

Procedure, For the 12 treatments with other 
than 0 OTs, each S simply saw either 10, 35, or 60 
trials with either the left or right blue light coming 
on either 60%, 70%, 80%, or 90% of the time.* 
Of course, the complementary light appeared either 
10%, 30%, 20%, or 10% of the time. During the 
OTs the signal light appeared for 1 sec. and was 
followed by an event light for 1.5 sec., which was 
followed by an intertrial interval of 1 sec. Following 
OTs each S received 100 test trials (TT) with the 
same s level that had been observed. Signal-light 
on time was increased to 2 sec. Other times re- 
mained the same, The 4 treatments receiving 0 OTs 
saw only 100 TTs at the appropriate z level. 

All Ss were run individually, one per session with 
E present, Essential parts of the instructions to Ss 
were as follows. 


With observation trials: 


Your job is to keep track of about how often each 
blue light comes on. That's all. Just keep track 
of how often each one comes on. It might seem 
as though the lights follow a pattern or system, 
but they don’t. Just concern yourself with about 
how often each blue light comes on. 


After observation trials were shown: 


Now this is important. For this series of trials, 
the overall proportion of lefts and rights will be 
exactly the same as the short series you just saw, 
although it will not be in the same order as the 
earlier series. Again, there will be no pattern or 
system although sometimes it might seem like 
there is, Your job is to be correct as often as 
possible, Try to be correct as many times as 


? Since 70% and 90% of 35 are 24.50 and 31.50, 
respectively, half of the Ss in each of these treat- 
ments, half with each E, and half with left and right 
event lights received either 24 or 25 presentations 
of A; for .70 and either 31 or 32 presentations of A; 
for .90. 
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youcan. Use what you learned about the propor- 
tions of lefts and rights in helping you predict as 
best you can. 


Without observation trials (the principal portion 
of these instructions after the initial description of 
the procedure): 


Your job is to be correct as often as possible. Try 
to be correct as many times as you can. It might 
seem as though the lights follow a pattern or 
system, but they don't, Just concern yourself 
with getting as many correct predictions as you 
can. 


Every one of the 144 sets of OTs was a different 
random arrangement, and every one of the 192 sets 
of TTs was also a different random arrangement. 

Subjects. Female Ss enrolled in introductory 
psychology courses at Bowling Green State Uni- 
versity during the fall quarter of 1972 participated 
in this experiment as part of the psychology depart- 
ment's curricular requirements. Each S chose the 
experiment voluntarily and was permitted freedom 
to refuse to participate before and during her 
experimental session. None did. 

Only a single sex was used in this sample of Ss 
to prevent possible confounding of treatment. with 
sex differences (Hanson & Schipper, 1971). 


RESULTS AND DISCUSSION 


Figure 1 summarizes that portion of the 
data obtained during the 100 TTs for each 
S. The ordinate is the proportion of A, 
predictions; the successive abscissas are 
the first, second, and third thirds (33, 33, 
and 34 trials) of the 100 TT's; and the 
parameters for each subset of curves are 
the numbers of OTs. As expected, each 
subset fluctuates at or near the appropriate 
m level. The summary curves, one for each 
subset with its corresponding proportions 
of A; predictions attached, show no cross- 
overs. That is, the 3 points for .90 are all 
higher than the 3 points for .80, which are 
higher than .70, which are higher than .60. 
Regularity, vis-à-vis performance on TTs 
as a function of number of OTs, is not in 
evidence if one were to repeat the points of 
Figure 1 in subsets of 0 vs. 10 vs. 35 vs. 60 
OTs with 4 curves in each family, one for 
each value. Nor is there an orderly 
overall progression in response level as a 
function of increasing OTs from 0 through 
60. 

An analysis of variance of the data in 
Figure 1 using m and OTs as between-Ss 
variables and blocks of trials (1-33, 34-66, 


1.00) 


90 


80 


.60 


50 


1 2 3 1 2 3 
BLOCKS OF TEST TRIALS 


Proportion of A; predictions, P(A,), as functions of + (shown at the top of eac 


Figure 1, 


and numbers of observation trials (shown in the lower right). 


attached reflect mean values for each subset. A; = 


67-100) as a within-Ss variable shows (a) 
differences in proportions of A; predictions 
as a function of r to be statistically reliable, 
F (3, 176) = 92.39, p < .001; (b) differ- 
ences as a function of OTs to be reliable, 
F (3, 176) = 3.02, p < .05; and (c) blocks 
of trials, F (2, 352) = 11,54, p < .001, and 
the interaction of blocks and z, F (6, 352) 
— 3.40, p < .005, both reliable. The result 
of the first of these statistical tests is again 
expected and uninteresting. The third 
pair of tests gives statistical support to the 
interpretation of an increasing trend to- 
wards a higher response rate as a function 
of blocks with the caution that this trend 
is not the same for all z levels. The second 
statistical test, while showing that different 
numbers of OTs did bring about differencet 
in proportions of A, predictions, does nos 
expose the unexpected finding that the 
overall proportions are arranged 35 OTs 
(713) < 0 OTs (.714) < 60 OTs (.750) 
< 10 OTs (.762) in increasing order, which 
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1 2 3 1 2 


ibset) 
(Curves with the values of the 


most frequently occurring event.) 


shows no systematic (and little overall) 
variation attributable to numerosity of 
OTs. 

Perhaps the most informative analysis of 
OTs vs. TTs is the use of staggered blocks 
of trials as originally suggested by Reber 
and Millward (1968). Two such analyses 
are presented here. 

At each of the levels of z, Trials 67-100 
for the 0 OT treatments, Trials 57-90 for 
the 10 OT treatments, Trials 32-65 for the 
35 OT treatments, and Trials 7—40 for the 
60 OT treatments were compared. This 
procedure essentially uses the data for the 
last block of 34 TTs in the 0 OT treatments 
as a referent and makes comparisons 
among (a) 66 TTs, (b) 10 OTs plus 56 TT's: 
(c) 35 OTs plus 31 TTs, and (d) 60 OTs 
plus 6 TTs (see top half of Figure 2). If 
observing and predicting have the same 
effect, then response curves for these 
different blocks for all levels of x should be 


dinate 


| 
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TRIALS EXAMINED. 
IN ANALYSIS 


0 vs 10 OT'S 


0 vs 35OT'S 


0 vs 600T'S 


0 20 40 60 


80 100 120 140 160 


TOTAL OT'$ AND TT'S VIEWED 


Ficure 2. Test trials used for comparisons in first and sec 


ond staggered 


analyses. (OT = observation trial; TT = test trial.) 


horizontal. A summary of the results is 
shown in Figure 3. 


dinate in Figure 3 is again propor- 
predictions and the abscissa is 


r of observation tric These 
re rather flat for 7 80 and 
ut rather nonflat for m = .60 and 


Treating these data as entries in a simple 
4 X 4 factorial arrangement (with » = 12 
in each cell), the differences in the 4 levels 
of the curves as a function of ~ yield 
F (3, 176) = 20, p < .001, and once 
again a predictable and expected finding. 
Differences between numbers of OTs yield 
F (3, 176) 3.48, p < .05, which indicates 


35 60 


OBSERVATION TRIALS 


FIGURE 


Staggered analysis showing comparisons among various combina- 


tions of observation and prediction trials. (The dashed curve is fitted to the 4 


means at each level of observation trial. 
A, = most frequently occurring event.) 


P(A,) = proportion of A; predictions; 
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TABLE 1 
Proportion OF A; RESPONSES AS A FUNCTION OF 
PRECEDING OBSERVATION AND TEST 
TRIALS AND AS A FUNCTION OF + 


10 35 60 

i ol Ob 

vation Tet | adem | Test | vation | Test 
.90 | .853 | .910 | .900 | .913 | .850 | .885 
:80 | .788 | .738 | .724 | .825 | .818 | .838 
.70 | .662 | .681 | .630 | .678 | .584 | .666 
160 | .575 | .562 | .560 | .542 | .673 | .571 
M |.T19 | 523 | .703 | .739 | .731 | .740 


Nole. Entries for the Observation cells at the respective x 
values are mean proportion of A; (most frequently occurring 
event) responses for Test Trials 1-33. Entries for the cor- 
responding Test cells are mean proportion of Ai responses for 
Test Trials 11-43 when 10 observation trials were used for 
comparison; for Test Trials 36-68 when 35 observation trials 
were used for comparison; for Test Trials 61-93 when 60 
observation trials were used for comparison. For example, 
the entry in the ,90/10 Observation cell is the mean for Ss who 
had received 10 observation trials prior to the measured 33 
test trials. The entry in the .90/10 Test cell is the mean for Sa 
who had received 10 test trials prior to the 33 measured test 
trials. Each entry for the Observation cells is based on 12 
different Ss. Each entry for Test cells .90/10, .90/35, and 
90/60 is based on only those 12 Ss who received the (90/0 
treatment. Similarly, the .80/10, .80/35, and .80/60 Test’ cells 
are based on the 12 Ss who made up the .80/0 treatment, and 
80 forth, 
an overall deviation from flatness when 
data are averaged over z levels and a 
single point is plotted for each of the OTs. 
The interaction of m and OTs, F (9, 176) 

= 2.05, p < .05, is also reliable and repre- 
sents, of course, the lack of parallelness of 
the 4 curves shown in Figure 3. 

A second staggered analysis, somewhat 
more complicated than the first, is illus- 
trated in the bottom half of Figure 2, 
Essentially what this analysis does is 
provide a look at TT performance shifted 
farther and farther "down the line" in the 
TT sequence as a function of the number of 
OTs substituted for TTs. Table 1 summar- 
izes this analysis, and a complete descrip- 
tion of the nature of each entry is given 
with the table. 

To be noted in Table 1 is the similarity 
between the means for 10 OT and 10 TT 
treatments, 719 and .723; for 35 OT and 
35 TT, .703 and .739; and for 60 OT and 
60 TT, .731 and -740, with 95% confidence 
intervals for the differences between each 
of these 3 pairs of means equal to 54, 
£.048, and -+.049, respectively. Differ- 
ences appear to be truly minute and suggest 
only a slight (if any) difference in a positive 
Eton toward the programmed mean z 
of .75. 
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Statistical tests of differences een 
pairs of columns in Table 1, 10 O . 10 
TT, 35 OT vs. 35 TT, and 60 OT vs ie i 
with the 4 values of z as a second \ ble, 
uniformly show statistically reliab! fer- 
ences for m (again, as would be ex; ted) 
and give F ratios of less than for 
10 OT vs. 10 TT and 60 OT vs. 60 ind 
F (1, 88) = 218, 10 € p< 25 35 
OT vs. 35 TT. All 3 of the int s of 
m and type of trials give F r: vith 
confidence levels exceeding .05. nly 
one of these, that for 60 trials, lose 
to .05. 

Still another analysis, that of tr: can 
be carried out to answer the ques n of 
whether or not the collective set of ves 
in Figure 2 is flat, linear with slop cher 
than zero, quadratic, or cubic in orm. 
According to a hypothesis of no dif «nce 
between OTs and TTs, flatness is ex; ted. 

Using a procedure outlined in -aito 
(1965) for unequally spaced values o. the 
abscissa, only the cubic form of the »oly- 
nomial provides an F ratio which “accounts 


for" a statistically significant portion of 
the variance attributable to numbers of 
OTs, F (1, 176) = 8.71, p < .005. This 
cubic expression reflects the 2 inflections 
which are apparent when plotting only the 
4 means as shown by the dashed curve 
in Figure 3. 
. A final look at what occurred during TTS 
is given in Table 2. This table shows the 
distributions of Ss along all dimensions of 
the study during the successive thirds as 
well as during the complete set of 100 TTs. 
Deserving of special attention are the 
distributions of variances for 0, 10, 35, and 
60 OT treatments at successive stages of 
the experiment. These OTs represent the 
principal variable in the experiment, and 
lack of homogeneity here could invalidate 
tests of significance among their means. 
Cochran's C statistic with k = 4, 47 df, 
and æ = .05 for each appropriate set of 4 
variance estimates shows that there is no. 
basis for rejecting a hypothesis of homoge- 
neous variability at any of the 3 stages of 
TTs, nor among the overall distributions. 
Overall, we feel there is little difference 
between types of trials in terms of subse- 
quent performance in the traditional PL 
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TABLE 2 


FREQUENCY DISTRIBUTIONS OF ALL SS 


0 10 35 60 
% Ay 
responses | 6o | 70 | go | 90 | All| 60 | 70 | 80 | 90 | All | 60 | 70 | 80 | 90 | am 60 | 70 | 80 | 90 | All 
Test trials 1-33" 
91-100 | 0} 1 O2 3| 1 1 5 6|13| 0| 0 1 8 X gend 13172, 5|11 
81-90 Pavan cls 4 6; 0) 0).0 2| 2|0 i34 8) 0) 1) 4 5) 10 
71-80 Beyer E 13-13: 25] 3 E Sl fs K E fae poe a Ole er: ho] o EU 
61-70 3 1/ 4/2/10] 0} 4) 3 1| 8} 2] 3/41] 0 95) 25-127 | d; 2|12 
51-60 0 3 2 1 6) 6. ]-9 1 0/10} 5 1 0; 0 6/41) 3 0 0 7 
41-50 4|4/|0]|0 8) 0) 1 010 BESUA 1] 0 B50 20. 0 07] 0 
31-40 1 0 1 0 2 :2:5].50:370; ON N 1 0; 0 4) 0] 1 0 0 1 
21-30 0; 0} 0} 0 0j 1 0| 0 Ti o Oo OOO: Ue 0) 5042010 0| 0 
11-20 0|0/|0/]|0 0|0/|0]| 0 010/[0]|0/|0] 0 0; 0); 0) 0 0; 0 
0-10 0|0/|0]|0 0| 0 0 0 0| 0; 0} O; 0} 0 0; 0; 0/0 0; 0 
Test trials 34-66" 
91-100 | 0| 0| 4 | 7 | 11| 1 OFS 7) 13) 0 1 2 | 6 9/0) 2) 4 9} 15 
81-90 0} 3} 3 | 4) 10) 1] 3| 3 4|11|0|2|6|4/|12/|,0|0]| 4 3:| T 
71-80 03 y a 1 EPT Tey 2. DX TT 5274 07^ 4| 2 | 2-| 3 QUE 
61-70 5541-9: | eb 0-140] SS ICA A 112121 2 |o O | 8|4|6/]|1 0|1 
51-60 3 2 0,0 el beet 1 0 0| 4] 4 1 3| 0 8] 2 0,0 0 2 
11-50. 3 2 0,0 ent 0] 0 0 14) "a! I 010 2| 1 2 0 0 3 
sie 0. 1 0} 04-0 S OF 1.0 0| 0} 2 0); 0/0 2^3 0,0 0| 3 
21-30 0| 0 0} 0 OUND, 0-130 0; Oo; 1 1 0| 0 2:201 P a A TE o 0| 0 
11-20 0/0/00 0, 0/0) 0 Di 0} 0}; 0;0);0 0|(0|0]|0 0| 0 
0-10 071-07] XX. 1^. 0. 0}; 0) 0) 0 0:1 D^ 4 0; 0] 0 1/0; 0) 0 0 0 
I 
trials 67—100° 
91-100 | 0 1 3-426422: 021 X 4|10|15|0| 0| 4| 6]|10|]0]|0 | 3 | 11 | 14 
81-90 0|2 1|4 7| 1 Hd ye PA 6.) 0. 13-14: 1-6 rts pt 2 | 4 V G i 
71-80 Dial «271.6 1/1} 3) 4) 4 1) 12) 4/3: }53' | 04] 10] 2:| 3) S iad 
61-70 33 1 0 d 13r 5. 1 0} 9) 2] 2 1 0 5) 4) 3-0 (tl er! 
51-60 3 1 0] 0 4|4,0 1 0| 5| 1 1|0)]0 2432:,| 3 0 0] 3 
41-50 2 3 Teal EU, Seah 0] 0 0 1.142 1 0,0 zr 2 0 0 4 
31-40 2 0|0]|0 2| 0 0/0 0 0|2 1 0] 0 ST 1 0 0 2 
21-30. 0-105 707]. 0 0; 0}; 0; 0 0| 0|1 0/0]; 0 T4505 50. 0 0| 0 
11-20 0|0/|0]|0 0; 0/010 0|'0|/0|0/|0]|0 0/0/0440 0 0 
0-10 0 0; 0} 0 0; 0/0) 0); 0} 0] 0 1 0; 0 1/0) 0 0 0| 0 
"Test trials 1-100" 
91-100 | 0| 0| 1| 2 3/0) 0 1 5] 610); 0] 0) 4 4/0/02 7| 9 
81-90 CAP On| 4a) 9-154 EE] Ss 6 5} 15} 0 | 2 7 TPAC LES x re 5113 
71-80 0|3]|4 1 8] 1 n 1429/-07|-2: 1-2 1 SI O5 0| 6 
61-70 Z6 3:507] 13:1 -:5 7 .5. 1 1]12|4|4/|2/|,0/10|5/]|6]| 0 0| 11 
51-60 6 I-550: 190 T| 4 1 0 0,5 12.1 3 1 0 6|3|2],0 9. [035 
41-50. eae SO SIND, 4|1]|.0]|0 0 PP See Oo) O20. SIZ NEZ MO 0j 4 
31-40 0. |.0.|:0'| 0 0|0/|0]0 0| 0| 1 0 0 0 1/0) 0 0 0) 0 
21-30 0|0/0/,0 0|0|0]|0 0| 0| 0 1 0] 0 1/0) 0 0 0 0 
11-20 0|0/|0]|0 0/,0/|0]0 0|0/0|0/|0]|0 0/0[|0]|0 eot 
0-10 0|0/,0]|0 0; 0; 0; 0 0| 0; 0); 0/0) 0 0; 0/0] 0 0| 0 


Nole. The entries in the respective 60, 70, 80, and 90 columns show the distributions of those 12 Ss giving A1 (most fre- 
quently occurring event) responses who had received 0, 10, 35, and 60 observation trials, The 4 All columns show the distributions 


of all 48 Ss with 0, 10, 35, and 60 observation tri 
Variances of the All distributions in the 0, 


228.48, respectively, 


ls. 
10, 35, and 60 observation trials conditions are ø? = 253.78, 338.89, 332.64, and 


b Variances of the All distributions in the 0, 10, 35, and 60 observation trials conditions are a? = 300.20, 199.83, 466.62, and 


335.38, respectively. 


© Variances of the All distributions in the 0, 10, 35, and 60 observation trials conditions are ø? = 320,79, 208.16, 444.40, and 


312.33, respectively. 


3l Variances of the All distributions in the 0, 10, 35, and 60 observation trials conditions are g? = 195.14, 166.49, 295.66, and 


241.49, respectively. 


120 SCHIPPER, THORPE, JONES, AND HEIMAN 
situation. We agree with Estes’ (1972)  Garro, J. Unequal intervals and unequa rend 
interpretation of the Reber and Millward analyses. Psychological Bulletin, | 63, 
(1968) experiment saying approximately 125-127. 
the same thing. Some differences may be — Hawsow, B. L., & Scuiprer, L. M. Num y in 
a function of the particular z levels used, probability learning and decision ma with 
ZA : robably observable multiple equivalent predictors. P mic 
with these differences probably É j Sdan, 1971; 22; 199-201. 
near m = .50 but not near extreme values. 
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RETRIEVAL OF SUPERORDI 


ELIZABETH F. LOFTUS? 
University of Washington 


NATES AND SUBORDINATES? 
MARTIN BOLTON 


New School for Social Research 


Semantic memory retrieval was investigated in 2 experiments. In the first 
experiment, Ss were shown a category name and asked to respond with a word 
belonging to the category (for one block of trials) or a class to which the cate- 
gory name belonged (for another block of trials). Subjects produced a category 
member faster than they produced a superordinate. For example, they pro- 
duced an instance of the category CAR faster than they produced a super- 
ordinate such as vehicle, The time taken to retrieve a superordinate was 
strongly related to the category's hierarchical position, while the time taken 


to retrieve an instance was not so related. 
duced free associates to categories differing in hierarchical level. 


ordinates were given for all but the 
against the notion that the superset is 
or concept. 


The present article deals with relations 
of class inclusion, relations which have 
been of interest at least since Aristotle's 
time. Knowledge of these inclusion rela- 
tions is knowledge that we have learned 
some time ago and know very well; it 
forms a part of our semantic memory. 
Whenever we make a subordinate or 
superordinate response, as when we assert 
that (a) an example of a dog is a collie, 
or (b) the class to which collies belong is 
“dogs,” we manifest our knowledge of 
inclusion relations. 

Class inclusion relations have become 
central in many current theories of memory. 
In the network model of Rumelhart, 
Lindsay, and Norman (1972), for example, 
a statement such as "A car is a vehicle" 
is represented as 2 nodes, one corresponding 
to "car" and the other to "vehicle," and a 
relation between them, the “isa” relation. 
One question that arises is what directional 
effects in information retrieval exist as a 
consequence of these inclusion relations? 
For example, given the stimulus word 
CAR, is it easier to retrieve a superordinate 
response, such as vehicle, object, or thing, 
or it is easier to retrieve an instance, such 
as Ford, Buick, or Ferrari? 

Miller (1969) claims that "a superordi- 


1 The authors are grateful to Sif Wiksten for her 
assistance in data analysis. 

? Requests for reprints should be sent to Elizabeth 
F. Loftus, Department of Psychology, University 
of Washington, Seattle, Washington 98195. 


In the second experiment, Ss pro- 
More sub- 
lowest level categories, The data argue 
the most accessible property of a category 


nate response will be much stronger (more 
frequent and probably faster) than a 
subordinate response [p. 231]." He points 
to results from word association tests to 
support h im. For example, the results 
of a reclassification of the data of Kent and 
Rosanoff (1910) indicated that 7.6% of all 
the responses given were superordinate and 
only 1.6% were subordinate (Woodrow & 
Lowell, 1916). From this evidence, Miller 
concluded that the subordinate-superordi- 
nate direction of association is stronger 
than the superordinate-subordinate direc- 
tion. Collins and Quillian (1972) seem to 
be in agreement with Miller, as they say 
"in many cases the superset is the most 
accessible property of a concept. . .. In 
contrast, the instances of a concept are not 
easily accessible properties in general 
[p. 3207.” 

Rather than trust "indirect" data from 
word association tests to settle the matter 
of directionality, an experiment was per- 
formed in which Ss were shown a word on 
each trial and were asked to name an 
instance (for one block of trials) or a 
superordinate (for another block of trials). 
The Ss were timed while they produced 
their responses. 


EXPERIMENT I 
Method. 


Subjects. The Ss were 22 students at the New 
School for Social Research. Each S took part in one 
experimental session that lasted 30 min. 
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TABLE 1 


MEDIAN Reaction TIME (IN SEC.) TAKEN TO RETRIEVE A SUPERORDINATE AND A SURBORDI^ 
FOR TwENTY SETS or NESTED CATEGORIES 


Level 1 category 


Level 2 category 


Name Superordinate | Subordinate Name Superordinate | Sut 
| 
CLOTHING 1.54 FOOTWEAR 2.10 | 
SCHOOL 1.67 COLL 1,93 
ACTIVITY 1.71 CRIME 3.80 j 
SPORT 1.42 WATERSPORT 3.00 
FOOD 1.99 VEGETABLE 1.95 
VEHICLE 1.30 CAR 1.92 | 
TYPE OF BUILDING 1.89 RELIGIOUS BUILDING 2.82 | 
cITy 1.19 CITY 3.08 | 
TITLE 1.92 MILITARY TITLE 3.79 
PLANT 2.14 TREE 2.01 | 
POLITICIAN 1.88 PRESIDENT 2.05 
SCIENCE 1.59 NATURAL SCIENCE 3.61 
MINERAL 2.79 METAL 2.97 | 
BEVERAGE 1.50 SOFT DRINK 2.30 
STONE® 3.49 GEM 1,66 
TYPE OF MONEY 1.59 co! 1.44 
GEOMETRIC SHAPE 1.54 TRIANGLE* 2.04 
ANIMAL 1.30 DOG* 1.40 1 
WORD 1.73 NOUN^ 2.02 2 
FURNITURE 1.35 CHAIR" 1.44 3 
M 3.08 1.72 2.29 1 
? Categories for which production of superordinates was faster than production of instances. 
Materials. Twenty pairs of nested category terminated the trial. If S did not respond within 
names were constructed, For each pair of categories, 10 sec., the trial was terminated. A w m-up 
the superordinate category (Level 1) included period of 10 trials, using stimuli different [roi those 


everything that belonged in the subordinate cate- 
gory (Level 2). The nested categories used are 
presented in Table 1. The category names were 
printed on 5 X 8 in. index cards, with one category 
per card, Each S received 2 random permutations 
of the 40 cards. Half of the Ss named super- 
ordinates on the first 40 trials, and subordinates on 
the last 40 trials. The reverse arrangement held 
for the remaining Ss. 

Procedure, Each S was told that we were conduct- 
ing a study on how memory works, that he would 
See items consisting of single words, and that he was 
to respond with a word that was a superordinate 
(subordinate) of the stimulus word. He was given 
examples and told to respond as quickly as possible, 
but to avoid errors. 

The S sat in front of a screen in which there was a 
window covered by halí-silvered glass. The index 
card containing the stimulus was placed in a dark 
enclosure behind the mirror and was presented by 
illuminating the enclosure. A microphone was 
placed in front of S and he responded by speaking 
into it, 

A trial consisted of the following events. As a 
card with the item printed in large type was placed 
in the darkened enclosure behind the half-silvered 
mirror, E said “Ready,” and pressed a button that 
illuminated the stimulus. The S's verbal response 
activated a voice key that stopped the clock and 


used for the experimental trials, preceded cach 
block of experimental trials, 


Results 


Only correct responses (96%) to the 20 
pairs of nested categories are included in 
the following analyses. For each of the 
nested pairs, one category is more inclusive 
(higher in the semantic hierarchy) than 
the other. For example, vEniCLE includes 
more instances than CAR; in some sense, 
CAR is lower in the hierarchy than VEHICLE- 
We designated the higher category "Level 
1" and the lower category “Level 2," and 
we computed a Level 1 and Level 2 median 
reaction time (RT) for each S for each of 
the 2 types of responses (subordinate and 
superordinate). 

A 2 X 2 repeated measures analysis of 
variance indicated that the speed of pro- 
ducing a subordinate was significantly 
faster than the speed of producing à 
superordinate, F (1, 21) = 46.46, p « Ot. 
All 22 Ss showed this effect. Furthermore, 
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there was a significant effect of hierarchical 
position of a category on the time taken to 
respond, F (1, 21) = 9.86, p « .01. The 
interaction was also significant, F (1, 21) 
= 12.04, p « .01, indicating that the mean 
time taken to produce a superordinate was 
influenced by category level, while the time 
to produce a subordinate was not so 
influenced. 

Median RTs were computed for sub- 
ordinate and superordinate responses to 
each of the 40 categories and are presented 
in Table 1. For 32 stimulus words, Ss 
produced a subordinate faster than he 
produced a superordinate; for 6 stimuli, the 
reverse result occurred. 


Discussion 


The finding that subordinates are produced 
more quickly than superordinates is incon- 
sistent with Miller's (1969) and Collins and 
Quillian’s (1972) intuitions that a super- 
ordinate is more accessible than a subordinate. 
On the other hand, Woodworth and Wells 
(1911) found that S produces superordinates 
faster than he produces subordinates, which 
is, of course, opposite to our own result. To 
reconcile their data with the present results, 
we must examine the Woodworth and Wells 
procedure more closely. In their experiment, 
13 Ss named superordinates of 20 stimulus 
words (for example, Ss produced a super- 
ordinate for OAK and for CABBAGE, etc.) and 
they named subordinates of 20 different words 
(for example, TREE and VEC ABLE). A 
comparison of the average time taken for the 
2 types of responses indicated a shorter RT 
for naming superordinates. 

Although the present procedure is sounder 
than that of Woodworth and Wells (1911) in 
that we use the same stimulus words for the 
production of both superordinates and sub- 
ordinates, the discrepancy between the 2 
studies must be explained. A closer examina- 
tion of the Woodworth and Wells stimulus 
words indicated that at least 12 of the words 
in the “name a superordinate" test were 
themselves instances of the words in the 
“name a subordinate" test. For example, Ss 
were required to name a superordinate of OAK, 
CABBAGE, and PENNY, but to name subordi- 
nates of TREE, VEGETABLE, and COIN. Not 
only were the words for which Ss produced 
superordinates lower in any assumed hier- 
archical structure than the words for the 
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"subordinate test," but these words were 
lower than the words we used in Experiment I. 
Thus, the words for which Woodworth and 
Wells’ Ss produced superordinates would 
constitute, in our terminology, a “Level 3.” 
These observations suggested the possibility 
that the discrepancy between the Woodworth 
and Wells study and the present study was 
due to item differences (specifically in hier- 


archical level) between the 2 studies, To 
test this possibility, Experiment II was 
designed. 
EXPERIMENT II 
Method 
Subjects. The Ss were 22 college students, none 


of whom had participated in Experiment I. 
Materials. The 20 pairs of nested categories 
shown in Table 1 were used. In addition, a group 
of Level 3 categories was created by selecting one 
subordinate for each of the Level 2 categories. So, 
for example, for the pair FOOD-VEGETABLE, we 
selected the stimulus LETTUCE. Each S was pre- 
sented a random permutation of the 60 unique 
categories, 
Procedure. 


The Ss were told that we were con- 
ducting an experiment on fr ociation. They 
were told that they would see a s of 60 stimulus 
words, and after seeing each word, they were to 
write down the first word that it made them think of. 


Results 


A simple count was made of all response 
words which were superordinate and those 
which were subordinate. In case of any 
doubt, the simple question “Does one of 
these words belong to the class of things 
named by the other word?" would usually 
solve the problem. This technique has 
been used successfully by Peters (1952). 
In Figure 1, the mean numbers of these 2 
types of responses are presented separately 
for the 3 levels of stimulus words. A 2 
(response type) X 3 (hierarchical level) 
X 22 (Ss) analysis of variance indicated 
that more subordinates than superordinates 
were given, F (1, 21) = 43.90, p < .01. 
Also significant were hierarchical level 
F (2, 42) = 32.94, p < .01, and the Level 
X Response Type interaction, F (2, 42) 
—170.37, p < .01. A2 (response type) X 3 
(levels) X 20 (category triplet) analysis of 
variance gave identical results, namely, 
significant response type, level, and inter- 
action effects. 
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Figure 1. Mean number of subordinate and super- 


ordinate responses given in Experiment II. 


GENERAL DISCUSSION 


Whether or not the superordinate or sub- 
ordinate is the stronger depends on the level 
of the stimulus word in the semantic hier- 
archy. At Levels 1 and 2, more subordinates 
are given as free associates (Experiment II) 
and subordinate responses are given faster 
(Experiment I). At the lowest level, Level 3, 
more superordinates are given as free associa- 
tions, and presumably superordinate responses 
would be given faster. At this level, Ss appear 
to have few (if any) subordinate responses 
available. The reader can verify this fact 
by attempting to produce some instances of 
CABBAGE or PENNY, Subordinate responses are 
unlikely as free associates, but, if forced, S 
might reply with red cabbage or copper penny. 
Responses are highly idiosyncratic and correct 
single-word responses rarely occur. 

The finding in Experiment I that the mean 
time taken to produce a subordinate when 
given a Level 1 category was not different 
Írom the time taken to produce a subordinate 
to a Level 2 category replicates an earlier 
study of Loftus, Freedman, and Loftus (1970), 
which showed that the RTT taken to retrieve a. 
member of a given category was not signifi- 
cantly different from the RT taken to retrieve 
a member of a superset of that category. 
Hierarchical position of a category does, 
however, affect the speed of producing super- 
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ordinates. The RT taken to produce a 
superordinate of a Level 2 category such as 
CAR was faster than the RT taken to produce 
a superordinate of a Level 1 category such as 


VEHICLE. Thus, the more specific a word is, 
the faster Ss can produce a superordinate for 
that word. In fact, an examination of the 6 
categories listed in Table 1 for which produc- 
tion of superordinates was faster than produc- 
tion of instances reveals that 5 of the categories 
are the lower member of their respective 


nested pair. Thus, not only do some Level 2 
(or specific) categories have a readily ac- 
cessible superordinate, but they do not have 
as accessible an instance. "There are so few 


of these cases (and in all 6, there is less than 
.8-sec. difference in the RT to produce a 
subordinate vs. a superordinate), that we find 


it difficult to agree with any overall statement 
that “the instances of a concept are not easily ^ 
accessible [Collins & Quillian, 1972, p. 320].” 
In fact, the results of both experiments point 
out the need for recognizing that statements? 
such as “the subordinate-superordinate direc- 
tion of association is stronger" (cf. Miller, 
1969) are, at best, only sometimes true 
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In the first of 2 runway investigations, 2 groups of rats received equal amounts 
of punishment and nonreinforcement during acquisition training. Group PR 
received transitions from punishment to reinforcement and Group NR received 
transitions from nonreinforcement to reinforcement. A control group re- 
ceived unpunished continuous-reinforcement training. allowing training, 
half of the Ss in each group received punished extinction, and the other half 
received unpunished extinction. The results indicated that Group PR was 
superior to Group NR in punished extinction, and the reverse was true in 
unpunished extinction. The control group exhibited the poorest performance 
regardless of the response-decrement procedure employed, In the second 
experiment, Groups NR and PR received unpunished, punished, and un- 
punished extinction following acquisition. Generally, the results were con- 
sistent with those obtained in the first experiment. These results were inter- 
preted in terms of Capaldi's 1967 sequential theory of instrumental learning. 


It is a well-established finding that par- 
tial punishment increases response persist- 
ence to continuous punishment over and 
above that produced by partial reinforce- 
ment (e.g., Banks, 1966a, 1966b; Brown & 
Wagner, 1964). Response persistence is 
increased more readily when punishment 
(P) is administered on nonreinforced (N) 
trials rather than reinforced (R) trials 
(Fallon, 1968, 1969), although persistence 
effects (relative to unpunished controls) 
have been observed in situations where 
punishment has been presented simultane- 
ously with reinforcement (ie, Banks, 
1966b; Brown & Wagner, 1964). 

Persistence effects due to punishment 
have traditionally been interpreted via an 
extension (Wagner, 1966) of Amsel's (1962, 
1967) analysis of frustrative nonreward. 
However, several recent attempts have 
been made to extend Capaldi's (1967) se- 
quential theory to account for the effects 
of punishment upon instrumental perform- 
ance (Campbell, Crumbaugh, Marshall, 
! Requests for reprints should be sent to Dennis 
G. Dyck, who is now at the Department of Psychol- 
ogy, University of Manitoba, Winnipeg, Manitoba 
R3T 2N2, Canada. 


& Sparling, 1972; Campbell, Crumbaugh, 
Massey, & Reed, 1972; Campbell, Wroten, 
& Cleveland, in press; Capaldi & Levy, 
1972). 

According to a sequential analysis, the 
mechanism of increased persistence depends 
upon the sequence of different goal-box 
events and not on the "number" of such 
events per se (Capaldi & Kassover, 1970). 
In a partial-reinforcement situation, resist- 
ance to extinction is increased when the 
memory of nonreinforcement (SN) is con- 
ditioned to the instrumental response (R1). 
Sequence is an important variable due to 
the fact that S is conditioned to Ry only 
on reinforced trials that have been preceded 
by nonreinforced trials. 

Assuming that punishment results in a 
memory (S") like SN, it may be conditioned 
to Ri on reinforced trials that follow pun- 
ished trials (P-R transitions). It follows 
that differential trial sequencing should 
produce differential response persistence. 
This hypothesis has been supported in 2 
separate investigations (Campbell et al., 
in press; Capaldi & Levy, 1972). In both 
studies, it was found that response persist- 
ence to punished extinction was increased 
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by partial-punishment schedules containing 
P-R transitions relative to schedules con- 
taining only R-P transitions. Thus, pre- 
liminary evidence suggests that P-R transi- 
tions increase response persistence in a 
manner similar to N-R transitions, In 
addition, Capaldi and Levy found that 
groups which received punishment prior 
to large reward performed better over a 
series of punished trials than a group which 
received punishment prior to small reward. 
This finding indicates that the sequence of 
punishment-reinforcement events, not the 
ordinal position of punishment in the daily 
series of trials, determined performance. 
The present experiments were designed to 
further explore the effects of sequential ma- 
nipulations using both punishment and non- 
reinforcement. Accordingly, in the first of 
2 experiments, 2 groups of Ss experienced 
partial punishment-partial reinforcement 
training, so that one group (Group PR) re- 
ceived P-R transitions and the other group 
(Group NR) received N-R transitions. A 
third group received continuous reinforce- 
ment (Group RR). It should be noted that 
both experimental groups received the 
same number of nonreinforcement-rein- 
forcement-punishment events; only the 
trial sequence of these events was varied. 
Following training, the 3 groups were split 
in half so that half of the Ss in each group 
received punished extinction and the other 
half received unpunished extinction. The 
unpunished-extinction and punished-extinc- 
tion procedures will be referred to as 
response-decrement procedures. It was hy- 
pothesized that differential group perfor- 
mance would bea function of the interaction 
between the type of training sequence and 
the kind of response-decrement procedure 
used in testing. Specifically, it was as- 
sumed that the stimulus complex in pun- 
ished extinction would be dominated by 
punishment stimuli (S*) and therefore Ss 
trained with theoretical SP-R associations 
(Group PR) should be superior to Ss 
trained with theoretical SN-R ; associations 
(Group NR). Conversely,? the opposite 
prediction was made for Ss receiving un- 
punished extinction. “That is, Group NR 
was expected to show greater persistence 
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relative to Group PR in unpu 
tinction (i.e, since unpunished 
would occasion nonreinforceme: 
stimuli). Of course, Group R 
pected to show the least persi 
gardless of the response-decr: 
cedure employed. 


EXPERIMENT I 


Method 

Subjects. The Ss were 60 experim naive 
male albino rats of the Sprague-D train, 
purchased from the Holtzman Co were 


housed individually and were appr: ly 70 
days old at the start of the experim: he Ss 
were randomly assigned to 1 of 3 : rental 
groups (n = 20/group). 


Apparatus. The apparatus consiste: single 
straight-alley runway manufactured Lunter 
The alley, which was 150 X 15 X 9 cn s con- 
structed of clear Plexiglas with a grid fi It was 
divided into a 30-cm. start section, a run 
section, and a 30-cm. goal section, all « being 
separated by guillotine doors. A teaspo unted 
in the middle of the far end of the goal erved 
as a food cup. The S's progress in th: y was 
measured by 3 .01-sec. Standard Electric Corp. 
timers. The first timer, which measured t time, 
was started by a microswitch at the st x door 


and stopped by a photocell located 11 c: ito the 


alley. The second timer, which measured run time, 
was started by the first photocell and «t d bya 
second photocell located 11 em. in front he goal 
box. The third timer, which measured goal time, 


was started by the second photocell and stopped bya 
third photocell located 9 cm. inside the goal box. 
Start, run, and goal speeds were obtained by con- 
verting the start, run, and goal times to r« iprocals. 
Total speeds were obtained by summing the t 
run, and goal times and reciprocating this measure. 
Punishment consisted of a ,3-ma, scrambled shock 
which was gradually increased to .5 ma. for .5 seca 
administered in the goal section of the alley by 4 
Model 700 Grason-Stadler shock generator, The 
shock was administered manually by pushing à 
button on the apparatus control panel. 

Procedure. For 3 days after arrival in the lab- 
oratory, Ss were allowed free access to ad-lib food 
and water. The Ss were then placed on a 12-gm 
daily food-deprivation schedule, with water con- 
tinuously available. The deprivation schedule was 
established 7 days before the start of the experi- 
ment. During the 3 days before the start of the 
experiment, each S was handled individually for 
3-5 min. daily. Two days of pretraining preceded 
the SxXperiment proper. On the first day all 55 
received 2 reinforced trials, and on the s 
they received 4 continuously reinforced trials. 
cup in the goal box was baited on reinforced 

throughout the experiment with 2 lem- 


trials 
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Purina Hog Starter pellets (approximately 90 mg. 
each). 

Two groups received 66% reinforcement, 17% 

nonreinforcement, and 17% punishment, and an 
additional group received 100% reinforcement. In 
Group NR, all nonreinforced trials were followed by 
reinforced trials, and punished trials were never 
followed by reinforced trials, In Group PR, all 
punished trials were followed by reinforcement, and 
nonreinforced trials were never followed by rein- 
forcement. Group RR received 100%% reinforcement 
throughout training, Punishment was administered 
immediately after S broke the final photobeam. 
The goal box was not baited on punished trials, 
and Ss were removed from the goal box following 
punishment. The intensity of punishment was 
gradually increased from .3 ma, to .5 ma. in .1-ma. 
increments during every fifth day of training. On 
nonreinforced trials, Ss were confined in the goal 
box for 15 sec. 
Acquisition training lasted 15 days with 4 trials/ 
The rats were run in squads of 6, with a 
. intertrial interval. In order to prevent Ss 
from forming a pattern discrimination, 3 different 
sequences were used during acquisition, each being 
used a total of 5 times. The 3 sequences of N, R, 
and P trials for Group NR were NRRR, RRRP, 
and NRRP. For Group PR, the sequences were 
RRRN, PRRR, and PRRN. Following acquisi- 
tion, half of the Ss in each group received punished 
extinction and the other half received unpunished 
extinction. Testing consisted of 20 trials in 5 days, 
4 trials/day. 


Results 


Acquisition. Asymptotic performance 
was evaluated by analyzing the mean daily 
running speeds over the last 5 days of 
acquisition (20 trials). A 3 (Group) X 2 
(Response Decrement) X 5 (Days) re- 
peated-measures analysis of variance indi- 
cated nondifferential group performance as 
a function of training in start, F (2, 54) = 
1.76, p > .05; in run, F (2, 54) = .38, p > 
.05; and in total, F (2, 54) = 2.54, p > .05. 
The main effect of group did account for a 
significant portion of the variance in the 
goal measure, F (2, 54) = 9.64, p < .01. 
Post hoc comparisons (all comparisons re- 
ported in this and the subsequent experi- 
ment used the Scheffé correction procedure 
for post hoc comparisons) indicated that 
Group RR was superior to both Group PR 
and Group NR (p < .01). Unlike the 
acquisition data reported by Capaldi and 
Levy (1972), Groups PR and NR did not 
differ from each other. The response decre- 
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Unpunished 
Extinction 


Punished 
Extinction 


SPEED (ft/sec) 


Claes eer, 5 


Mean running speeds for the last day 
of acquisition (A) and 5 days of punished and un- 
punished extinction for all runway measures. 


ment variable (extinction vs. punished 
extinction), which was a dummy variable 
for acquisition, was nonsignificant in all 
measures (all Fs < 1). The main effect of 
days was significant in start, F (4, 216) = 
6.10, p < .01; in goal, F (4, 216) = 26.02, 
p < .01; and in total, F (4, 216) = 8.68, 
p <.01; but not in the run measure, F 
(4, 216) = 2.16, p > .05. None of the 
interactions in the analysis were significant. 
As the analyses indicated, very little re- 
sponse suppression was observed in Groups 
NR and PR relative to the continuously 
reinforced controls. It should be mentioned 
that at no time did any Ss in Groups NR 
or PR evidence patterning. Mean speeds 
across the last 5 days of acquisition are 
shown as Point A in Figure 1. 

Response decrement. A 3 (Groups) X 2 
(Response Decrement) X 5 (Days) analy- 
sis on the mean daily speeds indicated a 
significant main effect for groups in start, 
F (2,54) = 19.47, p « .01;in run, F (2, 54) 
= 46.21, p < .01; in goal, F (2, 54) = 
31.87, p < .01; and in total, F (2, 54) = 
50.59, p <.01. Post hoc comparisons 
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showed that PR groups were superior to 
NR groups, and both were superior to RR 
groups (p < .01 in all cases). The response 
decrement main effect (punished extinction 
vs. unpunished extinction) accounted for 
a significant portion of the variance in all 
sections of the alley, the smallest F being 
22.15 in start (df — 1, 54 in all cases). 
This finding reflects the fact that punished 
extinction produced greater response decre- 
ments than unpunished extinction. The 
most important finding was the significant 
Group X Response Decrement interaction 
in start, F (2, 54) = 8.20, p < .01; in run, 
F (2, 54) = 11.12, p <.01; in goal, F 
(2, 54) = 10.38, p < .01; and in total, F 
(2, 54) = 14.40, p < .01. Post hoc com- 
parisons indicated that Group PR was su- 
perior to Group NR in punished extinction 
in all measures, On the other hand, Group 
NR was superior to Group PR in unpun- 
ished extinction in all measures (p < .01 in 
all cases) except run, where the 2 groups 
did not differ from each other. Of course, 
Group RR was inferior to all other groups, 
regardless of the response-decrement pro- 
cedure employed. It is notable that Group 
PR performed as well in punished extinc- 
tion as in unpunished extinction in some 
sections of the alley. In start and total 
. measures, performance to the 2 response- 
.. decrement procedures was nondifferential ; 

however, performance in the run and goal 
Sections was better in unpunished extinc- 
tion than in punished extinction (p < .05 
in both cases). Both Group NR and Group 
S were superior in unpunished extinc- 
tion relative to punished extinction. Of 
course, the days main effect was highly sig- 
nificant in all sections, The Fs for start 
run, goal, and total were 55.16, 82.49, 91.29, 


(df = 4, 216 in all 


extinguished at diff, 
rates. These effects for each of the alley 


segments across days are shown in Figure 1 
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Discussion 

The acquisition data re; here dil 
somewhat from previous re Camp! 
et al., in press; Capaldi & 72) in t 
suppression, which was ve t, was ol 
served only in the goal secti both of 
studies mentioned, suppres is subst 


tial during training, and the 
sion was dependent on the 
transitions experienced. It y 
lack of suppression in thi it was di 
to the fact that punishment I ministeri 
on a sparse schedule (i.c 7% of t 
trials were punished). In ad 
ment was given on every thir 
It is notable that suppression 
occurred in the goal section that se 


ex 


ment of the instrumental re chain 
curring in the closest temp id. spati 
proximity to punishment w ippress 
whereas earlier segments of t! mse wel 
not. Evidence from a number perimen! 
(Campbell & Meyer, 1971; ( li &I 


1972; Capaldi & Ziff, 1969) pr 
for the notion that mildly ave: 
the goal box produce differ: 
groups in the goal section, whe € 
tense aversive stimuli produce 'up differ 
ences in the earlier segments of the runway: 
Presumably, differences in the start and run 
sections of the alley between the punished 
and unpunished groups would have been ob- 
served if punishment intensity and/or density 
had been increased in the present experiment, 
The response-decrement results obtained in 
the present experiment reflect the powerful 
influence of "sequence" in determining Te- 
Sponse persistence. More importantly, T 
sponse persistence was shown to be a function 
of the interaction between the type of së- 
quences used in training and the kind of 
response-decrement procedure used in testing 
Consistent with this prediction, it was foun 
that Group PR was superior to Group NR it 
Punished extinction, while the reverse Was 
true in unpunished extinction. ‘Thus, P-R 
transitions increase response persistence rela- 
tive to N-R transitions when testing is done 
with punishment-related stimuli. Conversely: 
N-R transitions provide for greater persistence 
than P-R transitions when testing occurs wit 
nonreinforcement-related stimuli. This inter- 
action is predicted by sequential theory, follow- 
ing the assumption that persistence to response 
decrement testing is increased by the similarity 
tween memory stimuli conditioned to the 


more ind 
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instrumental response during training and the 
stimuli presented during testing (i.e., stimulus- 
generalization decrement). Thus, both Group 
PR and Group NR performed better when 
they were tested with stimuli that had been 
conditioned to the instrumental response dur- 
ing training than when they were tested with 
stimuli not directly conditioned. It is note- 
worthy that, in a similar experiment, Camp- 
bell et al. (in press) found that Ss trained with 
both N-R and P-R transitions were as per- 
sistent in unpunished extinction as Ss trained 
only with N-R transitions. A comparison of 
the present results with those reported by 
Campbell et al. illustrates how subtle varia- 
tions in "sequence" can dramatically alter 
performance. 

It is notable that Group PR was markedly 
superior to Group NR in punished extinction, 
but the superiority of Group NR over Group 
PR was less pronounced in unpunished ex- 
tinction. In fact, the run measure did not 
reflect differential group performance in un- 
punished extinction. This finding may be 
interpreted by considering that punisl 
was administered on nonreinforced trials. 
Group PR, subsequent reinforced trials may 
have resulted in the conditioning of a com- 
pound memory stimulus consisting of both 
nonreinforcement and punishment elements 
(SN plus S?) to Ry. Thus, Group PR was 
provided with some degree of persistence to 
nonreinforcement-related stimuli as well as to 
punishment-related stimuli. This analysis also 
provides an explanation for the group main 
effect finding which showed Group PR to be 
more persistent than Group NR when the data 
were collapsed across the response-decrement 
variable. 


EXPERIMENT IL 


Although the data from Experiment | 
clearly reflect the powerful influence of se- 
quential manipulations in determining re- 
sponse persistence, it was felt that an 
additional experiment would provide more 
information on the interaction between 
training sequence and response-decrement 
procedure. 

Thus, 2 groups were given training 
similar to that experienced by Groups NR 
and PR in Experiment I. Following train- 
ing, all Ss experienced a 3-phase response- 
decrement test, in which a period of pun- 
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ished extinction was administered between 
2 periods of unpunished extinction. 

Due to the brevity of the initial phase, 
it was hypothesized that performance would 
be nondifferential. In the second phase 
(punished extinction) it was hypothesized 
that, due to the reinstatement of punish- 
ment-related cues, Group PR would be 
superior to Group NR. Finally, in the 
third phase of the test it was hypothesized. 
that Group NR might be superior to Group. 
PR. The third phase (unpunished extinc- 
tion) was included to determine whether 
or not the reinstatement of SN would pro- 
duce increased performance in Group NR. 


Method 


Subjects. The Ss were 20 experimentally naive 
male albino rats of the Sprague-Dawley strain, 
purchased from the Holtzman Co. They were 
housed individually and were approximately 70 
days old at the start of the experiment. The Ss 
were randomly assigned to 1 of 2 experimental 
groups (n = 10/group). 

Apparatus. The apparatus was identical to that 
utilized in Experiment I. 

Procedure. Acquisition training was carried out 
in the same manner as in Experiment I, with the 
following exceptions: (a) training was extended to 
112 trials, 4 trials/day for 28 days, and (6) several 
additional sequences were included to prevent Ss 
from forming a pattern discrimination. The addi- 
tional sequences for Group NR were RRNR and 
RNRP. For Group PR, the additional sequences 
were RRPR and RPRN. The percentage of nonre- 
inforcement-reinforcement-punishment events was 
exactly the same as in Experiment I. Furthermore, 
it should be noted that on any given day, all Ss 
experienced the same number of each kind of goal- 
box event. 

Response-decrement testing consisted of 3 phases: 
unpunished extinction, punished extinction, and un- 
punished extinction. Testing consisted of 24 trials, 


4 trials/day for 6 days. Each of the 3 phases lasted 
for 2 days. 
Results 

Acquisition. The mean daily running 


speeds during the last 5 days of acquisition 
were analyzed by means of a 2 (Group) X 5 
(Days) repeated measures analysis of vari- 
ance. The differences between Groups NR 
and PR were negligible at the end of acqui- 
sition training in start, goal, and total 
measures (Fs « 1), although there was a 
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nonsignificant trend for Group PR to be 
superior to Group NR in the run section, 
F (1,18) = 3.58, p > .05. The days main 
effect was significant, indicating a general 
increase in speeds, F (4, 72) = 5.09, p < 
:01; in run, / (4, 72) = 10.19, p < .01; in 
goal, F (4, 72) = 4.83, p < .01; in total, 
but not in start, F (4, 72) = 1.88, p > .05. 
Finally, the interaction did not approach 
significance in any of the runway sections, 
the largest F being in total, F (4, 72) = 
1,69, p > .05. 

Response decrement: Phase 1 (unpunished 
extinction), A 2 (Group) X 2 (Days) re- 
peated measures analysis of variance, per- 
formed on the mean daily running speeds, 
indicated a nonsignificant main effect for 
groups in all runway measures: Fs = .01, 
.69, 1.06, and .55 in start, run, goal, and 
total sections, respectively (df = 1, 18 in 
all cases). The days main effect was sig- 
nificant at the .01 level or better in all 
measures, the smallest F being 9.23 in start 
(df = 1, 18). The Groups X Days inter- 
action was negligible in all runway mea- 
sures (Fs <1 in most sections). Thus, 
the first phase of the response-decrement 
lest produced a nondifferential decline in 
response speed (ie. both groups were 
running more slowly than in aquisition, but 
they were not different from each other). 

Response decrement: Phase 2 (punished 
extinction), A 2 (Group) X 2 (Days) anal- 
ysis indicated a significant main effect for 
groups in start, F (1, 18) = 5.46, p < .05; 
-. Anrun, F (t, 18) = 11.02, P « .01; in goal, 
F (1, 18) = 5.93, b < 05; and in total, F 

(1, 18) — 8.06, 6 < .01. The days main 
effect was significant in all segments of the 
runway with the smallest £ being in start, 
FQ, 18) = 39.03, p — 01. The Group 
E x Days interaction did not account for a 
Significant portion of the variance in any 
of the runway sections, The major finding 
was that Group PR was Statistically su- 
perior to Group NR in punished extinction 
consistent with the finding in Experiment I. 
ý Response decrement: Phase 3 (unpunished 
extinction). A 2 (Groups) X 2 (Days) 
analysis on the mean daily running speeds 
(8 trials) revealed a significant group main 
effect in run, F (1, 18) = 4.72, b «.05; in 
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goal, F (1, 18) = 7.19 05; and 
total, F (1, 18) 4.89 55; but m 
in start, F (1, 18) = 3.32 05. Grou 
PR maintained its superi ver Gro 
NR despite the fact that nforcemel 
stimuli were reinstated goal be 
without punishment. Tiu nain effe 
accounted for a signific. tion of th 
variance only in the start e, F (1,18 
= 12.45, p c .0l. This es that $ 
had reached asymptotic ion in d 
runway segments except Í start med 
sure, in which some dec! s still ol 
served. The Group X I nteractiol 
was negligible in all runw: ons (most 
Fs < 1). 

Phase 2 trial analysis. ler to 
amine Phase II performan 
à 2 (Group) X 4 (Trials) à s was per 
formed on the running sj for the 4 
punished trials on each of > days ú 
punished extinction. The ; sis on t 
first day of punished exti: n showe 
that the group main effect v significant 
in all runway measures: in s F (1, 18) 
= 5.67, b. < .05; in run, F (1 = 13.33 
b < .01; in goal, F (1, 18) p< OL; 
and in total, F (1, 18) 10.19, p < 08 
The Group X Trials interaction also ac 


counted for a significant portion. of the 
variance in start, F (3, 54) = 2.92, p S 
-05; in run, F (3, 54) = 3.13, p < .05; and 
in total, F (3, 54) = 5.87, p < .01; but not 
in goal F (3,54) = 1.29, p > .05. Post hoe 
comparisons indicated that the perform: 
ance of the 2 groups on the first punishe 
trial was nondifferential. However, Group. 
was superior to Group NR on the 
second trial (all ps < -01) and all succeed- 
ing trials. Thus the superiority of Group 
PR over Group NR was evidenced after 
the administration of only one punishe 
trial. Of course, the trials main effect Wa 
significant in all runway measures. On tlie 
Second day of punished extinction, a pat 
tern of results similar to the first day W4S 
Observed except that performance Was 
slower and therefore the magnitude of the 
group differences was smaller. The group 
main effect was reliable in start, F (1, 18) 
rs 4.56, p < .05, and in run, F (1, 18) = 
7.81, p < .05; but not in goal, F (1, 18) = 
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3.49, p > .05, or total, F (1, 18) = 4.19, 
p ».05. The Group X Trials interaction 
was statistically significant in start, F 
(3, 54) = 4.04, p < .05; in run, F (3, 54) 
= 7.14, p; € .01; and in total, F (3, 54) = 
2.91, p < .05; but not in goal, F (3, 54) = 
2.86, p >*.05. Appropriate post hoc com- 
parisons (on start, run, and total data) 
indicated that group performance was non- 
differential on the first punished trial; how- 
ever, by the second trial, Group PR was 
superior to Group NR (p «.01 in all 
cases except goal). The trial-by-trial per- 
formance of the groups during the 2 days 
of punished extinction is shown in Figure 2. 
An inspection of the graphs on the second 
day of punished extinction indicates that 
Group PR actually ran faster on the second 
trial than on the first. This effect was 
statistically reliable in both the start and 
run sections (p < .05 and p < .01, re- 
spectively). On the other hand, it can be 
seen that Group NR ran reliably slower on 
the second trial of both punished extinc- 
tion days (p < .01 in all cases). A related 
finding was that the groups performed non- 
differentially on the first trial of each day, 
thereby indicating that spontaneous re- 
covery was not differently affected by the 
treatment variables. 


Discussion 


The response-decrement findings in Experi- 
ment II are generally consistent with the re- 
sults of Experiment I. Following a block of 
unpunished extinction trials (Phase 1) in 
which performance was nondifferential, the 
sudden introduction of punishment produced 
dramatic between-groups performance differ- 
ences (Phase 2). These differences were non- 
existent on the first punished trial, but ap- 
peared in robust form on following trials. The 
sudden superiority of Group PR over Group 
NR in punished extinction indicates that 
stimulus control (ie. the capacity of S? to 
occasion Ry) of punished reactions is main- 
tained through a block of unpunished extinc- 
tion trials. 

In contrast, the data obtained in the final 
phase of unpunished extinction did not con- 
form to prediction. It was hypothesized that 
the reinstatement of SX would result in in- 
creased. performance for Group NR and de- 


131 


$$ GROUP NR 


SPEED (ft/sec) 


1 2 3 4 1 2 3 4 
DAY 2 


Ficure 2. Mean running speeds for the 2 days of 
punished extinction for all runway measures, 


creased performance for Group PR. This 
hypothesized interaction did not occur; rather, 
both groups seemed to reach asymptotic ex- 
tinction performance with Group PR still 
running significantly faster than Group NR. 
Several possibilities may be considered to ac- 
count for the Phase 3 data. First, it is possible 
that proactive interference occasioned by 
Phase 2 punishment resulted in the disruption 
of the SN-R; association for Group NR. 
This is a plausible assumption, but it is not 
immediately clear why unpunished extinction 
(Phase 1) did not proactively interfere with 
Phase 2 performance for Group PR. A second 
possibility to account for the poor performance 
of Group NR in Phase 3 is that nonreinforce- 
ment memory stimuli are simply not as dis- 
tinctive as punishment memory stimuli, and 
therefore stimulus control established with SN 
dissipates or is disrupted more readily than 
stimulus control established with S”. Finally, 
it is possible that Phase 3 performance merely 
reflects S's ability to recover from the effects 
of response-contingent punishment. Group 
PR had reinforcement associated with the 
aftereffect of punishment during training, 
whereas Group NR did not; consequently S? 
may have been psychologically more intense 
for the latter Ss. If S? was a more "intense" 
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event for Group NR relative to Group PR, 
poorer recovery for the latter group would be 
expected. There is evidence (Boe & Church, 
1967) that recovery from punishment does not 
occur (when punishment is delivered at the 
beginning of extinction) at a wide range of 
punishment intensities, with the extent of 
suppression being inversely related to punish- 
ment intensity. In summary, it is possible 
that a combination of the factors mentioned 
is responsible for the poor performance ex- 
hibited by Group NR in Phase 3, 


GENERAL DISCUSSION 


These experiments suggest that the trial 
sequencing of partial-punishment-partial-rein- 
forcement training controls performance during 
response-decrement testing. Furthermore, the 
level of performance depends not only on the 
trial sequence of nonreinforcement-reinforce- 
ment-punishment events in training, but also 


. upon the kind of response-decrement pro- 


: 


cedure that is used during testing. These 
effects were demonstrated using both a be- 
tween-Ss and a within-Ss response-decrement 
procedure. 

These data confirm and extend the results 
of Campbell et al. (in press) and Capaldi and 
Levy (1972) and lend further support to the 
notion that sequence is a crucial determinant 
of response persistence. It appears that P-R 
transitions function in a manner analogous to 

V-R transitions. However, the theoretical 
similarity between nonreinforcement and pun- 
ishment could be extended even further if 
effects analogous to those obtained in partial- 
reinforcement-extinction situations were dem- 
onstrated using punishment. 
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VERBAL CONTEXT SHIFTS AND FREE RECALL? 
ALAN S. BROWN axp BENTON J. UNDERWOOD* 


Northwestern University 


The number of different stimuli (1, 2, 3, 4, 9, 18, or 36) associated with 36 
response terms was manipulated as a means of simulating variation in con- 
textual shifts in free recall. Recall was inversely related and pair recognition 
directly related to number of different stimuli. When response terms with the 
same stimulus terms were blocked within a list, performance generally de- 
creased within blocks. When the pairs were unblocked, evidence for unlearn- 
ing of the early pairs was found. The interpretation of unlearning was sup- 
ported by the further finding that spontaneous recovery of the initial items 
occurred over a 20-min. interval. Insofar as the present procedures simulated 
context shifts realistically, it must be concluded that a complex set of phe- 


nomena emerge as context shifts in free recall. 


Investigators representing a wide range 
of theoretical persuasions have found it 
convenient, if not necessary, to invoke 
contextual stimuli to account for at least 
some of the responses produced in single- 
trial free recall. Contextual stimuli are 
usually said to be represented by the char- 
acteristics of the physical environment or 
by S's emotional and cognitive activity, 
the latter including the implicit verbal 
responses which may be produced by the 
words presented for learning. Direct evi- 
dence on the role of contextual stimuli in 
memory has sometimes been produced (e.g., 
Turvey & Egan, 1969), but there is nearly 
a complete lack of systematic studies on 
the topic for free-recall learning. The 
original conception of the first study to be 
reported here viewed the conditions as being 
ones which would provide systematic evi- 
dence on the role of context. As will be 
seen, they may also be viewed as dealing 
with other related phenomena. 

Consider a situation. in which S is pre- 
sented a paired-associate list for one trial 
and is then asked to recall only the response 
terms. The formal stimulus terms may be 
conceived of as contextual stimuli, each 
serving as a unique context for its response 
term. Like contextual stimuli, the stimulus 


1 The authors acknowledge with gratitude the 
critical appraisal of this report by John J. 
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terms do not have to be overtly produced. 
If they serve a role in the recall of the re- 
sponse terms, their systematic status is 
essentially the same as that attributed to 
context cues except that, in this case, they 
are under experimental control. If S can 
recall (to himself) a stimulus term, and if 
an association has been established be- 
tween it and the response term, the response 
term will be produced. Of course, the 
critical question is sidestepped, for an 
appeal to some other context would be 
necessary to explain how the stimulus 
term was recalled. Only if the contextual 
stimuli are always present (as would be 
true for the physical properties of the 
room) could this problem be avoided. Any 
notion of a nonstatic context—a shifting 
context or a series of successive contexts— 
requires that in some way that context be 
reinstated (remembered) if it is to serve 
as an effective cue for recall. 

Although the present experiment does 
not resolve the basic issue, it proceeds on 
the assumption that it is quite possible to 
vary the likelihood with which context 
stimuli will be recalled. In the first experi- 
ment, Ss were presented a paired-associate 
list of 36 pairs. There were 36 different. 
stimulus terms at one extreme, and in suc- - 
cessive conditions there were fewer and 
fewer different stimulus terms, each being 
paired with more than one response term, 
until at the other extreme there was a single 
stimulus term paired with 36 different re- 
sponse terms. The pairs with common 
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stimulus terms always occurred in blocks. 
It might be assumed that contextual shifts 
occur on the study trial of a long free-recall 
list of single words. The present manipula- 
tions were intended to simulate such shifts; 
each change in a stimulus term was 
likened to a context shift. Furthermore, 
it can be seen that as the number of con- 
text shifts decreases, the frequency with 
which each stimulus term (context) is pre- 
sented increases. Thereby, the probability 
that the stimulus term will be recalled 
increases. 

If the formal stimulus terms are to 
simulate contextual cues in free recall, 2 
learning conditions must be met. First, S 
must know that he will be expected to 
recall the response terms without the 
stimulus terms (context) present. Second, 
at the same time, he must know that he will 
also be held responsible for associating the 
response terms with the context. There- 
fore, in addition to being instructed. that 
Tesponse-term recall would be required, .S 
was told that his knowledge of the pairings 
would be tested by a recognition procedure. 

If the procedures as described are a 
reasonable representation of context varia- 
tion, it follows that context changes within 
à single free-recall study trial could bring 


into play a number of phenomena, listed 
below. 


1. When more than one response term is 
paired with a single stimulus term, an 
A-B, A-C paradigm is set up within the 
list. This could produce either or both of 
2 outcomes. First, the earlier learned pairs 
within a block could produce negative 
transfer in attempts to learn later pairs. 
Second, and working in the opposite direc- 
tion, attempts to learn later pairs in a block 
could produce unlearning of earlier pairs. 
Furthermore, the magnitude of the effects 
ought to vary as a function of the number 
of different Tesponse terms paired with a 
common stimulus term. 

2. One could view each block of pairs 
with the same stimulus term as a list, 
Thus, when the same stimulus 
curred with 12 different respon 
there would be 3 different lists. 
rOactive inhibition has been s 


term oc- 
se terms 
Since ret- 
hown for 
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free-recall lists, a compa: »henomen 
may emerge within the | 

3. An increase in the 1 
stimuli within the list reduce fl 
interference effects resulti n the Ack 
A-C paradigm but, at | e 
crease the memory load 
ent words) for S. 


of differel 


These potential com; 
might have recommended 
not be undertaken. Ou: 
sulted from the belief tha: 
contextual stimuli cannot 
plexities, so we chose to ac vem in 
initial experiment, Furt e, in 
analysis of the data, atte ave bi 
made to determine if the nentiond 
interference effects were in operatii 
during the single study tria 


these co 


EXPERIMENT | 
Method 


The Ss were given a single study 
list prior to being tested. Before 
sented, they were told that their 
be tested in 2 w: First, resp 
would be required; and second, a fo 
ognition test would be given, in whic 
would be shown with the request to choose u 
correct pair (the pair presented on the study t | 
in each set, Four study-test trials were used. Hall 
of the Ss were tested in each of the following orders 
recall, recognition, recognition, recall; and recogmig 
tion, recall, recall, recognition. The .5 did not kno 
which test he would be given for a trial until after thg 
study trial. The major interest was in the memo! i 
for the items after the first trial. Analysis of score 
beyond the first trial has provided no useful informa 
tion beyond that given by the first trial, so that 
evidence to be presented is derived from the first 
trial performance. 


d-choice ree 


ets of 2 pali } 


f 

Lists. The 7 lists were constructed from 36 | 
3-letter words of minimal letter duplication angi 
from 36 2. | 


-syllable words having concreteness ratings 
of 6.0 or higher in the Paivio, Yuille, and Madig 
tables (1968). The 3-letter words served as stimull) 
terms, the 2-syllable words as response teri 
-àe instructions prior to learning included illusti? 
tions of both types of words (stimulus terms and a 1 
Sponse terms) so that S knew what to expect on th 
* types of retention tests, 1 
The 7 lists represented the context m nipulatio | 
and these conditions will be designated by © 
number of stimulus terms used in the lists. 
Condition 1, a single stimulus term was used: t 
"t was essentially like a true free-recall list in 04 
the stimulus terms could not be used as a differen 
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cue, In successive lists there were 2, 3, 4, 9, 18, and 
36 different stimulus terms. For Condition 36, the 
36 3-letter words and the 36 2-syllable words were 
paired randomly, When the number of stimulus 
terms was less than 36, they were chosen randomly 
from the 36 and paired randomly with the response 
terms, subject to the restriction that each was 
paired an equal number of times. Response terms 
paired with a common stimulus term were blocked 
within the list. Thus, in Condition 9, for example, 
there were 9 blocks of 4 p: each, with the same 
stimulus term used within each block. The ordering 
of the blocks was random. 

Procedure and testing. The data were collected by 
a group procedure. Prior to testing, each group was 
assigned randomly to 1 of the 7 lists. Within each 
group, the recognition test and the recall test were 
assigned randomly for the first trial after the study 
phase, For the recall test Ss were simply asked to 
write down all of the response words (which had 
been identified in the instructions as the right-hand 
2-syllable words). For the recognition test, Ss were 


given a test sheet on which there were 36 sets of 2 
pairs each, both pairs in each set having the same 
response term but a different stimulus term. The 


Ss marked the correct pair in a blank provided for 
this purpose. Both stimulus terms were always 
from the list. In Condition 2 there were only 2 
stimulus terms, so these occurred in all pairs. For all 
other conditions, the incorrect stimulus term was 
chosen randomly, subject to the restriction that it 
was not used a second time until all other stimulus 
terms had been used once. The ordering of the sets 
of pairs on the recognition test sheet was random 
with respect to the order of the pairs on the study 
trial. 

The pairs were presented for study at a 4.5-sec, 
rate. Approximately 3.5 min. were allowed for the 
test period. Small groups of Ss were tested until 
each condition had a minimum of 20 Ss each. Seven 
of these groups of 20 Ss had been tested for recall on 
the initial trial, 6 for recognition, The list desig- 
nated Condition 1, in which only one stimulus terms 
was used, could not be administered as a recognition 
test in a meaningful manner. The Ss given this list 
had 4 successive recall trials. 


Resulls 


Response-term recall. The percentages 
of items recalled for each of the seven condi- 
tions are plotted in Figure 1. The base 
line is described as the log of the number 
of different stimuli involved in the 7 lists. 
Thus, the number of different contextual 
changes increases as the number of different 
stimulus terms increases. In general, recall 
decreased as the number of contextual 
stimuli increased, but the relationship is 
by no means a regular one. The differences 
among the recalls of the 7 groups are 
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of number of different stimuli for the 36 response 
terms. 


statistically significant, / (6, 133) = 4.33, 
p < .01. Performance under Conditions 3 
and 4 is less than that for Condition 9. 
However, the statistical reliability of these 
differences, as well as of the difference 
between Conditions 1 and 2, cannot be 
demonstrated. For the moment then, it 
may be said that as the number of different 
stimulus terms increased, there was a de- 
crease in the number of response terms 
recalled. Stated in another way, the rela- 
tionship shows that as the number of 
different words presented (stimulus words 
plus response words) increased, recall de- 
creased. This might be interpreted in 
terms of memory load, although the data to 
follow make it unlikely that this interpreta- 
tion will suffice. 

Position and block effects. Figure 2 pro- 
vides an initial inspection of recall as re- 
lated to the position of the item on the 
study trial. The 3 lines represent different 
portions of the list, viz., primacy (Items 
1-6), recency (Items 31-36), and all items 
in the body of the list (Items 7-30). First, 
it can be seen that there was a general 
decrease in recall of items from the body 
of the list as the number of different 
stimulus terms increased. Next, there was 
a lack of primacy and recency effects for 
Conditions 3 and 4; that is, recall for the 
initial and final positions did not differ 
appreciably from recall of the items within 
the body of the list. All other conditions 
show primacy and recency effects, although 


PERCENT RECALL 


NUMBER DIFFERENT STIMULI (LOG SCALE) 


Fic. 2. Response-term recall as related to pri- 
macy (Items 1-6), recency (Item 31-36), and all 
other items (7-30), and to number of different 
stimuli, 


the primacy effects are relatively slight 
for Conditions 18 and 36. 

It was pointed out earlier that when S 
tries to learn response terms to a common 
stimulus, interference might occur. ‘This 
in turn could lead to unlearning. If un- 
learning occurs, items early in a block 
having the same stimulus term should be 
more poorly recalled than items occurring 
late in the block. The data have been ex- 
amined to see if they give any support to 
this notion. For Condition 18, where there 
were only 2 response terms for each stimulus 
term, recall of the first and second items 
(combined across the 18 blocks) was essen- 
tially equal. For Condition 9, very clear 
Position effects were apparent within the 
blocks. The first item within the block 
showed the best recall, and the second was 
recalled better than the third and the 
fourth, the latter 2 being about equal. This 
finding provides no support for unlearning 
within blocks. Rather, it appears that each 
new stimulus term served to define a new 
rehearsal block, and as S Proceeded through 
the 4-pair block, rehearsal fell most he: 
on the first item in the block. In Condition 
4 (9 response terms for each of 4 stimulus 
terms) there were no within-blocks differ- 
ences of any consequence. The position 
effects within blocks were indistinguishable 
from those of Condition 1 when the latter 
Was scored comparably. 

For Conditions 2 and 3, the 
changes again. 


avily 


j picture 
The percentages correct 
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for 6 successive blocks of 6 item: have 
been determined for the 2 con: In 
Condition 3, each stimulus was with 
12 response terms, hence each ! ith a 


single stimulus term consisted 
sive sixths. For Condition 2 
items had a common stimul 


cces- 
st 18 
and 


the second 18 had a differe: mon 
stimulus term. Plotting in s eans 
that the first 3 sixths had the s: iulus 
term, and the last 3 sixths th« imu- 
lus term. Figure 3 shows that i; lition 
3, performance increased for cac ssive 
block of 12 items and within « lock, 
with the last 6 items recalled m »oorly 
than the first 6. Thus, the wit locks 
effects are much like those for Cx on 9, 
and again, it would be concluded chat no 
unlearning is apparent within blo j For 
the first block of 18 items in Condition 2 
there is a sharp decline across the 3 sixths: 
This is followed by a sharp increase as the 
stimulus term changes, followed uin by 
a decrease, with what is appasitly a 
recency effect in the final block. r both 
conditions represented in Figure it be- 
comes particularly clear that a change of 
stimulus context had a marked effec: on the 
specific items recalled. Finally, for Condi- 
tion 1, the recall by position was much 


like that for Condition 2 for the first 18 
items, but, of course no marked increase m 
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recall occurred for the fourth sixth as was 
true for Condition 2. 

Recognition. In the recognition tests, 
Ss were required to choose between 2 pairs 
in 36 sets, one pair in each set being correct, 
the other incorrect. The percentages cor- 
rect for the 6 conditions are shown in 
Figure 4. Unlike recall, recognition per- 
formance generally increased as the number 
of stimulus terms increased. The difference 
overall is statistically reliable, F (5, 114) 
=2.49, p < .05, although the maximum 
difference is only about 10%. 

Primacy effects, although small, were 
present in the recognition scores, but there 
were no recency effects. Furthermore, 
differences within blocks, so evident in 
several of the conditions for recall, were 
much less apparent in the recognition data. 
Correct decisions made on pairs in recogni- 
tion might be expected to be correlated 
with the recall of the response terms of 
those pairs when summed across Ss. For 
the 6 conditions, the product-moment cor- 
relations across the 36 items varied between 
.01 and .31. Although even the largest 
value is not significant by conventional 
standards, the fact that all 6 correlations 
were positive indicates a slight relation- 
ship, perhaps reflecting the common pri- 
macy effect found in both recall and recog- 
nition. In any event, these data indicate 
that knowledge of associations, as deter- 
mined by recognition of appropriate pair- 
ing, holds little relationship to the recall 
of the response terms of those pairs. 


DISCUSSION 


Discussion 


As pointed out in the introduction, it had 
been expected that learning of the different 
lists would produce a rather complicated 
picture, and this expectation seems to have 
been realized. Yet evidence for within-lists 
interference produced by the A-B, A-C para- 
digm is not easy to identify. The recognition 
data do suggest that the fewer different re- 
sponse terms associated with a given stimulus 
term, the more intact the association. There 
was no support for the idea that unlearning 
might occur within blocks. Except for Condi- 
tion 4, within-blocks differences were quite 
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marked, with performance on the initial items 
in a block being better than on later items. 
One interpretation that has already been sug- 
gested is that the blocks, at least the smaller 
ones, provide a ready rehearsal set. This 
interpretation is not without some ambiguity, 
since the same type of effect would be ex- 
pected if the early pairs interfered with the 
learning of the later pairs. However, as will 
be seen in Experiment II, it is likely that the 
interference component is relatively small. 

For some lists (Figure 3) there was a sizeable 
increase in recall for each successive block. 
This could be interpreted as retroactive inter- 
ference of later blocks on earlier blocks. Such 
interference would have to result from an A-B, 
C-D paradigm. This may not be plausible. 
An alternative interpretation is that the in- 
crease in recall across blocks simply represents 
differences in recall associated with retention 
intervals of different lengths. 

Context can play a role in recall only if it is 
reinstated at the time of recall. There is little 
doubt that for the lists with relatively few 
stimulus terms, these terms could have been 
recalled. The number of response terms re- 
called was directly related to the likelihood 
that the stimulus terms could be recalled. 
Yet, it should be clear that there is no con- 
vincing evidence that recall of the response 
term was mediated by the recall of the stimulus 
term. So far as we can tell, there is no way 
to design an experiment to make such a deter- 
mination directly. Although not reported 
above, clustering by stimulus terms occurred 
only in Conditions 9 and 18 in amounts greater 
than could be expected for clustering mediated 
by position alone (as determined from Condi- 
tion 1). This makes it likely that stimulus- 
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term recall preceded response recall, but such 
a result does not demand this interpretation. 

It is apparent that the complexity of the 
findings requires more analytical work if the 
processes associated with context shifts are 
to be untangled. Experiments II and HI were 
viewed as first steps toward this untangling, 
with the focus being on the interference (and 
unlearning) for the within-lists A-B, A-C 
paradigm, 


EXPERIMENT II 


In Experiment II the A-B, A-C pairs 
have been unblocked. This was viewed as 
a means of breaking up, or at least mini- 
mizing, the possibility that common stimu- 
lus terms may be used to define rehearsal 
blocks. "There were 3 lists, corresponding 
to those in Conditions 3, 4, and 9 in 
Experiment I. As a second variable, half 
of the Ss were given the stimulus terms at 
the time of recall of the response terms, 
and half were given free recall of the re- 
sponse terms. All Ss were instructed that 
they would be tested for response-term 
recall with and without the stimulus terms 
present. Thus, the instructions again met 
the 2 requirements believed necessary if the 
stimulus terms were to simulate contexts. 


Method 


The pairs used in Experiment 1 for Conditions 3, 
M and 9 were assigned randomly to positions in the 
list, subject only to the restriction that pairs with the 
same stimulus were not allowed to occupy adjacent 
positions. Three groups of 20 Ss each were given 
response-term recall (as in Experiment 1), and 3 
additional groups of 20 Ss each were given the 
stimulus terms on the recall sheet and were asked to 
pair the response terms with the appropriate stimu- 
lus terms, The Ss in all groups were instructed that 
on successive trials they would be given both free 
recall and cued recall, the particular type of recall 
to be identified after the list had been presented for 
learning. As with Experiment I, trials beyond the 
first are not included in the present analysis, these 
extra trials being given merely to fulfill the obliga- 
tion implied by the instructions. All other pro- 
cedures were the same as those in Experiment I. 


Results 


Recall. The recall for the 6 groups varied 
between 17.2% and 23.995. Mispaired re- 
sponses for the cued condition were counted 
as correct. Neither condition, as defined 
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by number of stimuli (F nor 


cuing vs. noncuing (F = 1.61), w: ifi- 
cant source of variance. An is of 
the free recall, which included the from 
the 3 conditions in Experiment wed 
no statistical difference, althou; lack 
of a difference is difficult to i et as 
the 2 experiments were perform: iffer- 
ent times in the school vear. 

Position and recall. There w sys- 
tematic differences in recall as tion 
of position of the items on the trial 
between cued and free recall fore, 
the data have been combined | luate 
the influence of input position ecall. 
Some general observations will made 


initially. All 3 conditions show: irked 
primacy and recency effects, in c ist to 
Experiment I, where Conditions . and 4 
showed little evidence of sych effc For 
the 3 conditions of Expatillent JI, primacy 


(Items 1-6) increased directly as number 
of stimulus terms increased, the values for 
recall being 28%, 35% and 45% for Condi- 


tions 3, 4, and 9, respectively. — i:ccency 
effects (Items 30-36) were gres «r for 
Condition 3 than for Conditions 4 and 9, 
the values being 33%, 25%, and 20%, re- 


spectively. For items in the body of the 
list (7-30), recall decreased by 8% between 
Condition 3 (20%) and Condition 9 (12%), 
with Condition 4 intermediate. 

The major interest was in the recall of 
successive response terms having the same 
stimulus term. There is no satisfactory 
way to remove the overall primacy and 
recency components from this analysis, 
since buffer items were not used. In fact, 
these effects should probably not be re- 
moved, since they may reflect in part the 
interference which is the focus of the 
analysis. The approach used was to ex- 
amine recall as a function of position in the 
series of items having the same stimulus 
term and to compare this recall with the 
results of Experiment I. In Experiment ! 
the pairs were blocked by stimulus ternis: 
whereas in Experiment II the pairs with the 
same stimulus term were randomized 
throughout the 36 positions. The data 
were summed across different stimuli lor 
the first occurrence of the stimulus terms, 
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the second occurrence, and so on. For 
Condition 9, each stimulus term occurred 
4 times, hence each of the 4 positions was 
represented by the 9 different stimulus 
terms. For Condition 4 each stimulus 
term occurred 9 times, therefore, each of the 
9 positions was represented by 4 different 
stimuli. Finally, for Condition 3, each 
stimulus occurred 12 times, so each of the 
12 positions was represented by 3 different 
stimulus terms. The percentage recall was 
determined for each position in the se- 
quence for each of the conditions of the 2 
experiments. These values are plotted in 
Figure 5. 

The first finding to be noted is the initial 
drop which occurs, this being greater for 
Experiment lI than for Experiment I. 
The second finding of note is that following 
the initial drop, recall in Experiment I] 
tended to increase across successive posi- 
tions, whereas no such trend was apparent 
in Experiment I. This was particularly 
clear for Condition 3, where each stimulus 
occurred 12 times. The data for Condition 
3 may be related to the plot for Experi- 
ment I, shown in Figure 3. For each of the 
thirds in Figure 3, the first 6 items within 
a block were better recalled than the second 
6 items. This is reflected in Figure 5 for 
this condition, where recall for the first 6 
positions (31%) is greater than recall for 
the last 6 positions (21%). In contrast, in 
Experiment II recall for the first 6 posi- 
tions (2095) is less than for the last 6 
positions (27%). 


Discussion 


The results in Figure 5 for Experiment II 
suggest that unlearning occurred within the 
list when the same stimulus term was used for 
successive response terms. The first pair 
seems to be “protected” from unlearning, but 
beyond this there is a clear trend for each 
successive response term to be recalled better 
than the preceding ones, although the second 
position is somewhat ambiguous. Increasingly 
better recall with each successive stimulus 
should happen if each successive pairing pro- 
duced more and more unlearning of previous 
items. 

There are undoubtedly many other expla- 
nations in addition to unlearning which might 
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Nic Reeall of response terms having the same 
stimulus terms as a function of position in the 
sequence. (In Experiment | the pairs with the 
same stimulus terms were blocked, while in Ex- 
periment II they were not. The conditions with 3, 
4, and 9 different stimulus terms are represented 


in the 3 panels.) 


be offered to account for the gradual increase 
in recall across positions (e.g., shorter retention 
intervals for later items). If unlearning is 
occurring, one expectation might be that abso- 
lute recall should have been better on, say, 
Positions 3 and 4 for Condition 9 than for 
Condition 3. This might be expected on the 
grounds that in the latter condition there were 
more successive items to produce unlearning 
than in the former. However, a complicating 
factor may arise from different amounts of 
retroactive interference between items with 
different stimuli, This possibility is greater 
in Condition 9 than in Condition 3. 

When pairs are blocked, as in Experiment I, 
it seems likely that the blocking provides a 
ready rehearsal set which would counter un- 
learning, but which would lead to poorer recall 
for items late in the block than for those early 
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in the block. The purpose behind the un- 
blocking in Experiment II was to remove the 
ready-made rehearsal blocks. It was believed 
that evidence for unlearning might be ob- 
tained under these circumstances. If unlearn- 
ing did occur, as presumed in Experiment II, 
spontaneous recovery might be expected. 
Since unlearning is assumed to be greater for 
items in the initial positions, spontaneous re- 
covery should be greater for the items in these 
positions than for items in the late positions 
sharing the same stimulus term. The purpose 
of Experiment III was to determine if spon- 
taneous recovery could be detected in a delayed 
retention test. 


o EXPERIMENT III 


"There is no evidence as to the course of 
recovery over time following free-recall 
“learning. Therefore, a 20-min. interval 
was chosen on the basis that spontaneous 
recovery has been observed within this 
range when the A-B, A-C paradigm holds 
between lists (e.g, Postman, Stark, & 
Henschel, 1969). 


k 
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Method 


__A single list of pairs was used, this being the list 
identifying Condition 3 in the first 2 experiments. 
There were only 2 basic conditions, an immediate 
y. cued recall (the same as given in Experiment II), 
- and for another group of Ss, a cued recall after 20 
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_, Fic. 6. Response-term recall as a function of the 
length of the retention interval (immediate or 20 
min.) and position in sequences of response terms. 
aving the same stimulus term. 
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min. Two random orders of the 36 jx e en 
ployed, but since the results showed tha were 
equivalent, no further mention will be f this 
factor. Each group was made up of 34 1e Ss 
were instructed prior to the presentat e list 
that they would be tested for recall t the 
response terms were to be listed unde: »pro- 
priate stimulus term, The Ss were t« vidus 
ally, and the words were presented at rate, 
Those Ss given the 20-min. recall na 
pyramid puzzle during the interval 

Results 

The immediate recall was 25.4‘ ly 
higher than that for the compar: ndi- 
tion in Experiment II (23.9* rhe 
group given recall after 20 min iced 
18.4% of the response terms. 

The critical data are shown in e 6. 
The plot corresponds to that for e 5. 
Looking first at the immediate , it 
can be seen that except for the tial 
positions, the recall by position w uch 
the same as in Experiment II and = dis- 
played in the lower panel of Figore 5. 


Why the primacy effect was greater in 
Experiment II than in Experiment !II is 
not known. Recall by position after 20 


min. was quite different from the corre- 
sponding curve for immediate recall. For 
the 4 initial positions, recall was actually 


higher after 20 min. than on the immediate 
test. On all subsequent positions ii was 
less. In short, the results are not unlike 
those to be expected if unlearning and 
spontaneous recovery occurred. The inter- 
action between position and length of reten- 
tion interval is highly reliable, F (11, 726) 
= 11.36, p < .01, using an arc sine trans- 
formation of the data. Evaluated inde- 
pendently, recall for the first 4 positions 
is statistically greater for the 20 min. group. 
than for the immediate group, / (1, 66) 
= 4.01, p « .05. 

: If there is output interference and if Ss 
in the 20-min. group gave priority to the 
recall of the early items, while those with 
immediate recall gave priority to the recall 
of items late in the list, the results shown 
in Figure 6 might be produced. However, 
an analysis showed no difference between 
output order for the 2 groups. The data 
appear to be consonant with the notion that 
Spontaneous recovery occurred and that 
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this retarded the forgetting of items. The 
greatest retardation occurred early in the 
list, where the greatest amount of unlearn- 
ing is presumed to have taken place. Taken 
at face value, the data indicate that re- 
covery even occurred for the first pair in 
the sequences. 


GENERAL DISCUSSION 


The procedures of the present experiments 
were intended to simulate context shifts or 
changes. In the first experiment, recall was 
inversely related to the number of context 
shifts, although this relationship was neither 
strong nor regular. Number of context shifts 
was directly related to number of different 
stimulus terms. The relationship between 
context shifts and recall may be complicated 
by internal interference when the same con- 
text serves as a stimulus term for more than one 
response term. The fact that recognition 
showed that the association between stimulus 
and response terms remained more stable with 
increasing numbers of stimulus terms suggests 
that interference may counteract to some 
extent the positive effect of a relatively few 
context stimuli, Nevertheless, the positive 
and negative factors which seem to be oper- 
ating jointly to produce recall performance 
may be quite realistic and applicable when 
notions concerning the role which context 
plays in free-recall learning are advanced. 
This is to say, simplified theoretical notions of 
the role of multiple contexts may not be 
appropriate. 

As a context shift occurred—as pairs with 
the same stimulus term were followed by a 
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pair with a new stimulus term—performance 
on the initial pairs after the shift was markedly 
enhanced under some conditions. When the 
pairs of words with the same stimulus terms 
were randomized within the list rather than 
being blocked, the learning pattern within 
the sequence of response words with the same 
stimulus terms changed, It is possible con- 
ceptually to think of contexts, particularly 
internal contexts, which may be present for 
only one word, and then reoccur at a later 
point in the list fora new word. Experiment II 
simulated this reoccurring context. The evi- 
dence from this experiment was interpreted as 
showing unlearning for pairs having the same 
context, with the earlier occurring pairs being 
unlearned as a consequence of the occurrence 
of the later pairs. Experiment III gave some 

support to this interpretation by showing that 

spontaneous recovery of unlearned associa- 

tions may occur over a retention interval. 

Such data again suggest that contextual stimuli 

may produce a rather complicated set of 

relationships. 
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DICHOTIC LISTENING AND SEQUENTIAL ASSOCIATIONS 
IN AUDITORY SHORT-TERM MEMORY ' 
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Two experiments are reported in which a dichotic-bisensory stimulus presenta- 
tion was used. In one experiment, 4 stimulus pairs were presented in a trial, 
2 dichotic pairs being alternated with 2 visual pairs. The preferred recall order 
was mode by mode when all words were to be recalled, and ear by ear for audi- 
tory words, In Experiment II, 8 words were presented in a trial, but 4 words, 
2 auditory and 2 visual, were simultaneous. The S was instructed to recall 
either the word presented before or after the probe but in the same ear or same 
part of the screen, or the word presented at the same time and in the same 
modality as the probe. The results were nterpreted as evidence for the exis- 


tence of strong associations between successive auditory stimuli. 


In dichotic listening, auditory stimuli 
are presented 2 at a time, one to each 
ear, and S is asked to recall. If S is not 
instructed to report the items in any par- 
ticular order, he tends, at fast presentation 
rates, to report all those items presented 
to one ear and then those presented to 
the other ear (Broadbent, 1954, 1956). 
At slow rates, he generally reports the 
first pair of items, then the second pair, 
and then the third pair. 


Broadbent (1958) interpreted this “ear- 
by-ear" order of report in terms of the 
difficulty of switching attention between 
channels of information. He hypothesized 
that a selective filter mechanism can be 
Set to select 1 channel, distinguished on 

the basis of some physical characteristic, 
and to reject all other channels. Because 


of the time involved in switching channels, 
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ear-by-ear order of report is more ei nt 
at fast, but not at slow, presentation es. 
Subsequent work has indicated t! at 
least 1 main assumption of Broad. :t's 


model, i.e., that only simple physical «:ar- 
acteristics could be analyzed prior to the 
filter, is incorrect. Several studies g^ 
Gray & Wedderburn, 1960; Yntema & 
Trask, 1963) have indicated that mea: ing- 
fulness of input items could define a chan- 
nel, and that meaning therefore must be 
decoded prior to or at the level oí the 
filter. 

Yntema and Trask (1963) suggested 
that S can perceive both items presented 
simultaneously and can tag items accord- 
ing to either their meaning or car of 
arrival at input. The presence of the tags 
provides S with a way to systematically 
retrieve the items. With homogeneous 
stimuli, ear of arrival provides the basis 
for output organization, but with distinct 
classes of stimuli, recall by stimulus type 
is apparently preferred. It is not clear, 
however, why time of arrival is not equally 
useful. 

Contrary to Yntema and Trask (1963), 
Bryden's (1964) data show that in some 
cases meaning alone is not a sufficient 
basis for a retrieval plan. He presented 
highly associated pairs of words where ! 
word of each pair was presented to each ear. 
At fast presentation rates of 2 pairs /sec, 
he found that Ss preferred to recall in an 
ear-by-ear order and not, as might be 
expected from the findings of Yntema and 
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Trask, in the terms of the associated pairs. 
Whitmore (1972) presented words dichot- 
ically in such a way that if S reported 
items in temporal pairs, he would have 
a meaningful sentence. Without specific 
instructions to the contrary, Ss tended 
to report in an ear-by-ear order. It seems 
that Yntema and Trask's retrieval expla- 
nation is not entirely adequate, as .Ss 
appear to freely report according to mean- 
ing only when the meaningfully related 
items are not simultaneous. 

A further complication is presented by 
the work of Savin (1967) and Rollins, 
Everson, and Shurman (1972). When 2 
digits are presented simultaneously over 
a loudspeaker and do not differ in their 
physical characteristics, S presumably has 
no basis on which to channelize the in- 
formation. One would therefore expect the 
items to be recalled in pairs. However, 
Rollins et al. and Savin found that S still 
preferred to recall the items in a sequential 
order, that is, an item from the first pair, 
one from the second pair, and the remain- 
ing item from the first pair and the 
last item. 

When, instead of the 2 ears, the sensory 
modalities of audition and vision define 
the channels used, the picture becomes 
even more complex. The Ss prefer to 
recall 3 audiovisual pairs by modalities at 
fast presentation rates, and as pairs at 
slow rates (Madsen, Rollins, & Senf, 1970). 
Output by modalities also results in higher 
recall than output by pairs (Broadbent 
& Gregory, 1965). 

However, other findings indicate that 
the dichotic and bisensory situations ar 
not strictly analogous. In the bisensory 
analogue of the Yntema and Trask (1963) 
study, Madsen et al. (1970) found that 
recall by type of stimuli resulted in a low 
level of performance at both fast and slow 
presentation rates. Broadbent and Greg- 
ory (1965) also found that recall according 
to types was worse than recall by modali- 
ties. Moreover, when auditory stimuli are 
presented alternately to the 2 ears, Ss have 
no difficulty in recalling them in their 
sequential order (Moray, 1960). This find- 
ing does not hold when stimuli are presented 
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alternately to the ear and eye (Broadbent 
& Gregory, 1961). 

Finally, in contrast with auditory studies, 
an examination of the visual analogue of 
dichotic listening shows that the channel- 
by-channel report is not a preferred output 
order for recall of simultaneous stimuli. 
When separate digits are presented to each 
eye (Sampson & Spong, 1961) or 2 super- 
imposed digits are presented to both eyes 
(Schurman, Everson, & Rollins, 1972), the 
preferred recall order is in terms of tem- 
poral pairs. However, Schurman et al. 
showed that Ss were equally accurate 
with a temporal-pairs or a sequential 
recall order. 

In summary then, a channel-by-channel 
recall order is obtained with both a di- 
chotic and bisensory stimulus presentation, 
but not with a visual presentation. The 
variables of type of stimulus material, 
associative relations between items, and 
whether items on the 2 channels are 
simultaneous or successive do not have 
similar effects on recall order for the di- 
chotic and bisensory task. These findings 
suggest the need for further investigation 
of the differences between audition and 
vision as input channels. These differences 
were explored in the present experiments 
using a task that combines dichotic and 
bisensory presentation. In Experiment I, 
pairs of dichotic auditory stimuli were 
alternated with pairs of simultaneous visual 
stimuli, and S recalled either the auditory 
stimuli, the visual stimuli, or both. With 
this type of stimulus presentation, one 
would expect recall to be organized by 
modalities, with either a temporal-pairs 
organization within each modality, or no 
strong organization at all. 


EXPERIMENT | 
Method 


Design. On each trial, 8 stimulus words were 
presented as 4 successive pairs. On half of the 
trials the first and third pairs were presented 
dichotically, while the second and fourth pairs 
were visual (AVAV). On the other trials, the 
order of the word pairs was visual, auditory, visual, 
auditory (VAVA). At the end of each list, a cue 
was presented, indicating 1 of 3 recall conditions: 
recall of all of the words, only the auditory, or only. 
the visual. 


For each of these recall conditions, the postlist 
cue was auditory on half of the trials and visual 
on the other half. The 12 treatment combinations 
(obtained by factorially varying the cue modality, 
the recall instruction, and the order of the auditory 
and visual word pairs) were presented in a random 
order within a session with the restriction that 
each condition was tested once in each block of 
12 trials. There were 6 blocks of experimental 
trials, giving 6 replications of each treatment 
combination for each S. To ensure that the results 
obtained would not be limited to a specific set 
of stimuli, 2 different sets of stimulus material 
were constructed. Six Ss were tested on each set. 

Subjects. The Ss were first-year male and female 
University of Toronto students who participated 
in psychology experiments for credit. All Ss were 
native speakers of English, were aware of no hear- 
ing impairment, and had normal or corrected vision. 
. Three male and 3 female Ss were individually 
tested on each set of stimulus materials, 

Stimulus material. For each set of stimulus 
materials, words were randomly selected without 
eplacement from a pool of 1,430 1-syllable com- 
mon English words of all grammatical classes, 
- The visual words were typed on a roll of plain 
white computer paper and presented over closed- 
circuit television. To construct dichotic tapes, a 
male and a female graduate student were isolated 
in soundproof booths which contained a microphone 
and a device set by E to click at a -T5-sec. rate. 
- The speakers uttered the stimulus words in time 
with every second click, starting after 4 or 5 warn- 
ng clicks. The clicks were recorded on the tape 
to provide 5 with a signal before the start of each 

trial. The auditory and visual stimuli were syn- 

chronized by means of a Uher diapilot and a Hunter 
timer, The timer caused a video control monitor 
— to transmit the signal to S's television monitor 
for .75 sec. Except when a stimulus was being 

shown, S's screen was bright but blank, The 
postlist cue was timed to occur 1.5 sec. after the 
start of the last stimulus pair. 
j Procedure. The S was given a sheet of instruc- 
ions that described the simultaneous presentation 
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FIGURE 1. Recall of auditory and visual words 
asa function of serial position, modality order 
A = auditory, V = visual), and type of recall trial. 
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of 2 words in 1 sensory modality and tert 
tion of the 2 modalities. The single it thi 
end of each list served as a recall cu s last 
word was one, two, or three and w ented 
either auditorily or visually. The 5 w ructed: 
to ignore the modality of the cue ittend 
to the number. One signaled recall words, 
two indicated only auditory words è, the 
visual words, The S could recall ds im 
any order and was instructed to when 
uncertain. 

The S's performance was mor Y ng 12 
practice trials to ensure that the inst were 
followed. If S experienced any diffi a few 
extra practice trials were given. No 5 cjected 
for inability to do the tà After ractice 
trials, any questions pertaining to thi cedure 
were answered, and the 72 experiment s then 
followed. 

Results 


The criterion for scoring was lenient, 
and a different criterion was used for audi- 
tory and visual items. The reason for 


this was that, particularly for © :ditory 
items, there was a large number of what 
were possibly auditory perceptua! errors. 
For example, S might report bloc! or blot 
when the recorded word was ploi. In an 


earlier experiment (Penney, 1973, Experi- 
ment I) with a similar stimulus presenta- 
tion, a comparison of strict and lenient 
scoring criteria showed the pattern of re- 
sults to be the same for both, with the main 
effect of using the lenient criterion being 
the raising of the level of recall of auditory 
items to approximately that of the visual 


items. The scoring criteria used were as 
follows. An auditory item was counted 


correct if the word spoken by S had no 
more than 2 incorrect phonemes. A visual 
word was counted correct if a possible 
spelling of S's spoken recall contained no 
more than 1 incorrect, omitted, or trans- 
posed letter. For example ace would be 
accepted for age or through for though. 
Figure 1 shows the mean number of 
words recalled from the pair as a function 
of serial position and modality. The sepa- 
rate panels are for the AVAV and VAVA 
presentation orders. The 2 curves in each 
panel represent the total- and partial-recall 
trials. 
A2X2X2X2xX2X2 X 6 analysis 
of variance was conducted with the follow- 
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ing factors: modality (auditory or visual), 
serial position within a modality (first or 
second pair), type of recall trial (total or 
partial), modality of the recall cue, order 
of the auditory and visual word pairs 
(AVAV or VAVA), set of stimulus mate- 
rials, and Ss. The dependent variable was 
the proportion of words recalled out of the 
6 replications. As arc sine transformation 
was applied to these proportions (Murdock 
& Ogilvie, 1968). 

Because of the different scoring criteria 
for auditory and visual items, a direct 
comparison of auditory and visual recall 
cannot be made. However, the inter- 
actions with presentation modality are 
independent of the main effect and can be 
interpreted. Figure 1 indicates that there 
is very little difference in recall as a func- 
tion of serial position for auditory items. 
For visual stimuli, on the other hand, 
words from the second pair are generally 
recalled much. better than those from the 
first pair. The serial-position effects are 
smaller for the VAVA order, with a flat 
curve in the total-recall condition. The 
Serial Position X Modality interaction was 
significant, F (1, 5) = 20, p < .01. These 
serial-position effects were large enough to 
cause the main effect of serial position to 
be statistically significant overall, F (1, 5) 
= 8.9, p < .05. There were, however, no 
reliable differences in recall of auditory 
and visual stimuli. 

Recall of the visual words in the last 
serial position was lower in the VAVA 
condition than in the AVAV condition, 
but recall of the first pair was slightly 
better. Recall of auditory words did not 
seem to be affected by order. The Serial 
Position X Order interaction was signifi- 
cant, F (1, 5) = 63, p « .005, and the 
Serial Position X Modality X Order inter- 
action was also significant, F (1, 5) — 17, 
p < 01. 

Considerably more auditory words were 
recalled on partial-recall trials than on 
total-recall trials. For visual words, the 
effect of the type of recall trial was less 
marked, Overall, the main effect of the 
type of recall trial was highly significant, 
F (1,5) = 75, p < .005, and the Modality 
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X Type of Recall Trial interaction was 
marginally significant, F (1, 5) — 6.6, 
p «.05. 

The total-recall trials were further ex- 
amined to discover the order of recall of the 
different modalities. On 198 of 288 total- 
recall trials (68.8%), S recalled some words 
from both modalities and switched be- 
tween modalities during output only once. 
On 44 of the remaining 90 trials, ,S recalled 
words from only 1 modality. Thus, there 
is still a strong preference for modality- 
by-modality recall even when auditory and 
visual stimuli are presented alternately 
(Broadbent & Gregory, 1965). 

If only those total-recall trials on which. 
S made exactly one modality switch are 
considered, 77.3% of the time S recalled 
the visual words first. The output priority 
given to visual items on total-recall trials 
explains why there is so little difference 
for visual stimuli on total- and partial- 
recall trials. The main effect of cue 
modality was not significant, nor was any 
interaction with cue modality. 

The data of primary interest in this 
experiment are those concerning the order 
of recall of the auditory stimuli. Only 
partial-recall trials were examined. The 
first 2 words recalled were classified as 
having been presented at the same time, 
in the same place, or neither. For visual 
words, the "same-time" recall order oc- 
curred on 57.3% of the trials. A "same- 
place" output order was found on only 
10.8% of trials. For auditory words, on 
the other hand, the first 2 words recalled 
had been presented to the same ear on 
39.9% of the trials and at the same time 
on 21.2%. A sequential output order, in 
which the first 2 words recalled were 
presented to different ears and at different 
times, occurred on 18.1% of the trials. 
There is still a substantial preference for 
a sequential recall order over a temporal- 
pairs output order even when there are 
visual stimuli intervening between sub- 
sequent items presented to the same ear. 

The finding of a preference for a se- 
quential order of report in the present 
experiment would seem to pose a problem 
for Broadbent's (1958) model. A strategy of 


M 


A 


attending to only one ear is unlikely, as on 
131 of 288 auditory partial-recall trials, 
S was able to recall 3 or 4 auditory words. 
Of these trials, an ear-by-ear order of 
report occurred on 50.4%, a temporal- 
pairs order on 32.06%, and a sequential 
order on 17.56%. 

In fact, many Ss reported that they 
tried to attend to the visual stimuli during 
presentation, as they found these more 
difficult to remember, and they certainly 
recalled them first on total-recall trials. 
If the auditory stimuli actually constituted 
the rejected channel, as Ss’ introspective 
reports would suggest, the finding of an 
ear-by-ear report order is puzzling. The 
auditory items must stay in the sensory 
buffer while the visual information is passed 
through the limited-capacity system. When 
S is free to switch attention to the audi- 
tory mode, the most logical strategy would 
seem to be to process the 2 oldest items 
first. 

Itis clear that Broadbent's (1958) model, 
even in its revised form, has difficulty with 
these data. Equally, Yntema and Trask 
(1963) provided no explanation for the fact 
that Ss do not use time of arrival as a 
basis for organization. In the present ex- 
periment, time would seem to be as good 
as or better than ear of arrival as a basis 
for a retrieval plan. However, it may be 
that under different retrieval conditions, 

time of arrival could be an effective cue. 

In the second experiment, a partial- rather 

than a total-report procedure was used in 

order to see if S’s encoding and retrieval 
strategies were affected. 
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EXPERIMENT II 


Most experiments on dichotic split-span 
memory have used a whole-report 
dure and measured either the relative ef- 
ficiency of, or Ss preference for, the ear- 
by-ear or temporal-pairs output order. In 
the second experiment, a Partial-report 
technique was used, with S$ reporting only 

— d word on each trial. The probe was either 
_ the word that had occurred at the same 
_ time as the to-be-remembered (TBR) word 

A but in the other ear and voice (same- 
4 Probe) or a word that had been prese 


proce- 


time 
nted 
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either before or after the T1 ord 

the same ear and voice (same-p! probe) 
The requirement in earlier stuc that 9 
report every item on each tria y ha 
necessitated, as Yntema and 1 (1963) 
suggested, some kind of taggin orclisxag 
to ear of arrival or stimulus t orda 
for S to have an organized r« i| plz 
This tagging strategy may n neces 


sary when a probe techniq used 
When S knows in advance 
presentation whether he is t eive 
same-time or same-place prolx shou bb 
be able to encode the items dingzds— 
Therefore, one would expect : rence 
in overall recall in the 2 prolx litioaass 
The occurrence of any differen recall 
would suggest that Ss are not vs able 


to optimally encode informatio. in this 
situation. 
Method 

Design. The second experiment inv a slig h€ 
modifica ion of the stimulus-presentation, procedta 
used in Experiment I, in that the pair adito y 
and visual stimuli were presented simwulcaneous yz 


m each 
presented 


The S was still presented with 8 wor 
trial, but 4 rather than 2 words wer 
ata time. The probe word was rando: 
from the 8 words with the restriction t h 
block of 8 trials each of the 8 positions was teste ct 
once. There were 8 blocks of experimental trials 
and 2 blocks of practice trials, giving a total Of 
80 trials. All Ss were tested on both the same 
time (ST) and same place (SP) conditions, ha*f 
receiving the ST condition first and half the SIP. 
Two sets of stimulus material were used. Of those 
Ss receiving the ST condition first, half had Set A 

for this condition and Set B for the SP condition, 

while the remaining Ss had Set B for the ST con- 

dition and Set A for the SP condition. In this 

way, there was no confounding of probe conditions, 
stimulus set, and practice effects. The design J5 
therefore a fully crossed factorial with position 
tested, replications (blocks), and instruction con- 
ditions varied within Ss, and presentation order of 
the instruction conditions and of the stimulus 
materials varied between Ss. k 

Subjects. Eight male and 8 female University 
of Toronto graduate students with normal or cor 
rected vision and no known hearing defects DAS 
tested individually and were paid $4 for the 
sessions. 

Stimulus materials. Two sets of simulus ma 
terials were used. For each set, 640 words were 
tandomly selected from the word pool used "i 
Experiment I. Recording of auditory stimuli ana 
Presentation of visual stimuli were similar to P* 
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periment I. The presentation rate was 1 set of 
4 words per sec., with the actual presentation 
time of visual stimuli being .5 sec. Immediately 
following and in rhythm with the stimulus words, 
a random 1- or 2-digit number which S had to 
repeat was presented visually. This was necessary 
because a pilot study had indicated that recall of 
items in the second set of 4 words was very high. 
Following the number, the probe was presented in 
the same modality, and in the same voice and 
ear or to the same part of the screen as in the 
original presentation. 

Procedure. The S read a sheet of typed in- 
structions that described the stimulus presentation. 
Two sets of instructions were devised, one con- 
taining the ST instructions and the other the SP 
instructions. The ST instructions told S$ to recall 
the word presented in the same modality and at 
the same time as the probe word. The SP in- 
structions asked for recall of the word presented 
in the same place as the probe. It was explained 
that "same place" referred to the same ear for 
auditory words and to the same part of the screen 
for visual words. The Ss were further instructed 
that if they could not remember the required word, 
they were to report any other word from that 
trial. The S was given 16 practice trials, any 
questions about procedure were answered, and the 
64 experimental trials then followed. 

The same procedure was followed for the second 
session. When S entered the laboratory, he was 
given whichever instruction sheet he had not 
received the first day and instructed to read it 
carefully, He then received the 16 practice trials 
and 64 experimental trials, as on the first day. 


Kesulls 


'The following factors were included in 
the analysis of variance: serial position, 
modality, instructions, Ss (nested under 
the following 2 factors), order of instruc- 
tion conditions, and order of stimulus 
material. The dependent variable was the 
arc sine transformed proportion of words 
recalled out of the 8 replications (Murdock 
& Ogilvie, 1968). 

Figure 2 shows the probability of recall 
as a function of serial position, presenta- 
tion modality, and instruction condition. 
Serial position refers to the position of the 
recalled word, not to the position of the 
probe word. For both auditory and visual 
items, there is a large effect of serial posi- 
tion, with words in the second set being 
recalled at a higher level. The probabili- 
ties of recall of auditory words in the first 
and second serial positions were .473 and 
.617, respectively. For visual words, the 
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FIGURE 2. Probability of recall as a function 
of presentation modality (auditory or visual), 
serial position, and probe type. 


corresponding probabilities were .391 and 
.586. The main effect of serial position 
was significant, F (1, 3) = 13.6, p < .05, 
as was the Serial Position X Modality 
interaction, F (1, 3) = 11.0, p < .05. The 
effect of serial position was slightly larger 
for visual items than for auditory. 

The main effect of the instruction con- 
dition was not significant overall. How- 
ever, the instructions did interact signifi- 
cantly with other variables. As Figure 2 
indicates, the serial-position effects were 
quite different for the 2 instruction condi- 
tions. This Serial Position X Instructions 
interaction was confirmed by the analysis 
of variance, F (1, 3) = 12.6, p « .05. 
When SP instructions were given, the level 
of recall was only slightly better for the 
second serial position than for the first. 
With ST instructions, the difference be- 
tween recall of words in the first and 
second serial positions was much larger. 

The most interesting aspect of these 
data is the interaction between presenta- 
tion modality and instructions. Compare 
recall of auditory and visual words under 
the ST instructions. For the first serial 
position, recall is about equal for auditory 
and visual items. For the second position, 
recall is slightly higher for visual items. 
The SP instructions result in quite a dif- 
ferent picture. For both serial positions, 
recall of auditory items is clearly superior 
to that of the visual stimuli. The analysis 
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of variance indicated that the Modality 
X Instructions interaction was significant, 
F (1,3) = 18.7, p < .025. 

The direction of the Modality X Instruc- 
tions interaction is consistent with the 
hypothesis of strong associations between 
successive and auditory stimuli. The for- 
ward and backward associations between 
auditory stimuli are clearly stronger than 
the corresponding associations for the vis- 
ual stimuli. These sequential auditory as- 
sociations are so strong that, for the first 
serial position at least, recall cued by a 
backward probe is actually better than 
that when a simultaneous stimulus is used 
as a retrieval cue (Duncan's range test, 
p < .01; Edwards, 1968). 

That there is no difference in auditory 
recall for the 2 instruction conditions when 
the second serial position is tested does 
not weaken the argument. In the ST 
condition the item given as a probe was 
a recent item, whereas in the SP condition 
the probe was an earlier item. If there 
were retrieval difficulties involved in probe 
recognition, these should be greater for 
the SP probe. Recall of the tested. item 
would then be slightly lower than it would 
be if the SP probe were as available as 
the ST probe. 

The critical comparison, however, is be- 
tween auditory and visual presentation 
under the SP instructions. The large dif- 
ference found provides evidence for the 
efficacy of the sequential associations in 
auditory short-term memory. The dif- 
ference in auditory recall under the 2 in- 
struction conditions Suggests that these 
sequential associations may, under certain 
conditions, be stronger than associations 

based on simultaneity. 

The effect of the instruction conditions 
upon auditory recall becomes even more 
surprising when possible rehearsal strate- 
gies are considered. When the first 4 words 
were presented, S would probably attempt 
a quick rehearsal of all 4. Upon presenta- 
tion of the second set, S would also re- 
hearse those. If an ST Probe is given, 
S only has to retrieve the word he had 
rehearsed either immediately before or 
after the probe. However, if S knows he 


CATHERINE G. PENNEY 


is to get an SP prolx 
He cannot wait until 1 
sented and then atte: chearse 
appropriate pairs, as ! Il 
forget many items |» 
them. Also, the num! e 
appears quickly and do 
time for rehearsal. Tix 
is to rehearse each set 
sented and to then atte: 
recode into the appropri 8. 
less of which strategy .5 
condition should give sup 
does with the visual st 
that the SP condition is lx 
stimuli is evidence for th 
sequential associations. 
Errors. The Ss were ins 
| word on every trial, a 
word from the list if the 
the correct word. The er 
were classified into 4 categ 
place error—S responded 
that had been presented ir 
as the probe when he shc have givelils 
the word that occurred at th- same time 
(^) same-time error—S shou! have givellf? 
the word that had occurred in the sa 1 
place as the probe, but ga instead tht 
word that had occurred at same timed | 
(c) same-modality error—S responded wi 
a word in the correct modal 
had been presented neither in the sa 
place nor at the same time as the prox 
word; and (d) miscellaneous—omissions 
and all other errors. The proportion @ 
each kind of error was tabulated, and f 
results are shown in Table 1. Proportion)” 
are reported rather than írequencies 99 
cause of the different number of errors ™ 
the different conditions. When recall 
auditory items was tested, there was į 
larger proportion of SP errors when an 
Probe was given than of ST errors whe 
an SP probe was given. The number 9. 
same-modality errors did not differ à$ 
function of instructions. When a visu 
word was tested, there was a sligh 
greater proportion of ST errors. The 
was also a larger proportion of Same 
modality errors with the ST than with 
instructions, 


s it is pr 


or auditor 
igth of 
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The data on the types of errors made 
are consistent with the accuracy data in 
that they provide evidence for the hy- 
pothesized sequential associations in audi- 
tory short-term memory. If S knew he 
could not recall the correct word, one 
would expect him to report at random 
any word he could remember. Overall, 
the proportion of ST and SP errors should 
be approximately the same. The greater 
number of SP than ST errors and the 
high frequency of the same-modality er- 
rors suggest that it is somehow easier to 
retrieve an item connected to the probe 
by means of a forward or backward asso- 
ciation than an item that occurred at the 
same time as the probe. 

The occurrence of the sequential asso- 
ciations in auditory short-term memory 
cannot be regarded as resulting solely from 
a conscious decision on the part of S to 
process the stimuli in this way. In the 
present experiment, Ss knew well in ad- 
vance which type of probe they would 
get. They served in only 1 condition per 
day and had several practice trials before 
the experiment started. This should have 
enabled them to encode the stimuli ap- 
propriately. In spite of their foreknowl- 
edge of the probe condition and the 
rehearsal opportunities in this task, the SP 
probe was more effective for at least the 
first serial position. This suggests that in 
the time allowed to them, Ss found it 
difficult to encode the 2 simultaneous audi- 
tory items as a pair, and the sequential 
associations predominated. 


DISCUSSION 


The results of the present experiments, 
which used a bisensory-dichotic presentation, 
extend the findings of earlier studies that 
employed either the bisensory or dichotic 
presentation alone or a visual presentation of 
2 simultaneous items. The Ss clearly pre- 
ferred to recall simultaneous or successive 
auditory and visual stimuli separately by 
modality. Two simultaneous visual stimuli 
are recalled as a pair, but sequences of di- 
chotic auditory stimuli are recalled as 2 se- 
quences, not in pairs. The sequential recall 
of dichotic stimuli occurs even when visual 
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‘TABLE I 
PROPORTION OF DIFFERENT TYPES OF 
ERRORS IN EXPERIMENT II 


I 


Presentation modality 


Type of error 


Auditory Visual 
Same time instructioi 
Same place .304 .281 
Same modality 467 302 
Miscellaneous .229 


Same place instructions 


.235 


Same time | 
Same modality A51 
Miscellaneous 


| 314 


stimuli intervene between successive auditory 
stimuli. When a probe technique is used, the 
SP condition, which tested memory of a for- 
ward or backward association, was found to 
give higher recall for auditory than for visual 
presentation. For the ST condition, the dif- 
ference between auditory and visual presenta- 
tion was smaller and in the reverse direction. 

The results of the 2 experiments reported 
also have implications for a theory of auditory 
and visual short-term memory. Recently, 
a number of investigators have hypothesized 
the existence of 2 separate short-term memory 
stores, one for auditory items and another 
containing information in verbal format (e.g., 
Crowder & Morton, 1969), An alternative 
form of the hypothesis is that the 2 stores 
are modality specific, one containing informa- 
tion about auditory items and the other con- 
taining visual information (e.g, Murdock & 
Walker, 1969). 

The present data do not support one of 
these alternative hypotheses over the other. 
They do, however, suggest an explanation of 
one of the more puzzling findings in the area 
of auditory-visual differences, In a serial- 
probe experiment with lists containing both 
auditory and visual items, Murdock (1967) 
found that the best performance was obtained 
when both the probe and the TBR word 
were auditory. When either or both were 
visual, recall was considerably lower. The 
results of the present experiments indicate 
that successive auditory stimuli are rather 
strongly associated, and that this occurs 
whether or not S tries to encode in this 
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manner. When either or both of the probe 
and TBR word are visual, this strong asso- 
ciation is absent and recall is lower. Thus, 
the hypothesis of sequential associations be- 
tween successive auditory stimuli, in addition 
to accounting for the dichotic and bisensory 
split-span data, would appear to be useful in 
integrating within a unified theoretical frame- 
work the results of experiments on auditory 
and visual presentation in short-term memory 
and those on selective attention. 
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TY, AND STIMULUS ENCODING? 


ROBERT E. WARREN * 


Columbia University 


The degree of interference with color 


carrier for the color in a discrete-trial Stroop task 


associatively related word was concu 


naming produced by the word used 
1s found to increase if an 
rrently held in memory. The degree of 


interference varied directly with the associative strength of the connection 


between the memory word and the color word. 


unidirectional associates showed that 
only if the word being held in memory 


A second experiment using 
interference with color naming increased 
elicited the color word and not vice versa. 


It is suggested that the results support the hypothesis that activation of the 
memory representations of a word's associates is part of its encoding process. 


Attempts to determine the structure of 
memory have recently centered on the 
process of stimulus encoding. Encoding 
as a general term includes all those trans- 
formations performed on the nominal stim- 
ulus before it assumes its final representa- 
tion in memory. These transformations 
may theoretically include both additions 
to and loss of stimulus information. En- 
coding is of interest to the memory theorist 
because of its presumed relation to the 
structural properties of memory. It has 
been argued (Wickens, 1970) that the 
transformations employed in encoding are 
a function of the structural characteristics 
of memory. Thus, if one could specify 
what factors are involved in the initial 
encoding of words, this information would 
allow the inference of verbal memory 
structure. 

Various models of memory have sug- 
gested different conceptions of the encoding 
process. Bower (1967) has proposed that 
encoding involves the specification of val- 
ues along semantic dimensions and that 
words are represented in memory as groups 
of these values. Similarly, Conrad (1967) 
and others (e.g., Yntema & Trask, 1963) 
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have suggested that encoding involves the 
amalgamation of particular components or 
the tagging of a central word unit to in- 
dicate specific characteristics. Operation- 
ally, however, the nature of encoding has 
often been assessed in terms of S's reaction 
to some characteristic of “words as wholes.” 
The proactive interference paradigm used 
by Wickens (1970), for example, appears 
to rely on binary decisions regarding the 
list membership of individual items (Loess 
& Waugh, 1967). Similarly, false recogni- 
tion studies (Underwood, 1965) utilize the 
identification of individual words as old 
or new as an indication of previous en- 
coding patterns. 

The use of whole-word measures is often 
based on the theoretical position that 
specification of a particular component or 
dimensional value affects the representa- 
tions in memory of not only the presented 
stimulus, but also of those words sharing 
the characteristic in question. In the false 
recognition paradigm, for example, pre- 
sentation of the word ICE is said to produce 
implicit activation of words sharing a com- 
monsense impression such as COLD (Kim- 
ble, 1968). This position suggests that 
one way to map out the extent and degree 
of encoding is to present a word and then 
test the activation of other words which 
bear some nominal relation to the pre- 
sented item. If the words are found to 
be activated, the common nominal rela- 
tion could be assumed to be part of the 
encoding process, 

Changes in performance on a number 
of tasks as a function of word activation 
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can be predicted from a system of word 
recognition units or logogens proposed by 
Morton (1970). The logogen system as 
à whole combines sensory, contextual, and 
semantic information to identify which 
word has been encountered in any given 
situation. The system relies on an initial 
analysis of sensory information into com- 
ponents which are then routed to logogens 
which have been activated by similar in- 
formation in the past. One logogen is as- 
sumed to exist for each word in a person's 
vocabulary, and each is composed of the 
following 3 elements: (a) a counter that 
increases in value each time stimulus in- 
formation that corresponds to the word 
the logogen represents is encountered, (b) 
à motor program that represents the verbal 
production of the word, and (c) a threshold 
value that is fixed by the past frequency 
of occurrence of the word. Upon presenta- 
tion of a word, counter values at logogens 
which correspond in whole or part to the 
physical representation of the word begin 
to increase as components are identified 
by the sensory analysis system. Counters 
at those logogens which correspond best 
to the actual word given increase in value 
at the greatest rate. When one of the 
logogens' counter values exceeds its thresh- 
old, its motor program is said to become 
available and is loaded into a response 
buffer by a central processing unit where 
it is held for output. Only one program 
can pass to the buffer at a time, so that 
if one program becomes available before 
another, the second will be delayed until 
the buffer is cleared, 
Presentation of a word, then, 
what might be termed primary activation 
of a logogen. Besides reacting to sensory 
input, logogens can theoretically also be 
activated indirectly, as suggested earlier, 
when words sharing some common seman- 
tic feature or other relation are presented, 
This secondary activation is produced by 
what Morton (1970) termed the cognitive 
system. Secondary activation produces 
counter increments at related logogens, 
but these increments are thought to be of 
lesser magnitude and to begin longer after 
Word presentation than those produced by 


produces 


ROBERT E. WARREN 


primary activation at thi 
word actually encounter: 
In the present experiri 
with color naming in a dis 
(1938) task was used to in: 
of activation of the 
In the Stroop task, S is 
the color of ink in which a: 
while attempting to ignor 
In the traditional Stroop : 
items printed in colors ar 
antagonistic color words 
word RED would appear i: 
word YELLOW in red ink, a: 
typical finding is that it t 
to name the colors when 
are antagonistic color nanm 
they are rows of Xs (Jense: 
1966). In an extension of S 
Klein (1964) has shown that 
latency is also increased ove: 
trol if common words unrela 
are used as base items in the t 
For the logogen system, in 


wor 


erence in 


the Stroop situation occurs because stimu- 
lus information activating 2 :ogens 18 
present, i.e., those of the base iton and thë 
ink color name. If the motor piogram for 


vefore the 
he produc- 


the base item becomes available 
program for the ink color name, 
tion of the color name will be delayed be: 
cause of the response-buffer space limita- 
tions mentioned earlier. The greater the 
level of activation of the logogen for the 
base word prior to stimulus presentation, 
the more likely it is that this will occur. 
This means that any operation that in- 
creases the base word's level of activation 
prior to its presentation will produce alt 
increase in color-naming latency. It follows 
that if secondary activation occurs during 
the encoding of a word, the degree and pat- 
tern of activation can be assessed by pre- 
senting other words as base items in à 
Stroop task subsequent to presentation. of 
the word in question and ol serving whether 
or not color-naming delays are produced. 
Previous work using this technique 
(Warren, 1972) has shown that the name 
of the taxonomic category from which 4 
triad of words is drawn, as well as any 
of the words themselves, produces an it 
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crease in color-naming latency when used 
as the base word in a Stroop task given 
subsequent to the presentation of the triad. 
Experiment I tested whether words which 
appear as responses in association norms 
produce similar increases in color-naming 
latencies when used as color base words 
after their associative stimuli have been 
encountered by S. 

To review the argument presented so 
far, color-naming latency in the Stroop 
task is thought to vary with the degree 
of activation of the logogen of the word 
used as a carrier for the color. If the 
level of activation of Word B's logogen is 
somehow increased during the presenta- 
tion of Word A, the degree of interference 
with color naming produced when Word B 
is used as a base item in a subsequent 
Stroop task should also increase. Experi- 
ment I tested whether such an increase 
in activation occurs. when Word A and 
Word B are related associatively. That is, 
the following 2 questions are posed: (a) 
Does the associative relationship indicated 
by standard association norms parallel an 
underlying pattern of activation in the 
logogen system such that when a stimulus, 
Word A, is encountered the logogen of its 
primary response, Word B, is activated? 
(b) Does the frequency of a response, 
Word B, to a stimulus, Word A, in an 
association test reflect the degree or prob- 
ability of activation of the logogen for 
Word B when Word A is given? 


EXPERIMENT [| 
Method. 


Subjects. Five male and 5 female students at 
the University of Oregon were run individually for 
a single 45-min. session, They were paid $1.50 for 
their participation. 

Procedure, The Ss were seated at a 2-channel 
tachistoscope used to present the visual stimuli 
in the experiment. Prerecorded auditory stimuli 
were presented to Ss over earphones, 

Each trial in the experiment consisted of 3 phases. 
In order of occurrence, these were (a) auditory 
word presentation, (b) visual word presentation 
and color naming, and (c) recall of the auditory 
word. In the first phase, the word READY was 
presented over the earphones followed by a pre- 
recorded auditory word. 
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Immediately after the presentation of the audi- 
tory word, a word printed in colored ink and 
centered on a white field was shown in the ta- 
chistoscope. The word remained visible until § 
had named aloud the color of ink in which it was. 
printed. Naming the color activated a voice- 
operated relay which terminated the visual display 
and stopped a Hunter millisecond clock that had 
been started at the onset of the visual word. The 
Ss were instructed to name the color of ink in 
which a word was printed as rapidly as possible | 
and to ignore the word itself. 

In the final phase, which followed color naming. 
directly, S recalled aloud the word given auditorily 
in the first phase. 

Both color-naming latency in milliseconds from 
the onset of the visual item and recall accuracy 
were recorded. There was a 15-sec. interval be- 
tween trial onsets. 

Design and materials. Thirty pairs of associated 
words were selected from Palermo and Jenkins 
(1964). The word pairs and their associative 
strengths appear in Table 1. Three groups of 10 
pairs each were formed with mean associative 
strengths of .630 (high associative strength), .334 
(medium associative strength), and .152 (low 
sociative strength). Associative pairs were chosen 
so that the range and average frequency of occur- 
rence in English (Kučera & Francis, 1967) of the 
response members of the 3 groups were approxi- 
mately equal. 

During the experiment, the logogen activation. 
of the response words was tested by presenting 
them in the color-naming phase of a trial, Inter- 
ference with color naming produced by each re- 
sponse word was assessed under 2 conditions: 
(a) after the stimulus to which it was a primary 
associative response had been given auditorily (test 
condition) and (b) after auditory presentation of 
an unrelated word (control condition), For exam- 
ple, the response word GIRL was preceded in the 
test condition by the auditory word mov, while in 
the control condition it was preceded by the audi- 
tory word case. The unrelated words used as 
auditory stimuli in the control condition are 
shown in Table 1 matched with the assigned 
response word. 

The experiment thus consisted of 6 conditions, 
resulting .from test and control assessment. of the 
color-naming interference produced by each of the 
response words in the high, medium, and low 
associative strength groups. 

Auditory stimuli in the experiment were pre- 
recorded and presented over earphones. Words 
presented visually were printed in colored ink in 
lowercase letters and subtended an average visual 
angle of less than 3°. The 5 colors of ink used 
were red, blue, yellow, green, and purple. Five 
decks of stimulus cards were made up. In each 
deck, one-fifth of the responses in each associative 
strength group was printed in each color. Across 
the 5 decks, each response word appeared in each 
color. Within a single deck, a response word was 
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TABLE 1 
AUDITORY AND VISUAL STIMULI: EXPERIMENT Í 


: Auditory al | Asocia- | Auditory 
Group em | sinning BRUN control 
| | | 
| | 
High | LAMP LIGHT 706 ADD 
BOY GIRL | .704 | CASE 
SCISSORS | CUT ,078 | BOOK 
KING QUEEN 651 | roop 
SLOW FAST | 634 | ART 
BLOSSOM is | .629 | HOT 
577. | TIGE 
S576 | COFFEE 
«575 | ACTIVE 
.S68 | QUESTION 
Medium | TOBACCO A82 | povere 
CHAIR 28 | HAPPY 
DOORS WINDOWS | .358 | JACKET 
BATH | CLEAN | 354 | LAW 
BIBLE | GoD | 316 | sHoRT 
SQUARE | ROUND |. ..315 | HOME 
SOFT | HARD „304 | RICH 
CRY | BABY | .284 | THREAD 
SPEAK | TALK | .252 | army 
SMOOTH | ROUGH | .243 | LosE 
Low envy TOWN 332. | BREAK 
PRIEST | CHURCH | 225 
GUNS SHOOT 
WISH | DREAM 
EARTH. DIRT | 
PEOPLE | CROWD | 
STREET | ROAD 
SALTY SWEET 
WHISTLE | TRAIN | 
: CARRY HOLD 


presented in the same color on both its test and 
control condition appearances. Each deck was 
pes with one-fifth of the Ss. 
rials were ordered and presented in 2 35-trial 
blocks, Blocks A and B, with a 2-min. break be- 
tween blocks. Each block began with 5 filler 
trials followed by 30 experimental trials. Within 
each Consecutive set of 6 experimental trials, 1 trial 
representing each condition was given. Colors were 
no color was repeated on con- 


222 


“assigned so that 
secutive trials, 
All of the res; 


ponse e 
block. Half ae words were shown in each 


the response words in each asso- 


TABLE 2 


MEAN COLOR-NAMING LATENCIES (IN MSEC): 
EXPERIMENT 1 í 


Associative strength group 


Condition 
pe High Medium Low 
È Test 929 906 
: Control 834 856 Hn 
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ciative strength group were prese the t 
condition and half in the 


contr 


Block A. The words were ther ged a 
presented in the opposite condit Block 
Half of the Ss received Block nd half, 
Block B first. Prior to the beg the ex: 


nples 
of row 


perimental trials, Ss were show 
each of the 5 colors being used ir 
of 5 lowercase Xs printed in colo 


Results 

Recall of the auditory wor «s good 
with only 1.5% of the poss emory: 
errors occurring (9 out of 6 »ssible). 
All but 1 of these errors s to be 
acoustic in nature, e.g., recall AD for 
THREAD. The 1 remaining er is the 
intrusion of the color word s! on the 
same trial. Because correct ||. per- 
ception of the auditory word : not be 


errors 
| from 


assumed for trials on which me 
occurred, these trials were ex: 
further analysis. 
Color-naming accuracy was 
with errors occurring on only 1 
trials given (8 out of 600). Coi 
errors consisted of calling out 
color name (4/8), or premature! 
the voice key with a hesitation «^ 
as "ah" or "er" (4/8). These i 
also discarded. 7 
Mean color-naming latencies in milli- 
seconds for each of the 6 experimental 
conditions were determined for cach S. 
Overall means are shown in Table 2. 
A repeated measures analysis of variance 
was performed on the data with Ss, con- 
dition (test, control), and associative 
strength (high, medium, and low) as 
factors. The analysis showed a significant 
main effect for condition, F (1, 9) = 15.44 
b < .005, and a significant Condition X 
Associative Strength interaction, F (2, 18) 
= 6.13, p < .01. The main effect for as- 
sociative strength did not approach sig- 
nificance, F (2, 18) = 2.30, p > .05. 
Winer's (1962) method for a priori com- 
parisons of treatment totals showed laten- 
cies in the high-test condition to be greater 
than in the high-control condition (P < 
001). Similarly, medium-test latencies 
were greater than medium-control laten- 
cies (^ < .025). Latencies in the low-test 
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and low-control conditions did not differ 
significantly (p > .05). 

In addition to the analysis of variance 
on within-Ss means, mean color-naming 
latenciés across Ss for each stimulus- 
response pair were determined for both 
test and control conditions. The ratio of 
the test to the control mean was then de- 
termined for each pair and the correlation 
between these values and the correspond- 
ing associative strengths of the word pairs 
was calculated. A positive correlation was 
found, r = .41, p « .05. 

The measure of associative relation used 
to form the associative strength groups in 
the experiment was that of forward asso- 
ciative strength, i.e., the frequency with 
which the auditory word elicted the color 
word as a response. Backward associative 
strength was not controlled for, and exami- 
nation of bidirectional association. norms 
(Keppel & Strand, 1970) shows that a 
positive correlation existed between the 
forward and backward associative strengths 
of the pairs used, r = .36, p < .05. To 
examine the influence of this backward 
associative relation on color-naming laten- 
cies, a partial correlation analysis was done 
on the test-control ratios and the 2 sets 
of association values for the word pairs in 
the experiment. The analysis showed the 
partial correlation of the interference ratios 
with forward associative strength to be 
significant, r = .36, p < .05, while that 
with backward associative strength was 
not, r-= 12; p > 05: 


Discussion 


The findings of Experiment I appear to 
confirm that secondary activation of words 
given frequently as responses in association 
tests does occur during the encoding of the 
appropriate stimulus word. The degree or 
probability of activation seems to follow quite 
closely the frequency with which the word 
is given as a response. The greater the re- 
sponse frequency, the larger the interference 
with color naming produced in the test 
condition. 

The word pairs in Experiment I were chosen 
without regard to the type of associative 
relation between the words. That is, associa- 
tions between synonyms (SPEAK-TALK), anto- 


nyms (sMOOTH-ROUGH), and words joined by 
continguity or function (BED-SLEEP) were all 
included. It is probably the case that these 
different associative types reflect different 
patterns of activation. Differences of this 
sort have been found in several studies of 
false recognition (e.g, Grossman & Eagle, 
1970). It should also be noted that for all 
of the pairs of associates used, only primary 
responses were selected, There is some indi- 
cation from other studies (e.g., Cramer, 1969) 
that nonprimary responses do not function in 
the same manner as primaries. 

It has been assumed that it is secondary 
activation taking place during the presenta- 
tion of the auditory word in Experiment I 
that is producing the subsequent interference 
in the color-naming phase. Since the second- 
ary activation produced by the color word 
does not become effective until sometime after 
its presentation, any activation of the recall- 
word logogen is delayed and unlikely to in- 
terfere with color naming. This view is sup- 
ported by the partial correlation results. 
Because of the critical nature of the assump- 
tion, however, a second experiment which 
tested it directly was carried out. In Ex- 
periment II, word pairs with strong unidirec- 
tional associative relations (AB but B-A) 
were selected and the pair member which 
strongly elicited its mate was assigned to 
either the recall-word or color-word role in 
trials similar to those used in Experiment I. 


EXPERIMENT lI 
Method. 


Subjects. ‘Twelve male and 12 female Ss re- 
cruited from the staff and undergraduate popula- 
tions of Columbia University were run individually 
for a single J-hr. session. They were paid $1.75 
for their participation. 

Procedure, The Ss were seated in front of a rear 
projection screen on which all the stimuli in the 
experiment were shown. A Gerbrands 2-channel 
projection tachistoscope was used to present the 
stimulus material. 

Each trial in the experiment consisted of 3 phases. 
In order of occurrence, these were (a) recall-word 
presentation, (b) color-word presentation and color 
naming, and (c) recall of the word shown in the 
first phase. 

The word given for later recall was printed in 
white capital letters on a black background and 
shown for 800 msec. ‘This exposure was followed 
by a dark field which lasted 1,000 msec. At the 
end of this interval, a word printed in colored 
lowercase letters on a black background was shown 
and remained visible until S had named aloud the 


color in which it appeared. Naming the color 


activated a voice operated relay which terminated 
the visual display and stopped a Gerbands milli- 
second clock that had been started at the onset 
of the colored word. The Ss were instructed to 
name the color as quickly as possible and to ignore 
the word itself. After naming the color, S recalled 
aloud the word shown in white during the first 
phase of the trial. 

Color-naming latency in milliseconds from the 
onset of the colored word and recall accuracy 
were recorded. There was a 10-sec. interval be- 
tween trial onsets. 

Design and materials. Two sets of 12 associa- 
tively related word pairs were chosen from the 
bidirectional association norms collected by Keppel 
and Strand (1970). In both groups, one member 
of the pair elicited the other as a response, while 
the opposite relation was weak or nonexistent. 
For one of these groups (forward association 
group), the pair member which elicited its mate 
was assigned to be a recall word during the experi- 
mental trials while the other member of the pair 
was assigned to appear as a color word. For the 
other group (backward association group), the 
words were assigned to opposite roles. For the 
forward association group, the average associative 
strength for the reeall-word-color-word relation 
was .397 and for the color-word-recall-word rela- 
tion, .020. For the backward association group, 
these same values were .021 and .404, respectively. 
"The sets of word pairs are shown in Table 3, along 
with the individual pair associative strengths. The 
range and average frequency of occurrence in 
English (Kučera & Francis, 1967) of the pair 
members assigned to the color-word role in the 
2 groups were approximately equal. 


TABLE 3 
STIMULI: EXPERIMENT II 


edd Td S 

ecal olor strengt! 

Group word word San ok 

(A) (B) ue 
AB | BoA 

Forward | xirrens | cars 23 | 02 |van —— 
BLOSSOM | FLOWERS | .640 011 STRAL 
EAGLE BIRD 005 | COLDER 
DREAM | SLEEP .082 | GRASS 
EASIER | HARDER 4049 | winpows 
BATH CLEAN .000 |suoor 
WHISKEY | DRINK -011 |DiRT 
MUTTON | LAMB .000 | wes 
MOON STAR. 011 | CHAIR. 
BROADER | WIDER -016 | NURSE 
STOVE HOT 016 | THREAD 
PRIEST CHURCH 016 | PEPPER 

Backward | APPLE FRUIT 454 | TIGER 
FOOT HAND 235 | SONG 
SWEET BITTER 5440 | uir. 
LAW JUSTICE. 3243 | QUESTION 
SOFT Loup 423 | sky 
LIGHT LAMP -106 | ORDER 
FAST SWIFT 4452 | BABY 
GOD BIBLE 316 | ALTHOUGH 
HOUSE COTTAGE | .005 | .264 | HIGH 
WATER | OCEAN 005 :362 | WAR 
DOWN SIT 000 | .271 | BECAUSE 
SMOKE | TOBACCO | .000 | .482 | QUEEN 
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The logogen activation of the color words 
tested by measuring color-naming latency 
Experiment I.  Color-naming latency for 
word was determined under the following 2 co 
tions: (a) after presentation of the other men 
of the associative pair as the word to be rec: 
in the first phase of a trial (test condition) 
(b) after pre of an unrelated word 
Phase 1 (control condition). For exampl 
word STAR from the forward association grou; 
tested after its associate MOON in the test condi 
and after CHAIR in the control condition. Simi! 
Loup, from the backward association grou; 
tested after sorr in the test condition and 
SKY in the control condition. The unrelated v 
presented as recall words in the control con 
are shown in Table 3 matched with the appro; 
associative pair. 

The experiment, then, consisted of 4 conditi 
resulting from assessment of the color-naming 1 
cies produced by the color words from the forw 
and backward association groups under test 
control conditions. 

Recall words presented in the first phase < 
trial and the colored words presented in the sec 
phase subtended average visual angles of 2 
The 4 colors used for the color words were : 
purple, yellow, and green. Four decks of color-y 
slides were prepared. In each deck, one-fourth 
the words in each experimental condition appea 
in each color. Across the 4 decks, each word 
peared in each color. Within a single dec! 
was presented in the same color on both 
control appearances, Each deck was used for o: 
fourth of the Ss. 

Trials were ordered and presented in 2 28-trial 
blocks, Blocks A and B, with a 1-min. break bc- 
tween blocks. Each block began with 4 filler trials 
followed by 24 experimental trials. Within each 
consecutive set of 4 trials, 1 trial from each con 
dition was given. Colors were assigned so that no 
color was repeated on consecutive trials. 

All of the color words were shown in each block 
Half of the color words in the forward and bac 
ward groups were presented in the test condition 
and half in the control condition in Block A, The 
words were presented in the opposite condition in 
Block B. Half of the Ss received Block A first 
and half, Block B first. Prior to the beginning of 
the first block, Ss were shown 2 examples of each 
of the 4 colors being used. Colors were presented 
in the form of rows of 5 lowercase Xs printed on 
a black background. 


Results 


Retention of the recall words was quite 
good, with only 1.095 of the possible 
memory errors occurring (11 out of 1,152). 
The following 4 types of memory errors 
were noted: (a) omissions (4/11), (b) in- 
trusion of the color word from the same 
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TABLE 4 


MEAN COLOR-NAMING LATENCIES (IN MSEC.): 
EXPERIMENT IT 


Associative direction group 


Condition n 
Forward | Backward 

Test 964 | 930 

Control 892 | 930 


trial (4/11), (c) intrusion of a recall word 
from a previous trial (2/11), and (d) in- 
trusion of a color word from a previous 
trial (1/11). No particular pattern of 
errors was noted across the 4 conditions. 
Since correct initial perception of the recall 
word could not be assumed for trials on 
which memory errors occurred, these trials 
were discarded and not used in subsequent 
analyses. 

Color-naming accuracy was also good 
with errors occurring on 1.7% of the trials 
given (20 out of 1,152). The following 
6 types of color-naming errors were noted: 
(a) naming the wrong color (14/20), (b) 
naming the color word itself (2/20), (c) 
prematurely tripping the voice key with 
a hesitation sound such as “uh” or "er" 
(1/20), (d) naming the color correctly but 
too quietly to trip the voice key (1/20), 
(e) calling out the recall word for that 
trial (1/20), and (f) producing no response 
before the onset of the next trial (1/20). 
The 2 Type b errors occurred on trials in 
the forward-test condition; the single Type 
e error occurred in the backward-test con- 
dition. Trials on which color-naming er- 
rors occurred were also discarded. 

Mean color-naming latencies in milli- 
seconds for each of the 4 conditions were 
determined for each .S. Overall means are 
shown in Table 4. A repeated measures 
analysis of variance was done on the data 
with subjects, condition (test, control), 
and associative direction (forward, back- 
ward) as factors. The analysis showed a 
significant main effect for condition, 
F (1, 23) = 8.48, p « .01, and a signifi- 
cant Condition X Associative Direction in- 
teraction, F (1, 23) = 5.66, p < .05. The 
main effect for associative direction was 
not significant, F (1, 23) < 1.0, p > .05. 


Winer's (1962) method for a priori com- 
parisons of treatment totals showed laten- 
cies in the forward-test condition to be 
greater than in the forward-control condi- 
tion (p < .001). The comparison of back- 
ward-test and  backward-control totals 
showed no significant difference (p » .05). 


Discussion 


The results of Experiment II appear to 
support the notion that the color-naming 
interference that occurs in this paradigm is 
due to activation of the color base word 
during the encoding of the recall word. The 
alternative hypothesis, that the encoding of 
the color word activates the recall word which 
then interferes with color naming, does not 
seem to be supported. Both the presence of 
the forward-test and forward-control latency 
difference and the absence of any difference 
between the backward-test and backward- 
control latencies contribute to this conclusion. 

The lack of any interference in the back- 
ward associative conditions does not mean 
that the logogen for the recall word is not 
activated after color-word ^ presentation. 
Rather, it indicates that the time course of 
activation is such that the color name be- 
comes available and is loaded into the re- 
sponse buffer before the logogen for the recall 
word passes its availability threshold. Inter- 
ference in the color-naming task can occur 
only when a motor program other than that 
for the color name is loaded in the buffer 
first. This indicates, as suggested earlier, 
that secondary or semantic activation does 
not match in degree or latency the activation 
caused by sensory input. 

Several questions remain to be explored in 
future research. It is likely that the different 
types of relations obtained in an association 
test result from quite different underlying 
semantic processes. The patterns of activa- 
tion which parallel these processes may vary 
both in extent and what might be termed 
rise time or latency. Experimental variation 
of semantic relation and of the interval be- 
tween word presentation and the test for 
activation may help tease out these encoding 
patterns. The problem of whether the second- 
ary activation function is set into motion by 
any amount of logogen activation or whether 
it only takes place once a logogen passes 
threshold is also unanswered. There are some 
indications from other paradigms (e.g., Lewis, 
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1970) that this latter step may indeed not be 
necessary. 
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PRONOUNCEABILITY AS AN EXPLANATION OF THE DIFFERENCE 


| BETWEEN WORD AND NONSENSE ANAGRAMS! 
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gram class. It was concluded that a: 


can be 


An anagram problem involves presenting S with 
a letter sequence and asking him to discover the 
word which can be made by rearranging the letters. 
‘The anagram presented is typically a nonsense 
| string (e.g., given SLACH, find CLASH) but can itself 
be a word (e.g., given SAUCE, find CAU: Previous 
research has demonstrated that word anagrams are 
more difficult to solve and that the difference be- 
iween word and nonsense anagrams can neither be 
explained in terms of differences in the frequencies 
of the bigrams constituting the anagrams nor be 
dismissed as a consequence of methodological biases 
(Ekstrand & Dominowski, 1968). 

In attempting to explain the difference between 
word and nonsense anagrams, the most obvious 
approach is to consider the fact that, by definition, 
word anagrams have denotative meaning and non- 
sense anagrams do not. However, an alternative, 
more general, explanation has been proposed. Two 
studies using only nonsense anagrams have demon- 
strated that the easier an anagram is to pronounce, 
the harder it is to solve (Dominowski, 1969; Hebert 
& Rogers, 1966). The proposal that, on the 
average, word anagrams will be easier to pronounce 
than nonsense anagrams and will be more difficult 
to solve for that reason. The present study pro- 
vided a test of this hypothesis. 

In order to use a large number of problems, no 
attempt was made to strictly equate word and non- 
sense anagrams in terms of pronounceability. The 
rationale was that, in order to test the hypothesi: 
all that was needed was reasonably wide varia 
in pronounceability both among word anagrams and 
among nonsense anagrams, and overlapping dis- 
tributions. If the difference between word and 
d nonsense anagrams is to be explained in terms of 
| pronounceability, then it must be true not only that 
I 


the difference between word and nonsense anagrams 


i This article was completed while R. L. Dominowski was a 
research fellow at the University of Aberdeen, Scotland. The 
authors are grateful to Sheldon Rosenberg for his help in the 
measurement of pronunciation times. 

? Requests for reprints should be sent to Roger L. Dominow- 
ski, Psychology Department, University of Illinois at Chicago 
Circle, Chicago, Illinois 60680. 


The present study involved a test of the hypothesis that differences in pro- 
nounceability can account for the finding that word anagrams are more 
difficult to solve than nonsense anagrams. 
of each type, indicated that word anagrams were more difficult when pro- 
nounceability was ignored but that the difference between anagram types was 
eliminated when pronounceability was statistically controlled. 
pronounceability was strongly related to problem difficulty 


The results, based on 48 anagrams 


In addition, 
hin each ana- 
ssing the ease with which an anagram 


verballv encoded eliminates the need to determine if it has meaning. 


is eliminated when pronounceability is (statistically) 
controlled but also that pronounceability affects 
problem difficulty within each anagram class, 

Method. Using the Olson and Schwartz (1967) 
tables, 48 word anagrams and 48 nonsense anagrams 
were constructed. The solutions to the nonsense 
anagrams were chosen to be similar in language fre- 
quency to the solutions of the word anagrams, By 
using the letter orders of the word anagrams to 
construct the nonsense anagrams, the number of 
letter moves required to reach solution (Dominowski, 
1966) was equated for the 2 types of anagrams. 
For each of the 96 anagrams, the mean bigram fre- 
quency was caleulated, using the Mayzner and 
"Tresselt. (1965) norms. 

For each problem, 3 additional measures were 
obtained, using a total of 103 Ss all of whom were 
introductory psychology students fulfilling a course 
requirement. To allow a check on the comparability 
of the solutions to the 2 types of problems, the 96 
solutions were presented, in random order, to 28 Ss 
who were asked to judge how often each word 
appears in print, using a magnitude estimation 
procedure devised by Underwood (1966). 

To measure anagram pronounceability, the 96 
anagrams were presented one at a time to each of 
23 Ss, with S instructed to pronounce each item as 
though it were an English word. lt was pointed 
out that some of the items would be common words 
and some would not, but that .5 was to pronounce 
every item as though it were a word. Tape record- 
ings of responses were made, and the tapes were 
subsequently run through a Grason-Stadler Model 
E7300A-1 voice operated relay which triggered 
Hunter Model A Series D Klockounters. The time 
taken for each pronunciation was measured in 
milliseconds. 

Problem difficulty was determined by presenting 
the 96 anagrams to 52 Ss, run in 3 small groups. 
The Ss were told what an anagram is, shown 
examples of word and nonsense anagrams, and told 
that for each problem their task was to find the 
word which could be made by rearranging the 5 
letters presented. The 96 problems were presented 
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on a screen in random order. For each problem Ss 
were given 30 sec. to solve and were instructed to 
mark an X in the answer space if they had not 
solved a problem at the end of the 30-sec. period. 
Results and discussion, The analyses utilized 
problems (N — 96) as the replicate. For each 
problem, the following measures were available: 
number of solutions (possible range 0-52), type of 
anagram (word vs. nonsense), anagram pronuncia- 
tion time, judged familiarity of the solution, and 
anagram bigram frequency. Using number of 
solutions as the criterion, a multiple-regression 


equation was determined with the other 4 variables - 


serving as predictors. In this analysis, significant 
contributions were made by only 2 predictors, ana- 
gram pronunciation time, 8 = .64, p < .001, and 
solution familiarity, 8 — .16, p < .05. These 2 
results confirm previous findings that anagrams that 
are more difficult to pronounce (= have longer 
pronunciation times) or that have more familiar 
solutions are easier to solve (e.g., Dominowski, 1967, 
1969; Hebert & Rogers, 1966). Anagram bigram 
frequency did not independently contribute to the 
prediction of problem difficulty, a finding consistent 
with the pattern of earlier results indicating that 
this variable reflects at best a minor characteristic 
of anagrams (e.g., Dominowski, 1967). 

The distribution of familiarity judgments for 
word anagrams (M = 68.6, SD = 26.6) was nearly 
identical to that for nonsense anagrams (M = 62.6, 
SD = 25,5); thus the effect of solution familiarity 
need not be considered in examining the difference 
between anagram types. However, as expected, 
word anagrams (M = .208 sec.) were easier to 
pronounce than nonsense anagrams (M = .273 
sec). The results presented above indicate that 
pronounceability is more important than anagram 
type (word vs. nonsense) in accounting for anagram 
difficulty. The following analysis demonstrated 
this more directly. 

1f pronounceability was ignored, the mean number 
of solutions for word anagrams (12.0) was signifi- 
cantly lower than that for nonsense anagrams (16.6), 

F (1,94) = 4.85, p < .05. If, however, pronounce- 
ability was used as a covariate, the mean difference 
was reduced to near zero (F <1). Furthermore, 
pronounceability was strongly related to problem 
difficulty both within the class of word anagrams 

(r = .61) and within the class of nonsense anagrams 

(r = .62). These results indicate that pronounce- 
ability meets the criteria of a satisfactory explané 
tion of the difference between word and nonsen: 
anagrams. For cach type of anagram considered 
separately, the longer it takes to pronounce an 
anagram, the easier the anagram is to solve, and 
the difference in difficulty between word and non- 
sense anagrams is eliminated when their difference 
in pronounceability is taken into account. 

The results support the conclusion that word 
anagrams can be considered simply as a set of 
anagrams which are on the average relatively easy 
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to pronounce, Stated differently, the finding 
gest that there is no need to separate an ms 
which happen to be words from other anag of 


equivalent pronounceability, Although the ent 
results do not support the notion that w« na- 
grams are harder to solve because they h ane 
ing, the emphasis on pronounceability is ent 
with the more general proposal that th rga- 


nized or cohesive an anagram is, the mor: 
will be to solve. 

The pronounceability of an anagram 
extent to which it conforms to the stri 
of the language. For 5-letter anagr 
sible range is from a string which n: 
be considered as 5 units to an anagra 
easily be encoded as a single unit (a 
accumulated evidence on anagram sol« Aim tes 
that Ss first try to think of words based c, m^ malf 
rearrangement of the anagram, attemp ing 


duce syllables when the first method fa’ eg 
Kaplan & Carvellas, 1968). Prese;ting <n asily 


pronounced anagram provides S with pr perly 
structured but incorrectly ordered s ing. It 18 
likely that, given such an anagram, ^ will : 
its plausible but incorrect or ^r 
find the solution. In contrast, & 
structured anagram (e.g, a string 
followed by a vowel), S is unlikely 
order but will rather quickly rearra 
letters into a plausible basis for a word 
possible that S will simply perseverate o: 
pronounced anagram; i.e., solution att 
be made at a slower rate, at least initially, 
it seems unlikely that this mechanism alo: 
sponsible for the effect of pronounceability. 
appropriate record were obtained, it would b> poss 
sible to determine if anagram pronounce: ilit; 
affects both the rate and the nature of solutio 
attempts. 
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WHEN IS RECALL SPECTACULARLY HIGHER THAN RECOGNITION? * 
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Tulving has described a paired-associate procedure in which the B terms were 
recalled better than they were recognized. The present experiment replicated 


and magnified this finding. 


It is considered that these results argue against. 


current models in which recall and recognition are characterized as qualitatively 


distinct processes 


There is currently some disagreement about 
whether recall and recognition are to be accounted 
for by the same or by different mechanisms. The 
predominant view postulates 2 qualitatively distinct 
processes (Kintsch, 1970; Underwood, 1972), Ac- 
cording to this theory, recall involv arch-cum- 
implicit-retrieval stage, followed by a decision stage, 
in which the retrieved information is either accepted 
as appropriate or rejected. In recognition, the re- 
trieval of an available item is inevitable (or “auto- 
matic"), and hence only the decision phase is func- 
tionally operative, Other students of memory 
reject the notion that retrieval is bypassed in recog- 
nition. Thus, Tulving and Thomson (1971) argue 
that while retrieval is usually comparatively more 
succes: ognition, the retrieval phase is none- 
theless inct and fallible component of the 
recognition process, just as it is in recall, 

One implication of the dual-process theory is that 
recognition should be higher than recall. The only 
exception is when there is no retrieval failure in 
recall, in which case the 2 measures should be 
identical. In a paper entitled “When is recall 
higher than recognition?" Tulving (1968) reported 
an experiment in which Ss learned a paired-associate 
list to a criterion of 2 successive error-free trials, and 
were then given a recognition test of the B terms. 
Approximately 10% of the B terms were not recog- 
nized, Tulving's results are clearly difficult to 
interpret in terms of the dual-process theory, That 
the recall advantage was obtained with a paired- 
associate procedure rather than with free recall does 
not detract from its theoretical importance. To be 
recalled, an item must be represented or “available” 
in memory, and an automatic ess theory of 
recognition predicts recognition of all available 
items, However, Tulving's recall superiority was 
rather small and it might be held that the experi- 
ment was not totally conclusive, in that (a) incorrect 
identification was less frequent than failure to 
identify a presented item, and hence the recall 
advantage would have been even less if Ss were 
forced to make more recognition responses; and 
(b) a few items may have become unavailable during 
the course of the recognition test, perhaps as a 
result of considering the distractor items. 


This research was supported by National Science Founda- 
tion Grant GB-24171X. 1 am indebted to my wife Olga for 
collecting and BORDER the e» and to Endel Tulving for 
encouraging the publication of the paper. ` 

‘4 Inquiries YU this article should be sent to Michael 
J. Watkins, Department of Psychology, 333 Cedar Street, Yale 
University, New Haven, Connecticut 06510. 


In the demonstration to be reported, the procedure 
was free from these potential sources of artifact, and 
yet yielded a greatly magnified recall advantage. 

Method. Twenty male and female undergraduate 
psychology students were tested individually or in 
all groups. In outline, all Ss were presented 
with and tested on a practice paired-associate 
list, and were then shown an experimental 
which certain pairs had been designated critical, 
Immediate testing was confined to noncritical pz 
After a filled interval all Ss were given free and 
forced choice recognition tests of the critical B terms. 
Finally the eritical A terms were presented in a 
conventional cued recall test of the corresponding B 
terms, 

For both the practice and experimental lists, the 
A terms contained 5 letters and the B terms 2 lette 
The majority of both the A and B terms were 
"nonsense words," insofar as they do not appear in 
the dictionary. However, if each B term was added 
to its A term, a common word was formed. This 
was true for all pairs of both practice and experi- 
mental lists, Neither A nor B terms were repeated 
within or between s. There were 20 pairs in 
the practice list, while the experimental list con- 
tained 39 pairs of which 24 were designated critical 
and 15 noncritical. The Ss were randomly assigned 
to 2 groups of 10, Both groups saw the same prac- 
tice list but dilferent forms of the experimental list, 
he 2 variants of the experimental list differed only 
in the B terms of the critical pairs. Examples of 
pairs presented to one group of Ss are 
EXPLO - RE, LIQUE - FY, HIDEO - US and 
the corresponding pairs for the other 
group would be SPANI - SH, EXPLO - DE, LIQUE - UR, 
HIDEO - UT, and AMNES-TY, The Ss were not 
explicitly informed of the nature of the relationship 
between A and B terms, although this was pre- 
sumably perfectly obvious from the first 1 or 2 
pairs of the practice list, The order of the A terms 
for both critical and noncritical pairs was identical 
for the 2 groups of Ss; the first and last 5 pairs were 
noncritical, while the remaining 5 noncritical pairs 
were randomly allocated among the 24 critical pairs, 

Presentation was by means of a closed-circuit 
television system, with successive pairs typed in 
uppercase, the A term on the left, the B term on the 
right, and a hyphen between the 2 terms, The pairs 
were presented at a 4-sec. rate. 

Immediately following the presentation of the 
practice list, Ss were given a list of all 20 A terms, 
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in a new random order. ‘The Ss wrote down, at 
r own rate, as many of the B terms as they could 


The experimental list was simply announced as 
“next list" and the Ss were led to believe that they 
would be tested in just the same way as for the 
practice list. In fact at this stage they were merely 
given a recognition test of the 15 noncritical pairs. 
The test consisted of the presentation of the 15 
tl pairs plus 15 new pairs of the same type. 
s' task was to circle 15 pairs which they 
thought most likely to have appeared in the preced- 
ing list. The purpose of this test was simply to 
reduce the probability of covert retrieval and re- 
hearsal of critical pairs, and the impression was 
given that this test terminated the experiment. 

The Ss next participated for 30-40 min. in a 
separate experiment, which involved the free recall 
of a series of word lists. Following this experiment, 
each S was unexpectedly given a list containing the 
24 critical B terms seen in the experimental list, 

randomly mixed with the 24 critical B terms which 
had been presented to the other group of Ss. In- 
structions were first given for a free recognition 
test; Ss were to check any item recognized as oc- 
curring in the second list. The Ss were then given 
a dillerent pen and asked to check further items 
which they thought most likely to have been pre- 
sented before, until a total of 24 had been checked 
in this forced recognition test. Finally the Ss were 
given a cued recall test of the 24 critical pairs. The 
24 A terms were presented in random order, and Ss 
were instructed to write down the B terms with 
which they had been paired. Thus for both groups 
of Ss the recognition and cued recall tests were 
identical, although the correct responses dillered. 
In both tests Ss were given as much time as they 
needed. 

'The confounding of type of test and order of 
test may have given a small advantage in favor of 
the recognition test. However, this effect is not 
critical in that it would merely serve to reduce the 
predicted recall advantage. The converse possi- 
bility that exposure to the recognition items may 
have enhanced recall seems unlikely, in that the 
recall targets and the recall lures were equally 
exposed in the recognition test, 

Results, Essentially identical patterns of results 
were obtained for the 2 groups of Ss. The 2 sets of 
data were therefore pooled. For all 20 Ss the num- 
ber of presented critical B terms successfully recalled 
was greater than the number freely recognized, and 
equaled or exceeded the number responded to in 


TABLE 1 
PERFORMANCE AS A FUNCTION OF MEMORY TEST 


Free | Forced 
Item recog- | recog- ued 
nition | nition | "eall 
Probability of correct response 4 252-3. 294 
Probability of incorrect response 07 AS ET 
Corrected probability of correct. -08 09 67 


response 
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* = 
the forced choice recognition test—despit: lact 
that half of the items in the latter condit uld 
have been identified by chance. Table 1 ~ the 
mean probability of correct response for | pre- 
sented and nonpresented (control) item the 
recognition and recall tests. The control | bili- 
ties were used to estimate the amount of sing 


in each of the test conditions. In detail 
ence between the presented and control pr: 
foreach test condition was divided by the 
of not responding to a control item. F« 
the corrected free recognition probabili: 
by .14 — .07, divided by 1! —.07, th 
These corrected probabilities are also 
Table 1. 


Discussion. The results clearly support ving's 
(1968) conclusion that in some circumstan: - infor- 
mation may be recalled better than it can recog- 
nized. Moreover, the present recall adva: ige was 
large. It is fun to speculate how the pre :t para- 
digm might be refined to further accen: te this 
advantage— perhaps to perfect recall d zero 
(chance) recognition. Recall performance. uld pre- 
sumably be increased by reducing the ret. tion in- 
terval, or else by improving the origina! arning. 
One way of enhancing the initial learning o! he A-B 
pairs would be to reduce the presentation r. =, while 
a second would be to instruct the Ss to for visual 
image comprising the A and B terms in i action 
(Bower & Winzenz, 1970)—a strategy to «ich the 
present material would lend itself particul« !y well. 
Recognition performance could probably be ;educed 
by further encouraging attention during presenta- 
tion to be focused on the A-B pairs as such, «nd less 
on the individual terms. One way of achieving this 
would be to present to one group of Ss A bB pairs 
such às PLUN*ER - «s«»Ges, REVEN#E - «essetis, CLC 


while the second group would see PLUNsER - «««sD** 
REVEN#E - »**»9Gs, etc, It is likely that the B 
terms would be readily retrieved on presentation of 
the A terms, yet poorly recognized. 

Of course the technique being exploited with such 
à procedure, as in the experiment reported, involves 
making only the A-B pairs meaningful, thereby 
encouraging Ss to pay little attention to the indi- 
vidual A and B elements. It is conceded that such 
contrived procedures are not representative of those 
used in most verbal memory studies, and that the 
finding of recall that is better than recognition is 
likewise unusual, But the finding is not without 
theoretical significance, in that it poses a difficulty 
for the dual-process theory. According to this 
theory, failure to recognize an item indicates that 
it is not available in memory; however, in the pre- 
sent experiment the availability of a substantial 
Proportion of nonrecognized items was demonstrat 
by their subsequent recall. Moreover, to interpret 
the obtained recall advantage in terms of the organ- 
ization of nominal or E-defined units into very 
different functional units is of little comfort to the 
dual-process theorist, for “If retrieval and recogni- 
tion processes can be separated as neatly as the 
dual process theory implies, organization of the 


^r VER 


learuing material can have no effect upon recogni- 
tion, since organization facilitates retrieval and 
retrieval in recognition is trivial [Kintsch, 1970, 
p. 2807." 

it might be thought that the present results 
simply demonstrate a tendency to encode material 
into whole words rather than partial (“nonsense”) 
words, While this may be largely true for the 
present experiment, it is argued here that hierarchi- 
cal structuring does not terminate at the level of 
the word; rather the general principle of organizing 
or chunking is emphasized. In fact, in the present 
experiment 20% of the B terms were words in their 
own right, yet were apparently not well encoded 
as separate elements, Moreover, in Tulving's (1968) 
experiment all the terms (both As and Bs) were 
meaningful words, yet at least some were encoded 
in a higher order unit. 

One way of conceptualizing the present findings 
is to assume that an event is remembered only when 
a retrieval cue makes contact with the stored repre- 
sentation of the event, With paired-associate and 
recognition procedures, the retrieval cue is the 
perception of the A term and nominal “target,” 
respectively, In free recall, the nature of the re- 
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trieval cue may not be obvious. In all cases, how- 
ever, successful retrieval follows only when the 
retrieval cue matches the item trace, as originally: 
encoded (Tulving & Watkins, in press). According 
to this view, when is recall higher than recognition? 
‘The answer is “. . . whenever retrieval cues present 
at the recall test are more effective in providing 
access to stored information than are retrieval cues 
present at the recognition test [Tulving, 1968, 
p. 54]" And when is recall spectacularly higher 
than recognition? Whenever the difference in effec- 
tiveness of the 2 cues is a spectacularly large one. 
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APPARENT SLOWING OF BIMANUALLY ALTERNATING PULSE TRAINS! 


SEYMOUR AXELROD? AND MICHAEL NAKAO 
Department of Psychiatry, State University of New York at Buffalo 


Auditory clicks presented in rapid alternation to the 2 ears are heard as slower 
than clicks presented at the same rate to a single ear. 
Thirty Ss compared the rates of bi- 
manually alternating taps with the rates of taps presented to a single hand, 
In terms of inter-5 consistency, of the dependence of degree 
of perceptual slowing on repetition rate, and of the quantitative aspects of the 
dependence, the somesthetic results were similar to those obtained in an earlier 
auditory experiment, suggesting that the apparent slowing in the 2 modalities 
of stimuli alternating between receptive surfaces on the 2 sides of the body has 
a common basis central to modality-specific afferent structures. 


similar phenomenon in somesthesis. 


or vice versa, 


When rapidly repeated clicks are presented alter- 
nately to the 2 ears, their apparent repetition rate is 
slower than that of clicks presented at the same ob- 
jective rate to one ear alone (Axelrod, Guzy, & 
Diamond, 1968). milarly, the numerosity of 
dichotically alternating stimuli is less accurately 
reported than that of monotic ones (Guzy & Axelrod, 
1972; Holden, 1973), 

These phenomena reveal a limitation in the pro- 
cessing of bilaterally alternating stimuli. If such 
limitation is unique to audition, then it would make 
sense to suppose that the underlying neural mecha- 
nisms are located in audition-specific structures, 
and that the limitation thus arises relatively early 
in the series of events from stimulus to perception. 
Explanations might then invoke the sort of contra- 
laterally inhibitory process found, e.g., in single 
units of the cat's inferior colliculus (Erulkar, 1959), 
But if a similar limitation is demonstrable in another 
modality, then it would be possible (though not 
necessary) to posit a common limiting mechanism, 
located central to modality-specific pathways, and 
operating later in the processing sequence, 

Method. Subjects were 30 students, 16-30 yr. old, 
whose participation fulfilled requirements of courses 
in introductory psychology. None admitted to any 
somesthetic deficit, 

Stimuli were impacts delivered. by headless 6/32 
screws inserted into the driving spindles of 2 Ling 
(V 47, Model 102) vibration generators. Each 
generator was covered by a cap with a central hole of 
8-mm, diameter; when S lightly rested the palmar 
surface of the middle fingertip on the hole, the finger 
made light contact with the screw.  Two-milli- 
second square pulses gated a transistor which im- 
pressed 6 v. across the generator and a series 
resistance. There were 2 such circuits, one for each 


1 This study was supported by Research Grant 06862 from 
the National Institute of Neurological Diseases and Stroke 
and by General Research Support Grant RR05400 from the 
General Research Support Branch, Division of Research Re- 
sources, National Institutes of Health. Michael Nakao was 
supported by a fellowship from the United Health Foundation 
of Western New York. Thanks are due to Donald Harmon for 
advice on instrumentation, to John Tank and Eric Zinnerstrom 
for technical assistance, and to Dewey Bayer and Janet Crock- 
well for assistance in procuring Ss. 

* Requests for reprints should be sent to Seymour Axelrod, 
Department of Psychiatry, SUNY, 2211 Main Street, Buffalo, 
New York 14214. 
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We examined here a 


hand, the series resistances being adjusted to make 
the impacts on the 2 hands subjectively equali 
intensity to Es. The distance between the sere) 
delivering the impacts was 15cm. The sounds madi 
by the generators were masked by a low level 
thermal noise, 
Half the Ss (9 male, 6 female) constituted t 
unimanual standard (US) group, and half (8 
7 female) the bimanual standard (BS) group. T 
Ss in the US group were told that they would 
2 4-sec. trains of taps, a standard train to the li 
hand, followed after a .5-sec. interval by a com 
son train in which the taps alternated between th 
hands, starting with the left. The task was to judi 
whether the comparison taps occurred at a i 
faster than, equal to, or slower than that of tht 
standard. A method of limits was used, the 
point of the interval of uncertainty being taken @ 
the point of subjective equality (PSE). The ta 
was demonstrated in a trial using a long (2,00 
msec.) standard interstimulus interval (ISI). 
Three onset-to-onset ISIs were used as standards 
1,000, 250, and 100 msec, At each ISI 
trials alternated with 5 descending tri 
trial, the comparison ISI was varied in equal ste 
each 10% of the standard. Sequencing of trials 
such that effects of practice, fatigue, etc., were d 
tributed over ISI, within and between Ss. 
conditions for the BS group were essentially 
same as for the US group, except that these Ss had 
judge a unimanual comparison train relative to 4 
bimanual standard. 
The experiment was thus similar to the auditory 
experiment of Axelrod et al. (1968), the maj 
differences being that in the earlier study, (a) 
wider range of ISI was explored (1,000-25 msec.)j 
and (b) the comparison ISIs in a trial varied im 
logarithmic series. The linear series used here pron 
vided a denser set of comparison ISIs around 
standard than that used in the auditory experiment 
Results. For each S for each ISI, the score 
[(Xs — standard) X 100/standard ] was computed; 
where Xs is the mean of S's 10 PSEs, The sc 
represents the percentage deviation of S’s me 
PSE from the standard ISI. The means of th 
Scores are given in Table 1. All 6 means are in 


- 
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directions indicating apparent slowing of bimanually 
alternating trains relative to unimanual ones, and 
each is significantly different from the corresponding 
standard, és (14) > 4.04, ps < .01. In both groups, 
mean percentage deviation increased monotonically 
as standard ISI decreased, i.e., relative slowing in- 
creased with repetition rate. With one exception in 
each group, both with 1,000-msec. standards, all 
Ss showed relative slowing of bimanual pulses at all 
standard ISIs; by binomial test, the proportion 
14/15 has p < .001. 

A separate Treatments X Ss analysis of variance 
was performed on the percentage deviations for each 
group. In both analyses, the effects of IST were 
significant, Fs (2, 28) > 11.50, ps < .001. For the 
BS group, the difference between the means at 1,000 
msec. and 250 msec. was significant by one-tailed 
test, ¢ (28) = 1,84, p < .05. For the differences 
between the 250-msec. and the 100-msec. conditions, 
and between the 1,000-msec. and the 100-msec. 
conditions, ts (28) > 2.92, ps < .01. For the US 
group, the difference between the means at 1,000 
msec, and 250 msec. was not significant; for the re- 
maining 2 differences, ts (28) > 4.48, ps < 01. 

The mean PSEs for each group were transformed 
from measures of period (ISI) to measures of rate 
(taps per second), and the resultant group data are 
plotted in Figure 1. The triangles directly represent 
the data of the BS group, with the bimanual 
dards on the abscissa and their corresponding E 
on the ordinate. For US group (circles), bimanual 
PSEs are on the abscissa, with the unimanual 
standards on the ordin This is the same pro- 
cedure as that used in lyzing the auditory data 
(Axelrod et al., 1968, pp. 52-53). The line of best 
fit to the 6 points has a y intercept of .38 and a slope 
constant of .68. The light solid line represents the 
function obtained with clicks by Axelrod et al. (1968), 
with intercept = .05 and slope = .62, 

Individual slope constants were calculated for the 
3 data points for each S, The means for the 2 
groups were not significantly different; the grand 
mean was .68, agreeing with the group data value, 
The difference between the grand mean and the 
hypothetical veridical slope (1.00) was significant, 
t (29) = 12.31, p < .001, reflecting the uniform 
direction and significance of the percentage devia- 
tions from zero in Table 1. Likewise significant was 
the difference between the obtained slope and the 
value (.50) which would have resulted had Ss 
systematically ignored or suppressed all the pulses 
from one hand during bimanual stimulation, £ (29) 
=6.92, p < .001. 

The group means of the i dividual intercepts were 
not significantly different. The grand mean was 
.42, approximating the group data value (.38), and 
was significantly different fron zero, the intercept 
of the veridical line, ¢ (29) = 6.42, p < .001. 

To permit comparison of these results with those 
of the auditory study of Axelrod et al. (1968), we 
calculated slopes and intercepts for each of the 40 Ss 
in that study, from the original data. The mean 
slope coincided with the group data value (.62). 


TABLE 1 


MEAN PERCENTAGE DEVIATIONS AND STANDARD ERRORS 
FOR ALL CONDITIONS 


Group 


Standard inter- 
stimulus interval 
(in msec.) 


Bimanual standard. | Unimanual standard. 


x S¥ Sk 

1,000 987 | 244 | 2.85 
250 24.30 5.98 pe 
100 47.07 7.30 3M 


Note. 


'h mean is based on 15 scores, 


The difference between the somesthetic mean and 
the auditory one was not significant, / (68) = 1.50, 
10 < p <.20. The mean intercept in the auditory 
study was .21 (group data value .05), and the 
difference between the means in the 2 modalities was 
n not significant, ¢ (68) = .86. Therefore, for 
neither of the 2 determining constants (slope and 
y intercept) is there reason to assume a difference 
between the modal ; he, the auditory and 
somesthetic lines derived from the individual data 
are not signi ntly different from coincident. 
Discussion. The present results were thus strik- 
audition. (a) Apparent 


Ss. (b) The degree of slowing (percentage de 
tion) increased monotonically with repetition rate, 
(c) The group data points on the plot of single- 
source PSE as a function of alternating rate were 
well fitted by a straight line, (d) whose slope 
approximated that obtained with clicks. While 
there was a difference of $ of a pulse per second 
between the intercepts of the auditory and som- 
esthetic group data lines, (e) there were no signi 
modality differences between the mean slopes or 
mean intercepts calculated from the individual data, 
Although it can be argued that analogous pro- 
cesses might occur in modality-specific portions of 
the auditory and the somesthetic pathways, the 
similarity of the findings in the 2 modalities is con- 
sonant with a common central limitation on the 
processing of bilaterally alternating stimuli. The 
limitation might arise from a relative inefficiency in 
shifting attention rapidly between spatially separated 
sources, In speculation on possible bases of such an 
inefficiency in audition, it has been suggested (Guzy 
& Axelrod, 1972; Kahneman, 1970) that breakdown 
in attention shifting may be due to inability of motor 
orienting responses to follow the stimulus-alterna- 
tion rate; such a mechanism would operate equiva- 
lently in audition or somesthesis, and generate 
parametrically similar results in the 2 modalities. 
An alternative approach comes from the demon- 
stration by Bregman and Campbell (1971) that Ss, 
listening to a repeated sequence of 6 tones (a subset 
of 3 high-pitch, interdigitated with a subset of 3 
low-pitch, tones) presented at 10 per second, split 
the sequence into 2 subjectively segregated, non- 
integrable streams—a high and a low. Auditory 
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and somesthetic slowing of bilaterally alternating 
trains may reflect an inability to integrate the results 
of similar segregations into left and right streams, a 
possibility which could be tested by using variants 
of the Bregman-Campbell paradigms with ear or 
body-side subsets, rather than pitch subsets. Stream 
segregation by spatial location appears to have been 
implied in von Békésy's (1959) early observations 
that when white-noise bursts are presented alter- 
nately between widely spaced speakers, or between 
widely spaced somesthetic stimulators on an arm, 
the observer can attend to one or the other source, 
but not simultaneously to both. 

So far as neural substrate is concerned, 2 types of 
account of the present results suggest themselves. 
One would involve a central mechanism which is 
less able to integrate bilaterally alternating stimuli 
than to process single-source stimuli, e.g., because 
of greater refractoriness to alternating than to uni- 
lateral inputs. A mechanism of this sort would pre- 
sumably be located in an area receiving both audi- 
tory and somesthetic information, such as association 
cortex (Thompson, Johnson, & Hoopes, 1963) or 
superior colliculus (Wickelgren, 1971). Another 
possibility is contained in Holden's (1973) suggestion 
that difficulty in integrating alternating stimuli is a 
function of the disparateness of the cortical projec- 


ive equality (PSE) as a function of alternating- 
(The heavy line represents unimanu: 
data], and is the line of best fit to the data points. : 
The light solid line is the line of best fit to the auditory- 
click data of Axelrod, Guzy, and Diamond, 1968.) 


20 
pulses / sec. 


30 


PSE as a function of bimanual rate [group 
Triangles: bimanual standard group. 


tions of the stimulus subsets. This hypothesis 
especially attractive in being testably applicable t 
many dimensions from which subsets can be draw! 
—e.g., ear, pitch, location in auditory or visual space 
body locus, and sensory modality. 
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LEARNER JUDGMENT IN INSTRUCTIONAL DECISIONS FOR LEARNING 


MEANINGFUL PAIRED ASSOCIATES * 


M. I. CHAS. E. WOODSON? 


University of California, Berkeley. 


The problem studied is the facilitation of the learning by college students of 
meaningful paired-a: ite lists consisting of pairs of romanized Japanese 
words and corresponding English words. Six instructional strategies differing 
in the method of determining the frequency of item occurrence were evaluated 
experimentally, The strategies differed in that during instruction, the next 
item was (a) randomly selected, (b) randomly selected with item frequencies 
balanced to give more training on difficult items, (c) randomly selected from 
among items that had not been correctly responded to twice cons cutively, (d) 
learner selected where learners received no instructions about selection, (e) 
learner selected where learners were told to select items they did not know, and 
(f) learner selected where leartiers were told to select items they knew. Learn- 
ing, as measured by a retention test 1 wk. later, was superior for the sequences 
which lead to high error rates during instruction, whether or not they were 
learner selected or computer selected, — Learner's judgment can be used to 
increase instructional effectiveness during instruction and can be made more 
effective by instructions to the learner. Implications for instructional theory 


are discussed. 


Atkinson (1972b) has reported an experiment in 
which a model for computer-selected instructional 
sequences was found to be more effective than 
learner-selected or randomly determined instruc- 
tional sequences. Atkinson found learner selection 
of instructional sequences considerably less effective 
than a computer model based on a theory of learning 
which postulates 3 states of learning and used transi- 
tion probability matrices, estimated for each item, 
for state changes when an item is responded to 
correctly and when another item is missed. One 
might take the position that his learners were handi- 
capped in their selection by not following the gen- 
cral instructional strategy used by the computer 
model, that is, that it is most effective to study those 
items which one does not know very well. The 
average number of errors during training show Ss 
drilled themselves on relatively learned material 
about midway between the most difficult and e; t 
computer-selected sequences, perhaps out of motiva- 
tion to get a higher score during instruction or 
perhaps out of the general habit of opting to re- 
spond to what one feels one knows, à habit which 
is probably taught indirectly in school. 

The instructional strategies cussed here may 
be considered schemes for the allocation of instruc- 
tional trials among items. "The basic strategy is to 
allocate more trials to unlearned items and fewer 
to learned items. Presumably if the learner is able 
to select those items which he does not know and 
those which he knows, his judgment would be very 
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valuable as a basis for the allocation of instructional 
trials among items, 

"The purpose of the present study is to examine the 
question of whether learner-selected instructional se- 
quences can be made more effective if the learners 
are given specific suggestions about a general strat- 
egy for selecting their instructional experiences. 

Method, ‘The Ss were 120 undergraduate students; 
20 Ss were assigned randomly to each of 6 instruc- 
tional conditions and were instructed individually. 
None of the Ss had prior course work in Japanese 
and none professed familiarity with the language. 

The instructional materials were 50 romanized 
Japanese words paired with English words, The 
pairs included 30 nouns, 11 verbs, and 9 adjectives, 
and were selected to have nearly equivalent mean- 
ings in the 2 languages. The words involved simple 
everyday terms (uchi, house; yoru, night), in most 
cases having 4 or 5 letters, The 50 Japane words 
were divided randomly into 5 subsets of 10. The 
order of words within these 5 subsets was randomly 
determined for each S and these lists were before S 
during the training ns, 

The Ss participated in 2 sessions: an instructional 
session which lasted approximately 2 hr. and a test 
session, 1 wk, later, which lasted about 20 min, The 
instruction was conducted by teletypewriters and 
an HP2000B computer (Hewett-Packard) time- 
sharing system. 

The instructional session involved a series of self- 
paced trials, each involving a response to a particu- 
lar item. Each S had before him, mounted for easy 
reading on the teletype terminal, 5 lists of Japanese 
words which remained in view throughout the train- 
ing period. The instructions to S explained that he 
must respond to one of the items from each of the 
lists in sequence from list to list. In the computer- 
controlled sequences this was automatically con- 
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TABLE 1 


PROPORTION OF CORRECT RESPONSES IN SUCCESSIVE 
TRIAL BLOCKS DURING THE INSTRUCTIONAL 
SESSION AND ON THE RETENTION- 
TRANSFER Test ADMINISTERED 
ONE WEEK LATER 


Successive trial block 
(instructional session) 


Condition Test 
1 2 3 4 5 
R 61 65 420 40 
RD 27 24 32 80 
RU 14 13 A9 79 
LS 21 .32 .36 52 
LSK 37 50 67 22 
LSU 12 10 AS 85 


trolled as the computer selected the next Japanese 
word. In the learner-selected conditions this was 
not controlled, but a spot check indicated Ss were 
highly reliable in following this procedure. 

In all instructional conditions S responded by 
typing the Japanese word, a blank, and the English 
word. If the response by S was correct, the system 
proceeded to the next item. If the response by 5 
was incorrect, the system crossed out his answer 
and typed the correct answer, thereby providing 
feedback regarding S’s responses. The 6 experi- 
mental groups were distinguished by the way the 
next item was selected for instruction: 

In the random order (R) condition, the next item 
was selected randomly (without replacement) from 
the appropriate sublist of Japanese words. 

In the random order balanced for difficulty (RD) 
condition, the next item was selected randomly with 
the item distribution balanced to make more diffi- 
cult items occur more frequently. Data from Con- 
dition R were used to estimate the mean trials to 
learning for each item, and the distribution was 
selected to attempt to equalize for all items the 
probability of learning during training. 

In the random order unlearned (RU) condition, 
items responded to correctly twice in a row were 
considered learned by S. The next item during 
instruction was selected from among the items which 
had not been answered correctly 2 consecutive times. 

In the learner-selected (LS) condition, S selected 
the next Japanese word to answer from the appro- 
priate sublist. 

The learner-selected known (LSK) condition was 
the same as LS except S was told that previous 
research had shown it was more effective to practice 
items he knew to "strengthen their retention” or 
"'try items he thought he knew rather than items of 
which he was completely unsure," and “to only try 
items he was unsure of when others were learned 
very well." 

The learner-selected unknown (LSU) condition 
was the same as LS except 5 was told that previous 
research had shown that it was more effective to 
study those items he did not know because “‘partly 
known items tended to be almost learned but com- 
pletely unknown items required quite a bit of 
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practice to get them to the stage wher * was 
any chance of remembering them.” 

In all conditions, before instruction 
typed all 50 Japan words and Eng 
from a list provided him. 

The retention test, conducted approxi: 
later, was the same for all Ss and consiste 
sheets with all 50 Japanese words studie 
were instructed to write the correspond: 
term for each. 

Results, The results of the experime: 


in Ss 
words 


l wk. 
littoed 
"he Ss 
nglish 


> sum- 


marized in Table 1. The first 5 colu! of the 
table present the proportions correct di learn- 
ing; the sixth column presents data fror reten- 
tion test. The data from the instructi ession 
are presented in 5 successive blocks of 50 t each; 
for the R condition this means that each em was 
presented once in each of these blocks. ve table 
gives point estimates of the parameters ivolved. 
All possible pair-wise comparisons amo <: group 
means were calculated with an overall or rate 
set at .05 within measures by means of t Bonfer- 
roni inequality procedure using the / statis: (Miller, 
1966). The pooled within-cell variances ^r Trial 
Blocks 1-5 and the retention test were . 5, .010, 
.029, .041, .047, and .032, respectively. follow- 
ing conclusions are based on those compa: «ons that 
indicated statistically significant differen 

In terms of average proportion corre: during 
training, the conditions may be ordered | LSK 
>LS = RD > RU =LSU. In the LS, i.:K, and 
LSU conditions, the error rates during ins:ruction 
follow the anticipated pattern, indicating {ost Ss 


followed their instructions. 

The order of the experimental groups on the 
retention test is nearly the reverse of that of errors 
during training. The pair-wise comparisons on the 
retention scores lead to the order LSU = RD = 
RU > LS > R > LSK in instructional effectiveness. 

The RD, RU, and LSU conditions involve a pro- 
cedure that attempts to identify and instruct on 
those items that have not yet been mastered an 
therefore should produce high error rates during the 
instructional session. 

The magnitudes of the effects observed on the 
retention test are large enough to be of practical 
significance. The LSU condition (when compared 
to the R condition) leads to a point estimate of 
relative gain of 112.5% in terms of items correct 0n 
the retention test. It is clear that S can be very 
effective in determining an efficient study sequence, 
as effective as the adaptive instructional system 
used here, and can be made more effective by appro- 
priate instructions. 

Discussion. The results show that learning of 
meaningful paired associates can be facilitated to 2 
considerable degree by some simple instructions to 
the learner. 

On the other hand, it may also be argued that 
this task does not test the learner-controlled notion 
very effectively. Learners were probably primarily 
following the instructions they were given. Factors 
which have been claimed to affect learner-controlled 
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sequences such as motivation, learner inquiry, and 
the use of learner judgment about his own under- 
standing have relatively little opportunity to be 
effective in the present task. 

As in Atkinson's (1972b) study, there is an inverse 
relationship between proportion correct during learn- 
ing and proportion correct during testing. This sug- 
gests some caution in the popular custom of using 
trials to criterion or number of errors during instruc- 
tion as measures of learning in paired-associate 
studies, particularly when the instructional pro- 
cedure is sensitive to responses by the learner. 

The LSU, RD, and RU strategies were not dis- 
tinguished on the retention score; the confidence 
intervals associated with the contrasts were .05 + 
.18, .06 + .18, and .01 + .18, indicating that what- 
ever differences there may be among these 3 are 
small compared with the differences between these 
strategies and the others tested. 

Atkinson’s (1972b) finding that automated pro- 
cedures can facilitate learning to a degree of con- 
siderable practical significance is supported by the 
present study. But this study also implies that 
appropriate instructions to learners can also facili- 
tate learning to a degree of considerable practical 
significance. Presumably we will be able to design 
automated instructional devices which are more 
effective than procedures which the learner himself 
can implement; but they will have to be more 
effective at inferring the degree of learning of indi- 
vidual items, and selecting appropriate instruc- 


tional experiences and sequences, than the learner 
himself when the learner has the benefit of as much 
knowledge about the task as that used to design 
the automated device. 

This study does not contradict Atkinson’s (1972a) 
conclusions that learner control may be one source 
of information in making instructional decisions, 
but complete dependence upon learner judgment is 
not warranted. Learner judgment may be an im- 
portant component in instructional models. Ob- 
viously it expends time and other resources, but the 
exercise of learner judgment may itself result in 
learning; and this study demonstrates that learner 
judgment can be used to obtain more efficient in- 
struction than random selection of instructional 
sequence. Instructional models will no doubt be 
developed which use learner judgment as a part of 
the information upon which instructional decisions 
are based, but it remains to be seen whether the 
costs associated with this procedure exceed the gains. 

This study shows that learners can be taught to 
more effectively use learner judgment for instruc- 
tional decisions. 
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In 2 experiments, half of the Ss were permitted to choose their responses from 
alternatives following the 1971 procedure of Perlmuter, Monty, and Kimble. 
The other Ss were forced to learn the responses chosen by their yoked masters. | 
Both groups of Ss were forced to learn an interposed A-C list following the 
choice-force procedure but prior to learning the A-B The performance of 
force Ss was reliably superior to that of choice Ss on A-C; however, when both 
groups learned A-B, the typical facilitative effect of choice was disturbed. 
In à second experiment, interposing a C-D list produced equivalent perform- 
ance in both groups on C-D. Similarly, the facilitative effect of choice was | 
not observed on the subsequent A-B trials. The degraded performance of 
choice Ss was discussed in terms of frustration as an additional source of 


motivation, 


Allowing Ss the opportunity to choose a portion 
of the response materials to be learned on a subse- 
quent paired. ciate list was examined with a 
paradigm that manipulated both the amount of 
choice allowed during the selection procedure and 
the locus or point at which choice took place (Monty, 
Rosenberger & Perlmuter, 1973). The amount of 
material chosen is not as important as the point at 
which choice is exercised, as indicated by the finding 
that even when only 3 out of 12 items were chosen, 
performance on the entire list was nearly as good 
as that when all 12 items were chosen, provided that 
choice was allowed early in the experiment. Such 
an observation ruled out the associative interpre- 
tation of the choice vs. force phenomenon as sug- 
gested by Perlmuter, Monty, and Kimble (1971) 
and seemed to favor a motivational explanation. 
Subsequently, Perlmuter and Monty (1973) found 
some preliminary evidence that motivational factors 
may even work in a detrimental fashion under 
certain conditions. 

A rather extensive literature exists on the relation- 
ship between motivational level and performance. 
It is generally asserted that performance increases 
as motivation increases. However, excessively high 
levels of motivation have also been implicated in a 
deterioration of performance (Brown, 1961). Thus, 
the purpose of the present experiments is to study 
further the role of choice in learning and to examine 
the possibility that choice may have an inhibitory 
as well as a facilitatory effect on performance. 


1 This research was supported in part by Grant MH-15315 
from the National Institute of Mental Health to the first 
author. Experiment I was presented at the meeting of the 
Southeastern Psychological Association, New Orleans, May 
1973. 
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EXPERIMENT I 


Employing the basic design of Perlm et al. 
(1971), half of the Ss chose their A-B sponses 
(choice group), while the other Ss were -rced to 
learn those responses chosen by their yoke. masters 
(force group). In contrast to earlier ex; niments, 
an A-C list was forced upon both groups prior to 
the learning of A-B, with the expectation that 
inhibitory effects would cause the choice group to be 
inferior to the force group on A-C tria while 
motivation produced by prior choosing would cause 


them to be superior to the force group on the sub 
quent A-B trials. Thus, the analysis of the A-B 
trials should serve to determine the durability of 
the beneficial effect of choice, 

Method. The Ss were 40 male and female Uni- 
versity of Delaware students, aged 19-25 yr., who 
were paid $2.00 each for participating in the experi- 
ment. They were alternately gned to 2 groups 
of 20 Ss each in order of their appearance. 

The apparatus and general procedures were 
identical to those employed in previous studies. 
Briefly, the last 84 5-letter words were taken from 
a list generated by Taylor and Kimble (1967) and 
were considered to be of relatively high meaning: 
These items were embled into 12 groups of 7 
each. Within each group, one item was designated 
arbitrarily as the stimulus word and one item 
as the A-C response word. The remaining 5 items 
served as the 5 potential response words on the 
A-Blist. Each S was first presented with 3 paired- 
associate (PA) practice trials utilizing common 
words. The choice group was then presented with 
12 slides, one at a time. Each slide contained a 
stimulus word centered on the left and 5 potential 
response words listed vertically on the right. The 
S was instructed to read the stimulus word and the 5 
potential response words aloud and to then select 
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one word to serve as the response word to the 
particular stimulus in a subsequent PA learning 
tas Each S went through the 12 slides at his 
own rate. 

Immediately upon completing the selection of the 
response items, Ss were informed that they would 
be required to learn an A-C list prior to the A-B 
list. The A-C trials were comprised of the stimuli 
viewed during the selection process, and these 
stimuli were paired with the E-selected response 
words indicated above. The stimulus and stimulus— 
response slides were shown for 2 sec. each. The 
interval between slides was approximately .9 sec, 
The S was instructed to say the appropriate word 
aloud upon presentation of each stimulus slide, and 
E recorded S's verbal responses. 

There were 10 presentations of the 12-word A-C 
list in 3 different random orders. Each order was 
presented on every third trial, and each S viewed 
all items in the same order. A brief period of about 
2 sec. followed every third presentation of the list 
to enable E to return the slide tray to its starting 
position. 

The A-C list was then followed by 10 PA learning 
trials, with pairs comprised of the stimulus words 
along with .S-selected responses, This A-B list 
(also presented in 3 different random orders) 
followed the A-C list after a delay of approximately 
50 sec., which was necessary to change slide trays 
on the projector. The Ss were informed about the 
change in the list just prior to the A-B trials. 

The force group was treated identically to the 
choice group except that, although they read the 
stimulus and potential response words aloud, they 
were not given the opportunity to choose their own 
respon: Rather, following their reading, they 
were assigned the response words chosen by the 
previously tested choice S while each slide was still 
visible to them. During the choice-force procedure, 
both groups viewed the slides for approximately 
15 sec. 

Results and discussion. The number of words 
correct per trial served as the measure of perform- 
ance on all analyses. The data for the first A-C 
trial were omitted from this analysis of variance 
as all scores were zero. Both trials, F (8, 304) 
= 149,00, p < .001, and the Trials X Groups inter- 
action, F (8, 304) = 2.35, p < .05, reached signifi- 
cance, while the groups effect did not. The data 
underlying the interaction are shown in Figure 1. 
It can be seen that the force group performed 
consistently better than the choice group, and that 
the magnitude of the difference tended to increase 
generally as a function of trials. This finding is in 
sharp contrast with the earlier findings (Monty & 
Perlmuter, 1972; Monty et al., 1973; Perlmuter 
et al., 1971), all of which indicated that when A-B 
learning immediately follows the choice procedure, 
the choice group performs significantly better than 
the force group, presumably as a result of enhanced 
motivation stemming from having had the oppor- 
tunity to choose. However, these results are 
supportive of Perlmuter et al. (1971), who found 


171 


12 
u 
p 
9 
Fa 2 
Es 
UP M 
x 
$ 6} E 
25 
AF A-C TRIALS 
z aa FORCE 
$5 e— CHOICE 
mco A-B TRIALS 
"ih £---* FORCE 
o— CHOICE 
isd Ae IP CHO TEN 
J "3553 4,785 6 r-- 8-29. 10 
TRIAL NUMBER 
FiGURE 1. Mean number of correct responses on the A-B and. 


A-C trials for the choice and force groups (Experiment I). 


that performance on a nonchosen A-C list that was 
learned following A-B was poorer for choice Ss 
than for force Ss. 

Theoretically, it may be possible to account for 
the degraded performance of the choice group on the 
A-C trials in terms of a nonoptimal increase in 
motivation caused by the interposition of the A-C 
trials. That is, the additional source of motivation 
might be thought of as frustration or thwarting 
(Brown, 1961) caused by the forced learning of the 
nonchosen A-C list following an opportunity to 
choose the A-B responses. For force Ss, neither 
the A-B nor the A-C lists are chosen; hence this 
frustrative source of motivation should not manifest 
to the degree that it does for choice Ss. Intrusion 
data provide further evidence for this interpretation, 
since increased motivation presumably increases the 
frequency of competing responses. The choice 
group made reliably more intrusions (311) than the 
force group (145), and 87% of the intrusions in each 
group were misplaced A-C responses. In addition, 
the finding that choice Ss tend to perform more 
poorly whether the nonchosen A-C list follows 
(Perlmuter et al., 1971) or precedes the chosen A-B 
list lends generality to the proposal that forced learn- 
ing induces a nonoptimal level of motivation and 
hence degrades performance. 

Finally, when both groups begin to learn the A-B 
list (following A-C), it would be expected that the 
incremented motivational level of the choice group 
would be reduced to a more favorable level. Pre- 
sumably, the beneficial effect of choice would 
manifest itself on A-B. In view of the fact that 
there were differences between groups on the A-C 
list, however, this analysis must be treated with 
caution, as discussed by Perlmuter et al. (1971). 
Only the main effect for trials reached significance, 
F (9, 342) = 142.24, p « .001, indicating a general 
improvement in performance across trials. As is 
evident from an examination of the upper 2 curves 
in Figure 1, the performance of the choice and force 
groups was highly similar across trials. Thus, it 
appears as if the beneficial effect of choice noted 
in previous investigations is destroyed when an 


intervening forced list is interposed prior to the 
learning of A-B. Whether the A-C list enhances the 
performance of the force group or diminishes the 
performance of the choice group cannot be ascer- 
tained from this experiment. 


EXPERIMENT [I 


We have just seen evidence which suggests that 
A-C trials interpolated between choosing A-B 
responses and learning the A-B list lead to a 
deterioration in performance. The nature of the 
interpolated materials to be learned may be im- 
portant in the production of this effect. That is, 
it is entirely possible that an interpolated C-D list 
may not produce the same degree of frustration that 
an A-C list might produce. This may be attribut- 
able to the degree of formal similarity between A-B 
and A-Cascontrasted with A-B and C-D. Whether 
the relationships just discussed also determine the 
performance of the choice and force groups on the 
subsequent A-B trials will have to await empirical 
test. Thus, the purpose of Experiment II was to 
repeat Experiment l, substituting a 10-item C-D 
list for the A-C list employed in the latter. 

Method. The Ss were 40 male and female students 
from Bowdoin College. The apparatus and pro- 
cedures were generally identical to those of the 
first experiment except that the A-B list was a 
" 12-item high-meaning list similar to that employed 
in Experiment I, while the C-D list was a 10-item 
low-meaning list, The A-B list was generated from 
72 of the last 75 stimuli on the Taylor and Kimble 
(1967) list, while the low-meaning C-D list was 
selected from the first 102 stimuli of the same list. 
As opposed to Experiment I, 12 rather than 10 trials 
of each list were presented. 

Two groups of 20 Ss each were run, A C-D 
choice group was treated identically to the choice 
group of Experiment I except that following the 
A-B choice procedure, choice Ss were required to 
learn the C-D list, after which they learned the 
original A-B list. The Ss in this group were in- 
formed of the fact that they would learn A-B (as 
in Experiment I) after learning C-D. The C-D 

- force group also learned the C-D list before learning 
- the A-B list and was forced to learn the A-B 
response chosen by the previous S in the C-D choice 
group. 

Results and discussion. Performance on the 
interposed C-D trials (2-12) by the C-D choice 
group and C-D force group was evaluated. "Trial 1 
was eliminated from the analysis because there were 
no correct responses. Only the trials effect reached 
significance, F (10, 380) = 69.96, p « .001, while 
groups (mean number of correct responses per trial 
= 2.50 for the C-D choice group vs. 3.00 for the 
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C-D force group) and the Groups X 1 
action were clearly not significant. 

"To test the effect of the interposition : C-D 
list on A-B learning, performance of the t hoice 


iter- 


group vs. the C-D force group was comp cross 
the 12 A-B trials. The C-D choice gro aged 
10.06 correct responses per trial, whil C-D 
force group averaged 10.08 correct re per 
trial. Again, only trials reached nee, 
F (11, 418) = 87.79, p < .001, and the 7 s for 
groups and the Groups X Trials interact were 


both less than 1.00. 
"Thus, the results of the present exper t are 


important in clarifying 2 major points | by 
Experiment l. First, when a nonchos: tis 
imposed upon choice Ss following their "tion 
of A-B responses, it appears that the sim ty of 
the chosen and nonchosen materials is itical 
factor in producing the frustrative effects p sumed 
to have been engendered by the forced |i This 
conclusion derives from the fact that an inte olated 
C-D list is learned equally well by C-D ch ce and 
C-D force Ss immediately following the ch force 


procedure, which is in marked contrast io the 


superiority of force Ss (Experiment I) A-C; 
Second, the interposition of a C-D list is ficient 
in disturbing the superiority of choice Ss A-B. 
This result is important in substantiating th similar 
effect observed in Experiment I, in which A-C 
list was interposed, but does so under co ditions 
in which first-list learning was not differen: in the 
groups subjected to the choice or force procedure, 
Of course, it is possible that the passage of time is 


responsible for the equalization of performance on 
the A-B trials rather than the effect resulting from 
the interposition of either the A-C or C-D list. 
This alternative cannot be ruled out completely; 
rather, it suggests the necessity of determining the 
temporal stability of the often observed beneficial 
effect of choice. 
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AND VERBAL CODING PROCESSES ! 
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t many such devices can be explained in 
transformations required to accomplish 


Learning of a categorized noun list under serial and total presentations 
s 4 types of mnemonic coding instructions: imagery, 
The imagery group had the highest total 


ering Ss' superior organization. Serial 
ning and was highly facilitative for the 
ported the contention that effects of 
€ a function of the type of coding trans- 


formation necessary to produce them. 


Studies of verbal coding processes have concen- 
trated on identifying the parameters of specific 
control procedures such as imaging, language 
mediators, prose chaining, and organizational 
strategies. Essentially, it appears that the majority 
of known coding mechanisms can be expressed in 
terms of some smaller set of functional transforma- 
tions necessary to produce them. Furthermore, one 
can argue that not only are these transformations 
necessary for encoding to occur, but they are, in 
fact, the primary vehicle for transfer of verbal 
input into long-term memory. 

There seem to be at least 2 types of transform: 
tional controls that operate when encoding tak 
place. The first of these we have labeled a unit 
transformation. Unit transformations occur when 
S defines the functional characteristics of verbal 
stimuli by direct elaboration. For example, if the 
stimulus is a specific word, S might elaborate by 
placing the word in a visual representation (an 
image), adding additional language components 
directly to the word (syntactic mediation), or 
embedding the word in some semantic context (a 
narrative). 

We will call the second process an order trans- 
formation. Order transformations accomplish en- 
coding by performing sequence and grouping changes 
on the input stimuli. In other words, functional 
characteristics become class dependent. Studies of 
category clustering are exemplars of this transfor- 
mation mode. 

Of the 2 transformation processes hypothesized, 
order transformations seem to be the more complex. 
With unit transformations, S simply selects the 
most available and salient coding vehicle, embeds the 
new materials within it, and stores the resulting 
fused compound in memory. In the case of order 
transformations, however, 2 types of coding informa- 
tion are required for efficient storage. First, S 
must select some primary storage classification 
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(category, superordinate, etc.), and second, he must 
rearrange the input to match the class chosen. 
This hypothesis is buttressed by research showing 
higher recall levels for Ss who receive category 
classes as a part of the input information (Tulving 
& Pearlstone, 1966), allowing a bypass of the first 
step in the process The more complex order 
transformations would, of course, entail treating 
entire sets of transformed elements as single com- 
ponents in some new ordering strategy. 

On the basis of our reasoning, we would predict 
that instructions to use less complex unit transforma- 
tions should lead to higher recall scores than instrue- 
tions to use more complex unit or order trans- 
formations—even when the words learned consist 
of well-defined category sets. If the complexity 
hypothesis is valid, measures of order transforming 
(i.e., clustering) should be significantly greater for 
order-instruction Ss, in spite of inferior total recall 
performance. 

Finaly, a test of the transformation notion 
would result if stimulus presentation mode were 
varied. When items are presented singly, unit 
transformations should be facilitated simply because 
each item can be processed and stored individually. 
However, when the list is given en masse, one would 
predict an advantage for order-transformation 
instructions, since ease of selecting appropriate 
primary coding classes would be increased. 

Method. Two factors, preexperimental instruc- 
tions (imagery, clustering, narrative, and learn 
only) and presentation mode (serial or total) were 
combined factorially to form 8 treatment groups. 
Trials were varied as a within-Ss variable across 
each factorial cell. The design was thus a 4 (In- 
structions) X 2 (Presentation Mode) X 3 (Trials) 
mixed analysis of variance, with repeated measures 
on the Trials variable. 

The Ss were 83 undergraduate volunteers, 
randomly assigned to the experimental conditions in 
order of their appearance for the experiment. Three 
Ss were dropped from the study for failure to 
follow procedural instructions. 

The stimuli were 20 nouns selected from the 


Battig and Montague (1969) category norms. Five 
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TABLE 1 


Instructions 
= £ : —— m - 
Trial Imagery Clustering Narrative | Lear 
Serial Total Serial Total Serial iv RES al 
presentation | presentation | presentatiom | presentation presentation | presentation | pre ation 
9.1 10,00 $20 | om puo v ph | ^ 843 158 
3 (2.79) (1.33) (2.09) | (2.18) (3.62) | (3.20) 2) 
15.61 15.23 13.73 13,35 1453. | | 1201 
£ (1.77) (2.04) (3.05) | (2.62) (3.58) | (3.65) ) 
P 18.10 17.82 1596 | 163 16.21 | 15,24 
(1.10) (1.87) (2.02) | (2.16) (2.74) | Q9» 9) 
| | 


Note. Each mean is based on the recall performance of 10 Ss. 


words were chosen from each of the following 4 
categories: insects, natural earth 
seasonings, and sciences. All words were selected 
from between the third and fourth associative- 
frequency quartiles for each category. 
words were assigned randomly to list position, and 
were typed on either 3 X 5 in. filing cards or on 
white foolscap as a triple-spaced list. 

During the experimental sessions, S sat across 
a table from E and was read verbal instructions 
appropriate to his treatment. Following instruc- 
tion, S was shown 3 sample items, either serially 
or together, as an example of the task required. 

Imagery Ss were told to form mental images of 
each word ; clustering Ss were asked to group the 20 
items into common categories; narrative Ss were 
directed to chain the words together sequentially 
in a "short story" that was meaningful for them; 
and learn-only Ss merely said each word aloud 
during presentation. 

For serial Ss, E exposed each of the 20 cards 
at a 3-sec. presentation rate. Speed of exposure was 
controlled by beeps from a tape recorder. In the 
total list conditions, S was presented with the entire 
list for 60 sec. Again, time was counted from the 
taped beeps. To eliminate direct recall from short- 
term memory, Ss were required to solve a page of 
complex arithmetic problems following presentation. 
The interpolated task was continued for exactly 1 
min. Finally, S was given a sheet of lined paper and 
a 2-min. period to free recall as many of the 20 
nouns as possible. This procedure was replicated 
exactly for each of the 3 trials. Following completion 
of Trial 3, S was given a questionnaire assessing 
the degree to which he had used the appropriate 
learning strategy. 

Recall protocols were scored for total recall, with 
minor spelling errors counted as correct. Degree 
of organization was calculated for each recall, using 
Bousfield’s (1953) ratio of repetition (RR). Perse- 
yeration errors were discounted in the calculation 
of RR. 

Results. Table 1 presents item-recall means and 
standard deviations for instructional conditions and 


presentation modes for each of 3 trials. 4 (. astruce 

formations, tions) X 2 (Presentation Mode) X 3 (Tria! analy- 
sis of variance yielded significance for main 

effects of instructions, F (3, 72) = 4,63, <01, 

he 20 and Trials, F (2, 144) = 580.88, p < .01 d the 


Instructions X Trials interaction, F (6, 144 2.53, 
p «.05. Comparisons among the in uction 
means using Scheffé's test showed that im. ory Ss 
recalled significantly more words than other 
treatment groups (p < .05). Recall performace of 
learn-only Ss was significantly below that the 3 
transformation conditions (p < .05). The !«struc- 
tions X Trials interaction occurred simply because 


the 3 transformation groups showed a itive 
acceleration in recall gain for Trials 2 and : 'reas 
performance of the learn-only controls was linear 
due to a lack of ceiling effects. Presentation mode 
failed to contribute significantly to recall per- 
formance. 

To assess the degree of order transformation, RR 
was calculated for each S on every trial. Table 2 
gives RR means and standard deviations for the 
instruction and presentation mode conditions across 
trials. A 4 (Instructions) X 2 (Presentation) X 3 
(Trials) analysis of variance was significant for the 
main effects of instructions, F (3, 72) = 11.57, 
p <.01, presentation mode, F (1, 72) ] 
p < .05, and Trials, F (2, 144) = 17.00, p < 0l, 
and the Instructions X Trials F (6, 144) — 29.00, 
b < .01, and the Instructions X Presentation Mode 
X Trials, F (1, 144) = 5.50, p < .01, interactions. 
A Scheffé test among the instruction means showed 
clustering 5s having a significantly higher RR than 
the remaining 3 groups (p < .05). No other post 
hoc comparison reached significance. The Instruc- 
tions X Trials interaction shows that neither the 
learn-only nor narrative group increased RR 
appreciably over trials. Imagery Ss demonstrated 
a slow rate of climb, and clustering Ss show 4 
negative acceleration in Trial 3, probably due to 
both ceiling effects and the restrictive parameters 
of RR itself (Shuell, 1969). 

Discussion. These data are essentially supportive 
of our model. The learning of a discrete noun list i5 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS (IN PARENTHESES) FOR RATIO OF REPETITION 


Instructions 
Trial Imagery Clustering Narrative Learn only 
Serial. Total. Serial. Total. Serial Total Serial Total 
presentation | presentation | presentation | presentation presentation | presentation | presentation | presentation 
21 Ad 237 | AL 23 20 26 A0 
i (.02) (02) (.05) (.02) (.03) (.02) (.02) 
; AL 22 A8 «32 33 35 16 
x (.03) (02) (05) (.03) (.04) (.02) (.04) 
2 152 E 51 | 54 E 20 30 E 
(04) (04) (05 | = Cos) (.02) (04) (.02) (04) 


facilitated by instructions to transform the items 
in a specific manner prior to storage. Furthermore, 
the recall power of such instructions can be pre- 
dicted from the type of transformation required. 
The imaginal unit manipulation led to greater 
facilitation than did either the more complex 
narrative or order transformations. This finding 
occurred in spite of the superior organization scores 
of clustering S 

Differences in item recall as a function of presen- 
tation mode did not occur as predicted. It appears 
possible that input vari bles are not as important 
for transformation processes as we assumed. It may 
be that Ss tend to treat total list items as single 
units—transforming them sequentially rather than 
selectively. 

'The Instructions X Presentation Mode X Trials 
interaction provides some interesting results. The 
reversal of the clustering group performance across 
trials is clearly due to serial Ss’ lack of an efficient 
order strategy on the first trial. However, on later 
trials, organization surpassed that of even the total 
list group once categories were learned, A more 
critical point concerns the organization performance 
of imagery Ss who received the serial list. The high 
RR scores for these Ss on Trials 2 and 3 suggests 
a tendency to use both unit and order transformations 
to accomplish encoding. Data from the post- 
experimental questionnaire indicate that these Ss 
first transformed the words in a unit fashion, using 
images, and then began to link the units together, 
using an order transformation. For example, one 


S in the group responded "first tried to form pic- 
tures . . . then saw them together (like) a katydid 
fighting a mile, . . ." Here, the unit-transformed 
images are interactively combined under the cate- 
gory "insects." Several Ss reported attempts to 
link the words in some fashion once they had formed 
the images. Such multiple transformations should 
result in a high probability of images in the same 
category being recalled together—exactly the effect 
shown by our data. In this cas al presentation 
may have expedited the multiple-transformation 
process in an effort to code rapidly or to reduce the 
load on memory. 

The results of this preliminary study are highly 
suggestive that research on the transformation 
process will prove profitable, If we are able to 

yate the effects of transformations as they oceur 
singly and together in natural language, we will 
have gone far in explaining the processes underlying 
human verbal behavior. 


Barnc, W. E, & MoxrAGUE, W. E. Category norms for 
verbal items in 56 categories: A replication and extension of 
the Connecticut category norms, Journal of Experimental 
Psychology Monograph, 1969, 80 (3, Pt. 2). 

BousrrELp, W. A. The occurrence of clustering in the recall 
of randomly arranged associates. Journal of General Psy- 
chology, 1953, 49, 220-240. 

SuuELL, T. J. Clustering and organization in free-recall. 

Psychological Bulletin, 1969, 72, 353-374. 

TurvixG, E., & PEARISTONE, Z. Availability versus accessi- 

y of information in memory for words. Journal of 

Verbal Learning and Verbal Behavior, 1966, 5, 381—391. 


(Received January 22, 1973) 


a 


Experim: 
102, Nt 


-— Journal Psycholo; 
i on Val. sa aa ii 
TIME-OF-OCCURRENCE CUES FOR “ 


STUART T. KLAPP 


California State 


When Ss shadow (repeat aloud) an 


UNATTENDED” AUDITORY MATER 


! AND PATRICIA LEE? 


University, Hayward 


auditory message arriving in one ear, 


memory for material arriving in the other ear falls off over a few seconds of 


presentation-to-recall delay. Contrary to Glucksberg and Cowen, however, 


an item in the nonshadowed ear to th: 


In the shadowing experiment, S repeats aloud 

- (shadows) what is heard in one ear while addi- 
- tional material is presented in the other ear. Much 
interest has surrounded the fate of the material 

that is not shadowed—the so-called “unattended” 

material. Although there seems to be little if any 

long-term memory (LTM) for this material, as 

assessed by recall at the end of a period of shadow- 

ing, short-term memory (STM) may be present 

(Glucksberg & Cowen, 1970; Norman, 1969). The 

present investigation focused on one alleged prop- 
erty of STM for nonshadowed material—the pos- 
sible lack of time-of-occurrence information. 

Glucksberg and Cowen had Ss recall digits that 

were embedded in prose and presented to the 
nonshadowed ear. A signal light indicated that 

S was to stop shadowing and attempt to recall 

the digit. The presence of STM without appre- 

ciable LTM was shown by a rapid decrease in 
- digit recall as the digit-to-signal delay increased 
over a few seconds. The Ss were also asked to 
"estimate the distribution of delays between digit 
occurrences and onsets of the cue light [p. 1547" 
- and they “uniformly reported that there had been 
o delays; either a digit had occurred contiguously 
ith or just before a cue light, or the cue light 
ad flashed alone [p. 1541." This finding led the 
iuthors to conclude that memory for nonshadowed 
material does not contain "time tags." This is 
- indeed a remarkable property for STM and would 
| seem to justify further investigation. However, 
one must view this conclusion with caution, since 
Ss were asked to estimate these delays in terms 
of a distribution for the entire session, rather than 
individually for each trial. The purpose of the 
present experiment was to assess in a more sen- 
sitive manner whether Ss can judge the length of 
the digit-to-cue delay. 

Method. The present experiment attempted to 
replicate the Glucksberg and Cowen (1970) pro- 
. cedure in all essential features, except that Ss were 
asked to estimate the length of the delay from 
digit presentation to recall cue after each trial 
rather than to estimate the distribution of these 
delays at the end of the experiment. It was as- 
sumed that this change in procedure would yield 
. a more sensitive estimate of the extent to which 
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Ss can accurately estimate the length of delay from the time of presentation of 


e time at which the recall signal occurs, 


Ss were aware of the time at which th nory 
digit was presented. 
Six right-handed students who report: mal 
ing and English as their native langu rved 
as Ss in order to fulfill an introductory p logy i 
course requirement at California State Un — rsity, | 
Hayward. Before serving in the main exp ment, 
each S received 1 min, of practice in s! »wing 
prose in the presence of a distracting me- ge in 
the other ear. 1 
The message to be shadowed was a pas age of 
English fairy tales, while the nonshadowed : .vssage 
was taken from David Copperfield and c tained 
embedded test digits. Both 32-min. mess: were 
read in the same female monotone at 160 .ords/ 
min. For half of the Ss, the material to t- shad- { 
owed appeared in the right ear. The ming 
relationship between the embedded test di and 
the cue light was obtained by recording in::.dible 
signal pulses in one channel of a 2-track e re 
corder such that these pulses followed tic test 


digits in the other track by predetermined intervals. 
These signal pulses triggered a white cuc light 
which was S’s signal to stop shadowing and recall 
the digit. Test digits were randomly selected from 
the digits 1-9, eliminating the 2-syllable digit 7. 
All other digits and digit homonyms were eliminated 
from both prose passages. 

Trials were presented in 2-min. blocks, with each 
block of trials containing all 6 delays (0, 1, 2, 4, 8 
and 16 sec.) presented in a random order. The 
delay from the recall cue of the preceding trial to 
the introduction of the next test digit was approxi- 
mately 15 sec. Each S received 16 blocks of 6 
trials. The Ss were instructed to stop shadowing 
when the cue light appeared and to recall any 
digit recently heard. Although Ss were encouraged 
to guess what the digit was, they were not require! 
to respond on each trial They were also to 
estimate the delay time from digit to signal light 
by calling out an estimate of 0-16 sec. each time 
a digit was recalled, 

Results. In agreement with Glucksberg and 
Cowen (1970), digit recall was approximately 25% 
at minimum cue delay and decreased at longer 
delays, F (5, 25) = 2.61, p < .05. As is clear from 
Table 1, the asymptotic level of recall was higher 
in the present experiment, but this may be at- 
tributable to differences in chance levels in the 2 
experiments (S. Glucksberg, personal communica- 
tion, October 1972). In the present experiment, 
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smaller set of digits was used, and there were no 
"blank" trials in which a cue light appeared when 
there had been no digit. Unfortunately, it is 
difficult to assess the chance level in either experi- 
ment, since Ss were not required to respond on each 
trial. ch a procedure would have been useful for 
purposes of analysis, but would seem unnatural to 
Ss, who frequently reported that no digit at all had 
been heard, Regardless of these details, it seems 
reasonable to conclude that the decaying STM for 
unatten led digits has been replicated in the present 
experiment, so that it appears that the same basic 
phenomenon is being investigated. 

Considering only those trials on which digit recall 


was t, Ss' estimates of the cue delays were 
highly correlated with the actual delay from digit 
to cue light, r =.90, p «.001. Furthermore, 
76% of the estimates were not more than 1 sec. 


different from the actual delay, while 32% of the 
estimates were identical to the actual delay. 
Discussion. The present results provide no sup- 


port for the assertion of Glucksberg and Cowen 
(1970) that time cues are missing from STM for 
nonshadowed material, but rather demonstrate 
that time-of-occurrence estimates can be made 
with greataccuracy. Having eliminated the alleged 
f time cues as a property of STM for non- 
hacowed material, there appears to be no reason 
;ssume that STM for nonshadowed material is 
different from STM in the distractor paradigm of 
Peterson and Peterson (1959). In both cases, 
r ecall f ills off dramatically after only a few seconds 
ıl activity. The lower initial recall of non- 
badowed material compared to the initial recall 
distractor paradigm can be attributed to 
reduced perceptibility of the to-be-remembered 
items in the presence of the sound and the verbal 
X shadowing. However, forgetting of per- 
ceived items may be assumed to follow the same 
laws in both eases in the absence of evidence to 
contrary, 
shadowed material seems to be understood 
even though it is not remembered for long. Corteen 
and \Vood (1972) observed galvanic skin responses 
to names of cities in the nonshadowed channel 


n the 
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TABLE 1 


Percent Dicit Recatt As A Function OF Cue DELAY 
(IN SEC.) 


Delay of cue 


Source of data 


e 
[3 
ES 
æ 
a 


Glucksberg and Cowen 
(1970)—estimated 
from curve 


Present experiment 26 23 17 17 14 | it 


Y 
x 
s 
S 
= 
LI 
ow 
wo 


when other city names had previously been asso- 
ciated with shock. Lewis (1970) found that the 
reaction time to repeat (shadow) a word was a 
function of its semantic relationship to a simul- 
taneously presented word in the nonshadowed 
message. In both of these studies, there was 
essentially no recall of nonshadowed material at 
the end of the trials, indicating that the material 
had not entered LTM even though it was com- 
prehended at some level. These findings, as well 
as the rapidly decreasing memory fer nonshadowed 
material, may be handled by assuming that shadow- 
ing reduces rehearsal in a manner analogous to the 
role of counting in the Peterson and Peterson (1959) 
paradigm. It is commonly assumed that reducing 
rehearsal reduces LTM. On the other hand, 
rehearsal may not be needed for comprehension 
or STM—both of which appear to exist for non- 


shadowed material. 
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In the shadowing of connected discourse, Ss’ performance deteriorated whe: 
intersentence relations were altered or disrupted. The following 3 methods 


were used to disrupt intersentence relation: 


(a) insertion of extraneou: 


material, (b) permutation of sentence order, and (c) deletion of sentences 
The insertion of extraneous material had the greatest effect on shadowing 
performance. This performance was most disrupted in the passages with the 
greatest number of anaphoric relations. At the shadowing lags found in the 
present study, Ss used intersentence relations as cues in their speech processing. 


The present experiment investigated the effect of 
varying the contextual constraints between sen- 
tences on the shadowing of connected discourse. 
A shadowing task is of interest primarily because 
of the light it sheds on the mechanisms of immediate 
speech processing. Earlier researchers (Chistovitch, 
1960; Chistovitch, Aliakrinskii, & Abul'ian, 1960; 
Galunov & Chistovitch, 1966) found that it was 
possible to shadow as closely as 200 msec. Con- 
sequently it was possible, when pressed, to shadow 
on the phonemic level, or, at most, the level of 
the word. In this case, the shadower does not use 
any other information beyond that in the imme- 
diate word or phoneme being processed. More 
recent experiments indicate that syntactic and 
semantic information from the whole sentence can 
be operative in shadowing. Using a series of 
statistical approximations, Treisman (1965b) found 
that Ss shadowed with less error as the material 
approached normal English. Using mixed sets, 
Miller and Isard (1963) found that normal sen- 
tences were better shadowed than nonsense sen- 
tences, which in turn were more easily shadowed 
than scrambled nonsense sentences. In our turn, 
we have shown how the relations between sen- 
tences can affect shadowing ability. 

Olney and Londe (1966) and Olney (1969) in- 
vestigated the class of intersentence relations called 
anaphoric relations, and specified criteria for tracing 
these relations between sentences. An anaphoric 
expression is one which functions in part as a sub- 
stitute for a preceding expression in the same di. 
course. For example, in the following sentence, 
“Tt” is an anaphoric reference: “John hit the ball. 
It went over the fence," The number of anaphoric 
expressions in each test passage is a measure of 
the intersentence relations in that passage, although 
there are of course many other types as well. 
Two expressions were considered to be anaphori- 
cally related when they were coreferential. This 
class of intersentence relations is not exhaustive, but 
it does provide one quantitative measure of the 

contextual coherence of the 3 test passages, and is 
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indicative of the relative amounts of sentence 
relations in them. 


Method. Three test passages were c! sen which 


contained different amounts of interse ence con- 
nections. Passage I was a passage fre: an intro- 
ductory textbook in psychology, Pas» :e ll was 
a newspaper editorial, and Passage ||| was a de- 
scriptive passage from a novel. Pass | had 40 
anaphoric relations, Passage Il had and Pas- 
sage III had 17 such relations. 

To investigate the effects of the di-vuption of 
contextual coherence on shadowing abi! 8 levels 
of contextual connectivity (including the «-iginal un- 
altered passages) were established by systematt 
cally altering the 3 test passages. Fou of these 
8 levels are defined as follows: Level original 
versions; Level 2, random-order sentences, in which 
each passage had the order of its sentences ran- 


domized, disrupting order- and contiguity dependent 
intersentence relations; Level 3, reversed order of 
sentences, in which closeness of each 
the other was maintained so that, for example: 
word associations might still hold, but the sequent? 
of order-dependent relations was altered; an 
Level 4, random sentences from the sé > sources 
in which new passages were formed by choosing, 
in each case, 22 sentences (without duplicating 
earlier choices) at random from the same source that 
provided the original passage; this preserves yle, 
but little else. 

The following manipulations, unlike the preceding 
ones, preserve the sentence order, but introduce 
extraneous material: In Level 5, alternation © 
passage sentences with samples from same source 
a passage was chosen from the same source as the 
original and its sentences were alternated with the 
sentences from the original passage, in order, $0 
that the first sentence from the first passage was 
followed by the first sentence from the secon! 
passage; then the second sentence from the first 
passage, and so on This intersperses the test 
passage with a second coherent passage in the sam? 
style. In Level 6, alternation of passage sentences 
with samples from other sources, sentences from 
Passage | were alternated with sentences from 
Passage III, likewise the sentences from Passage? 
II and III, and from Passages I and 1l. This i 
similar to Level 5, except that now the intersperse 
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passages differ on such things as style and a more 
divergent vocabulary. Level 7, alternation of 
partial passages from the same source, was identical 
to Level 5, except that every second sentence in 
each passage was deleted. Hence the first sentence 
of Passage I was followed by the second sentence 
of Passage II, and then by the third sentence of 
Passage I, etc, This reduces the coherence of the 
passage, which might have a greater effect on a 
passage such as I, which developed an argument, 
than on Passage III, which was descriptive. 
Level 8, alternation of partial passages from dif- 
ferent sources, was identical to Level 6, except 
that, as in Level 7, every alternate sentence was 
deleted. Once again there are now style differences 
between the partially destroyed passages. 

Passages I, II, and III each contained 22 sen- 
tences, although the number of words varied. 
Note that in each of the 8 conditions the test 
passage sentences have not been altered, only 
their relation to each other has been rearranged. 
Two practice passages about 6. min. in length 
were administered. These were literary in nature, 
but bore no relation through content or style to 
the test material. 

The resulting 24 passages were recorded by a 
linguist at an average rate of 150 words/min, using 

ormal inflection. Each S shadowed only one test 
passage, lasting 3-6 min. Five Ss were tested on 
each passage, with the exception of tbe 3 original 
passages, where 11 Ss per passage were used. 
^ total of 138 s par! ated. These Ss were 

jlunteers who had taken at least one psychology 
course, making them familiar with the "technical" 
erms in Passage I. ‘They were tested in groups of 
approximately 12, in separate booths (with an 
empty booth between) of a language laboratory. 
All equipment was controlled by Æ, who also read 
the instructions aloud, The stimulus passage was 
given over headphones and also recorded on one 
track of a tape. The shadowed response was 
recorded on the other trai 

Scoring. The raw data were scored for the 
number of errors per sentence. These results were 
transformed into percentages of words correctly 
shadowed per passage, permitting comparisons 

ss passages. An error consisted of any re- 
se which differed, through omission, distortion, 
or substitution, from the original. 

To score the time lag in shadowing, each of 2 
scorers listened to a separate track containing 
either the test passage or the shadowed response. 
When the scorer listening to the stimulus track 
heard each fifteenth word, he pressed a relay to 
start an automatic timer. The second scorer also 
pressed a relay when hearing the same word on 
the track containing S’s response, stopping the 
timer. The timer gave an automatic printout of 
each time interval to the nearest millisecond, which 
was rounded off to the nearest tenth of a second. 
There were 30-60 time intervals measured per 
test passage. 

Both error and time scores were taken only on 
those sentences from the original test passages. 
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TABLE 1 


MEAN PERCENTAGE CORRECT (IN PARENTHESES) AND MEAN 
TIME Scores (IN SEC.) 


Passage 
Contextual connec- - Fh 
tivity level n " HI 
(Text) | (News) | (Novel) 
1. Original version (3.5) | (75.1) | (82.5). | (77.1) 
E 498| esl 1565 
2, Random order, within | (63.8) | (68.8) | (66.7) _ | (66.2) 
passage i 5 1068 | 
3. Reversed order, within | (78.4) | (82.5) | (68.9) | (76,6) 
passage 1690] 509| 1529] 5 
4. Random, same source | (68.3) | (56.7) | (72.3). | (65.9) 
5 141 1| .76 
5. Alternated, same (47.6) | (61.7) | (72.8) | (60.7) 
source (832| — .6 a544 | .67 
6. Alternated, different | (59.3) | (73,5). | (61.7) _ | (64.7) 
source 1.0 E M E 
lternated and omitted, (74,0) | (72:6) | (65.2) | (70.6) 
same source 462| | .5 .651| ` .663 
8, Alternated and omitted,| (71.4) | (70.0) | (68.5) _ | (69.9) 
different source ‘769|  .588|  .673|  .67 
M (66.2) | (72.1). | (68.7) 
J10|  .648|  .619 


In conditions where other material was used to 
intersperse the sentences of the original passage, 
this interspersed m s not scored. Con- 
sequently some pas 1 a larger number of 
sentences scored than other passages, For example, 
in the last 2 conditions, sentences were deleted 
from the original passages. As there might have 
been a "ceiling" effect in the error scores, which 
were percentages, they were reanalyzed using an 
arc sine transformation. (Winer, 1962, p. 221). 
These analyses did not differ significantly from 
those performed on the untransformed data. It 
is the analyses of the untransformed data that 
are reported below. ‘The time latencies, on the 
other hand, had a lower, but no upper, limit, 
hence geometric means of each S's scores were 
used in computing a mean time lag per S for a 
test passage. Nonhomogeneity of variance tests 
were not significant, 

Results and discussion. Table 1 gives the mean 
percentage correct and mean time lag for the 3 
original passages, and all manipulations of these 
passages. The 3 unaltered, original passages 
(Level 1) were tested for differences using a one- 
way analysis of variance. The analysis did not 
reveal significant differences in the ability of Ss 
to shadow these passages. 

Two separate analyses of variance (two-way) 
were performed on all 8 levels of the 3 test pas- 
sages, on the error and time lag scores, respectively. 
There were significant passage differences for time 
lag scores, F (2, 114) = 3.32, p <.05. To inves- 
tigate these passage differences, a one-way analysis 
of variance performed on each test passage 
over all 8 levels of contextual connectivity. Pas- 
sage I had significant variations in both error, 
F (7, 38) = 3.96, p < .01, and time scores, F (7, 38) 
= 3.07, p < .05, over all levels. Passage II had 
significant variations in time scores, F (7, 38) 
= 3.79, p <.01, over all manipulations, Pas- 
sage III did not produce any significant differences. 
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To determine which manipulations were causing 
this effect, each level was compared to Level 1, 
the original versions. The comparison of Level 1 
with Level 5 resulted in significant differences in 
error scores, / (47) = 2.22, p < .05. Similarly, the 
comparison of Level 1 with Level 6 resulted 
in significant differences in both error, ¢ (47) 
= 2.24, p < .05, and time scores, £ (47) = 3.05, 
P «.01. The results seem concentrated in the 
manipulations that preserve sentence order, but 
introduce an alternate context. To test this, a 
two-way analysis of variance was performed on 
Levels 1-3, which vary sentence order. No sig- 
nificant effects occurred. Similarly, Level 1 with 
Levels 5-8 were analyzed, and significant main 
effects for time and error differences between the 
levels resulted, Fs (4, 93) = 4.53 and 4.24, ps < .01, 
for the time and error scores, respectively. To test 
this, Level 1 was contrasted to Levels 5-8. This 
was significant for the error scores, F (1, 93) = 4.86, 
p <.05. 

Our results show that in the shadowing of con- 
nected discourse, Ss may use the relations between 
sentences. Along with earlier findings (Miller & 
Isard, 1963; Treisman, 1965a, 1965b) the authors 
have shown that Ss are able to use contextual 
cues in shadowing. We have extended these pre- 
vious findings that Ss use relations within the 
sentence by showing that they are able to use 
the intersentence relations found in connected 
discourse as well. 

Our manipulations of test passages reduced the 
ability of Ss to shadow the same sentences in the 
various conditions. The insertion of sentences 
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from an extraneous passage into the t 
caused most of the disruption in 
A larger effect was found in passage 
anaphoric relations. No differences w 
the ability to shadow the unaltere 
Either the anaphoric relations do no 
shadowing of normal passages, or the 
fered on some other measure which 
any effect due to different numbers 
relations. 


vassage 

mance, 
h more 
ound in 


ces out 
aphoric 
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CONDITIONS THAT DETERMINE EFFECTIVENESS OF PICTURE- 
MEDIATED PAIRED-ASSOCIATE LEARNING! 


KEITH A, WOLLEN? AND DOUGLAS H. LOWRY * 


Washington State University 


Previous research has demonstrated that paired-associate learning is facilitated 
when the pairs are accompanied by pictures showing the pair members in some 


sort of spatial or interactive relationship. 
conditions under which such facilitation occurs. 


"Two experiments extended the 
In Experiment I, facilitation 


was obtained even though all pictures were presented before any of the word 


pairs and no specific instructions were given to use the pictures. 


In Experi- 


ment II, interacting pictures produced facilitation relative to either a no- 


picture control or to controls having noninteracting pictures. F 


cilitation was 


obtained when the pictures were presented either before the word pairs or at 
the same time, but not when the pictures were presented after the pairs. 


It is well known that Ss can learn paired associates 
more rapidly when an interacting picture is shown. 
along with each word pair than when no such 
picture is presented (e.g., Epstein, Rock, & Zucker- 
man, 1960; Wollen, Weber, & Lowry, 1972). A 
common interpretation of such facilitation is that 
the picture is stored with the word pair during input, 
and at output, the stimulus term presumably elicits 
ın image of the interacting picture, which is then 
decoded into the response term (Paivio, 1971). In 
previous research on such matters, Ss have been 
given the pictures at the same time as the words, 
and have been instructed to use the pictures to help 
them learn the pairs, The questions raised in 


periment Fare: (a) just how effective are pictures 
when they are presented at a different time from the 
word pairs, and (b) whether pictures produce inter- 
ference if they show 2 objects, only one of which is 
contained in the to-be-recalled pair. 


EXPERIMENT | 


The learning materials consisted of the 
of concrete nouns: List 1 
HAMMER, KEG-TABLE, NAIL-FLOWER, 
PHANT, WIGWAM-UMBRELLA, CHAIR-HO! 
TREE, WHALE-BIRD; and List 2—BOT! 
KEG-APPLE, NAIL-LOBSTER, SNAKE-RE 
WAM-ARROW, CHAIR-FLAG, HORS! WHALE- 
DIAMOND. The 2 lists were counterbalanced such 
that half of the Ss learned List 1 and half List 2. 
The words ranged from 6.23-6.90 on the imagery 
scale of Paivio, Yuille, and Madigan (1968). For 
each pair of words, a silhouctte picture was made 
that depicted the 2 nouns in a spatial or interactive 
relationship (e.g., an arrow stuck in a wigwam, a 
hammer about to hit a bottle, et Each silhouette 
and each word was made into a 2 X 2 in. slide. 
"These materials were projected onto a 60-in. viewing 
screen. 


Method. 
following pairs 


1'This investigation was supported in part by U.S. Public 
Health Service Research Grant HD-02249 from the National 
Institute of Child Health and Human Development, Apprecia- 
tion is expressed to Marvin Brown, who ran Experiment I. 

2 Requests for reprints should be sent to Keith A Wollen, 
Department of Psychology, Washington State Uni 
Pullman, Washington 99163. F 

? Now at Michigan Technological University. 


The Ss were 100 introductory psychology students 
whose participation partially fulfilled a course re- 
quirement. The Ss were run in small squads of 5 
these squads were randomly assigned to 5 experi- 
mental groups. As indicated in Table 1, the groups 
ered with respect to which of 4 dillerent stages 
were used and which types of pictures were em- 
ployed. In Stage 1, designated A-P,, a word (the 
A term) was shown by itself for 1.5 sec, and was 
followed by a picture of that A term (P4) presented 
by itself for 1.5 sec. This procedure continued until 
all eight A-P4 combinations had been presented, 
The Ss were told to associate each word with the 
picture that followed it. ige 2 consisted of the 
presentation of 8 pictures without any words, In 
each picture, 2 objects were depicted in a spatial or 
interactive manner. Depending upon the condition, 
the drawings pictured the 2 terms of the to-be- 
recalled pair (Pa + n) of Stage 3, the response term 
of the Stage 3 pair with an irrelevant object (C) 
(Pe + n), or the stimulus term of the Stage 3 pair 
with an irrelevant object (Pa + c). Each of these 
pictures was shown for 1.5 sec.; Ss were told to 
pay close attention so that they could remember 
the pictures. Stage 3, designated A-B, involved 
the presentation of the stimulus word (A) of a pair 
for 1.5 sec., then the response word (B) for 1.5 sec., 
and finally a 1, blank interval which served to 
separate the pa The Ss were instructed to learn 
the pairs so that they could provide one member 
given the other; no mention was made of using the 
pictures from the previous stages, Finally, Stage 4 
(test) consisted of presenting each A word for 1.5 
sec. followed by a 3,5-sec. interval for S to write the 
missing B term. 

Performance was expected to be the highest in 
Groups 1 and 2 since the objects pictured in Stage 2 
were the same as those named in the A-B stage. In 
Groups 3 and 4, in contrast, the pictures might 
produce interference, as only 1 pair member (either 
A or B) was depicted in Stage 2. Group 5 represents 
à no-picture control. 

Results. "The mean numbers correct and numbers 
of overt errors are shown in Table 1. Since there 


were significant differences among the mean numbers 
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TABLE 1 
FREQUENCY DATA IN EXPERIMENT I 
Frequency 
Group PEERFRIPEWNSCURTENLA 
TM Errors 

4 A-Pa, Pays, A-B, test 6,40 465 
2 —, Pain, A-B, test 6.08 AS 
: A-Pa, Poss, A-B, test 3.70 1.90 
É A-Pa, Pasc, A-B, test 3.50 1.70 
B SK A-B; test 4.25 1.30 


= 


Note. Abbreviations: A * stimulus term, P = picture, 
= response term, C = irrelevent object. 


correct, F (4, 95) — 55.15, p. € .001, a Dunnett's 
test was conducted comparing the Group 5 mean 
with each of the other 4 means. A difference of 1.23 
was required at the .05 level. As expected, Groups 
1 and 2 were significantly higher than Group 5. 
However, the trends for Groups 3 and 4 to score 
below the control group were not significant. The 
error data, which show the same trends, were not 
analyzed due to the small means and skewed dis- 
tributions. In general, it appears that pictures can 
enhance performance even though they are presented 
at a different time from the word pairs. 


EXPERIMENT II 


In Experiment I, the superiority of Groups 1 and 2 
might have been attributable to the fact that these 
groups in effect had 2 trials (P4 , nand A-B) whereas 
Group 5 had only 1 (A-B). This possibility was 


_ explored in Experiment II by giving the standard 


control group 2 successive trials (i.e, A-B, A-B) 
rather than 1. In addition, for every interacting- 
picture condition, there was another control group, 
which received pictures with the objects in a side-by- 
side, noninteracting fashion. The use of noninter- 
acting pictures equated each interacting-picture 
group with its noninteracting control group in every 
respect except for the presence or absence of inter- 
action; hence, interaction per se can be examined. 
Finally, groups were added in which pictures were 


presented (a) along with th d pairs, 
after all word pairs, eo m 


Method. The material 


: s were the same as th 
used by Wollen et al, ae 


j (1972) in their interacting- 
nonbizarre „and noninteracting-nonbizarre condi- 
tions. Basically, there were 9 high-imagery noun- 
odds E of which appeard at the bottom 
of a slide. en pictures wer d 
above the word Eu ips aes CS 
„_ The design consisted of the 9 groups outlined in 
able 2. The first and second columns indicate the 
conditions present on Trials 1 and 2, respectively 
The symbols are the same as those used in eed 
ment I with the exception of P4 y which refers to 
noninteracting pictures. For example, Group 2 
received noninteracting pictures without words on 
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Trial 1 and the word pairs ak 
received interacting pictures 
pairs on both trials, and so 
tained 20 introductory psycho 
whose participation partially 
quirement. The Ss were r 
5 each. A randomized block 
in which one squad was run i 
tions before beginning the nex! 
The sequence of events wa 
interval, Study Trial 2, 50-sec 
70-sec. interval, Test Trial 2 
sented in the same order on the 
1 of 4 different orders on the 2 
between trials were used to pro 
to pick up and distribute res 
slide, whether words only, pict 
was presented for 1.5 sec, with 
on study trials. On test trial 
shown each stimulus word (or a v 
the pictured objects) for 2 sec., f 
blank period, during which S wr 
sponse word (or object). All m 
sented via back projection to a 20 
The instructions varied some 
condition. On Study 1, Group 
to study the pictures and to r 1 
about them as they could. The Study 2, thet 
Ss were told to try to rememb« 
Study 1 as an aid to Study 2 | 
also instructed to learn such that 
one member given the other. Groups 3, 
were all told on Studv ! to k 
provide one member given th 
received similar instructions for ^ 
Groups 3 and 4 were instructed to study the pico 
because the pictures would help them remember ^ 
previously presented word pairs. For both sty 
trials, Groups 6 and 7 were told to use the picture 
to help them learn the pairs. Finally, Groups 1 
and 9 were instructed to learn the pictures 59 tl v4 
they could name one object when given the name 
the other. The instructions for test trials Wë 
simply to write the missing pair member. 
Results and discussion. The mean number cure 
was determined for each test trial. Since there V 


rial 2; Group 6] 
»anied by word 
ich group con: 
idents (N9 180 
d a course re 
mall squads o 
dure was used 
of the 9 condi- 


Irial 1, 30-see, 
ul, Test Trial 1 
pairs were pre- 
ly trials and inj 

The intervals 
structions and 
sheets, Bach 


d naming one of 
wed bya +sec } 
the missing rè- f 


ther. 
Study 2, bU 


TABLE 2 
Mean Corract 


ORI A 


Z erm, 

Note. Abbreviations: P = picture, A = stimulus (Pn 

= response term; PA. indicates a noninteracting Pi 

Pan indicates an interacting picture. sais: - 
Means are for combined data for the 2 test trial 
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relatively little dillerence between the 2 trials, 
Table 2 presents only the means of the 2 test trial 
means. A simple randomized analysis of variance 
on all 9 groups was significant, F (8, 171) — 3.23, 
p < .01. Consequently, each group was compared 
to the standard control (Group 5) using Dunnett's 
test. The difference required for significance at 
the .05 level (1.51) was only reached by those groups 
having interacting pictures on Study 1 (Groups 1, 6, 
and 8). As expected, having interacting pictures 
only on the second study trial (Group 3) failed to 
produce a significant amount of facilitation. Simi- 
larly, none of the conditions with noninteracting 
pictures produced a significant amount of facilitation. 

In addition to the above analyses, 1-tailed / tests 
were run comparing each group having interacting 
pictures with its noninteracting-picture control. 
Group 1 was higher than Group 2, £ (38) = 1.86, 
Group 6 was above Group 7, / (38) = 2.43, and 
Group 8 was above Group 9, ¢ (38) = 2.37, all ps 
< .05. However, Group 3 was not above Group 
4; in fact, Group 3 was slightly inferior. Hence, 
the same results emerge as when each group was 
compared with Group 5; namely, interacting pic- 
tures facilitate when pictures are presented on 


Study 1 (Groups 1, 6, and 8) but not when they 
appear only on Study 2 (as in Group 3). 

Together, Experiments I and II indicate that in- 
teracting pictures produce a higher level of response 
than noninteracting pictures, or than no pictures at 
all. This facilitation occurs when (a) the pictures 
and word pairs are presented simultaneously or (b) 
all of the pictures are presented before any of the 
words, but not when (c) the pictures are given only. 
after the presentation of the word pairs. 
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University undergraduates studied and then had immediate and delayed recog- 
nition tests on either (a) photographs, (b) verbal descriptions of the main theme 
of the photographs, (c) unembellished line drawings of the main theme of the 
photographs, or (d) main-theme line drawings embellished with extra details 
from the photographs. Although the 3 pictorial conditions had higher recogni- 
tion than the verbal condition, they did not differ from each other on either 


the immediate or delayed recognition tests. 


These results imply that the 


recognition advantage for pictures over verbal descriptions is not due to the 


extra details which pictures contain. 


It is well known that immediate recognition is 
very high for ordinary pictures (Nickerson, 1965; 
Shepard, 1967), Furthermore, immediate recogni- 
tion is nearly always higher for pictures than for 
verbal descriptions (but see Goldstein, Locke, & 
Fehr, 1972). It has also been established that 
long-term recognition is quite high for pictures 
(Nickerson, 1968; Shepard, 1967). Although the 
long-term recognition of pictures has not been 
directly compared with that of verbal descriptions, 
the results of studies examining long-term recogni- 
tion of verbal descriptions only (e.g., Strong, 1913) 
suggest that long-term recognition might be higher 
for pictures than for verbal descriptions. Such a 
direct comparison in long-term recognition of pic- 


= tures vs. verbal descriptions was one focus of the 


present study. 

Because of the myriad of details contained in 
ordinary pictures, the verbal descriptions of most 
pictures are not entirely complete (cf. the aphorism 
* picture is worth a thousand words"), One might 
explain any recognition advantage for pictures over 
verbal descriptions in terms of the extra details 
contained in the pictures but not in the verbal 
descriptions; if 5 encodes additional pictorial details 
during study, then recognition of any one of those 
details at the time of test could serve as the basis 
for distinguishing the target picture from the dis- 
tractor. These extra details, collectively referred 
to here as “embellishment,” have been considered 
by previous researchers (Haber, 1970; Nickerson, 
1965) as a possible determinant of the high recogni- 
tion accuracy for pictures. The second (and major) 
focus of the present study was to directly manipulate 
the number of details contained in the to-be- 


1 This research was begun while T. O. Nelson was a N; 
Se lere dation Postdoctoral Fellow ab Stanford Universtiy: 
1970-71, and was completed with the support of U.S. Public 
Health Service Grant MH-21037 to T. O. Nelson. The authors 
thank David Gorchoff for his help in pilot research, Colin 
MacLeod and Karen Woodbury for assisting in the statistical 
analysis, and Geoffrey Loftus for his comments on the 
manuscript. 

? Requests for reprints should be sent to Thomas O, Nelson, 
Department of Psychology, University of Washington, Seattle, 
- Washington 98195. 


recognized pictures to determine if extra pictorial 
details facilitate recognition. 

Method. There were 4 sets of stimuli, each con- 
taining 120 items. One set (PH) consisted of black- 
and-white photographs used in a previous expe! 
ment (Nickerson, 1965) Two student judges inde- 
pendently examined the photographs and generated 
a one-sentence description of each photograph. The 
few differences that occurred in the 2 judges' descrip- 
tions were reconciled by a third student judge. The 
final 120 verbal descriptions (VD) comprised the 
second stimulus set. Line drawings of the photo- 
graphs were constructed on the basis of the PH and 
VD stimulus sets, so as to include only those aspects 
of the photograph that were mentioned in the 
verbal description. ‘The resulting line drawings 
comprised the third stimulus set, unembellished 
line drawings (ULD). Finally, extra details from the 
PH stimuli were added to ULD to form the fourth 
stimulus set, embellished line drawings (ELD). 
It should be noted that ELD was not as detailed 
as PH, since the latter contained shading and fine- 
grain details that could not be included in a line 
drawing. An example of an item appearing in each 
of the 4 conditions (VD, ULD, ELD, and PH) is 
shown in Figure 1. 

The Ss, run in groups of 4-5, were 136 under- 
graduates at the University of Washington, whose 
participation fulfilled a course requirement. One 
independent variable (between-Ss) concerned the 
type of to-be-recognized items. The Ss were as- 
signed to conditions according to order of appear- 
ance, beginning with 34 Ss per condition, and com- 
plete data were obtained from 33 Ss in Condition 
PH, 34 Ss in ELD, 30 Ss in ULD, and 31 Ss in 
VD (the loss of data from 8 Ss was due to failure 
to return for the second session). The second 
independent variable (within-Ss) was the retention 
interval for the recognition test, which was either 
immediate (7 min. after the items had been studied) 
or delayed (7 wk. after the items had been studied). 
Both tests were 2-alternative forced-choice recogni- 
tion tests. To avoid the problem of having once- 


3 Raymond S, Nickerson kindly provided these photographs. 
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tested items in the immediate test and twice-tested 
items in the delayed test, the following technique 
was used. Of the 120 items in a given condition, 60 
were studied and. 60 served as distractors. Thirty 
of the 60 studied items were tested only on the 
immediate test, whereas the remaining 30 studied 
items were tested only on the delayed test; similarly, 
30 of the 60 nonstudied items were the immediate- 
, test, distractors, whereas the remaining 30 non- 
studied items were the delayed-test distractors. 
Items were counterbalanced across Ss so that a 
given item served approximately equally often in 
each of the 4 studied-nonstudied immediate-delayed 
conditions. 

During study, a Kodak Carousel projector pre- 
sented the items at a rate of 10 sec. per item. 
Following presentation of the last study item, S 
solved arithmetic problems for 7 min. while E 
rearranged the slide trays for the immediate recogni- 
tion test. During the recognition test, pairs of test 
items were presented simultaneously from 2 Kodak 
Carousel projectors, synchronized by an external 
timing device to present each pair of test items for 
15 sec. The S made a written response to indicate 
whether the previously studied item was the left 
or right member of the test pair, guessing whenever 
necessary. Half of the previously studied items 

. were on the left and half were on the right, inter- 
mixed according to a Gellerman sequence (1933). 
Seven weeks later, S returned for the delayed recog- 
nition test. The 30 test pairs were presented at a 
rate of 20 sec. per pair, and S made a written re- 
sponse to indicate which member of the pair he had 
previously studied, guessing whenever necessary. 

Results. The mean percentage of correct recogni- 
tions for each condition of the immediate and de- 
layed tests is shown in Figure 2. These data were 
analyzed by a 4 X 2 (Conditions X Retention In- 
terval) mixed-model analysis of variance. Recogni- 
tion accuracy was significantly higher on the im- 
mediate test than on the delayed test, F (1, 124) 
= 284.5, p « .001. The main effect of the 4 condi- 
tions was also significant, F (3, 124) = 13.3, 
b.«.001, but only because PH, ELD, and ULD 

“were significantly higher than VD (PH, ELD, and 

ULD were not significantly different from each 
other, p > .05). The Conditions X Retention In- 
terval interaction was also significant, F (3, 124) 

1 p < .001; analyses of the simple interaction 
effects showed that recognition accuracy declined 
over the retention interval significantly more slowly 
‘for PH, ELD, and ULD than for VD (the rates of 
decline for the former 3 conditions did not differ 
significantly, p > .05). Parenthetically, it should 
be noted that, except for a significant simple inter- 
action effect of ELD-ULD X Retention Interval 
(025 < p < .05), transformation of the percentage 
correct data to d' did not alter any of the above 
findings. 

Discussion. Not only were immediate recognition 
and long-term recognition better for the pictorial 
conditions than for the VD condition, but the rate 
of recognition loss over the retention interval was 
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"A SMILING OLD MAN HOLDS 
A LITTLE GIRL” 


FIGURE 1. Example of an item in each of the 4 conditions 
(from top to bottom: verbal description, unembellished line 
drawing, embellished line drawing, and photograph), 
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FiGuRE 2. Mean percentage of correct recognitions as a 


function of retention interval. (Curve parameter is condition : 
photograph—PH, embellished line drawing—ELD, unembel- 
lished line drawing—ULD, and verbal description—VD.) 


greater for VD than for the pictorial conditions. 
"Theoretical interpretations of the latter finding in 
terms of forgetting should be made with caution 
because the high degree of original learning may or 
may not have been the same for the VD and pic- 
torial conditions; however, if one is interested in the 
practical consequences of presenting various in- 
formational representations (verbal, pictorial) for 
the same amount of study time, then long-term 
recognition is shown to be higher for pictorial repre- 
sentations than for verbal representations. 

The lack of effect of embellishment on pictorial 
recognition was quite unexpected. If the superior 
recognition of pictures is due to the extra details that 
pictures typically contain, then recognition accuracy 
should have occurred in the order PH > ELD > 
ULD. The absence of this ordering does not mean 
that unusual details never facilitate picture recog- 
nition, but rather that the commonplace details 
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found in ordinary pictures do not seem to be a 
determinant of the superior recognition for those 
pictures. The theoretical problem is accounting for 
the lack of a facilitative effect of details while at 
the same time accounting for the fact (Loftus, 1972, 
p. 537; Poetzl, 1917) that pictorial details are 
entered into the information-processing system. 
One hypothesis that could account for both sets of 
findings has to do with conservation of processing. 
Suppose that for a fixed amount of processing time, 
S stores a constant amount of information about à 
picture, regardless of the number of details contained 
in the picture. That is, if one picture contains rela- 
tively few details, then S will store a relatively large 
amount of information about each detail; if another 
picture contains relatively many details, then S will 
store a relatively small amount of information 
about each detail. Because the fotal amount of 
stored information is the same for the 2 pictures, 
subsequent recognition performance would not be 
affected by the number of extra details (even though 
the-extra details are stored in memory). Although 
this conservation hypothesis can account for the 
present findings about PH, ELD, and ULD, an 
explanation is Still needed to account for the recog- 
nition advantage of pictures over verbal descriptions. 
The present study serves only to disconfirm the 
notion that this recognition advantage is due to the 
extra details which pictures typically contain. 
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LOCUS OF CONTROL AND LEARNED HELPLESSNESS ' 


DONALD S. HIROTO* À 
University of Portland 


Failure to escape, the defining characteristic of learned helplessness, was 


investigated with perceived and instructed locus of control Ss 
Three groups, equally divided between internals and 


helplessness paradigm. 


in a learned- 


externals and counterbalanced for sex, received different treatments with 


an aversive tone prior to the testing for helplessness. 
neither escape nor avoid an aversive tone, 
the tone, and the third group was not 


escape-avoidance trials followed, u 


exposed to the treatment. 
g a human analogue to an animal 


The first group could 
the second group could escape 
Eighteen 


shuttle box, in which Ss received an instructional set describing the task as 
skill or chance determined. In addition to a complete replication of learned 
helplessness in man, externals were significantly more helpless than internals, 


and chance-set Ss were more help! 
lability of noise, externality, 
avoidance in parallel way: 


nce uncontrol- 


than skill-set Ss. 


id chance instructional set all impaired escape 
it was speculated that a common state may 


underlie all 3 dimensions—expectancy that responding and reinforcement 


are independent. 


Overmier and Seligman (1967) and 
Seligman and Maier (1967) demonstrated a 
profound interference with shuttle box 
escape-avoidance behavior of dogs given 
prior inescapable electric shock. They 
hypothesized that dogs given inescapable 
shock failed to escape later because they 
had learned that shock termination was 
independent of responding. This learn- 
ing was hypothesized to interfere with 
later acquisition of escape because the 


s based on a dissertation submitted 
sity of Portland in partial fulfillment 
irements for the PhD degree. The author 
!issertation chairman, William H. James, 
and the members of the committee, John R. Dono- 
ghue, George S. Larimer, B. Kent Parker, and Philip 
Van Bruggen. Special acknowledgments go to 
Martin E. P. Seligman for his comments and sug- 
gestions in the preparation of this report. 

2 Requests for reprints should be sent to Donald S. 
Hiroto, who is now at the Counseling Center, 
California Polytechnic State University, San Luis 
Obispo, California 93401. 
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incentive for initiating responses had been 
lowered, and the ability to associate re- 
sponding and shock had been proactively 
impaired. — "Learned helplessness” was 
chosen as the descriptive label for the 
phenomenon and also as the hypothesized 
process by which learning of independence 
between responding and reinforcement 
interferes with future responding. 

A social-learning construct, internal-ex- 
ternal control of reinforcement (Lefcourt, 
1966; Rotter, 1966), seems conceptually 
similar to the hypothesized learned-help- 
lessness process. The internal-external 
locus of control refers to the degree to 
which an individual perceives that rein- 
forcements are contingent on his actions. 
An "internal" tends to perceive reinforce- 
ment as a consequence of his responses and 
to attribute the reinforcement contingencies 
to his skills and abilities; an "external" 
tends to perceive reinforcements as un- 


187 
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related, i.e., independent of his behavior, 
and to attribute outcomes to luck, chance, 
or another person. 

Learned helplessness and the internal- 
external construct both view control of 
reinforcement as a crucial variable. Maier, 
Seligman, and Solomon (1969) and Selig- 
man, Maier, and Solomon (1971) em- 
phasized 2 consequences of uncontrollable 
events on later learning: dogs, cats, mice, 
goldfish, and probably rats are (a) slower 
to initiate responses to escape, and (b) 
retarded at learning that responding con- 
trols trauma. Reviews of the literature on 
the internal-external construct (Joe, 1971; 
Lefcourt, 1966, 1972; Rotter, 1966) report 
that external Ss are slower than internal Ss 
in learning a variety of tasks. Apparently, 
lack of control over reinforcement, whether 
real or perceived, impairs a variety of 
species on a variety of tasks. 

A major purpose of the present study 
was to investigate learned helplessness in 
internal and external Ss. It was predicted 
that Ss given inescapable/unavoidable 
pretreatments would show retarded acquisi- 
tion and performance measures relative to 
Ss without such pretreatments, and that 
such retardation might be a function of ex- 
ternality. Previous studies with dogs as 
well as humans (Racinskas, 1971; Thornton 
& Jacobs, 1971) used electric shock as the 
aversive stimulus. The present study used 
a loud tone rather than shock as the aver- 
sive stimulus in an attempt to extend the 
` generality of learned helplessness to a new 
noxious event (see also Braud, Wepmann, 

& Russo, 1969). 

The concept of control is central to both 
helplessness and internal-external studies 
but the definitions of the term differ. In 
helplessness, control refers to E actually 
arranging the events as independent of 
responding; while internal-external con- 
struct studies refer to perceptions of the 
actual events. In view of the differing 
reference bases of control, we introduced an 
instructional set to determine if perceptual 
Set interacted with helplessness. It was 
predicted that Ss pretreated with inescap- 
able noise and then given escape-avoidance 
trials under an instructional set of chance 
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would be more helpless than un kill 
instructions. This parallels our pr ion 
that externals would be more help! 
internals following inescapable nois 


han 


METHOD 


Subjects. All Ss were introductory | 
students at the University of Portland 
administered the Internal-External form 
Dekalb Survey Tests udent Opinior y at 
least 8 wk. prior to the study. The Ja: ie is 
based on the early work of Rotter (1954) ares 

and provides a measure of th: \t to 
S perceives reinforcements as being igent 
on his actions or as resulting from extern tors. 


logy 


The scale is a Likert-type instrument | re- 
sponse choices for each of the 30 critica d 30 
filler items. 

A total of 96 Ss with internal-external es at 
least 1 SD above or below the mean were r mly 
assigned to 1 of the 3 treatment groups anc f the 
2 instructional-set groups, with each cell iter- 
balanced for si of control, 

Apparatus. Two distinctively differe units 
were located on different tables. The ap ratus 
in the pretreatment was a red spring-loadec |. itton 
housed in a l-in.sq. wooden base. In the escape 
condition the button was connected to re con- 
trolling the termination of the aversive s: ulus, 
while in the inescapable condition the but n was 

tion, 


independent of the aversive-stimulus tern 

The apparatus in the phase testing for help! 
was a modified Manipulandum Type S task origin- 
ally designed by Turner and Solomon (1962) as a 
human analogue to the 2-way shuttle box used in 
animal learned-helplessness studies, The inip- 
ulandum was 24 X 5 X 6 in. with a 3-in. knob 
protruding from the top. The knob slid on à 
19 X 1 in. straight channel on the cover of the box. 
Attached to the knob on the underside of the channel 
was a 2}-in. wooden disk. The escape-avoidance 
response was sliding the knob to either side of the 
manipulandum so that the wooden disk made con- 
tact with a hidden microswitch. Only 1 of the 
microswitches on any trial would terminate the 
stimulus, so that on the next trial, the alternate 
switch would terminate the stimulus. 

The aversive stimulus was a 3,000-Hz. tone pre- 
taped on a Sony tape recorder and presented to 5 
through North American earphones. A Beltone 
15-C Audiometer recently calibrated to standards 
set by the International Organization for Standard- 
ization measured the stimulus at 110 db. All 
response variables were measured by standard 
(1/100-sec.) timers and counters located in an 
adjacent room separated by a 1-way mirror. 

Procedure. Each S was escorted into the experi- 
mental room and informed that the study involved 
listening to “some loud noise which has been 
judged to be somewhat unpleasant but not harmful 
or dangerous to you." A 3-sec. sample of the 3,000- 
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Hz. tone was first presented, then each S was as- 
igned to 1 of the 12 cells from the 3 X2 X 2 
(Treatments X Instructional Sets X Locus of Con- 
trol) factorial combination. 

The treatment groups included (a) escape (E) 
Ys, who received unavoidable /escapable pretreat- 
ments, (b) inescapable (E) Ss, who received un- 
avoidable/inescapable pretreatments, and (c) no- 
pretreatment (NP) Ss who received only the test 
trials with the manipulandum. 

Tape-recorded instructions for pretreated 
were an expanded and modified version which 
Turner and Solomon (1962) described as adequate: 


"s 


Listen to these instructions carefully. [am not 
allowed to give you additional information other 
than what is given to you now. So please 
listen and do not ask me any questions. From 
time to time a loud tone will appe When that 
tone comes on, there is something you can do to 
stop it. 


The pretreatments consisted of 30 unsignaled 
5-sec, trials with the 3,000-Hz. tone, The intertrial 
interval (ITI) ranged 15-25 sec., with a 20-sec. 
mean ITI. At the conclusion of the pretreatmen 
S rated the aversiveness of the auditory stimulus. 

The testing for helplessness was conducted with 
the shuttle box manipulandum at a different table 
but within the same experimental room. The NP 
Ss were given the preexperimental instructions and a 
3-sec. sample of the tone prior to being seated at the 
table. The manipulandum was covered until S 
received 1 of the 2 instructional sets describing the 
task. The first half of the taped instructions was 
identical and were presented to all Ss: 


You will be given some trials in which a rela- 
tively loud tone will be presented to you at 
different intervals. Now here is the important 
part, and I want you to listen carefully. When- 
ever you hear the tone come on there is some- 
thing you can do to stop it. 


The second half of the instructions varied ac- 
cording to the particular set. The “skill” instruc- 
tions emphasized direct control over the 3,000-Hz. 
tone: 


What you do is really up to you to figure out. 
There is a solution to the problem, and if you 
figure it out the tone will stop. Therefore, the 
amount of unpleasantness you receive is depen- 
dent on your skills and abilities to find the solu- 
tion to the problem. You are potentially in 
control of the situation. 


‘The “chance” instructions emphasized that S had no 
direct control over the stimulus and that chance 
factors predicted success: 


But I will be controlling the solution to the prob- 
lem. In other words, the way to stop the tone 
is really up to me. As far as you are con- 
cerned this is a guessing game. When you guess 
correctly, the tone will automatically stop. 
But if your guess is wrong the tone stays on. 


After the instructions, S uncovered the manipu- 
landum. The knob was always located at the 
midpoint of the manipulandum, so that S could slide 


the knob equidistant to either the left or right end © 


of the box. The test phase consisted of 18 signaled 
10-sec. trials. A 5-sec. red light, located at the mid- 
point of the manipulandum cover, preceded the 
onset of the 5-sec. auditory stimulus, with the offset 
of the light coinciding with the onset of the 3,000- 
Hz. tone. The ITI ranged 20-55 sec., with a mean 
ITI of 20 sec. 

The appropriate response was moving the knob 
to one side of the manipulandum on one trial and 


sliding the knob to the opposite side on the next — 
An avoidance response was terminating the — 


trial. 
red light prior to the onset of the auditory stimulus, 
ie. a response latency of 5 sec. or less, while an 
escape response was terminating the tone between 
Sand 10 sec. after trial onsct. If S did not terminate 
the light or noise, a latency of 10 sec. for that trial 
was recorded. At the completion of the test phase, 
S was asked to rate the unpleasantness of the tone, 
paid $2, and debriefed. 

Five response measures were used during the test: 
(a) trials to criterion for avoidance acquisition, 
defined as 3 consecutive avoidance responses; (b) 
trials to criterion for escape acquisition, defined as 
3 consecutive escapes; (c) number of avoidance 
responses for the 18 trials; (d) number of failures 
to escape, defined as number of trials with a la- 
tency of 10 sec.; and (e) the overall mean response 
latency for the 18 trials. These indices, particularly 
c, d, and e parallel the indices reported in the 
animal helplessness literature. 


RESULTS 


Animal learned helplessness was char- 
acterized by the similarity in escape- 
avoidance behavior between E and NP 
Ss, while E Ss revealed longer response 
latencies and more failures to escape than 
either of the other Ss. The results of this 
experiment disclosed remarkable similari- 
ties to the animal studies. The E group 
was retarded in escape-avoidance measures 
relative to the E and NP groups, with the 
latter groups not differing from each other. 

Maier et al. (1969) reported that ap- 
proximately 63% of E dogs and about 5% 
of naive dogs failed to escape in the shuttle 
box. Human Ss demonstrated similar 
but somewhat less dramatic findings. On 
the average, E Ss failed to escape the aver- 
sive stimulus on over 50% of the 18 trials, 
while E Ss failed on 13% and NP Ss failed 
on 11% of the trials. Approximately 34% 
of the E Ss failed to reach criterion, com- 
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pared to 8% of the E and NP Ss who also 
failed to reach criterion. Figure 1 presents 
the response latencies of the 3 treatment 
groups on the 6 blocks of 3 trials. The 
horizontal line at the 5-sec. mark on the 
ordinate represents the boundary between 
escape and avoidance: Points plotted above 
the 5-sec. line represent escapes, while 
points below the line denote avoidance 
responses. The E group shows consistently: 
longer latencies than either the E or NP 
groups, with the latter groups performing 
near equality. 

An analysis of variance (ANOVA) test 
on response latency found a main effect 
of treatment, F (2, 84) = 12.38, p < .01, 
and a Treatments X Trial Blocks inter- 
action, F (10, 420) = 2.29, p < .05. Two 
planned orthogonal comparisons between 

groups indicated that the main effect was 
due to the variability of E Ss. The first 
comparison between E vs. the average of E 
and NP Ss was significant (p < .01), 
while the second comparison found no 

- differences between E and NP Ss. A 
simple main effects analysis on the Treat- 
ments X Blocks interaction disclosed sig- 
nificant differences between groups at each 
of the 6 trial blocks (p < .02). Scheffé S 
tests revealed that E Ss had consistently 
longer latencies at each of the trial blocks 
than either E or NP Ss, while the latter Ss 
did not differ from each other. 


MEAN RESPONSE LATENCY 


BLOCKS OF 3 TRIALS. 


FrGURE 1. Mean response latencies of the 6 
escape-avoidance trial blocks for the 3 treatment 
groups. 


DONALD S. HIROTO 


MEAN RESPONSE LATENCY 


= 
6 
BLOCKS OF 3 TRIALS 

Ficure 2. Mean response latencies 6 
escape-avoidance trial blocks for internal exe 
ternal control Ss collapsed over the 2 exp: ital 
factors. 

A treatments main effect was found vith 
the failure to escape measure, F (2, 854) = 
22.57, p « .01, and trials to criteri or 
escape acquisition,’ F (2, 66) = 106.49, 
p €.01. A planned orthogonal compari- 
son indicated that these main effects e 
due to differences between E Ss anc the 
average of E and NP Ss (p < .01). This 
planned comparison also found a sig yif- 
icant effect on the trials to criteri ‘or 
avoidance acquisition: E Ss took longer to 
reach acquisition than the average of E 


and NP Ss,-F (1, 84) = 445, p < .05. 
The second orthogonal comparison found 
no differences between E and NP Ss on any 
of the 4 dependent measures. 

Other factors, independent of treatment, 
were also significant. External locus of 
control Ss, regardless of their pretreatments 
and instructional sets, were slower to 
escape or avoid than internal control Ss. 
Figure 2 presents the response latencies of 
the internal-external factor on the 6 
trial blocks. An ANOVA on response 
latencies disclosed an internal-external 
main effect, F (1, 84) = 6.58, p < .05, 
and a Trial Blocks X Internal-External 
interaction, F (5, 420) = 3.24, p < .05. 


* Analysis of the escape-acquisition measure w; 
based on unequal ms as some Ss (18) reached avo 
ance criterion without first reaching escape criterion. 
There was no statistical relationship between the 18 
Ss dropped from the analysis and any of the ex- 
perimental treatment factors. 
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GURE 3. Mean response latencies of the 6 
pe-avoidance trial blocks for the instructional- 
set Ss collapsed over the 2 experimental factors. 


A test for simple main effects on the inter- 
action revealed that external control Ss 
responded slower than internal control Ss 
on Trial Blocks 4, 5, and 6 (p < .025). 
External Ss also required more trials to 
reach avoidance criterion (M = 18.3) and 
made fewer avoidance responses during the 
18 trials (M = 3.1) than internal Ss 
(Ms = 15.0 and 6.2, respectively). The 
ANOVA tests on these measures sta- 
tistically verified the observed differences 
(p < .05), There were no internal-ex- 
ternal differences for the escape-acquisition 
or failures-to-escape measures. 

The chance-set group displayed retarded 
measures compared with the skill-set group. 
Figure 3 presents the response latencies of 
the instructional set factor collapsed over 
the 2 experimental factors. The ANOVA 
found a main effect of instructions, F (1, 
84) = 5.93, p < .05, and a Trial Blocks X 
Instructions interaction, F (5, 420) — 4.28, 
p «.05. A simple main effects analysis 
indicated that chance-set Ss had longer 
latencies than skill-set Ss on Trial Blocks 
4, 5, and 6 (p < .025). The effects of 
instructional set also interfered with avoid- 
ance responding. Compared to skill-set Ss, 
chance-set Ss required more trials to reach 
avoidance criterion (Ms = 18.2 and 15.1, 
respectively) and made fewer avoidance 
responses (Ms = 3.2 and 6.1, respectively). 
The ANOVAs computed on these measures 
were all significant (p < .05). There was 
no effect of instructions on either the 
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escape-acquisition or failures-to-escape 
measure. 
The results of internal vs. external and 
instructional set indicate that either factor 
can produce parallel results of inesca 
ability. That is, inescapability, chance - 
set, and externality retard escape-avoid- — 
ance behavior. 3 
Interestingly E internal Ss made more 
button presses i.e., escape attempts, i D 
pretreatment than E external Ss (Mdns = — 
$ 


21 and 5.5, respectively), x? = 3.97, p < 
.05, suggesting that persistence in trying 
to control an uncontrollable event may be — 
correlated with the absence of later help- | 
lessness. This is similar to Seligman and. — 
Maier’s (1967) concept of immunization — 
where dogs with prior escapable shock 
demonstrated enhanced panel pressing 
under inescapable shock and were later 
not helpless. 

There were no significant interactions 
between the experimental factors on any 
response measure. However, à priori 
comparisons permitted an analysis of pos- | 
sible interaction within the I group. 
Figure 4 presents the mean latencies of 
internal and external E Ss across the 6 
trial blocks. The F test for planned 
orthogonal comparisons revealed that the 
overall mean latency of externals was sta- 
tistically longer than that of internals, — 
F (1, 84) = 5.46, p < .05. No other de- - 
pendent measure approached significance. 
Inescapability and instructional set were - 
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FiGURE 4. Mean response latencies of internal 
and external control Ss in the inescapable treatment — 
group for the 6 escape-avoidance trial blocks. 
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analyzed next. -Dunn's test for planned 
nonorthogonal comparisons was not sig- 
nificant with any of the 5 response measures. 
Further analysis was conducted on a 
possible confounding variable. There was 
no yoking procedure in the pretreatment, 
so that duration of noise exposure was not 
equated. The E group received 5 sec. of 
aversive noise on each trial, whereas the 
E group received an average of 1.4 sec. of 
noise per trial. Since E Ss received longer 
periods of the loud tone, and perhaps 
greater stress, it could be argued that dif- 
ferential stress rather than uncontrolla- 
bility led to the retarded performance with 
the shuttle box manipulandum. However, 
— S ratings of the noise did not reveal signif- 
icant differences: the E group rated the 
= 110-db. tone at 3.55 on a 7-point scale, 
- while E group rated the same tone at 3.66. 
- This difference was not statistically dif- 
ferent and suggests that differential stress 
- did not produce the helplessness effects. 
s It should also be noted that the relatively 
— moderate ratings of the 110-db. stimulus 
indicates that learned helplessness can be 
A produced with moderately aversive events 
. as well as the more traumatic events used 
- án the animal studies. 


Discussion 


— Learned helplessness can be experimentally 
. produced in man. Both animals and man show 
- longer latencies and more failures to escape 
following inescapable aversive events than 

following escapable events or no pretreatment. 


acquisition when escape contingencies were 
E eliminated in the shuttle box. Considering the 
v divergent findings, perhaps it is only initial 
— learning—escape or avoidance, whichever oc- 
curs first—that is disrupted by inescapability. 

The first published account of learned help- 
- lessness in humans (Thornton & Jacobs, 1971) 
. had 2 methodological problems. First, an 
instructional set was confounded with in- 
. escapability in the pretreatment. The E Ss 
4 received instructions that described the non- 
contingency between shock and responding, 
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and E Ss received different i: that 
described the contingency betwee : and 
avoidance. "These different em- 
bedded within the different pret: may 
have predetermined Ss’ respo: se à 
relationship in the study. Thi prob- 
lem relates to the unusual pret pro- 
cedures. Previous animal help udies 
pretreated E Ss with unavoida able 
shock. Thornton and Jacobs, pre- 
treated their E Ss with avoidal able 
shock. Because of the differen ures, 
their study cannot be considered gue 
to the helplessness studies, 1 and 
Jacobs, rather than demonstrat rned 
helplessness, demonstrated the effe »rior 
avoidance training on a later escape ance 
task. 

The learned-helplessness hypoth: nti- 
fies control over reinforcement as t icial 
variable. The present study conf the 
hypothesis by demonstrating that pro- 
duced greater impairment in escape i ince 
than E or NP Ss. In addition, the rnal 
locus of control variable, also concer: with 
control of reinforcement, interacte: with 
inescapability to produce greater imp: ment 
than internal control Ss. 

In conclusion, this study demonst i that 
helplessness can be experimentally induced in 
man wholly parallel to helplessness in animals. 
The S variable of externality appears to func- 


tion like the pretreatment variable of in- 
escapability, as evidenced by the interaction 
between externality and the helplessness in- 
duction. Both locus of control and skill- 
chance instructional set factors produced an 
effect similar to inescapability. In view of the 
parallel effects between the 3 factors, | sug- 
gest that a single process may underlie learned 
helplessness, externality, and the perceptual 
set of chance—the expectancy that responding 
and reinforcement are independent. Seligman 
(in press) and Miller and Seligman (1973) 
have pointed to this cognition as the underlying 
state in reactive depression in man. 
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CONDITIONED ADAPTATION TO PRISMATIC DISPLACEM 


TRAINING TRIALS AND DECAY ! 


FRED L. YAFFE 


Howard University Eye Research Foundation, Bethe. 
Adaptation to prismatic displacement was conditioned to the wearing of a 1 
of goggles by employing Taylor's alternation technique. Each alternation co: 

sisted of 2 3-min, training conditions. In one condition, S was exposed to ti 

goggles and the prism; in the other, S was exposed to neither, Three group 
of 13 Ss were trained. They received either 1, 3, or 5 alternations. Pointing 
test measures designed to measure aftereffects and conditioned aftereffects 
were performed by each S at 0, 5, 10, and 20 min. after training. Otherwise, 5 
sat quietly with closed eyes after training. Training time and testing time both 
produced significant effects. Aftereffects of adaptation exhibited a reminiscence 
effect and then decay, whereas the conditioned aftereffects exhibited neither. 


When a prism displaces the viewer's visual 
field it initially causes errors in localization 
responses (effect). These errors decrease 
in magnitude (adaptation effect) after some 
appropriate practice by S. The adaptation 
effect measures are in turn taken to reflect 
an underlying psychological compensation 
(adaptation) to the optical disarrangement. 
Responses then made with the removal of 
the device exhibit an error in the direction 
opposite to the effect (aftereffect). For 
some time, aftereffects were assumed to be 
conservative estimates of adaptation effects 
and as an equally valid indicator of adapta- 
tion because, in the early studies which 
measured both, the magnitude of the 
former so nearly agreed with the magnitude 
of the latter (Rock, 1966). Conditioned 
adaptation studies, however, have indi- 
cated that this equality need not always 
hold (Kohler, 1964). 

Conditioned adaptation studies usually 
have 2 major parts: a training period and 
testing periods. The training period is de- 
signed to produce an association between 
the adaptation and a previously neutral 
stimulus. This is achieved (a) by em- 
ploying 2 alternating training conditions, 
one with normal vision and the other with 
vision modified by the optical device, and 
(b) by the systematic covariation with the 


! This study was supported by National Institute 
of Neurological Disease and Stroke Grant NB 
08696-01. 

* Requests for reprints should be sent to Jerome 
H. Kravitz, Department of Psychology, Howard 
University, Washington, D.C. 20001 


iffer- 


and 


training exposure conditions of ot! 
ent processes, specifically the abs: 
presence of the neutral stimulus. 

To date, 2 techniques have | de- 
veloped to produce this alternati: In 
one (Kohler, 1964), S is exposed to « split 
visual field by having an optical d e a 
prism, in the upper half of the fie and 
unobstructed normal viewing in the tower. 
Here the alternation produced is solely 
dependent upon a behavior of S, namely 
S's changing the direction of his gaze. In 
the other technique (Taylor, 1962), the 
alternation of the training conditions de- 
pends upon the intervention of Æ, who 
simultaneously exposes S to the optical 
device and the neutral stimulus and later 
removes them.  Conditioned adaptation 
effects are taken to be present when in the 
testing period the adaptation effects or 
aftereffects obtained with the neutral stimu- 
lus present are greater than those obtained 
with it absent. 

Taylor (1962) produced differential condi- 
tioned adaptations by systematically co- 
varying the training exposure conditions 
with the sensations produced by wearing 
spectacles that housed the optical device. 
With this procedure he conditioned the 
adaptation effect to right-left reversal and 
to the prismatically produced slant of 
horizontal surfaces. During testing he 
found that the adaptation effects obtained 
while wearing the spectacles were much 
larger than the aftereffects obtained with 
the spectacles absent. 


194 


CONDITIONED ADAPTATION TO PRISMATIC DISPLACEMENT 


Recently Kravitz (1972) refined the 
Taylor (1962) alternation training tech- 
nique and associated the adaptation to 
prismatic displacement to the wearing of 
goggles in 240 min. of training. This study 
replicated Taylor's findings, namely, that 
there were larger adaptation effects with 
the goggles in place than aftereffects with 
them removed. It also demonstrated that 
aftereffects obtained while wearing the 
goggles without a prism were larger than 
those obtained without goggles. The mag- 
nitudes of these differences were taken as 
a measure of the conditioned adaptation 
to the goggles. Aftereffects obtained in 
the absence of the goggles are accordingly 
viewed as measuring a component of the 
total adaptation not under the control of 
the goggles as a conditional stimulus (CS). 
One current theoretical issue regards how 
adaptation in general and this component 
in particular should be viewed. However, 
for reference purposes we propose to pro- 
visionally label this component as "'gen- 
eralized aftereffect." One position holds 
that adaptation does not depend upon or 
necessarily result in learning (Mikaelian, 
1963). The other holds that adaptation 
not only depends upon past experience 
(Held, 1961) but is due to learning (condi- 
tioning) processes (Taub, 1968). If the 
generalized aftereffect is a form of learning, 
then it could be expected to "decay" 
(Ebenholtz, 1969; Hamilton & Bossom, 
1964) in a manner similar to that of condi- 
tioned adaptations. The present study 
tests this hypothesis. 


METHOD 


Apparatus. The displacement of the visual field 
was produced by a 20-diopter wedge prism mounted, 
base left, in the right frame of a pair of weldei 
goggles. The left eye of S was occluded at all times 
by an eye patch. A second pair of welder's goggles 
had an optically plain piece of glass in its right frame. 

The testing apparatus was a vertically adjustable, 
horizontal platform, 152 cm. long and 66 cm. deep, 
which when raised to shoulder level allowed S to 
extend his arm easily underneath it. On the plat- 
form's near edge a biteboard apparatus was fixed. 
On the far side of the platform was a panel (102 cm. 
wide by 76 cm. high) perpendicular to 5's line of 
regard. The pointing test target was a vertically 
oriented black bar (6 X .6.cm.). It was presented 
in 2 positions: 6° to the right of S’s midline and 6° 
to the left, through a 76-cm.-long, horizontal, eye- 
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level slit in the rear panel. The rear panel and the 
remainder of the apparatus were painted flat white. 
The E used a meter stick to determine S’s pointing 
responses. 

Responses. A manual localization test was em- 
ployed to measure the effects of training. It involved 
having S stand biting the biteboard, extending his 
right arm under the platform, and pointing with his 
right index finger. At the testing apparatus, S had 
his eyes closed at all times, except when making a 
pointing response. The pointing test measure was 
the mean of 4 pointing responses: 2 responses to each. 
of 2 target positions. Each response required S to 
open his eyes and point at the apparent location of 
the target. The order of target position presentation 
was counterbalanced. 

Test conditions. The pointing test was performed 
in 2 testing situations: In the first (G/P), S wore the 
goggles (G) with plain glass (P); in the second 
(G/P), S did not wear the goggles (G) or the 
prism (P). 

Experimental design. ‘Thirty-nine undergraduates 
drawn from Howard University introductory psy- 
chology courses served as paid voluntary Ss. They 
were assigned to 3 equal groups. 

Each group was administered the same 6 main 
segments of the experiment, which were in order 
of occurrence: pretesting, training, and 4 posttest 
segments (A, B, C, and D). Figure 1 depicts the 
manner in which the 2 types of test situations were 
distributed among the first 3 segments of the ex- 


S went through the following sequence. First was 
the pretesting, which involved 2 test conditions: 
(G/P); and (G/P)s The S then received alterna- 
tion training followed by Posttesting A. All the 
posttest segments had the same test condition 
sequence. The Posttest A sequence had 3 test 
conditions: (G/P);, (G/P), and (G/P), Posttest 
B immediately followed Posttest A. After Posttest 
B there was a 5-min. rest period followed by Posttest 
C. Posttest C was followed by a 10-min. rest period 
and then Posttest D. Thus, Posttests A, B, C, and 
D occurred 0, 5, 10, and 20 min. after the completion 
of training. During each rest period S sat quietly 
with eyes closed. 

The effects of training and/or decay were deter- 
mined by obtaining the difference between the pre- 
and posttesting measures of a given type of test 
condition. 

Since all testing was under P (no prism) conditions, 
these differences (represented in Figure 1 by solid 
lines connecting the pretest conditions to the 
Posttest A conditions) resulted in aftereffect mea- 
sures represented in Figure 1 by the circled d symbol 
appended to each solid line. 

Thus, we obtained from each posttest segment 
3 pointing test measures of aftereffects of adaptation. 
It was expected that the G aftereffects would be 
greater than the G aftereffects to the extent that 
the adaptation had become conditionally associated 
to the wearing of the goggles. The difference be- 
tween these effects (indicated in Figure 1 as the 
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PRETESTING 
FIGURE 1. 


in the me 


adaptation effects [CAi, CA» ].) 


dashed lines labeled CA; and CAs connecting the 
circled d symbols, Posttest A) are considered 
measures of conditioned adaptation (CA). The 
mean of these 2 measures (CA,) constitutes a single 
estimate of the conditioned effects measured in the 
testing segment. The same calculations apply to all 
of the testing segments. 

Training procedure. Each group (Ei, Es, and E;) 
was given alternation training designed to produce 
differential conditioned adaptation to the wearing 
of goggles, Group E; received 1 alternation; Es, dn 
and E; 5. The total training times of Ei, Es, and 
E; were 6, 18, and 30 min., respectively. Each 
alternation consisted of 2 3-min. training exposures : 
In G/P, S wore the goggles (G) and was exposed to 
prismatically altered viewing (P); in G/P the goggles 
were absent (G) and the viewing was normal and 
unaltered (P). The alternation was accomplished 
by having S sit with his eyes closed while E placed 
or removed the goggles. During each training ex- 
posure, whether G/P or G/P, S performed the same 
training task. The task consisted of viewing his 
right arm resting upon a vibrating armrest. Twice 
during each 3-min. segment, S was instructed to 
move his arm and hand to the left, center, right, and 
center of the arm rest. Otherwise, the arm rested 
passively (Kravitz & Wallach, 1966). Each alterna- 
tion began with a G/P. exposure and ended with a 
G/P exposure. 


RESULTS 


The principal inquiries of this study were: 
(a) Will conditioned adaptation effects be 
produced by short forms of training; and 


The pretest, training, and posttest (Segment A) sequence. (The 
solid lines represent differences between pre- and posttest conditions that result 
ement of aftereffects [circled d] of adaptation. Dashed lines con- 
necting circled d symbols represent differences between G [goggles present ] 
and G [goggles absent] effects and result in the measurement of conditioned 


SEGMENT A | 


INING POSTTESTING 


(b) what is the relationship bet ^n Che 
decay of generalized aftereffects id the 
decay of conditioned adaptation effects ? 
Therefore, only G aftereffects and CX 
effects are presented in Table 1. S 
figure is the mean difference in degrees 
between the mean of the 2 G/P posttest 
conditions of a segment and the corme— 
sponding pretest condition. The CA. 
effects are a derived measure. They are the 
mean differences between the G and the 
G aftereffect measures. Each G measu re 
(not presented in the table) is the meam 
difference between the pre- and posttrainiza = 
measurements of the G/P test condition s 
Individual / tests were performed on eac}4 
of these mean effects to determine wheth ex4- 
they were significantly different from zero . 
Inspection of Table 1 indicates that evera 
with the short training times employe | 
significant conditioned adaptation effects 
were obtained. The grand mean CA, effec e ! 
1.5?, was significant, /(38) = 4.43, p < .0 1 ` 
A 3-way (3 X 4 X 2) analysis of variance 
with 2 repeated factors was performesc3 
and is presented in Table 2 (Winer, 1962) _ 
The first factor was training groups (6, 1 S = 
and 30 min.) ; the second factor was testina 
time (0, 5, 10, and 20 min.) ; and the thirey 


Each 
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TABLE 1 
MEAN G AFTEREFFECTS AND MEAN CA, EFFECTS (IN DEGREES) ON POINTING TEST 


Posttesting segment. 
Training time Fa) = 
$ med group) A (0 min.) B (5 min.) | C (10 min.) D (20 min.) M 
Total df — 38) — — E 
G CA, G CA; G CA; G CA« G CAc 
6 min | 
Be RU ted 0 1.0 —9 oF ct RU $1540 .8* 
SD 2.1 1.7 24 2.3 2.6 1.8 2.1 2f 2.1 1.7 
18 min. | 
x 1.0* 1,55 | 2.1%" | 1.0% 14* | .7 2 | 1.0* 12* | L1I** 
SD 2.4 DD 2.6 1.3 2:8: *]-.1.9 2.0 1.8 2.6 1.4 
30 min. | 
EN Let p DE v ped Gt Liat. JE: he A 8 Ope CI Se 
SD 2.1 | S a on 1.9 2.8 1149.1 4.6 1.7 2.3 1.5 
Total | | 
M {For sse LON Tote dish 5 L4 [5m.2 [2th VS 9e 
SD 2.3 1.6 | 2.9 1529. 29 2.1 3:2 2.1 2.5 1.7 
| I 
Nole, Each G is the mean difference between the mean of the 2 G/P (goggles and prism absent) posttest conditions of a segment. 


and the corresponding pretest conditions. (A minus signifies a G posttest deviation in the nonadaptive direction.) Each CA, is the 


mean difference between thi 
posttraining measures of the G/ 
* p <.05; devi gnii 
*# > <.01; deviations significantly different from 0; 


/P test conditions.) 


factor was the comparison of aftereffects 
and conditioned aftereffects (CA/G). The 
group effect was significant, F (2, 36) = 
8.05, p < .01. A Newman-Keuls com- 
parison of the group ordered means re- 
vealed that the 30-min. group produced 
significantly larger effects than both the 
18- and 6-min. groups (p < .05), and that 
the 18-min. group was correspondingly 
larger than the 6-min. group (p < .05). 
The time effect was also significant, 
F (3, 108) = 11.51, p < .01. A Newman- 
Keuls comparison of the time effects re- 
vealed that the immediate test was not 
significantly different than the 5-min. test. 
All other comparisons revealed significant 
decrements in effects with time (p < .01). 
However, these decrements are accounted 
for by the Time X CA/G interaction, which 
was the only other significant effect of the 
analysis, Æ (3, 108) 4.02, p < .01. 
Although the CA, trend is decreasing, a 
Newman-Keuls comparison of this com- 
ponent of the interaction revealed that these 
effects did not significantly change with 
time. The same comparison of the G 
effects (generalized aftereffects obtained 
with the CS absent) revealed significant 
changes as a function of time. Apparently 
the CA, decay rate is slower than that of 


G and the G aftereffect measures, (The G measure is the mean difference between the pre- and 


ficantly different from 0; 2-tailed test. 
tailed test. 


generalized aftereffects. The comparison 
of the immediate G effect to the 5-min. 
effect revealed a “reminiscence effect’’: 
The 5-min. effect was significantly larger 
(.7°) than the immediate effect (p < .05). 
The G effects then progressively decreased 
in magnitude with time. The 10- and 
20-min. effects were both significantly 
smaller (1.0? and 1.7°, respectively) than 
the 5-min. effect (p < .05). The 20-min. 
effect was also smaller (1.0?) than the im- 
mediate effect (p < .05). 


TABLE 2 


SUMMARY OF 3-FACTOR ANALYSIS 


Source df MS d 
Between 38 
Groups (A) 2 | 2,296.82 | 8.06* 
* Error 36| 285.08 
Within 273 
Time (B) 3| 420.55 | 11,51* 
AXB 6 58.85 | 1.61 
Aftereffects vs. condi- 
tioned aftereffects (C) 1 | 1,090,88 | 2.54 
AXC 2| 464.70, 1.08 
Error 36| 428.06 
Ho 3 192.10 | 4.02* 
AXBXxC 6 29.26 .61 
Error ~ 108 47,75 
*p <1. 
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Thus, an increase in the number of trials 
and training time does affect the magnitude 
of the conditioned adaptation effects and 
aftereffects obtained. Conditioned adapta- 
tion effects do not decrease in magnitude 
due to passive forgetting; generalized after- 
effects obtained in the absence of the CS 
reveal at first an increment and then a 
progressive decrement with time. 


DISCUSSION 


The CA, effects of the present study indicate 
that relatively short periods of training can 
produce conditioned adaptation effects. An 
argument against this conclusion might be 
that these effects are due to some form of 
conscious correction by S. Such a correction 
would involve implicit verbal self-instruction 
regarding the effects of the prism, its con- 
tingencies, and a mode of corrective com- 
pensatory responding. 

Evidence, however, does not support this 
contention. First, in postexperimental inter- 
views most Ss were unable to verbalize either 
what happened in the experiment or the dis- 
placement effects of the prism. All denied 
deliberate compensation during testing. 

Second, the magnitude of the overall CA, 
effects (1.5?) is so small relative to the pris- 
matic displacement (11.6?) that an explanation 
of these effects in terms of conscious correction 
would constitute a gross extension of the 
meaning of that construct. We therefore con- 
clude that the CA, effects are not due to con- 
scious correction and that they indicate that 
conditioning has been produced. 

Aftereffects typically increase in magnitude 
with increasing training time (Hamilton, 1964). 
The results of the present study indicate that 
this relationship holds even under conditions 
of alternation training. In addition, we find 
that conditioned adaptation effects and gen- 
eralized aftereffects both increase with in- 
creasing amounts of training and in this regard 
function similarly. Ebenholtz (1969) found 
that tilt aftereffects decay with time and 
that the rate of decrease was independent of 
initial magnitudes of effects. The lack of a 
Groups X Time interaction in the present 
study extends his finding to aftereffects ob- 
tained under alternation training. Ebenholtz 
also developed the comparator model of adap- 
tation (Held, 1961) so as to account for the 
obtained decay of tilt aftereffects. However, 
that model, as presently formulated, cannot 
account for the reminiscence effect obtained 
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with the G aftereffects in the 
Moreover, the present effect replic 
terms of time and direction that o 
monkeys (Taub, 1968). This efl 
that generalized aftereffects and 
aptation effects may be affected 
less transient inhibitory 


proces 


wise, the decay of G aftereffect is exa 
pected. The adaptation as a for: unin 
hypothesis would lead to the d: n that 
generalized aftereffects and condit wo 
ilts 


effects would decay similarly. Ou 
not confirm this deduction. Ka 
imply that the hypothesis must 

if it is to remain viable. The n 
would be such as to take into accou: 
that adaptation as a form of lea: 
conditioned adaptation as a form ol 
function dissimilarly in some repects 
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The size of the sound stimulus employed in the first stage of speech processing 
was investigated in an attempt to determine the perceptual unit of analysis 
in speech recognition. It is assumed that the perceptual unit is held in a pre- 
perceptual auditory image until its sound pattern is complete and recognition 
has occurred. Vowels and consonant-vowel syllables were employed as test 
items in a recognition-masking task. The results show that recognition per- 
formance improved up to 200-250 msec. after presentation of the speech sound. 
The results were interpreted as evidence that the preperceptual auditory 
storage and perceptual processing of a speech sound does not exceed 250 msec., 
implying that some transformation of the speech signal must occur about every 
sec, Since the stimulus within this time period must function as a perceptual 


unit, perceptual units appear to be of roughly syllabic length. 


The primary purpose of this study was 
to determine the size of the sound stimulus 
employed in the first stage of speech 
recognition. A listener has recognized 
(identified) a stimulus when he has deter- 
mined that one of a possible set of alterna- 
tives was presented. Recognition of a 
stimulus is possible only if the information 
in the stimulus is sufficient to distinguish 
that stimulus from other possible stimulus 
alternatives. Recognizing speech continu- 
ously implies that each small portion of the 
sound-wave pattern uniquely determines a 
stimulus alternative. However, small por- 
tions of the acoustic input are not unique; 
there is no one-to-one mapping of stimulus 
to percept. Since the sound pattern must 
contain enough information for a consistent 
stimulus-percept mapping, larger chunks 
of the acoustic input are necessary for 
recognition. 

Since the sound stimulus for speech recog- 
nition extends over time, the first part of 
the pattern must be held in some preper- 
ceptual form until the pattern is complete 
and recognition has occurred. Accordingly, 
it is assumed that the information in the 
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sound pattern is held in a preperceptual 
auditory image and*that the recognition 
process involves a readout of the informa- 
tion in that image. The duration of the 
preperceptual auditory image places an 
upper limit on the sound patterns that can 
be employed in the recognition process. 
Accordingly, to understand speech recogni- 
tion, it is necessary to determine the maxi- 
mum duration that information can be held 
in a preperceptual auditory image. 

The minimal sound patterns that are 
usually recognized in continuous speech 
are referred to as perceptual units. In 
terms of the present analysis, the perceptual 
unit cannot exceed the duration of the 
image and must uniquely determine the 
appropriate perceptual response. The per- 
ceptual unit of analysis gives the perceptual 
system acoustic information that can be 
reliably correlated with information in long- 
term memory. The information in the unit 
is defined by a set of features that corre- 
spond to a list of features in long-term 
memory. The features made available by 
the stimulus are acoustic, whereas the list 
of features in memory is abstract. The 
recognition process must find a match be- 
tween the acoustic features in the stimulus 
and a list of features in memory. The 
nature of perceptual units also provides 
information about the structure of long- 
term memory. A set of features in the 
acoustic signal indicates that the signal can 
be identified and can therefore be used to. 


199 


'200 


integrate with other units so that meaning 
can be derived from the message. 

The following studies were carried out to 
determine the size of perceptual units em- 
ployed in speech perception. It is neces- 
sary, therefore, to determine the temporal 
course of processing speech stimuli and to 
estimate the effective duration of the pre- 
perceptual image. Vowels and consonant- 
vowel syllables were employed as test items 
in an auditory recognition masking task 
(Massaro, 1970b). In this task, a test 
stimulus is presented, followed by a mask- 
ing stimulus that occurs after a variable 
silent interstimulus interval. In a typical 
experiment, the test signals are short tones 
differing in pitch, and the listener's task 
is to identify the higher tone as high and the 
lower tone as low. The masking tone is 
presented at another frequency and at the 
same loudness as the test tones. The re- 
sults indicate that recognition performance 
improves with increases in the silent inter- 
tone interval, up to about 250 msec. 
(Massaro, 1970b, 1971, 1972c). 

"These results provide information about 
the preperceptual auditory image of the 
test tone and the vulnerability of the audi- 
tory image to new inputs. Given that the 
test tone lasted only 20 msec., some pre- 
perceptual image must have remained for 
the perceptual processing necessary to im- 
prove recognition performance with in- 
creases in the silent intertone interval. 
This improvement in recognition also indi- 
cates that the masking tone terminated per- 
ceptual processing of the image. Since 
recognition performance leveled off at about 
250 msec., the image effectively decayed 
within this period. Using this paradigm 
with speech stimuli should make it possible 
to determine the duration of preperceptual 
images and the processing time required 
for speech stimuli. 


EXPERIMENT I 


The first experiment demonstrates that 
the recognition of a short speech stimulus 
is not immediate, but rather requires time 
for perceptual processing. Perceptual pro- 
cessing refers to the analysis of physical 
features in the sensory input in order to 
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recognize the stimulus. A mea 
ceptual-processing time might 

the durations of vowels in no 
which are in the range of 150 
1961 


ind in 
peech 
msec, 


(Fletcher, 1953; House, ierson 
& Lehiste, 1960). Since th« n of 
sound-pressure fluctuations in eady- 
state vowel repeats at the speak unda- 
mental frequency, the extende ation 
of the vowel might be necessa: pro- 
cessing the information availa! the 
vowel presentation. Vowels pre d for 
very short durations can be id: ed if 
followed by a- silent retroactis erval 
(Gray, 1942; Suen & Beddox 972). 
However, if processing is interfered th by 
following the short test-vowel pre ition 
with another speech sound, the t« owel 
should not be identified. This resu' vould 
provide evidence that the duratio the 
vowel in normal speech allows tin pro- 
cessing, since the extended duratio: the 


vowel protects it from later spee: 
recognition has been completed. 

The Ss were required to recogniz 
vowel. On each trial, 1 of 2 vowe 
be presented with equal probabi 
second vowel, referred to as the m 
vowel, followed the test vowel after : 
able silent intervowel interval. li 
assumed that the perceptual processing 
necessary for recognition of the test vowel 
could take place during the test-vowel 
presentation and the silent interval after- 
wards, but mot during the masking-vowel 
presentation. 


was 


Method 


Subjects. Three young adults from the University 
of Wisconsin community were employed as 

Procedure. The vowels of a male speaker were 
first recorded at the same fundamental frequency 
and amplitude. A steady-state segment of each 
vowel was stored digitally by a computer-con- 
trolled analog-to-digital converter. During this 
experiment, the vowel segment was played back, 
using a digital-to-analog converter. In the recogni- 
tion masking task, the vowels /i/ as in heat and /1/ 
as in hit were employed as test items. The duration 
of the test vowel was 20 msec. The silent intervowel 
interval lasted 0, 20, 40, 80, 160, 250, 350, or 
500 msec. The masking stimulus was a 270-msec. 
nonsense vowel made up of 2 alternating vowel 
segments. These 2 segments were taken from the 
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vowels /a/ as in hat and /U/as in put and lasted 
45 msec, each. 

The Ss were tested simultaneously in a sound- 
insulated chamber. All experimental events were 
controlled by a PDP-8 computer. The vowels were 
presented binaurally over matched headphones 
(Grason-Stadler Model TDH-39) at a normal listen- 
ing intensity (about 80-db. SPL). An 5 recorded 
his decision by pressing 1 of 2 push buttons, labeled 
iand I, respectively. Following the sec. response 
period, feedback was given by illumi ating a small 
light for 500 msec, above the correct response button. 
The intertrial interval was 2 sec. 

On every trial, S heard a test vowel, followed in 
turn by a variable silent interval and the masking 
vowel. He identified the test vowel as i or I and 


test vowels without a masking vowel and 2 days of 


ice with a masking vowel before the experiment 
If, which consisted of 2 days of 3 sessions each. 
There were 400 trials/session, and Ss did not respond 
to the first 5 trials of each session. Since all experi- 
mental conditions were completely random, memory 
and decision factors were reasonably constant. The 
results, then, indicate the temporal course of the 
perceptual or identification process. 


Results 


The results of the experiment, presented 
in Figure 1, show that for each S, recogni- 
tion improved from near chance to almost 
perfect performance with increases in the 
silent intervowel interval. The results indi- 
cate that an auditory image remained after 
the first vowel and was processed during the 
intervowel interval for recognition. The 
second vowel was effective in terminating 
the perceptual processing of the image. 
Figure 1 shows that the processing time 
necessary for asymptotic recognition dif- 
fered for the 3 Ss. Two Ss (KB and MF) 
required about 180 msec. for optimal recog- 
nition. In contrast, the third S (DC) was 
able to identify the test vowel with only 
80 msec. of additional processing time after 
its presentation. 

These results do not necessarily estimate 
the duration of the preperceptual image or 
the perceptual processing time in typical 
vowel identification. Since performance 
reached asymptote at essentially perfect 
recognition, the recognition process could 
have been complete before the image de- 
cayed. To estimate the duration of the 
image, recognition accuracy must reach 
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Figure 1, Percentage of correct test-vowel 


identifications for Ss KB, MF, and DC asa function 
of the duration of the silent intervowel interval: 


Experiment I. 


asymptote below perfect performance. In 
the present study, the test vowel v 1 of 
only 2 possible vowels. With only 2 alter- 
natives, S can probably make his decision 
very rapidly, since fewer acoustic features 
of the vowels need be processed for accurate 
recognition. With a larger number of 
alternatives, however, S would have to 
process more features before reaching a 
decision. Accordingly, increasing the num- 
ber of alternatives in this task should 
increase the effective perceptual-processing 
time up to the duration of the auditory 
image. Accordingly, Experiment I] was 
carried out with 4 test alternatives in the 
recognition masking task. 
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EXPERIMENT II 
Method 


Subjects, Twelve University of Wisconsin under- 
graduates fulfilling a course requirement were tested 
for 5 days. 

Procedure. In Experiment II, 4 vowels recorded 
by a female speaker were employed as test items. 
The fundamental frequency of the vowels was 
192 Hz, The test alternatives were the vowels /i/ 
as in heat, /1/ as in hit, /a/ as in hat, and /U/asin 
bul. The duration of the test vowel was 20 msec. 
The silent intervowel interval lasted 0, 20, 40, 80, 

140, 220, 340, or 480 msec. The masking stimulus 
was also chosen randomly from the 4 vowels, /i/, 
/V/, /a/, and /U/, and also lasted 20 msec. 

Four Ss were tested simultaneously in separate 
Sound-attenuated rooms, All experimental events 
were controlled by a PDP-8L computer. The 
vowels were presented binaurally over matched 
headphones (Grason-Stadler TDH-49) at between 
58- and 67-db. SPL. Each S recorded his decision 
by pressing 1 of 4 push buttons, labeled HEAT 1, 
HIT 2, HAT 3, and PUT 4, respectively. Following the 
2.5-sec. response period, feedback was given by 
presenting the digits 1, 2, 3, or 4 on a visual display 
(IEE readout) for 500 msec. The intertrial interval 
was 2 sec. There were 2 300-trial sessions on each of 
Sdays. On the first day, S identified the test vowel 
without the masking vowel present. On Days 2, 3, 
4, and 5 the experiment proper was carried out. 
Presentation ot the 4 test alternatives, the 4 masking 
alternatives, and the 8 intervowel intervals was 
completely random, and each condition was pro- 


Equation 1.) 


ited 
; the 


grammed to occur equally often. Day 2 wa 
as practice, so that the results are taken { 
last 3 experimental days. 


Results 


The results were analyzed in terms of a 
signal detectability framework to provide 
à measure of vowel discriminability, d', 
which would be relatively independent of 
decision biases. It is possible to derive à 
measure of recognition performance for 
each of the 4 test vowels. The probability 
of identifying a vowel correctly is desig- 
nated a hit, and responding with that vowel 
alternative to any other test alternative is 
designated a false alarm. For example, the 
probability of responding i given the test 
vowel /i/, (ili), would be a hit, whereas 
the probability of responding i given any 
other alternative, p (i|i), would be a false 
alarm. The d' measure derived from these 
probabilities would be the index of dis- 
criminability for the vowel /i/. The d' 
values were obtained from the hit and 
false alarm rates (averaged across Ss) in 
Elliot's (1964) tables. 

Figure 2 presents the d’ values for each 
test vowel as a function of the intervowel 
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interval. Identification of each vowel im- 
proved with increases in the silent interval 
between the test and masking vowels. 
These results can be used to estimate the 
duration of the preperceptual image and 
perceptual-processing time, since recogni- 
tion reached asymptote below perfect per- 


formance. Since performance did not im- 
prove beyond an intervowel interval of 
220 msec., the image could have decayed 


in this period. Figure 2 also shows that 
the test vowels were not recognized equally 
well; implying that the alternatives differed 
in discriminability. The rank ordering 
from least to most discriminable was /a/, 
/1/, /U/, and /i/. 

The continuous lines in Figure 2 are 
drawn through the predicted points given 
by a perceptual-processing model (Massaro, 
1970a), which assumes that readout of the 
information in the preperceptual image 
increases in a negatively accelerating man- 
ner with increases in processing time. More 
specifically, the perceptual strength of an 
item, as measured by its discriminability, 
d', follows an exponential growth function 
of time: 


[1] 


‘he measure d’ is the discriminability of 
the m after a presentation time of ¢ sec. 
In the model, presentation time includes 
both the duration of the test item and the 
silent interval before the onset of the mask- 
ing stimulus. Equation 1 indicates that 
the perception of a test item approaches an 
asymptote « at a rate of 8. 

Each test vowel is assumed to function 
as a perceptual unit and therefore has a 
number of distinctive features that corre- 
spond toa feature list in long-term memory. 
The rate at which these features are pro- 
| is reflected in the value of 6. A 
vowel with a large number of distinctive 
features would be expected to have a large 
a value. A noisy or unclear vowel would 
have few distinctive features and a small a 
value. Thus, the a value can be thought 
of as an index of the number of dis- 
criminable features of a vowel. 

According to the model, the overall dif- 
ferences found in identifying the test vowels 


d = a(t — e*). 
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should reflect differences in a, the measure 
of discriminability. However, 0, the rate 
of processing the information in the vowel, 
should be independent of a. In all cases, 
the number of features processed in any 
unit of time should be a constant propor- 
tion of the number of features that remain 
unprocessed. Equation 1 was applied to 
the observed results in Figure 2 by esti- 
mating a different parameter value of œ for 
each test vowel and a single value of 6. 
As can be seen in Figure 2, the model 
describes the results fairly accurately. 
Thirty data points are predicted with only 
5 parameter estimates, The parameter 
estimates of the a values for /i/, /I/, /a/; 
and /U/ were 2.29, 1.31, .88, and 1.77, re- 
spectively. The estimated value of 0 was 
15.17. 

Table 1 presents the average recognition 
performance for the different test stimuli 
as a function of the masking vowel. As 
can be seen in the table, all masking vowels 
were equally effective in terminating per- 
ceptual processing of the test vowel. Of 
special interest is the finding that a vowel 
can mask itself. This means that S did 
not discriminate trials on which the test 
and mask were the same vowel. Since 
identification of the test vowel requires 
perceptual-processing time, it cannot be 
accurately compared to the masking vowel 
until it is recognized. The masking vowel 
overwrites the test vowel effectively, leav- 
ing very little information about the test- 
vowel presentation. 

The results show that perception of a 
short vowel presentation improves with 


TABLE 1 
ORMANCE (MEASURED IN d' VALUES) 
AS A FUNCTION 
or MASKING VOWELS 


| Masking stimulus 


Ai A /al /U/ 

Jil 1.76 1.78 1.78 1.62 
/U .18 95 .96 .18 
laj 56 .66 23 .66 
/U/ 1.15 1.33 1.22 1.75 
Average 1.06 1.18 1.05 1.20 
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increases in the silent interval after its 
presentation. This result provides evi- 
dence for the assumption that the duration 
of vowels in normal speech provides time 
for perceptual processing. However, in- 
creasing the duration of a vowel could also 
increase the amount of information in the 
vowel presentation. Increasing the dura- 
tion of a vowel increases the information in 
the stimulus if one analyzes the frequency- 
amplitude spectrum of the steady-state 
vowel. With a short vowel stimulus, the 
distributions of energy around the respec- 
tive formants are fairly large, and increasing 
the duration of the vowel would decrease 
the variance of the distributions. Accord- 
ingly, to measure the relative contribution 
of processing time and stimulus informa- 
tion in vowel perception, perception of a 
short vowel followed by a silent interval 
was compared to perception of that same 
vowel left on for the processing interval 
before presentation of the masking vowel. 


EXPERIM 


Method 


Subjects. Eight undergraduates fulfilling a course 
requirement at the University of Wisconsin served 
as Ss for 5 days. 

Procedure. The 4 test alternatives used in Experi- 
ment II were also employed in this study. The 
vowels were first modified digitally to give exactly 
the same steady-state loudness (65-db. SPL). Since 
the vowels had a fundamental frequency of 192 Hz., 
5 fundamental segments gave a stimulus duration 
of 26 msec. Inthe continuous-processing conditions, 
vowels had to be presented at durations that 
exceeded our computer storage capacity. Accord- 
ingly, we stored 26 msec. of each vowel and repeated 
this segment until the desired duration was pre- 
sented, The masking vowel was a 208-msec. con- 
tinuous nonsense vowel made up of 2 repetitions of 
the 4 test vowels, /i/, /V/, /a/, and /U/, played for 
26 msec. each. 

On the first day of the experiment, Ss listened to 
the test vowels played at durations of. 208, 104, and 
52 msec., respectively, in 3 Successive 300-tria] 
sessions. Feedback was given simultaneously with 
the test-vowel presentation. The experiment proper 
was carried out on the following 4 days. In the 
silent-processing condition, the test vowel was pre- 
sented for 26 msec., followed in turn by a silent 
interval (0, 26, 52, 78, 130, 182, 260, or 390 msec.) 
and the masking vowel. In the continuous-process- 
ing condition, the test vowel was presented for 26, 
52, 78, 104, 156, 208, 286, or 416 msec., followed 
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immediately by the maskin experi- 
mental conditions (4 Test Vow« ocessing 
Conditions X 8 Processing In re com- 
pletely random in a given se re pro- 
grammed to occur equally ofte fore, on 
every trial, S heard a test vowel of * uration, 
followed in turn by a variable sil sland a 
masking vowel. Following a 2-se: period, 
feedback was presented. The inter val was 
1 sec. All other procedural deta nilar to 
Experiment II. Two 300-trial se given, 
and the first 5 trials on each day we l in the 
data analy 
Results 

The results shown in Figures and 6 
present discriminability values each 
vowel as a function of processin e and 
the silent and continuous condi: Vor 
every vowel, identification was bet when 
the vowel was left on for the 'ssing 
interval than when a 26-msec. vc. i was 
followed by a silent-processing rval. 
More specifically, for all test vov per- 
formance reaches asymptote at a nigher 
level of performance for the continuous- 
than for the silent-processing condition, 
suggesting that there was more stimulus 
information in the test vowel in the former 


than in the latter processing condition. 
Furthermore, the results also indicate that 
performance reaches asymptote at the same 
processing interval for the silent- and 
continuous-processing conditions. Accord- 
ingly, it appears that the rate of processing 
the information in the test vowel did not 
differ under the 2 processing conditions. 
This interpretation of the results can be 
described in the framework of the percep- 
tual-processing model. We first assume 
that a 52-msec. vowel produces more infor- 
mation than a 26-msec. vowel, but that 
the information in a vowel presentation 
does not increase with increases in duration 
beyond 52 msec, It follows that the con- 
tinuous condition should only have one 
value of o at test vowel durations of 52 
msec. or more. We assume that this a 
value is some constant K, times the a value 
for the corresponding 26-msec. vowel pre- 
sentation. Accordingly, to obtain the 
theoretical predictions, it is necessary to 
estimate a different a for each of the 4 
test vowels, the constant K, and a value 
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(The lines are drawn through the predicted points 
given by Equation 1.) 


for 0, the rate of perceptual processing, 
which should be constant under both the 
silent and continuous tone conditions. 

'The predicted curves are piven by the 
continuous lines in Figures 3, 4, 5, and 6. 
The model does a reasonably good job of 
describing the data, considering the fact 
that 64 points are predicted by estimating 
only 6 parameters. The parameter values 
for a were 2.57, 1.41, 1.24, and 1.91 for the 
vowels /i/, /I/, /a/, and /U/, respectively. 
The estimated value for K was 1.20, and 
the value for 0 was 23.77. 
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vowel /I/ as a function of processing time during 
the continuous vowel presentation or the silent 
interval after a 26-msec. vowel presentation: Experi- 
ment III, (The lines are drawn through the pre- 
dicted points given by Equation 1.) 
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vowel /a/ as a function of processing time during 
the continuous vowel presentation or the silent 
interval after a 26- msec. vowel presentation: Experi- 
ment III. (The lines are drawn through the pre- 
dicted points given by Equation 1.) 


In the silent-processing condition, recog- 
nition performance improved with increases 
in the silent interval, leveling off after 
roughly 200 msec. It might therefore be 
concluded that the preperceptual image of 
the test vowel decayed within this period. 
However, recognition performance reached 
asymptote at the same time in the con- 
tinuous vowel condition. In this case, 
the image should not have decayed, since 
the test vowel was still being presented. 
It appears that recognition leveled off be- 
cause S processed all the information avail- 
able in the stimulus in the first 200 msec. 
In terms of the perceptual-processing model 
presented here, the rate of processing was 
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vowel /U/ as a function of processing time during 
the continuous vowel presentation or the silent 
interval after a 26-msec. vowel presentation: Ex- 
periment III. (The lines are drawn through the 
predicted points given by Equation 1.) 
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Figure 7. Discriminability values for the 3 test 
Syllables as a function of the silent intersyllable 
interval: Experiment IV. (The lines are drawn 
through the predicted points given by Equation 1.) 


large enough that performance reached 
asympote within 200 msec. It is possible 
that further increases in the number of 
test alternatives will differentially enhance 
performance in the continuous- relative to 
the silent-processing condition. 

The results indicate that perceptual pro- 
cessing of a continuous vowel presentation 
continues for about 200 msec. This pro- 
cessing is terminated by a masking-vowel 
presentation. These results support the 
idea that the extended duration of the vowel 
in continuous speech allows time for per- 
ceptual processing, so that the vowel can 
be identified. The processing time during 
the steady-state vowels could also provide 
time for recognition of some consonants. 
The stop consonants are characterized by 
rapid transitions in the formants of the 
acoustic signal (Liberman, Cooper, Shank- 
weiler, & Studdert-Kennedy, 1967). The 
stop consonants /g/, /d/, and /b/ can be 
distinguished from each other by the second 
formant transition, which varies as a func- 
tion of place of articulation (Delattre, 
Liberman, & Cooper, 1955). It has been 
argued that the stop consonant could not 
be recognized while it is presented, but 
would require some silent time or steady- 
state period for recognition to take place 
(Massaro, 1972b). 

This analysis implies that the stop con- 
sonant-vowel (CV) transition would func- 
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tion as a perceptual unit in Spx 'ercep- 
tion. If the vowel is integrate th the 
CV transition, the extended d on of 
the vowel in normal speech wo rovide 
time for perceptual processing \ccord- 
ingly, the first part of the í 'attern 
probably could be identified pre- 
sented alone and followed by inter- 
val However, if the short ( llable 
were followed by a speech sound could 
not be integrated with it, percep hould 
be disrupted, and backward tion 
masking should occur. To tes , we 
took the CV transitions of 3 CV 
syllables and used these as tes: is in 
our backward-masking experimen 
EXPERIMENT IV 

Method. 

Subjects. Six University of Wisconsin under- 
graduates fulfilling a course requirement wc. tested 
for 5 days. 

Procedure. Three CVs were employed both 
test and masking stimuli. The CVs wer DA, 
and GA (A as in box). The 2 formant CV. were 
synthesized at Haskins Laboratory? and recorded 

igitally for playback during the experiment. 
Enough of the CV syllable was presented for it to 


sound speechlike. syllable was of 42-msec. 
duration (30 msec. of transition plus 12 msec. of 
steady-state vowel) and was presented at 76-db. 
SPL. On each trial, 1 of the 3 CV syllables was 
presented, followed in turn by a variable interval 
and 1 of the 3 syllables. The silent intersyllable 
interval lasted 0, 20, 40, 60, 100, 160, 240, or 320 
msec. Each S recorded his decision by pressing 1 of 
3 push buttons labeled Ga 1, DA 2, and BA 3, respec- 
tively. Following the 1.5-sec. response interval, 
feedback was provided by visually presenting the 
digits 1, 2, or 3 for 500 msec. The intertrial interval 
was 1.5 sec. Presentation of the 3 test alternatives, 
3 masking alternatives, and 8 intersyllable intervals 
was completely random. The alternative GA could 
occur 2 out of 8 times and Ba and pa each occurred 
3 out of 8 times. All other procedural details were 
the same as in Experiment II. 

On the first day, Ss identified the test syllable 
without the masking syllable present for 300 trials. 
The masking stimulus was presented in the second 
session. On Days 2, 3, 4, and 5, the experiment 
proper was carried out. There were 2 300-trial 
sessions per day. Only the results of Days 3, 4, and 
3 were included in the data analysis. 


* David B. Pisoni kindly provided a recording of 
the original unshortened syllables. 
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Results 


The results, presented as in Figure 7, 
show that identification performance im- 
proved with increases in the silent inter- 
syllable interval for all 3 CVs. The con- 
tinuous lines are drawn through the pre- 
dicted points given by Equation 1. Four 
parameter values were estimated: an a 
value for each CV test alternative, and a 
value of 6. The parameter estimates were 
3.18, 2.06, and 2.75 for the values for Ga, 
DA, and BA, respectively. The parameter 
estimate for the rate of perceptual pro- 
cessing, 0, was 17.28. Table 2 shows that 
the similarity between the test and masking 
stimuli did not systematically affect identi- 
fication performance. However, BA pro- 
duced less overall interference than GA or 
DA as a masking stimulus. This result 
might reflect the fact that the low fre- 
quencies of the syllable BA are less effective 
in terminating perceptual processing of an 
earlier CV presentation. 


DISCUSSION 


These studies were carried out to determine 
the size of perceptual units employed in speech 
perception. It was argued that the perceptual 
unit could not be larger than the temporal life 
of a preperceptual image of the speech stimulus. 
If the information in the image decays before 
it is synthesized, recognition would not be 
possible. The results indicate that processing 
the speech stimuli employed here occurs within 
200-250 msec. after the stimulus presentation. 
Accordingly, some transformation of the speech 
signal must occur about every ł sec. The 
stimulus within this time period must function 
as a perceptual unit. There must be sufficient 
information in this signal (and in the context 
in normal speech) for contact with information 
in long-term memory. To the extent the 
present studies simulate the processing of real 
speech, it might be concluded that the first 
stage of speech recognition must operate on 
acoustic segments that are of less than 250- 
msec, duration. If this is the case, perceptual 
units appear to be of roughly syllabic length, 
as suggested by Huggins (1964) and Massaro 
(1972b). 

There are at least 2 apparent difficulties with 
assuming that the first stage of recognition 
involves a transformation or synthesis within 
250 msec. after a sound pattern is presented. 


"TABLE 2 


AVERAGE RECOGNITION PERFORMANCE (MEASURED 
IN d' VALUES) FOR TEST SYLLABLES AS 
A FUNCTION OF MASKING SYLLABLES 


Masking syllable 


Test syllable 


GA DA BA 
GA 3.13 2.48 2-92 
DA 1.40 1.58 1.94 
BA 1.66 2,38 3.18 
Average 2.10 2.22 2.72 


The first is that the utilization of auditory 
information occurs across múch longer tem- 
poral periods. The memory for the quality of 
a speaker's voice requires an auditory memory 
that can last indefinitely. The interpretation 
of prosodic information such as stress requires 


an auditory memory that lasts for the length ~~ 


of a phrase or sentence. Most importantly, 
the recognition of sound patterns depends on 
an auditory memory for those sound patterns 
or even other sound patterns by the same 
speaker. For example, Ladefoged and Broad- 
bent (1957) showed that the acoustic charac- 
teristics of some vowels of a speaker influenced 
the recognition of other vowels by the same 
speaker. Accordingly, the listener must have 
remembered some of the auditory character- 
istics of the vowels for longer than 250 msec. 
The auditory memory illustrated by these ex- 
amples, however, is not preperceptual, but is 
auditory information that has been actively 
ed. 

ro (1972a, 1972b) has distinguished 
between two stages of auditory information 
processing which involve preperceptual and 
synthesized auditory storage, respectively. The 
first stage involves recognition of a sound 
pattern held in a preperceptual auditory image. 
The recognition of the sound pattern produces 
a synthesized auditory percept in synthesized 
auditory storage. Synthesized auditory storage 
has a much longer life span than preperceptual 
images and could be responsible for auditory 
memory effects that operate over longer time 
periods than are given by preperceptual audi- 
tory storage. Accordingly, the auditory storage 
responsible for memory for voice quality, the 
Ladefoged and Broadbent (1957) effect, and 
for interpreting prosodic features of a sentence, 
is not preperceptual, but is at the level of 
synthesized auditory storage. (For a more 
complete discussion of the differences in pre- 


perceptual and synthesized auditory storage, 
see Massaro, 1972a, 1972b.) 

The second problem is that it is well known 
that the acoustic sound pattern corresponding 
to a syllable changes with changes in the 
speaker, the rate of speaking, the stress pat- 
terns, and (most importantly) the influence of 
neighboring syllables (Fant, 1962; Ohman, 
1966). However, this problem is not unique 
to speech recognition, but is also the major 
deterrent to the development of a recognition 
device for handwritten text. This observation 
tells us that the representation of a perceptual 
unit in long-term memory must be a highly 
flexible one, with a number of normalization 
algorithms that operate on the sound-pattern 
input. Luckily, in normal speech, context or 
redundancy significantly reduces the number 
of valid alternatives for each sound pattern, 
The sound pattern can, therefore, be noisy, 
since redundancy reduces the number of acous- 
tic features that are necessary to recognize the 
pattern correctly. 
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Two experiments investigated whether Ss recognize that letters are physically 
identical prior to naming them, or whether the processes of naming and physical 
matching occur in parallel. Stimuli consisting of 3 letters were presented 
tachistoscopically at a duration such that Ss could identify all 3 letters about 
half the time. The S’s task was to name the letters. Some stimuli had all 
3 letters the same, some had all 3 letters different, and some had 2 the same 
and 1 different. There was no difference in recognition performance between 
the condition with all 3 letters the same and the condition with all 3 letters 
different, suggesting that the physical matching process and the naming process 


are not sequential. 
letters the same and the last letter 
were replicated with number stimuli. 


A given pattern can be categorized in 
numerous different ways. For example, 
the pattern A can be processed as a letter 
with a certain sound, as a vowel, as a part 
of a word, or simply as a physical pattern 
consisting of 2 oblique lines with a hori- 
zontal line connecting them. A series of 
experiments by Posner and Mitchell (1967) 
showed that it takes longer to perform some 
categorizations than it does to perform 
others. Posner and Mitchell asked Ss to 
decide whether 2 letters were the same or 
different, with the dependent variable being 
reaction time, and they found that physical 
matches took less time than did name 
matches, and name matches took less time 
than did matches based on the consonant- 
vowel distinction. A question which im- 
mediately comes to mind is how the various 
levels of categorization are related. There 
are 2 basic possibilities: they can be con- 
nected serially (sequentially), that is, 
physical processing is done first, and the 
information from physical processing is 
used to perform higher level categorizations 


1 This research is based on a doctoral dissertation 
submitted to the University of Connecticut. The 
author is indebted to his committee, Michael Turvey, 
David Zeaman, and Donald Dickerson, for their 
invaluable advice and assistance throughout. The 
equipment was purchased through University of 
Connecticut Research Foundation Grant 102, 

s for reprints should be sent to Robert 
, who is now at Russell Sage College, 
Troy, New York 12180. 


However, recognition of letter patterns with the first 2 


different was much lower, The results 


such as naming; or they can be connected 
in parallel, essentially independently of one 
another. 

The position that the 2 levels are se- 
quential was taken in a study by Beller 
(1970), who based his thinking on Neisser’s 
(1967) theory. Beller equates Posner and 
Mitchell’s (1967) Level 1 processing to 
Neisser's concept of preattentive mecha- 
nisms, while Level 2 corresponds to figural 
synthesis and pattern recognition. To 
verify his approach, Beller used the assump- 
tion that preattentive processes are carried 
out in parallel. If Level 1 (physical match) 
is done in parallel, then it should not matter 
how many items were presented; the time 
needed to classify them all as same or dif- 
ferent based on a physical match would 
be the same. His stimuli consisted of 2, 4 
or 8 letters which S had to classify as same 
or different according to their name, as in 
Posner and Mitchell's Level 2 instructions. 
There were 2 interesting conclusions to his 
study. The first was that the times re- 
quired to make physical matches were the 
same regardless of the number of letters 
in the stimulus; i.e., it took no longer to 
classify 8 letters as physically identical 
than it did to classify 2. These data pro- 
vide strong support for the assumption of 
spatiotemporal parallelism in Node 1 pro- 
cessing. The second conclusion was that 
when all the stimuli were physically iden- 
tical, sume response times were faster than 
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same response times when the stimuli 
differed physically but had a common 
name, even though the instructions in all 
cases were to judge on the basis of name 
similarity rather than physical similarity. 
This observation supports Beller's conten- 
tion that the 2 levels are operationally serial 
because, apparently, once S recognizes that 
the letters are physically identical, he does 
not have to name each one even though his 
instructional set is to do so. 
The purpose of the present experiments 
is to investigate the question of whether 
the physical match subsystem necessarily 
precedes the naming process. The general 
procedure was to present tachistoscopically 
a series of stimuli, each consisting of a 3- 
letter (or 3-number) array, followed by a 
masking stimulus. The time between the 
onset of the stimulus pattern and the onset 
of the mask was manipulated so that each 
S was able to report the 3 letters correctly 
about half the time. Previous research 
indicates that the amount of time that S 
has available to read out information from 
the stimulus is determined by the interval 
between the onset of the stimulus and the 
onset of the mask (see Kahneman, 1968, 
for a general review). Of course, this does 
not mean that no processing occurs after 
the mask comes on but rather that raw 
stimulus information is no longer available. 
Some of the stimuli had all 3 items the same, 
others had all 3 items different, and others 
had 2 the same and 1 different. The de- 
pendent variable was the percentage of 
letters correctly identified in each condition. 
If physical matching is an integral part of 
the preattentive processes as suggested by 
Beller (1970) and Neisser (1967), perfor- 
mance in the condition with all 3 letters 
the same should be better than performance 
in the case with all 3 letters different, with 
the other conditions in between these 2, 
because the preattentive mechanism would 
first detect the physical similarity of the 3 
letters. This information would facilitate 
recognition because S would only have to 
identify any 1 of the 3 letters in order to 
get all 3 correct. If, however, the physical 
matching level and the naming process 
operate in parallel, then there should be no. 
difference between the various conditions. 
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EXPERIMENT Í 


Method 

Design. The task was to iden eries of 
stimuli, each consisting of 3 letters, dona 
Gerbrands 3-channel tachistoscop vere 5 


conditions in the experiment. 
(AAA) all 3 letters were the same 
condition (AAB) the first 2 letter 

and the last letter was different. Ir 
dition (BAA) the last 2 letters werc 
the first letter different. In the fou 
(ABA) the first and third letters wer: 
the middle letter was different. In the 
(ABC) all 3 letters were different. Ea 
all stimuli in all conditions. The depend 
was the proportion of letters correctly 
each of the 5 conditions. 

bjecis. The Ss were 15 undergrad: 


ndition 
econd 

same 
d con- 

with 
;dition. 
ve and 
dition 
eived 
riable 
fied in 


it the 


University of Connecticut who particip n the 

experiment for approximately } hr. eac! ourse 
credit. 

Apparatus and procedure. The appar vas a 

Gerbrands T-3B-1 3-channel tachistosco Each 

art button. 

509 msec.; 

ted of 2 horizontal lines centered i e dis- 

play, which indicated where the letters w going 

to be. The luminance was .1 ftL. as mea l bya 

Spectra brightness spot meter. The second field 


t and 
iration 
ter- 


came on immediately at the offset of the fr 
contained the letters to be identified. The 
for the second field varied between Ss and 
mined in the pretraining as the duration which 
S was making 50% correct responses. The mean 
display time for the second field was 68 msec. with a 
standard deviation of 13.6. The third field was à 
mask made up of a series of interlocking circles and 
was in the same location as the letters. [t came on 
immediately at the offset of the second field and 
stayed on for 50 msec. The luminances of the second 
and third fields were identical at 1.5 ftL. 

Stimuli. The stimuli consisted of 50 4 X 6 in. 
cards, each of which contained 3 capital letters pro- 
duced by Letraset 48 pt. Futura Medium, Only 10 
of the 26 letters were used, C, D, H, L, M, N, P, R, 
S, and T. In the apparatus, each letter filled ap- 
proximately 46’ of visual angle, and a display of 3 
letters had a visual angle of approximately 2°51’. 
Each letter was used equally often in each condition 
to the extent that this was possible. There were 10 
stimuli in Condition AAA, 5 in Condition AAB, 5 
in Condition BAA, 10 in Condition ABA, and 20 in 
Condition ABC. The stimuli were divided randomly 
into blocks of 5 each, each block containing 1 of each 
condition to the extent that this was possible. The 
10 blocks were then ordered randomly and each 5 
was begun on a different block. In addition each S 
was given pretraining on stimuli identical to those 
used in the experiment, with the same proportion 
of each condition as in the actual experiment; i.¢- 
40% of the pretraining stimuli were of the ABC 
type, 20% were AAA pattern, 10% were BAA, etc. 
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The purpose of the pretraining was to stabilize the 
ince and to determine for each S that dura- 
which his performance was 50%. The num- 
of pretraining trials needed for this varied, with 
an of 48.9 pretraining trials per S (SD = 12.4), 
3 said his response aloud immediately at the 
ach trial. "The Ss were required to guess if 
y were not sure of the answer, and they had been 
old before the experiment which 10 letters were to 
be used, If S's response was wrong, he was told 
what the correct letters had been. The purpose of 
this feedback was to acquaint him with the possible 
combinations of letters and the approximate propor- 
tions of each, It had been determined in pilot work 
that Ss tended to guess 3 different letters each time 
without this feedback, and so it was necessary to 
indicate that some stimuli might have all 3 letters 
the same and that others might have 2 the same. 


Results 


Figure 1 shows the mean percentage cor- 
rect for each condition and position of the 
letter (first, second, or last). Averaging 
across positions, although performance in 
the AAA condition was slightly better than 
performance in the ABC condition as pre- 
dicted by the Beller (1970) — Neisser (1967) 
model, this difference was negligible. The 
only stimulus condition which appeared to 
differ widely from the others was the AAB 
condition, in which performance was in- 
ferior. An analysis of variance indicated 
that both main effects of position and con- 
dition and the interaction between them 
were significant : for condition, F (4, 196) = 
12.41, p < .001, for position, F (2, 196) = 
51.46, p < .001, and for the interaction, 
F (8, 196) = 2.93, p < .01. 

A Tukey test was done to determine 
which conditions differed significantly from 
each other. The only finding here was that 
the AAB condition differed reliably from 
the other 4 conditions at the .01 level. 
The difference between the AAA and ABC 
conditions did not even approach signifi- 
cance. Another Tukey test on paired com- 
parisons between positions averaged across 
condition indicated that performance in the 
first position differed from performance in 
the other 2 at the .01 level, while the differ- 
ence between Position 2 and Position 3 ap- 
proached significance at the .05 level. The 
significant interaction appears to be due 
primarily to the fact that the condition 
effect (performance in the AAB condition 
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AAA AAB BAA ABA ABC AV. 
STIMULUS CONDITION 
Ficure 1. Percentage of letters correctly identi- 
fied for each condition and position, Experiment I. 
(The A-B combinations represent the 5 same-difler- 
ent patterns, Av. = average.) 


being inferior to performance in the other 
conditions) was found only in the last 2 
positions. 

An analysis of the kinds of errors that 
were made was done to see if it would pro- 
vide further clues regarding the mental 
processes involved in this task. The results 
are shown in Table 1. If the Beller (1970) 
model is taken literally, Ss should be able 
to detect the same-different pattern of the 
stimulus (i.e., all letters the same; first 2 
letters same, last different; or whatever) 
before identifying the actual letters. Thus, 
there should be many cases where the re- 
sponse had the correct same-different pat- 
tern but missed one or more of the letters. 
The error analysis indicated that this was 
not the case. Many errors involved Ss 
getting all the letters correct but in the 
wrong order (1795 in the ABC condition, 
7% in the ABA condition, 20% in the AAB 
condition, and 2% in the BAA condition) ; 
while there were fewer cases in which the 
same-different pattern was correct but one 
or more of the letters was missed (13% in 
the ABA condition, 1195 in the AAB con- 
dition, and none in the BAA condition). 
In the AAA condition there was not a single 
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TABLE 1 
AERA No. DEUS NL sense, No. 
^ TER 6 m 
|12 6 5 | 
ace 1 10 | 
2 11 1 
18 1 | 1 
2 9 13 | 
9 10 9 
X! 8 X 5 Other | 2 | 
Other | 3 Other | 6 | 
| 42 (total) | 
57 (total) 55 (total) 


Note. 


case where S knew that all 3 letters were 
the same, but did not know what the letter 
was (an XXX error). 


PERCENT CORRECT 


Pausen cos PAIR 


AAA AAB BAA ABA ABC 


FiGURE 2 


ent patterns. 


AN. 


STIMULUS CONDITION 


Percentage of letters correctly identi- 
fied for each condition and position, Experiment II. 
(The A-B combinations represent the 5 same-differ- 


Av. — average.) 


Type of 
error 


Other 


t2 
ano Ugo 


60 (total) | 


BAC 
Other 


The A-B combinations he: Sing the columns represent. 5 same-different patterns determining the 5 conditions, 


EXPERIMENT II 


"There were 


suspicion. 


Was so 


replication. 


Method 


The design, equipment, and procedure of thi 


2 


major findings in Experi 
ment I that could be viewed with so! 
First, the absence of a signifi 
cant difference between the AAA and ABC 
conditions supports the null hypothes 
which is a questionable conclusion. 
the poor performance of the AAB condition 
unexpected 


that 


it 


second experiment were identical to thc 


ment I with the following differences: 
were numbers instead of letters; there were a tota 
of A stimuli instead of 50; Conditions AAA, ABAy 


and BAA ea had 5 KSR 


used instead of 15, 


Results 


The results of Experiment IH are shown 
The 2 tenuous conclusions 9 
Experiment I were firmly supported by 
First, the performance of the 
AAB condition was inferior to the perfor- 
mances in the other conditions; and second, 


in Figure 2. 


these data. 


and only 10 Ss were 


zZ 
E 


sod TOTO ISO S 


Second, 


warranted 


eof E 
the 
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there vas little difference between the per- 
fi es of the AAA and ABC conditions, 
this case the ABC condition was 
ior to the AAA condition. This dif- 
ce is in the opposite direction from that 
licted from the Beller (1970) — Neisser 
57) model. An analysis of variance in- 
icated that both main effects of condition 
wd position were significant at the .001 
level and the interaction between them was 
significant at the .05 level: for conditions, 
F (4,126) = 5.36, for position, Æ (2, 126) = 
15.63, and for the interaction, F (8, 126) = 
2.05. A Tukey test on the condition means 
averaged across positions indicated that the 
AAB condition differed from the ABC and 
ABA conditions at the .01 level and from 
the AAA condition at the .05 level. Another 
Tukey test on the position means averaged 
across conditions indicated that the first 
position performance differed from the 
other 2 at the ,01 level. 


DISCUSSION 


The most important conclusion of this re- 
is that the recognition of AAA stimuli, 
1 all 3 letters were the same, and the 
ABC stimuli in which all 3 letters were differ- 
ent, did not differ significantly, and this was 
true with either numbers or letters as stimuli. 
It can be concluded from this finding that 
either the same-different pattern of the stimuli 
was not discerned before the letters were 
identified or that it was discerned but not used. 
The latter seems unlikely. If the same-differ- 
ent pattern had been detected prior to the 
letters being identified, then there should 
have been many more cases in the AAA con- 
dition in which S's response had all 3 letters 
the same but used the wrong letter, This 
result calls into question Beller's (1970) asser- 
tion that physical sameness is detected by the 
preattentive mechanisms prior to identification. 

However, it is still necessary to resolve the 
difference between Beller's (1970) data and 
the present data. Beller found that when Ss 
were asked to make same-different judgments 
on the basis of name, response times were con- 
siderably faster when all stimuli were physi- 
cally identical, and it took no longer to classify 
8 physically identical stimuli as same than it 
did to classify 2. He concluded that physical 
match was done prior to the actual naming 
process and that it was done in parallel. The 
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present study concluded exactly the opposite. 
However, there are some important differences 
between the task of Beller's 5s and the task 
of the Ss in the present experiments. Beller's 
Ss were required to decide whether the letters 
presented had the same name or not. The 
actual task of naming was only implicit. In 
the above experiment, Ss were required to 
actually say the names of the letters. Ap- 
parently this difference can produce important 
differences in processing strategies in the sense 
that certain preattentive processes seem to be 
optional, When Ss are required to make a 
name match, i.e., say whether 2 or more letters 
have the same name or not, they apparently. 
employ a preattentive mechanism which makes 
physical matches prior to actually naming the 
letters. However, when their only gnment 
is to name the letters, then they can somehow 
forgo the use of the matching mechanism. In 
other words, which preattentive mechanisms 
are employed seems to depend on the specific 
task. 

A second finding of the present experiments 
is that recognition performance with AAB 
stimuli was considerably lower than perfor- 
mance in other conditions, This result is 
puzzling; perhaps some sort of symmetry 
notion can account for it; symmetry has been 
found to play a role in other perception experi- 
ments (e.g, Bryden, 1968), but this would 
require some stretching of the notion of sym- 
metry. At present, this phenomenon will have 
to be left unexplained. 

A third finding was that Ss were much 
more likely to get the first letter correct than 
the other 2 letters. This finding is not sur- 
prising; it has been reported in other studies 
(Bryden, 1960). What it indicates is that Ss 
are most probably reading from left to right, 
and it provides support for the notion that the 
letters are synthesized serially, one after the 
other, rather than in parallel (although the 
observation that Ss often knew what letters 
were present without knowing their order sug- 
gests the opposite). However, a parallel 
processing model could also account for the 
position effect by postulating that since the 
letters are spoken in a left-to-right order, 
there is a greater chance that S will forget 1 
of the last 2 letters rather than the first letter 
because he has to keep them in memory while 
he reports the first. 

Finally, these experiments may be taken as 
a demonstration of a case where one type 
of meaning information can be extracted 
directly from a stimulus without first having to 


attend to, recognize, or identify a lower and 
presumably more basic type of physical or 
sensory information. Other research suggests 
that Ss are capable of performing in a similar 
manner with other types of information. For 
example, Brand (1971) reported that some Ss 
were able to search through a list of letters 
more rapidly for "any number" than for a 
specified letter, suggesting that they are cap- 
able of determining whether a pattern is a 
number or a letter without first identifying it. 
This is not to say that basic sensory informa- 
tion is of no importance to perception, but 
rather that there is some sort of higher order 
invariant of "numberness" or ^letterness" 
which can be detected independently of 
identifying any particular alphanumeric char- 
acter. Similar notions have been expressed 
by Gibson (1966). In any event Brand's find- 
ing agrees with the present results in that it 
suggests that what is presumed to be higher 
order information can be extracted without 
first identifying lower order information. 
Thus, simple hierarchical models of pattern 
recognition in which lower level information is 
extracted first and then used to detect higher 
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order features must be called into 
question. 
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independent (I) interaxis control 
dependence, 


Two experiments on the acquisition and transfer of dependent (D) and 
movement 
defined as the correlation between the stimulus signal rate 


patterns are reported. Task 


" functions in the 2 axes, was varied in a transfer design. Twelve 5s in Experi- 
ment I and 24 Ss in Experiment II trained on either an I or a D task then 


were transferred to a test task with the same 
(Experiment II) intraaxis characteristic: 
Results showed. the following: 


(shift) interaxis dependence. 


(Experiment I) or different 
and the same (nonshift) or different 
(a) Ss ap- 


proached the dependence required by the task in training trials, (b) shift Ss 
had greater error increases in test than nonshift Ss, (c) Ss trained on an I task 
and tested on a D task (I-D Ss) showed greater relative negative transfer 


movement simultaneity. 


The principal objects of this research 
were (a) to ascertain the relationship be- 
tween the 2-hand movement dependence 
exhibited by Ss on a 2-dimensional tracking 
task and the fit of the multiplicative rule 
for time-on-target (TOT) scores, and (b) 
to determine whether the interlimb move- 
ment dependence required by the task is 
necessary to an adequate description of 
what Ss learn in tracking. The multiplica- 
tive rule for TOT scores is an empirical 
„relationship between the product of part- 
task scores and observed whole-task per- 
formance. Ellson (1947) first noted that 
the proportions of TOT in each of the di- 
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than D-I Ss, and (d) the fit of the 
scores to observed data was not related to movement dependence or to 


multiplicative rule for time-on-target 


mensions of a multidimensional task ap- 
peared to be statistically independent, so 
that 


P(TOT w) = P(TOT,) 
x P(TOT;) X P(TOT.), [1] 


where P(TOT we) is the proportion of time 
that S was on target in all 3 dimensions 
simultaneously, and P(TOT,), P(TOTy), 
and P(TOT.) are the proportions of TOT 
in each of the 3 part tasks measured without 
regard to each other. This rule has been 
shown to be very powerful. Ina variety 
of tracking tasks, both pursuit and com- 
pensatory, and for various stages of prac- 
tice, the rule predicts observed simultaneous 
P(TOT) within approximately 2% (Adams 
& Webber, 1961; Bilodeau, 1955, 1957; 
Bilodeau & Bilodeau, 1954; Dick, 1970). 
"The rule is robust over target sizes (Adams 
& Webber, 1961), at least under certain 
circumstances, and for up to 4-dimensional 
tasks (Bilodeau, 1957). As an empirical 
law relating part-task scores to whole-task 
performance in terms of system error, the 
multiplicative rule is unrivaled in its pre- 
cision and range. 

The question at issue is whether the 
operation of the multiplicative rule implies 
that the behavior giving rise to the TOT 
scores is somehow independent in the 
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several control dimensions. This implica- 
tion is based on the similarity of the form 
of the multiplicative rule to the equation 
defining the joint probability of indepen- 
dent events. Adams and Webber (1961) 
explicitly presumed this to be the case 
when they said, 


Therefore, at Present, we have no evidence to con- 
tradict the independence w [multiplicative rule] 
and must accept it as a first quantitative principle 
of complex tracking behavior with no known restric- 
tions on task, practice, or characteristics of the input 
signals [p. 200, italics added]. 


Other investigators have been more aware 
of the difficulty of inferring behavioral con- 
clusions from TOT error data. In their 
discussion of an experiment in which the 
data were well-predicted by the multiplica- 
tive rule, Bilodeau and Bilodeau (1954) 
said, 


To conclude from these data that the relation 
[between P(TOT) scores] is essentially zero, for 
example, is not necessarily to conclude that manipu- 
lative corrections between components are always 
independently performed [p. 42]. 


Nevertheless, they went on to say that 
"over a sizable time interval the events 
manipulative Corrections] can on the 
whole be Considered to be independent 
Lp. 42]," this conclusion being based on 
P (TOT) data. 

The principal difficulty with these kinds 
of inferences is the meaning to be assigned 
to "independence" as it refers to behavior 
and to response patterns. The term has 
taken several meanings in the literature. 
Perhaps the most commonly encountered 
meaning identifies the degree of indepen- 
dence (or dependence) with time-sharing 
patterns across control dimensions. |f AY 
moves 2 controls in an alternating manner, 
he is displaying little time-sharing ; if he 
tends to move the controls simultaneously, 
he is displaying increased time-sharing. 
This definition of ; 


they depend upon TOT error data for 
their premises, The term has taken other 
meanings. For example, Adams and Web- 
ber (1961), also quoted above, defined 
behavioral independence in terms of Cross- 
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correlation functions relating trackin ror 
signals in 2 dimensions of tracki A 
misconception underlying these defi ons 
is that tracking error is a valid of 


S's responding on t 
(1970) and Hoppe ( 
tracking error confo 


he task. As bot! ick 
1972) have point ut, 
unds the system { ng 


function and S's responses. As à -on- 


sequence, without an adequate descri; ion 
of the forcing function it is impossi!)' to 
separate their effects. Error indic re 
measures of system performance ot 
human responding, and without a beh r- 


ally relevant descri] 
S it is not possible 


"tion of the task: or 
to meaningfully ; E 


his responding based upon error dat 
Dick's (1970) major contribution wa to 


(correctly) observe 
human responding 


that conclusions ab. it 
in a tracking syst m 


must be based upon indices that are i 1- 
ambiguously descriptive of that respond 


Accordingly, Dick 


performed his m 


analyses on the recorded output of Ss’ cc- 


trol manipulanda, 
was notable, there is 
prerequisite to an 
ology adequate for 


While Dick's insig it 
one more indispensal je 
experimental metho.|- 
the study of human 


skilled behavior. In order to be interpre- 


table, any measure 


must have a referent. 


The referent for measures of tracking error 
is zero system error. The referent for 


measures of trackir 


ng behavior is not so 


obvious. It is what Hoppe (1972) has 
called the reference response pattern (RRP). 
The RRP is the precise spatiotemporal 


pattern of control 
to produce zero sys 


movements necessary 
tem error through the 


lengch of a trial, The RRP derives from a 


reconceptualization 
avior-reproduction 


of Ss' task as a be- 
task, at least for the 


purposes of measurement, and serves the 
Same purpose as the zero error referent 
for the error measures. (The RRP is 
developed in more detail in Hoppe, 1972). 


Zero error as the cri- 


terion system performance implies that E 
Should use indices of deviation from zero 
System error as dependent variables. The 
assumption of the RRP as the criterion 


behavior patterning 
Use indices that re 
that pattern as depe: 


implies that Æ should 
flect deviations from 
ndent variables, The 


ACQUI 
two are not n illy exclusive, but are 
complementa 

As a consexvonoce, a prerequisite for 
tracking r is a specification of the 
task requ in the same terms to be 
used to Ss’ behavior. Since the 
present deals with response de- 
penden c essary to define depen- 
dence in a way that renders it quantifiable 
and experi ily manipulable. In this 


research, dependence is defined as the rela- 
tionship of the rate and direction of con- 
trol movement in one axis to the rate and 
direction of movement in the other axis. 
More interaxis movement de- 
pendence in a 2-dimensional tracking task 
is indexed by the product-moment correla- 
tion of the rate functions in the 2 axes, the 
2 rate functions being synchronized in time. 
This correlation is the dependence coeffi- 
cient (DC) and is an index of what kind 
of movements of one control go with what 
kind of movements of the other control. 
consider a 2-dimensional 
ti controls Dimension A by 
lling a control rod with his 
| controls Dimension B by 
l-grip handle held in his 
the definition above, S is 
ispla ident responding when the 
DC deriv om his response movements 
departs i ero. This is equivalent to 
saying that S is moving in a dependent 
manner when, e.g., a clockwise twist of the 
left-hand control is statistically associated 
with pushing on the right-hand control. 
(Zero dependence does not mean that Sis 
not moving both controls continuously 
and/or simultaneously, but only that there 
is no statistical association of the rates and 
directions of movement of the 2 controls.) 
‘The sign of DC reflects the particular asso- 
ciation; in the research reported in this 
article, a negative DC indicated an associa- 
tion of counterclockwise twisting with 
pushing and clockwise twisting with pulling. 
One question addressed by the research 
reported here is whether task dependence 
so defined, when varied, produces concomi- 
tant variation in the fit of the multiplica- 


tive rule to Ss’ error data. 
The second and more general question 


precisely, 


pushing 
right hand, 
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addressed is whether dependence so defined 
is part of what Ss learn when practicing a 
tracking task. Textbook accounts of skills 
learning typically refer to the integration 
of part skills, or to the spatiotemporal 
patterning of movements (see, e.g., Fitts & 
Posner, 1967 ; Kay, 1969; O'Donnell, 1969), 
but there are few experimental data which 
give these assertions more than intuitively 
based meaning. In characterizing the task 
in the same terms to be used to describe 
Ss’ behavior in multidimensional tracking, 
the present research aims to provide the 
beginning of an empirical substrate for an 
account of learning of complex psychomotor 
skills. 


EXPERIMENTS I AND II 


Two experiments were performed to 
assess the effects of varying the DC task on 
(a) overall system performance, indexed 
by root mean square (rms) error, (b) the 
goodness of fit of the multiplicative rule to 
Ss' data, and (c) the degree of interaxis 
control movement dependence displayed 
by Ss. The experiments were similar in 
general form and will therefore be described 
together. Since the effort in the experi- 
ments was to determine the importance of 
task dependence to a description of what 
S has learned, a transfer design was used ; 
Ss were trained on one task dependence and 
tested on another. In this manner it was 
hoped to ascertain whether Ss were merely 
coming to display the interaxis control 
dependence required by the task, or were in 
fact learning something about that depen- 
dence that has effects on subsequent re- 
sponding on a task with a different DC. 
During training on a dependent (D) task, 
the DC of Ss' responding is expected to 
approximate, over trials of practice, the 
DC defined as criterion by the system 
forcing functions. This would demonstrate 
that Ss can impose a D structure on their 
responding if the task requires it. Transfer 
to an independent (I) task is expected to 
show, by comparison with an appropriate 
control, whether the D response patterning 
is a learning or performance variable. It is 
possible that Ss in a D task are learning to 
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respond appropriately in the 2 axes simul- 
taneously, but are not learning the depen- 
dence between responses in the 2 axes. 
If these Ss are shifted to an I task that is 
identical to the D task except for the de- 
pendence between axes, Ss’ DC should 
shift immediately to the test task value, 
and the system performance measure should 
show no change as a function of the change 
in tasks. The same prediction would hold 
for Ss trained on an I task and tested on a 
D task. If, on the other hand, there is an 
effect due to the shift in tasks, then one may 
conclude that Ss were learning the de- 
pendence of responding in the 2 axes. 

The experiments were also designed to 
assess the relationship between the multi- 
plicative rule for TOT scores and be- 
havioral independence; dependence. If the 
rule is in fact an index of behavioral inde- 
pendence as defined by DC, then it is to be 
expected that it will differentiate between 
Ss trained on tasks having different DCs. 
Specifically, the rule should predict the 
P(TOT) scores of Ss performing on an | 
task better than the scores of those per- 
forming on a D task. If there is no regular 
relationship between behavioral dependence 
and the rule's predictions, then one may 
conclude that the rule is not an indicant 
of independent responding on the 2 controls. 

The general approach in the experiments 
was to train Ss on a 2-dimensional com- 
pensatory tracking task with a known, 
E-determined DC. The Ss were then 
transferred to a task with either identical 
(Experiment 1) or different (Experiment 
II) intraaxis characteristics, and the same 
(nonshift groups) or different (shift groups) 
interaxis control movement dependence. 
The experiments were full transfer designs, 
with all 4 groups (I-I, D-D, I-D, and D-1) 
run in both, 


Method 


Subjects. The Ss were 36 right-handed male un- 
dergraduate students in introductory psychology 
classes at the University of Minnesota. Experiment 
I used 12 Ss, and Experiment II used 24 Ss; all Ss 
were volunteers who received extra class credit for 
their participation. 

A bparatus. A multidimensional compensat 
tracking system, the Minnesota Tacks Se 
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(MTS), was used. For both experiments, th: MTS 
was wired as a 2-dimensional system. Error infor- 
mation was displayed to Ss in the form o lot of 
light moving in 2 orthogonal dimensio n an 
8 X 10 em. oscilloscope screen. The S rolled 
the movement of the dot in the vertical nsion 
by pushing and pulling a control rod wit right. 
hand; pushing the rod moved the dot u rd on 
the display, pulling moved the dot d ward, 
Control in the horizontal dimension w come 
plished by twisting a pistol-grip handle ! n the 
left hand; twisting the control clockwise : 1 the 
dot to the right on the display, twisting nter- 
clockwise moved the dot to the left. Th: ge of 
travel of the right-hand push-pull control v cnt, 
corresponding to 11 em. of vertical displa: nt of 
the dot, 4-53 cm. from the center of the di i face, 
Control-display gain in the vertical dime: was 


twist 
hori- 
the 
jain 
s is 
the 


3.64:1.0. The range of travel of the left-h 
control was 160°, corresponding to 11 cm 
zontal placement of the dot, 3-53 cm 

center of the display face. Control dis 
in the horizontal dimension was 14.55:1.( 
obvious, both controls were capable of 


dot completely off the 8 X 10 cm. d I ice, 
The center of the display was indicated the 
intersection of 2 hairlines; the centers of th: nges 
of both controls corresponded to this « iair- 
defined center, The MTS is described in doil by 
Dick (1970). 

System forcing functions. Since the con tion 
of the system forcing functions is critic the 
experiments, it is necessary to describe m in 
some detail In Experiment I, 2 pairs of ‘orcing 
functions were used. Each pair consisted of ? com- 
plex signals, 1 for the vertical dimension axis) 
and 1 for the horizontal dimension (B axis AIL of 
the complex signals in both pairs were formed by 


combining a 5-epm function with a 1 spm function 
in the following manner, In the A axis, the complex 
signal was the resultant of the combination of à 
5-cpm sine function with a 12-cpm sine function. 
The peak-to-peak amplitudes of the 2 functions were 
equal. In the B axis, the complex signal was the 
resultant of the combination of a 5-cpm sine function 
with a 12-cpm negative cosine function, again with 
the peak-to-peak amplitudes of the component func- 
tions equal. The 2 D conditions used in Experiment 
I were obtained by varying the interaxis phase rela- 
tionship between the 2 complex signals to obtain 
the desired value of DC, A value of DC = 0.0 was 
the I stimulus, and DC = — -70 was the D stimulus. 
The result of these manipulations was one set of 
3- and 12-cpm stimuli for the D experimental condi- 
uon, and another set of 5- and 12-cpm stimuli for 
the I experimental condition, Since the period of 
the complex functions used in each axis is 60 sec., 58 
in both conditions saw 1 complete cycle of each 
stimulus signal within each trial. The only differ- 
ence between the I and D conditions was the point 
in the cycle where the B axis began. In the D 
condition, a clockwise twist of the left-hand control 
Was associated with pulling the right-hand control, 
and a counterclockwise twist was associated with 
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; the right-hand control. As the value of 
for DC in the D condition indicates, the asso- 


ci » was not perfect, but wasa partial dependence. 
In : | condition there was no systematic relation- 
shi > between the control movements required in the 
2 cAimensions, 


“Lhe stimuli for Experiment II were generated in 
the same manner as in Experiment I, except that in 
ad dition to 2 sets of 5- and 12-cpm combinations, 2 

s@ ts of complex functions composed of 6- and 10-cpm 
11e functions were generated. The latter had the 
Sa me dependence relationships as the 5- and 12-cpm 
sig nals, and the same peak-to-peak amplitudes. In 
x periment ||], the 2 sets (I and D) of 6- and 
1 O-cpm signals were used as training stimuli, and 
the 2 sets of 5- and 12-cpm signals were used as test 
Stimuli. In Experiment I, the same 2 sets of 5- and 

1 2.cpm signals were used in both the training and 
test phases. All trials within each set of stimulus 
conditions were identical. All the forcing functions 
had peak-to-peak amplitudes sufficient to drive 

the dot +4 cm. from the center of the display in 
both dimensions. 

Procedure. The basic design of the experiments is 

Shown in Table 1. Half of the Ss in each experiment 
vere randomly assigned to the D training condition 
zx nd half to the I training condition. At the end of 
the training phase, the training groups in Experi- 
rments I and II were randomly reassigned, half of 
<ach being tested on a task with the same (I or D) 
HC as in training and the other half on a dependence 
«lifferent from the training DC. 1 procedure 
yielded 4 groups in the test phases of Experiments I 
and II: I-I, D-D, I-D, and D-I. 

The Ss were trained for 24 1-min. trials on the 
appropriate task, and then were shifted to their 
respective test tasks for 16 1-min. trials. Trials 
were presented in blocks of 8, the intertrial interval 
being 30 sec., and the interblock interval, 5 min. 
The Ss left the experimental room during the 
interblock interval while E prepared the next block 
of trials. The shift at transfer was made during 
the normal interblock interval, between the third and 
fourth blocks, and Ss were not informed of the 
change. 

Instructions to Ss told them only that the experi- 
ment dealt with human motor skills, and that their 
task was to keep the dot centered on the screen by 
appropriate manipulations of the controls. The Ss 
were given a short pretraining familiarization with 
the action of the controls in the absence of a forcing 
function and then received their training and test 
trials in one experimental session. The Ss were 
aware that data were continuously recorded, but 
were not informed of the nature of the analyses to 
be performed. No feedback other than that present 
on the display during the course of a trial was given; 
nevertheless, Ss appeared to be well motivated and, 
without exception, were cooperative. 

Measurement. The 2 response signals generated 
by Ss' manipulations of the controls and the forcing 
functions driving the dot in the 2 dimensions were 
simultaneously tape recorded for all trials. In the 
computer analyses, the 4 signals were simultaneously 
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TABLE 1 
DESIGN or EXPERIMENTS I AND II 


Experiment. ; 
Training condition ] 
I I ! 
Dependence test condition Ss 
Dependence 3 6 
Independence 3 6 
Independence test condition Ss 
Dependence 3 6 
Independence 3 6 


sampled and converted to digital values at a 10-Hz. 
rate, and all analyses were performed on the digi- 
tized data. The sampling rate of 10 Hz. is suffi- 
ciently higher than the rates present in the sampled 
signals to assure good digital approximations to the 
original continuous signals. Using the digitized 
data, the following 4 measures of various aspects 
of Ss’ responding and of system performance were 
calculated: rms error, DC, simultaneity index (SI), 
and an index of the goodness of fit of the multiplica- 
tive rule's prediction to the observed data. A brief 
description of the measures follows. 


1. The rms error is the square root of the mean 
square tracking error over a trial. For each trial, 
each S has an rms error score for each dimension of 
the task. To derive error values from the recorded 
data required only that the computer calculate the 
difference between the amplitude of the forcing 
function in a given dimension and the amplitude of 
the response signal in that dimension at each sample 
point. These error values formed the basis for the 
rms error scores. The rms error is a sensitive, 
reliable measure of system performance (Bahrick, 
Fitts, & Briggs, 1957; Kelley, 1969), Decreasing 
rms error indicates increasingly good system per- 
formance. In addition, in a zero-order control 
system like the MTS, rms error is an index of the 
overall approximation of .Ss' control position to the 
criterion control position. 

2. The DC is the product-moment correlation of 
the rate-of-change in one axis with the rate-of-change 
in the other axis. The DC of the forcing functions 
is the criterion dependence required for perfect per- 
formance; the DC of S's responding in the 2 axes 
is the dependent variable. 

3. The SI is the ratio of simultaneous movements 
and nonmovements to the total number of sample 
periods, and is an index of output (response) time- 
sharing. Given a 4-fold table of movement (M) 
and nonmovement (NM) in Dimension A vs. M and 
NM in Dimension B, the M/A, M/B intersection 
is Cell A, the M/A, NM/B intersection is Cell B, 
the NM/A, NM/B intersection is Cell C, and the 
NM/A, M/B intersection is Cell D. For each 100- 
msec. sample interval during a trial, the computer 
decided whether 5 moved both controls, moved one 
control but not the other, or did not move either 
control. A movement was defined as a change in 


the amplitude of the response signal of greater 
than 10 mv., which corresponds to a movement rate 
of .51 cm/sec on the vertical control and 3.16 
degrees/sec on the horizontal control. For each 
sample interval the computer mak n entry in the 
appropriate cell of the table. At the end of a trial, 
SI is calculated as follo: 
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The SI of the forcing functions used in Experiments 
land I was approximately .80, which is the criterion 
value defined by the task. The SI of Ss’ responses is 
the dependent variable. It should be noted that 
because of the definition of very low rates as NMs, 
Cells B and D will be slightly inflated and as a 
consequence SI artifactually underestimates the 
actual simultaneity of the task and of Ss. How- 
ever, since both the task criterion SI and the SI of 
5s are underestimated to the same degree, the rela- 
tion of the two is meaningful. (Cell C is also slightly 
inflated for the same reason, but since it appears 
in both the numerator and denominator of Equation 
2 this does not affect the ratio.) 

4. The method of determining the P(TOT) scores 
from which to evaluate the goodness of fit of the 
multiplicative rule is somewhat out of the ordinary, 
and must be described in some detail. Bahrick 
et al. (1957) noted that the target size most sensitive 
to variations in system performance is one wh. 
encompasses approximately 68% of the error ampli- 
tude distribution. Since error in a tracking task 
varies as a function of practice, a given fixed target 
size will be more or less sensitive at various stages 
of practice. Therefore, for each S, the target size 
used was set equal to +1 rms in each axis for each 
trial, This variable target was used to compute 
P(TOT,), P(TOT;), and P(T Ta») for that trial 
in a post hoc analysis. This procedure results in 
fairly constant P(TOT) scores for each individual 
axis and for the product of the single-axis scores, 
which is the prediction of the multiplicative rule. 
This technique of using post hoc target sizes based 
upon the dispersion of S's own error amplitude dis- 
tribution is due to Dick (1970). The goodness of 
fit index is then: 


Goodness of fit 
= [P(TOT,) x P(TOT,) — P(TOT,4)] x 100, 


[3] 
where P(TOT,) and P(TOT,) are the proportions of 
TOT in each dimension, and P(TOT,) is the 
proportion of TOT in both dimensions simultane- 
ously. Assuming that mean tracking error is ap- 
proximately zero and that the error amplitude 
distribution over a trial is approximately normal 
(Bahrick et al., 1957), then 1.0 rms = 1.0 Sp of the 
error amplitude distribution. From this it follows 
that the approximate range of the index is from 
—22% (maximum possible underestimation) to 10% 
(maximum possible overestimation). 

In all the data analyses, only the middle 50 sec. 
of each trial were included. The first and last 5 sec, 
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of each trial were dropped to avoid target t 
artifacts and, at the end of a trial, the p ility of 
a synchronization error that would ir 
samples of posttrial noise from the tape tainil 


the raw data. As a consequence, the va! of DC: 
defining the I and D conditions in the « iments 
were not precisely 0.00 and —.70 as zinall 


planned, but varied slightly. In Experin 
D stimuli (both test and training) DX 
the J stimuli (test and training) DC 0. 
periment II, the D training stimuli D¢ 
the D test stimuli DC = —.71, the I traini 
DC = .02, and the I test stimuli DC 
differer between the | and the D D€ 
to be negligible. 


Results 


This section focuses primarily on thé 
results of Experiment II. With respect tø 
the task dependence variable, the 2 experi 
ments are replicates, so the result 
Experiment I will be introduced only a 
necessary to support the Experiment M 
data. In general, the Experiment | results 
parallel thóse of Experiment I1. 

System performance. The measure of sy 
tem performance, rms error, showed clea 
change as a function of practice and as a 
function of the change in dependence at the 
shift between Blocks 3 and 4. It decreased 
over trials of training, approaching asymps 
totic levels in the second and third blocks 
of training in both experiments. The E 
group tended to have slightly higher error? 
through the training trials, In periment 
Il, at the shift between Blocks 3 and 4, 
all 4 test groups showed an increase in 
system error. This in itself is not surprising, 
since the test-task forcing functions con- 
tained both higher and lower rates than 
the training forcing functions, However, 
the shift groups (I-D and D-I) showed - 
greater increases from Block 3 to Block 4 
than did the nonshift groups. Table 2 
shows the ratio of rms error in Block 4 
over rms error in Block 3 for the 4 test 
groups. Each entry in the table is the 
mean rms error in Block 4 of one of the test 
groups divided by the Block 3 mean rms 
error for the 6 Ss of that test group. All 
of the ratios are greater than 1.0, indicating 
that system error increased as a function 
of the task change in all 4 groups. Also 
apparent in the table is a pattern reflecting: 
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fects of the changed dependence for 
hift groups. They show greater in- 
cin rms error than the nonshift groups. 
re was an increase in system error due 
: change in task dependence over and 
hat due to the change in the intra- 

is characteristics of the task. The Ex- 
periment I data showed the same general 
ttern, With the exception of 1 S in the 
i-I group, the nonshift groups improved 
(ie., ratios « 1.0), which is reasonable 
since in Experiment Il their t ks were 
identical in training and test. The shift 
groups, on the other hand, showed in- 
creases in system error from training to 
test. These increases can only be due to 
the change in dependence, since the intra- 
axis characteristics of the training and test 
tasks were identical in Experiment I. This 
result was consistent at the individual S 
level. In Experiment I at this level there 
are 24 ratios (12 Ss X 2 variables), and of 
these 24 ratios only 3 depart from the 
pattern described. Of the shift Ss’ ratios, 
11 of 12 were greater than 1.0; of the non- 
shift ratios, 10 of 12 were less than 1.0. 

A transfer analysis comparing shift 
groups with nonshift controls, using Briggs' 
(1969) Equation 1, was performed on the 
Block 4 rms error data.’ This transfer 
index is: 


Control group — Experimental group 
Control group 


x 100. [4] 
For rms error, which decreases with im- 
proved system performance, the index 
varies from 100% to minus infinity. Table 
3 presents the transfer indices calculated 
from the Block 4 mean rms error data of 
Experiment II. All of the indices have 
negative signs, indicating greater rms error 
in the shift groups. In addition, the I-D 
group showed greater negative transfer than 
the D-I group. Again, the Experiment I 


3 The terms in the numerator of Equation 4 are 
reversed from Briggs (1969) so as to yield negative 
transfer indices in those cases where the experimental 
group error was greater than the control group error. 
Relative to a (hypothetical) control group receiving 
no training prior to the test trials, of course, all test 
groups showed positive transfer. 
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TABLE 2 


Ratio or Roor MEAN SQUARE (RMS) ERROR IN 
BLOCK 4 TO RMS ERROR IN BLOCK 3 FOR 4 TRAIN- 
ixG-TEsT CONDITIONS OF EXPERIMENT Il 


| Dimension 

Condition | — —— 
mee 
Independent-independent | 1.02 1.02 
Dependent-dependent | 1.03 1.01 
Independent-dependent 1.06 1.12 
Dependent-independent | 1.06 1.29 


data replicate the pattern of Table 3, show- 
ing general negative transfer (with the ex- 
ception of 1 aberrant S), and differentiating 
the D-1 from the I-D group, in spite of the 
aberrant S. The transfer indices, together 
with the ratios of Table 2, show that there 
was a detrimental effect on system perfor- 
mance due to the change in task dependence 
at transfer. 

Fit of the multiplicative rule. The multi- 
plicative rule generally predicted observed 
P(TOT) very well in both experiments. 
The range of the block mean goodness of fit 
indices varied over less than 10% of the 
total possible range of the index. The 
rule generally underestimated observed 
P(TOT), and at the block mean level of 
analysis appeared to segregate groups by 
task dependence, the D groups being un- 
derestimated more than the | groups in 
both training and test blocks. Both experi- 
ments produced the same patterns of results 
in these respects. 

However, the goodness of fit data were 
not as regular as the block means imply. 
Figure 1 shows the goodness of fit index 
over the last 8 training trials and the first 


TABLE 3 


"TRANSFER INDICES COMPARING Root MEAN SQUARE 
(RMS) ERROR OF SHIFT GROUPS WITH RMS ERROR 
or Nonsuirr CONTROL GROUPS: EXPERIMENT 
II, BLOCK 4 MEAN DATA 


Dimension 
Condition |a ee ne 
| A B 
Independent-dependent —18.7% - 16.4% 
Dependent-independent | — TAY — 6.3% 
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FIGURE 1. Goodness of fit of the multiplicative 
rule over the last 8 training trials (Block 3) and first 
8 test trials. (Block 4) of Experiment II. (Abbrevia- 
tions: I = independent, D = dependent.) 


8 test trials of Experiment II. As is 
obvious, the fit of the rule to the data was 
. neithersregular nor clearly related to task 
dependence. (Block 3 of Experiment II 
was the only place where the D and I Ss 
Were not separated at the block mean level 
of analysis.) There is considerable over- 
lap of the trial mean goodness of fit in the 
various test groups, and it is not the case 
that the fit of the multiplicative rule is un- 
equivocally determined by the task de- 
pendence, where dependence is defined in 
terms of DC. Again, the Experiment JI 
data agree. 

Behavioral dependence, The interaxis re- 
sponse dependence displayed by Ss, in- 
dexed by DC, is shown over the 40 trials 
of Experiment II in Figure 2. (The DC 
data are in the form of logarithmic trans- 
formations of r. This transformation was 
performed for purposes of analyses not 
germane to the present discussion, and does 
not affect the results reported here.) As 

~ the figure shows, Ss’ interaxis control move- 
ment dependence changed as a function of 
practice, and the change is in the direction 
of the task-required dependence, The 
training-trial means approach the criterion 
dependence in both conditions, the D Ss 
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moving in a more dependent ner anle 
the I Ss moving more indep tly, bu 
in neither condition did th: ich. th 
criterion in training. 

The change in DC at the shi! cars to 
be quite regular. The D-1 p in- 
creases DC slightly, corresp to the | 
slight increase in task depend id the 1 
I-I Ss change DC slightly sitive” 
direction, again corresponding smell 
change in the task dependenc: > shirt 
groups (I-D and D-I) also « ze de- 
pendence rapidly. The I-D gr while 
quickly changing dependence at v.e shift, 
does not change to the value oí D-D 
control group, but falls short b: small 
margin. In Block 3, the D-D gro: > block 


mean was at 79.7% of the criterio. value, 
while in Block 4 it was at 79.5° f the 
criterion. "That is, there was no substantial 
change in the D-D Ss’ relationshi;. to the 
task criterion DC as a result of the unge 
from a dependent 6- and 10-cpm ta 
equally dependent 5- and 12-cpm t 
The I-D group, however, reached onl 
70.5% of the criterion dependence 

first test block. This pattern of resu! 
replicated in Experiment I, where the D-D 

Ss were at 94.0% of criterion in Block 3 and 
95.2% in Block 4, while the I-D group d 
reached only 73.5% of criterion in Block 4.)°™= 
The I-D trial means are consistently below 
those of the D-D Ss through Block 4 in 
both experiments, A 2-way analysis of 
variance with repeated measures on one 
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FIGURE 2. Mean dependence coefficient (DC) 
over the 40 training trials of Experiment II. (The 
horizontal 


bars over the data points indicate the 
DC for the dependent [D] group; the 


d 1] group criteria were 02 in training 
and .04 in test.) 
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ictor (trials) was used to assess the 
tatistical significance of the effect of train- 
ing dependence (I or D) on the DC of Ss’ 
responding on a D task in Block 4 of 
Experiment H. -Neither the training de- 
pendence main effect, F (1, 10) = 2.18, 
the trials main effect, F (7, 70) = .60, nor 
the Training Dependence X Trials inter- 
action, F (7, 70) = .67, was statistically 
significant. However, the replicability of 
.he phenomenon renders the nonsignificant 
outcome of a statistical inference based on 
one set of data less than compelling. 

The shift in dependence from D to I 
showed no effects like those just described. 
The D-I Ss shifted directly to the new 
dependence at transfer, and were indistin- 
guishable from the I-I controls at both the 
block and trial mean levels of analysis. 
This result also replicated in Experiment I 
and in the partial replication mentioned in 
Footnote 4. In general, the results of the 
DC analyses show that while training on a 
D task has no discernible effects on the 
dependence displayed on a subsequent I 
task, the reverse is not true. Training on 
an I task is not equivalent to training on a 
D task with respect to subsequent respond- 
ing on a D test task. This asymmetry is 
reflected. in the rms error transfer indices 
in which the I-D groups show greater nega- 
tive transfer than the D-I groups in both 
experiments. The greater negative transfer 
in the I-D groups’ system error may be 
due to the failure to move in as dependent 
a manner as the D-D control groups in 
Block 4. This is not to say that there is 
no commonality of what is learned in the 
2 tasks; it is obvious that Ss in both training 
conditions are learning very similar skills, 
but they are also learning something differ- 
ent: a relationship in the D condition be- 
tween particular movements of one control 


1 It is worth noting that a third partial replication, 
in which 2 groups of 3 5s each were trained on 6- and 
10-cpm I and D tasks and then shifted to the other 
(D or I) dependence, yielded results (compared 
to extrapolations of training data) showing the same 
patterns of transfer asymmetry às the 2 full experi- 
ments reported here, The fact that the patterns 
replicated 3 times is sufficiently strong evidence to 
show that they did not occur by chance. 
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Ficurr 3. Simultaneity index (SI) over blocks 
of 8 training and test trials of, Experiment IH. 
(Horizontal bars over data points indicate training 
and test SI criteria. Abbreviations: I = indepen- 
dent, D — dependent.) 


and the other control'which did not charac- 
terize the I condition. 1] 

The SI is shown over blocks of trials of 
Experiment I 
panded scale*on the vertical axis of the 
figure. The total possible range of the 
index is 0.0-1.0; the vertical axis includes 
only 20% of this range. In terms of the 
total range of the index, SI changed very 
little as a function of practice. Although 
the task SI was not a manipulated variable 
in the experiments, it did change slightly 
in Experiment II, increasing from .77 to .79 
with the shift in tasks at transfer. The Ss’ 
SI changed abruptly at the shift, paralleling 
the change in criterion SI. In training. 
there appeared to be a segregation of the 
groups by task dependence, but it did not 
remain consistent in the test blocks, even 
for the nonshift groups. The Experiment 
I data replicated the Experiment II results 
in that in training, D Ss exhibited a 
slightly higher SI than I Ss, while in test 
the groups were not differentiated accord- 
ing to task dependence. Both training 
conditions within each experiment had the 
same criterion SI, and the author has no 


in Figure 3. Note the ex- @ 
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satisfactory explanation for the separation 
in training. 

The SI does not appear to be: clearly 
related to the fit of the multiplicative rule 
to Ss’ data. While the SI data indicated a 
consistent differentiation between depen- 
dence groups in training, the multiplicative 
| rule's fit to their P(TOT) data varied con- 
1 siderably. Changes in the goodness of fit 
of the rule did not reflect changes in re- 
1 sponse time-sharing patterns, 


Discussion 


Tt was noted in the Results section that the 
— fit of the multiplicative rule was not deter- 
—. mined by task dependence, but there Was no 
particular reason to suppose that it would be. 
The rule has been taken to be an indicant of 
response independence, an index of the degree 
component of 
se in the other 
ar by now that the rule 
is not directly related to response dependence 
— Whether dependence is defined as movement 
2 simultaneity (SI) or as the correlation between 
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the D and I group means becoming almost 
identical in this respect, both DC and SI 
clearly differentiated Ss in the 2 conditions. 

* à consequence, it must be concluded that 
the multiplicative rule for TOT scores, while 
a powerful empirical relationship between 


this out. 

The data indicate that Ss can and do impose 
a dependent Structure on their responding 
when the task situation requires it, On the 
other hand, if the task requires independent 
movements, Ss can supply them, Given trials 
of practice, Ss approach the dependence defined 
by the task as criterion. Over and above this, 
it appears that task dependence may be part 
of what Ss learn in the practice trials. The 
relationship of the rms error transfer indices 
to the asymmetry of the DC shifts Suggests 


224 RICHARD B. HOPPE 


that the I- D group had not learned everyting 


there was to know about responding on 

D test task despite the intraaxis similari: 
the 2 conditions in Experiment I. The rep 
tion of this pattern in Experiment II in 
face of a concomitant change in the intra 
Characteristics of the task implies that 
I-D Ss must learn the dependent res; 
patterning required by the new task 

D-I Ss changed dependence immediately 
transfer, implying that they had acquired 
skill necessary to pattern their movem 
independently prior to the test trials, tho 
one should note that their rms error scores 
creased from training to test. It js not cert 
that this skill was acquired in the context 
the experimental training; it is possible that 
was acquired preexperimentally. There ma 
be population stereotypes regarding the inte: 
limb patterning of movements in multidimen 
sional tasks. Research to explore this questior 
is currently in progress, 

The present research is the first in w hich ; 
question of human tracking behavior has bee: 
explicitlysstudied in the light of the behavior; 
requirements of the task. The view taken ha 
been that tracking is a behavior-reproductio: 
task. The S's job, from £'s point of view, i 
to reproduce the RRP defined by the Systen 
forcing functions and the electromechanica 
characteristics of the control system, In ord 
to study behavioral dependence, it is necessary 
to specify the dependence required by the 
task and to evaluate Ss' responding with 


This approach 
would have led Dick (1970) to specify the phi 


for SI and DC in the present research, so that 
n time-sharing he observed Would 
have been interpretable. Similarly, Adams 
npting to ascertain 
havioral indepen- 
dence and the multiplicative Tule, cross-corre- 
lated error records, They found time-lagged 
correlations that were reliably nonzero, and 
concluded that there was underlying beh, 
dependence in spite of the fit of the multi 
tive rule's predictions, However, Adam 
Webber give no information regarding the 
task requirements jn this respect beyond 
Saying that they used “input signals that were 
-. ünSystematic betw. 
Lp. 203]." Whether “ 


e 


f 


ACQUISITION AND TRANSFER OF 


reported is unavailable, and one cannot know 
just what the Ss were doing relative to the task 
requirements. 

The research reported in this article supports 
the utility of the view of tracking, for measure- 
ment purposes, as à behavior-reproduction 
task. This view assumes that one's task as an 
experimental psychologist is to produce re- 
search answers to theoretical questions. Given 
that theories in psychology make predictions 
about behavior, it is logically necessary to 
have a research and measurement methodology 
capable of generating answers phrased in 
terms of behavior. It follows that the experi- 
mental task used must be described in terms 
of behavior, and the measures used must 
refer unambiguously to behavior. Thus the 
view of tracking as a behavior-reproduction 
task is not a theory of motor behavior, but is 
the basis for a research methodology appro- 
priate to the study of that behavior. 
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The Ss rated total magnitude of 5 kinds of stimulus pairs: length-length, 


area-arca, length-area, length-weight, and area-weight. 
providing a within- 
tional measurement scale values for each dimension were lir 
across tasks, providing a between-task test of consistency. 
of consistency tests validate the rating 


fit the data in each c; 


An adding model 
test of consi: tency, Func- 
ly related 
y. These 2 kinds 
response as an interval scale of sub- 


jective magnitude, lending further support to previous conclusions that 
magnitude estimation is biased and invalid. 


A perennial idea over the last 4 decades 
of work on psychophysical scaling has 
been the use of one or another algebraic 


— model as a basis for measurement (see 
T 


x 
v 


Anderson, in press-a, in press-b). Ratio 


. models for fractionation and adding models 


are examples, 
Such models impose a criterion of internal 
Consistency on the data thereby constitut- 
i base for the scales, 
Psychophysical data, however, have not 


Recent developments show more prom- 
ise. Functional measurement methodology 
(Anderson, 1970b), and related techniques 
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paper. 
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H. Anderson, Department of Psychology, Uni- 

versity of California, La Jolla, California 92037. 


developed by other investigators, 
begun to yield results that do show 
ternal consistency. ‘This approach re 
heavily on the use of algebraic models 
psychophysical integration. The integ 
tion model serves as a scaling frame 
which to construct interval scales for stin 
lus and for response. Scaling then’ becon 
an organic part of substantive inquiry, no s 
à separate methodological preliminary. 

An important outcome of this work i= 
the support for ordinary rating methods 

th modest experimental care, ratings 
can yield responses that are on an interval 
scale. Magnitude estimation, in contrast, 
Seems to be biased and nonlinear (e.g., 
Anderson, 1972a; Weiss, 1972), a conclu- 
Sion that is also Suggested by the work 
of the Oregonians (eg, Beck & Shaw, 
1968; Curtis, Attneave, & Harrington, 
ame Curtis & Fox, 1969; Fagot & Stewart, 

9). 


: This report takes up an interesting task 
introduced by Feldman and Baird (1971) 
who asked Ss to judge “ 
of a sound plus a light. 
stimulus integration seems to call for 
adding model. 


stitutes a criterion of internal consistency, 


functional measurement, 
The total Magnitude task also Provides 
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(Weiss & Anderson, 1969) requires that 
the scales for any one dimension should 
be the same across the different pairs in 


hich it occurs. The present article gives 
n initial application of functional mea- 
surement methodology to this total mag- 
nitude task. 


METHOD 


Stimuli 


The 3 sets of stimuli were length, area, and 
weight, with 5 stimulus levels in each set. The 
lengths ranged 4-20 cm. in 4-cm. steps. They were 
constructed of black drafting tape, 2-mm. wide, 
stuck to a white tagboard approximately 8 X 28 cm. 
The areas were circles with diameters ranging 
2-6 cm. in 1-cm, steps, They were constructed of 

‘black paper, pasted on white tagboard approxi- 

mately 10 X 14 cm. The weights ranged 200-600 

gm. in 100-gm. steps. They were not visible, and 

were lifted by pulling down on a handle-pulley 
arrangement used in previous work (Anderson, 
1972a), 

t The Ss judged stimulus pairs, of which there 

+ were 5 types. Three were heteromodal (length-area, 
length-weight, and area-weight) and 2 were homo- 
modal (length-length and area-area). The weight- 
weight condition was omitted as it would have 
involved serial rather than simultaneous presenta- 
tion, The 3 heteromodal pair types were run in 
complete 5 X 5 factorial designs. Owing to an 
error, the design for the homomodal pairs was 

triangular, not a complete factorial, but that did 

not affect the results. n 

A small and a large stimulis end anchor, con- 
structed from corresponding single stimuli, were 
used for each type of pair. The small.and large 
lengths were 2 and 25cm., the small and large 
circles were 1 and 8 cm., and the small and large 
weights were 100 and 800 gm. These end anchors 
had their usual purposes, including that of reducing 
tend effects in the rating scale (e.g., Anderson, in 

press-a, in press-b; Eriksen & Hake, 1955, 1957). 


Procedure 
^ 


The S was instructed to judge the “total in- 
tensity’ of each pair of stimuli. Ten practice 
trials covering the stimulus range were given for 
each type of pair. The first 2 were the small and 
large anchors, presented with instructions that 
they should be called 1 and 20, respectively, with 
the other pairs to be given intermediate numbers 
according to their total magnitude. 

Pairs involving length and area were presented 
by placing the 2 stimuli in front of S. Pairs in- 
volving weight were presented by having S lift 
the weight while looking at the other stimulus. 

One replication of the 5 stimulus designs was 
run in each of 3 sessions, Practice was given only 
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4 Q X 4 RORI Wiss 4 € 
LENGTH (cm) LENGTH (em) DIAMETER lem) 
Ficure 1. Parallelism test of integration model. 

(Vertical axis represents mean judgment of total 

magnitude. Lower panels for heteromodal stimulus 


pairs; upper panels for homomodal stimulus pairs.) 


in the initial session, but the 2 anchors were pre- 
sented anew before each later replication. In each 
replication, the length-area pairs were presented 
first, followed in order by length-weight, area- 
weight, length-length, and area-area, Responses 
for each stimulus pair were recorded on'an index 
card. The deck of cards for each stimulus type 
was shuffled for each replication for each’ S to 
balance order effects. The Ss were 20 students 
or similar members of the university community who 
were paid $5.64 for the 3 sessions. 


RESULTS 
Integration Model 


The mean judgments of the stimulus 
pairs are presented in Figure 1. Each 
lower panel shows one heteromodal type, 
plotted in the format of the 5 X 5 design. 
The values of one design factor are given 
on the horizontal, and the values of the 
other design factor are listed by each 
curve. The 2 upper panels show the 
homomodal data plotted analogously. For 
comparison, the upper panels are plotted 
as though a complete factorial rather than 
a triangular design had been used; each 
point connected by a dotted line is iden- 
tical with a corresponding lateral point 
connected by a solid line. 

The key property of these data is par- 
allelism. The simple adding model pre- 
dicts that each set of curves should be 
parallel, and that is essentially true. 
A rigorous test of parallelism is available 
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FIGURE 2. Linearity test of cross-task con- 
sistency, (Each curve represents the functional 
scale from one type of stimulus pair plotted against 
the functional scale from another type of stimulus 
pair.) 


from analysis of variance; the ordinary 
Row X Column interaction tests for sig- 
nificant discrepancies from parallelism. 
This interaction did not approach sig- 
nificance in any test, F (16, 304) = 1.49, 
79, and .78, in the 3 respective panels. 
The 3-way interactions with replications 
were also nonsignificant. Similar tests are 
possible for the 2 triangular designs (Ander- 
son, 1972b), but were not made in view 
of the evident parallelism in the graphs. 
The simplicity of this analysis is notable, 
Simply plotting the raw ratings gives an 
immediate graphical test of the integration 
model. The parallelism property jointly 
validates the integration model and the 
response scale since in general both must 
hold for parallelism to obtain. Functional 
scales of the subjective stimulus values are 
also available. These are considered next. 


Functional Scales 


Since the model has passed the paral- 
lelism test, subjective scales of the stimuli 
can readily be obtained. According to 
functional measurement theory, these are 
just the marginal means of the factorial 
design. : 

Scaling can be illustrated for the Length 
X Area data in the lower left panel of 
Figure 1. The vertical spacing of the 
curves constitutes an interval scale of the 
subjective values of the areas. The curves 
for circles of 2- and 3-cm. diameter appear 
to be closer together than the curves for 
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circles of 5- and 6-cm. diamete 

subjective circle value appears to 
small. positive acceleration as a 
of physical diameter. In this sa 
the mean curve yields a scale of fi 
length. It is almost linear in 

physical length. 

Functional scales from the oth: 
torial designs are obtained in 
way. Scale values from the : 
designs can be obtained analogou 


Cross-Tusk Validation 


Three functional scales of su tive 
length are available since the length tim- 
uli were used in 3 types of pairs. the 


same subjective values function in each 
of the 3 integration tasks, each |-ngth 
scale should be a linear function of each ? 
other length scale. " 

This test of cross-task consistes.v is 
shown in Figure 2. In the left panc:, the 
functional scale of length from the | « ngth 
X Length design is plotted on the hori- 
zontal axis. The other 2 functional -cales 
of length, from the Length X Area and 


Length X Weight designs, are plotted on 
the vertical. Both curves are essentially 
linear. Thus, the subjective values of 
length are indeed the same, up to a linear 
transformation, across all 3 integration 
tasks. 

The center panel plots subjective weight 
from the Area X Weight design against 
subjective weight from the Length X 
Weight design. The right panel shows 
subjective area from the Length X Area 
and Area X Weight designs as a function 
of subjective area from the Area X Area 
design. All these curves appear linear. 
In general, therefore, the same subjective 
values appear to function in each task. 


Psychophysical Law 


Once the subjective stimulus values have 
been obtained, the psychophysical law can 
be obtained very simply. It corresponds 
to the graph of these subjective values as 
a function of the objective physical values. 
These graphs are shown in Figure 3. 

The 3 psychophysical functions for 
length (left panel of Figure 3) are all 
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essentially linear. This result agrees with 
most previous work on length. 
[he center panel plots the 2 
ohysical functions for heaviness. Both 
chow a slight downward convexity. This 
shape, which would correspond to an ex- 
ponent slightly less than 1 in a power 
function, agrees with previous work in 
: (his program (Anderson, 1970a, 1972a). 
In contrast, magnitude estimation yields 
an exponent around 1.45 (Stevens & Ga- 
lanter, pea This disagreement is con- 
sidered further in the Discussion section. 
The 3 curves in the right panel of 
Figure 3 show the psychophysical func- 
tion for area. These appear to have some 
upward convexity, as might be expected 
since physical area would itself be convex 
upwards as a function of circle diameter. 
It should be noted, however, that a linear 
function would give a reasonable fit to 
these curves for area, suggesting a tend- 
ency to rely on the diameter, perimeter, 
or other linear aspect as a measure of the 
magnitude of the circle. Since such a 
tendency might be potentiated by the 
present array of tasks, these scales for 
area may be task specific. More appro- 
priate scales of area might be obtained by 
restricting to designs with areas only and 
by emphasizing area judgments in the 
instructions. Thus, circles could be paired 
with triangles, crosses, or disks. This 
procedure could also provide subjective 
scales for irregular figures by applying 
a 2-stage model (Anderson, in pressa; 
Anderson & Weiss, 1971). 


psycho- 


Related Work on Weight Adding 


Curtis and Fox (1969) applied an adding 
model to judgments of sums of 2 lifted 
weights. They reported a good fit for 
their rating data, in agreement with tests 
of an averaging model for lifted weights 
(Anderson, 1967; Anderson & Jacobson, 
1968). 

However, these 2 integration tasks, 
summing and averaging, have produced 
a seeming disagreement about the psycho- 
physical function for lifted weight. The 
relevant reports all agree that the psycho- 
physical function is convex downwards, 
which would correspond to an exponent 
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Psychophysical functions: Func- 
plotted against the 


FiGURE 3. 
tional stimulus scales are 
physical values. 


less than 1 in a power function. However, 
Curtis and Fox (1969) reported an ex- 
ponent around .65, whereas Anderson 
(19722) claimed a near linear function by 
visual inspection. This seeming disagree- 
ment prompted further analysis of the 
previous data, and 3 main points need 
mention. 

Data amalysis. First, S. J. Rule (per- 
sonal communication, March 16, 1973) fit 
a power function to Anderson's (1972a, 
Figure 3) data, obtaining an exponent 
of .69. This value is close to that of 
Curtis and Fox (1969). With a fixed 
response range, of course, the difference 
in stimulus ranges (10-218 gm. in Curtis 
and Fox, 200-600 gm. in Anderson) would 
itself produce some difference in the ex- 
ponent of a power function fit (Poulton, 
1968). It may be worth adding that a 
logarithmic function remains invariant, 
changing only in its base to adapt different 
stimulus ranges to à fixed response range. 

Second, the present author reanalyzed 
the category rating data of Curtis and 
Fox (1969) using functional measurement 
to test the adding model. Analysis of 
variance for triangular designs (Anderson, 
1972b) was applied separately to each 
single subject and to the group as a whole. 
These analyses supported the adding model 
fairly well. For the group data, deviations 
from additivity were not significant tested 
against the Ss X Pairs interaction, F (20, 
243) — .97. Of the 10 single Ss, 2 or 3 
did show significant discrepancies, tested 
against individual within-cell variability, 
but these did not seem to be overly serious. 


n 
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For category ratings, therefore, the data of 
Curtis and Fox (1969) agree well with re- 
sults on stimulus integration obtained by 


"the author in the present and previous 


reports. 

Third, the present author also reanalyzed 
the magnitude estimation data obtained 
by Curtis and Fox (1969). In the indi- 
vidual analyses, 7 of the 11 Ss showed 
marked discrepancies from the adding 
model, with 6 of the 7 F ratios significant 
beyond the .001 level. If the adding 
model used by Curtis and Fox is correct, 
magnitude estimation cannot be valid. 

In the group magnitude estimation data, 
these individual discrepancies from the 
adding model averaged out. Indeed, the 
adding model fit the group means quite 
well, F (19, 260) = .34, for deviations 
from additivity. The group geometric 
means were not much different from the 
arithmetic means, in part perhaps because 
of the choice of standard and modulus in 
this particular experiment. This example 
illustrates the risk in the standard pro- 
cedure of averaging magnitude estimation 
data across Ss. 

Statistical analysis. The statistical anal- 
ysis used by Curtis and Fox (1969) has 
certain limitations that need to be pointed 
out. The basic limitation is that it does 
not provide an adequate test of the adding 
model. In slightly changed notation, their 
2-stage model would read 


R = a(S* + S)" + b, 


where Sı and Sz are the physical values 
of the 2 stimuli, R is the observed response, 
k and m are power function exponents, 
and a and b are constants. 

They fitted this equation using the 
standard, nonlinear least squares analysis 
to estimate the exponent parameters in 
the above equation. Such analyses also 
purport to give certain statistical reliability 
information, including standard errors of 
the parameters, together with an overall 
F test of deviations from the model. 
This reliability information, unfortunately, 
has little value because it rests on the 
assumption that the fitted model is cor- 
In this analysis, deviations from 
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additivity, from the power funct 1s, or 
both, go into the error term. Incor: -tness 
in the model thus tends to obscure If be- 
cause it inflates the error term. To best 
of the present author's understandi_, this 
kind of analysis needs to be restri ed to 
the single S, and requires an e! term 
based on within-cell variability fo h S 

The functional measurement p dure 
may be useful in further stud the 
2-stage model. It is more gen than 
the procedure used by Curtis Fox 
(1969) because it requires no ass tion 
about the psychophysical stimulu — 'unc- 
tion. Indeed, no physical metric is eded 
for the stimuli which may have rely 
nominal values. General response rans- 
formation is also allowable (Ander: n, in 
press-a, Section II, A, 6; Bogartz & `vack- 
witz, 1971; Weiss, 1973). Thus, a r: orous 
test of the basic model is feasible while 
allowing for general input and tput 
functions. 

It should be emphasized that the present 


nclu- 


analyses have reached the same © 
sions as Curtis and Fox (1969), exccpting 
only the present finding of substantial 
discrepancies in the individual data on 
magnitude estimation. The limitations in 
their statistical technique are not of present 
importance, therefore, but they need men- 
tion because similar techniques have been 
employed in related reports. These statis- 
tical tests of goodness of fit are important 
because the theoretical interpretation de- 
pends on whether or not the basic algebraic 
model is correct. 


Other Related Work 


The heteromodal integration task was 
introduced by Feldman and Baird (1971) 
who tested an adding model (arithmetic 
mean) and a multiplying model (geometric 
mean) for judgments of total magnitude 
of stimulus pairs that consisted of a sound 
and a light. Their graphs suggest that 
the adding model did better, but they do 
not provide a test of goodness of fit. The | 
functional measurement procedure would 
give a much simpler and more powerful 
analysis. The adding model predicts par- 
allelism, while the multiplying model pre- 
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dicts a bilinear form (Anderson, in press-a, 
Section II, B, 6; in press-b, Section 7.12). 

Dawson (1971) studied sum and dif- 
ference judgments of circular area and of 
loudness, and Marks and Cain (1972) 
studied difference judgments of circular 
area, heaviness, and roughness. Both 
reports concluded that there were system- 
atic discrepancies from the sum and dif- 
ference models. However, these discrep- 
ancies may merely result from bias in the 
magnitude estimation response. 

One other report on sum judgments 
should also be noted. Five experiments 
on length adding in a thoughtful article 
by Kreuger (1970) all found systematic 
overestimation of the combined length of 
2 or 4 lines, relative to judgments^of single 
line controls. Although Kreuger concluded 
that the integration procedure itself in- 
troduced a bias into the length production 
response, a simple adding model may be 
able to account for his data. To a large 
extent, though perhaps not completely, 
the overestimation could reflect a constant 
error introduced by the combining process. 
That would be consistent with the rough 
tendency toward constant overestimation 
across different lengths, and with the 
greater overestimation for 4 lines than 
for 2 lines, 


DISCUSSION 


Integration Theory 


The present data support the adding model 
for psychophysical integration in all 5 cases. 
The simple parallelism test of Figure 1 ac- 
complishes 3 goals simultaneously (Anderson, 
1970a, Figure 1): (a) It validates the inte- 
gration model (psychological law), (b) it vali- 
dates the response scale, and (c) it yields 
interval scales of subjective stimulus value. 

Numerical response measures, such as 
ratings, may well fail to be interval scales. 
But if the ratings were not an interval scale, 
a correct model would fail the parallelism 
test. That test, therefore, provides a joint 
validation of the model and of the response 
scale. The stimulus scales derive from the 
model as already shown. 

The basic element in this functional logic 
is the integration model. It is the structural 
frame on which response and stimuli are 
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scaled. The simple adding task may seem 
trivial in itself, but it has an important 
theoretical role to play (Weiss & Anderson, 
1969, 1972). 


Magnitude Estimation Seems to be Invalid 


The present results imply that magnitude 
estimation is biased and invalid. It has been 
repeatedly emphasized (e.g., Stevens, 1957, 
1966, 1971) that ordinary ratings and mag- 
nitude estimations are nonlinearly related for 
prothetic dimensions such as were used here. 
This point is well illustrated by the present 
psychophysical function for lifted weight 
which would have an exponent less than 1 in 
a power function fit. In contrast, magnitude 
estimation yields exponents around 1.45 
(Stevens & Galanter, 1957; see also Stevens 
& Rubin, 1970). Being nonlinearly related, 
ratings and magnitude estimates cannot both 
be interval scales of the same thing. At least 
one is invalid. 

The present rating data have satisfied 2 
validity criteria. The parallelism test of 
Figure 1 demonstrates within-task consistency, 
providing one basis for concluding that the 
rating response is an interval scale, The 
linearity test of Figure 2 demonstrates cross- 
task consistency, thus giving generality to the 
concept of stimulus scale value. Since these 
results support the validity of the rating 
response, they imply that magnitude estima- 
tion is invalid. 

This experiment does not stand alone. 
Three other lines of evidence support the 
same conclusion. The first is a program of 
work in the author's laboratory that has 
provided substantial support for simple alge- 
braic models of psychophysical integration. 
In particular, Anderson (1972a) showed 
within-task and cross-task consistency of 
heaviness scales obtained from weight aver- 
aging and from the size-weight illusion. Also, 
Weiss (1972) made a direct comparison in 
grayness averaging and found that ratings 
satisfied the parallelism prediction and mag- 
nitude estimation did not. 

The second line of evidence comes from 
other work on psychophysical integration. 
Studies by Curtis and his colleagues have 
generally found that magnitude estimations 
do not fit sum and difference models, unless 
perhaps they are nonlinearly transformed. 
These same studies have also generally ap- 
peared favorable to ratings though correct 
tests of fit have been lacking as noted above. 
Fundamental work by Birnbaum and Veit 
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(in press) can also be most simply interpreted 
to support ratings as opposed to magnitude 
estimations. 

The third and final line of evidence comes, 
curiously enough, from social psychology. 
Using social stimuli, Stevens (1966) has found 
the same nonlinear relation between ratings 
and magnitude estimations that is obtained 
with prothetic psychophysical dimensions. He 
concluded that ratings show the same general 
bias in both cases. However, a long-extended 
program of work by Anderson (1962, in 
press-b) has found that ratings also satisfy 
a class of algebraic integration models in the 
social field, Basic work in decision making 
by Shanteau (e.g., 1970, 1972) has also sup- 
ported the functional measurement approach. 

The social data have particular interest, 
Unlike most psychophysical dimensions, many 
social dimensions have both positive and 
negative values, Such data make it plausible 
to rule out geometric averaging models and 
analogous models that might seem compatible 
with magnitude estimation data (Anderson, 
1972a; Birnbaum & Veit, in press; Weiss, 
1972). 


Integration and. Valuation 


Heteromodal integration raises an interest- 
ing question about process, Since length and 
weight are qualitatively different sensations, 
how can they be added together? To handle 
this conceptually, it would seem reasonable, 
if indeed not necessary, to suppose that the 
different stimuli are changed into some com- 
mon currency for the demands of the im- 
mediate task. Sensations are not themselves 
added, but rather their processed values. 

Integration theory handles this matter in 
terms of a valuation operation. Any task 
defines a judgment dimension, implicitly or 
explicitly. The valuation operation produces 
the scale value of the stimulus, and its weight, 
in relation to this judgment dimension. 

But this view requires that the scale value 
depend on the integration task because that 
controls the valuation operation. It would 
be premature, therefore, to claim that scale 
values from any one task will measure the 
magnitude of a basic sensation. Each task 
imposes its own valuation operation, and 
these may be nonlinearly related across tasks. 
That is certainly true for verbal stimuli. 
Even for a simple tone, scale value will 
depend on whether S is set to judge loudness 
or pitch. 
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In this view, the subjective magnitude 
that are studied in psychophysical judgment 
may be rather distant derivatives from any 
basic "sensation." The commonsense yi 
that there is a direct relation, may well bt 
correct, but it requires justification. Com 
fidence in any of sensation would 
thus seem to rest on a concept of cross-tas 
scale invariance. 

Need for such cross-task validation agree 
with the emphasis on converging operation 
by Garner, Hake, and Eriksen (1956), and 
in an important article by Birnbaum and 
Veit (in press), Similarly, Weiss and Andersot 
(1969) commented on scale invariance 
psychophysics that “Phe validity of func 
tional measurement will depend on gettim 
the same scales with different integration tasks 
Cp. 63]. The present and previous reports 
(Anderson, 1972a) have demonstrated such 
cross-task scale invariance. 


measure 


The Psychological Law 


The disagreement between magnitude esti 
mation and rating methods is only a surface 
reflection of a deeper difference. Indeed, 
since functional measurement explicitly allo 
for monotone transformation, it could make 
direct use of magnitude estimation data by 
transforming out the bias. 

The deeper difference appears in the validity 
criterion. Functional measurement rests om 
the use of some algebraic model; that provides 
the scaling frame and the validity criterion. 
But in the magnitude estimation approac 
these integration models have been largely 
disavowed (Stevens, 1971; see also Anderson, 
in press-a, Section IV, B, 4). 

This difference in methodology comes from 
a shift in conceptual orientation, away froi 
the psychophysical law and toward the psy- 
chological law. The prime concern and em- 
phasis in work with magnitude estimation has 
been with the psychophysical law, that is, 
with the relation between subjective mag- 
nitude and the physical values of the stimulus: 
The functional measurement approach i$ 
primarily concerned with the psychological 
law, in particular, with models of stimulus; 
integration. Accordingly, the basic problems: 
are substantive problems of stimulus integra- 
tion. Measurement in general, and the psy- 
chophysical law in particular, are derivative 
from the Psychological law. In this respect 
(Anderson, 1970b, in press-a), functional mea- 
surement is a reaffirmation of the spirit of 
à long tradition in psychophysical judgment, 
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STIMULUS FAMILIARIZATION 


AND CHANGES IN 


DISTRIBUTION OF STIMULUS ENCODINGS ! 


ALLEN R. DOBBS? 
University of Alberta, Edmonton, Alberta, Canada 


Independent groups were given 1 or 10 familiarization trials on integrated 
(consonant-vowel-consonant word) or nonintegrated (consonant-consonant— 


consonant) trigrams. 


The 4 groups (n = 48) were then given a stimulus 


recognition task containing the following 4 types of items: (a) the original 
stimuli, and distractor items having their (b) first, (c) second, or (d) third 


letters in common with the original stimuli, 


The probability function for 


encodings isomorphic with letters Írom the 3 letter positions was inferred from 


identifications, S 


een Ss from each group then learned 1 of 3 types of 


paired-associate (PA) list wherein the stimuli were the familiarized items. 


Encoding during PA learning was inferred from a. stimulated-recall ta 


k gi 


after PA learning. Contrary to the 1968 conclusions of Martin, the result 
indicated that stimulus-only familiarization resulted in selective. encoding for 
nonintegrated stimuli. Although those changes did not affect the rate of PA 
acquisition, familiarization did affect stimulus encoding during PA learning. 


The stimulus encoding variability hy- 
pothesis advanced by Martin (1968) pro- 
poses that the perceptual encoding response 
to a verbal unit may vary on successive 
exposures, and that the amount of this 
variability is negatively correlated with the 
degree to which the unit is integrated. 
Highly integrated units such as consonant- 
vowel-consonant (CVC) words are as- 
sumed to be consistently encoded on suc- 
cessive paired-associate (PA) trials, and it 
is this encoding stability which is postu- 
lated to underlie the relatively rapid 
acquisition of lists having integrated-stimu- 
lus items. Lists which contain uninte- 
grated (low meaningfulness, m) stimulus 
units, such as low m CCCs, are acquired 
more slowly, and it is postulated that this 
decreased learning rate is due to the lack 
of initial stable encodings for unintegrated 
units. However, stability of encodings for 
unintegrated stimuli is assumed to vary 
across trials. Specifically, it is assumed 
that there are initial probabilities for the 
elicitation of the various possible encodings 
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of unintegrated stimuli, that this proba- 
bility density function (PDF) shifts toward 
degeneracy for a single encoding early in 
learning, and then becomes more platy- 
kurtic with overlearning. That is, the 
initial PDF distribution is assumed to shift 
such that a single selective encoding be- 
comes dominant early in learnin z, but all 
encodings become equally probable during 
overlearning training. Although it is left 
unspecified as to why or by what mecha- 
nism the PDF shifts to degeneracy, it seems 
clear that Martin considers the cued-recall 
situation to be a necessary condition. This 
appears intuitively reasonable, as the func- 
tional value of stable selective stimulus en- 
coding during PA learning is oby ious, 
whereas it is difficult to see why individuals 
would selectively encode at all in a stimulus- 
only familiarization task. There are also 
data available which are, at least, not in- 
compatible with this notion. For example, 
Schulz and Martin (1964) have reported 
that stimulus familiarization, consisting of 
aural (spelled) presentation and requiring 
recall, was found not to have a differential 
effect on the subsequent rate of PA learning 
for several levels of stimulus m. 

On the basis of these data and the Martin 
(1966) results, Martin (1968) has concluded: 
that increases in stimulus recallability, such 
as that produced by stimulus-only familiar- 
ization, does not affect the acquisition of an 
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association between an encoding(s) and 
the response term differentially over levels 
of m (or integration). Further, he stated 
that this 


therefore implies that independent familiarization 
with a nominal stimulus does not alter the distribu- 
tion of probability density over [ri—sı] [the en- 
codings], at least not in a way that changes the 
availability of one alternative encoding relative to 
another for purposes of incorporation into an overt- 
response-producing association [p. 423]. 


It is not at all clear that this conclusion 
is necessitated by the data from the Schulz 
and Martin (1964) study, for it is possible 
that the PDF for encodings is altered 
by stimulus-only familiarization, and this 
change may carry over to encoding during 
PA learning in such a way as to system- 
atically alter the availability of possible 
encodings for incorporation into overt- 
response-producing associations, but still 
not differentially change the rate of associa- 
tion formation. This interpretation is also 
consistent with the Schulz and Martin data. 

In order to differentiate between these 2 
possible interpretations, a more direct test 
is needed of the effects of stimulus-only 
familiarization on the PDF for various 
encodings, and of the type of encoding used 
during PA learning as a function of the 
amount of stimulus-only familiarization, 
Support for Martin's (1968) interpretation 
would be obtained if the data indicated no 
change in the PDF for various encodings 
with increased stimulus-only familiariza- 
tion, and if there was no difference in 
stimulus encoding during PA learning às à 
function of the amount of stimulus-only 
familiarization. 

The present experiment was designed to 
provide this information. First, independ- 
ent groups were familiarized for 1 or 10 
trials on materials differing widely in the 
degree to which they are integrated. High 
m CVC words were selected to represent 
the highly integrated units, and low m 
CCCs were employed as the unintegrated 
units. The PDF for encodings isomorphic 
with letters from Positions 1, 2, or 3 of the 
trigram stimuli was then inferred from the 
false identifications made during a stimulus 
recognition task given immediately. after 
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familiarization. The effect of stimulus 
familiarization on the rate of PA learning 
was then assessed for 3 types of lists. It 
was assumed that if any shift in the PDF 
occurred, it would be toward greater selec- 
tive encoding of the initial position, at least 
for the unintegrated stimuli, On the basis 
of this assumption, 3 types of PA lists 
were constructed and administered to 
independent groups: For one type of list, 
selective encoding of the first-position letter 
would facilitate list acquisition, for another 
it would lead to interference, and for the 
third there should be no effect. These 
conditions were manipulated by varying 
the relationship of the initial letters of the 
stimulus and response terms, inally, à 
stimulated-recall test was administered in 
order to assess whether the type of encoding 
employed during PA learning was a func- 
tion of stimulus-only familiarization. 

Nonrecall familiari n was employed 
in the present study because it does not 
impose task demands with regard to 
stimulus terms which may be in opposition 
to those of the PA task, That is, familiar- 
ization requiring recall would seem to 
demand that individuals not stimulus 
(letter) select, whereas in a. PA task they 
tend to do so with nonintegrated stimulus 
materials (Postman & Greenbloom, 1967). 
The nonrecall procedure would appear to 
introduce no biases since the essence of 
Martin's (1968) conclusions concerning 
familiarization have to do with stimulus 
repetition outside the context of a cued- 
recall situation, and stimulus availability 
in the sense of recallability is undoubtedly 
also correlated with the amount of non- 
recall familiarization. 


METHOD 


Design and subjects. The overall experiment con- 
sisted of a 2X 2X3 factorial design, The respective 
sources of classification were unit integration (high 
vs. low), number of familiari n trials (1 or 10), 
and appropriateness of familiarization encoding 
(same, same re-paired, and different). The appro- 
priateness conditions were formulated primarily for 
the low-integration (low m CCC) stimuli on the 
basis of 2 assumptions, First, it was assumed that 
Ss do selectively encode low m CCCs during non- 
recall stimulus familiarization, and second, that this 
encoding would be in accordance with information 
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theory analyses (Aborn & Rubenstein, 1958; 
Carson, 1961; Garner, 1962). That is, that there 
would be a shift in the encoding PDF away from the 
middle letter, with the predominance of the shift 
being directed to a selective encoding of the initial 
letter. On this basis, selective encoding (first letter) 
was "appropriate" for the same (S) condition, in 
that paired stimulusand response terms had identical 
first letters. It was inappropriate for the same re- 
paired (S,) condition as the items from the S list 
were re-paired. [n the different (D) condition, all 
stimuli and responses began with different letters 
and provided a baseline for evaluating the effects of 
the $ and S, list constructions, 

The Ss were 144 University of Alberta under- 
graduates whose experimental participation was in 
partial fulfillment of an option for a course require- 
ment, and 48 paid volunteers from an introductory 
Psychology summer course. These latter Ss were 
equally divided among all conditions, Since no 
differences were detected between the 2 “sets” of Ss, 
they were grouped together in all analyses presented 
herein, Eight males and 8 females were assigned 
randomly to each of the 12 groups, with the restric- 
tion that each group must have » — 1 Ss prior to the 
assignment of the nth .S to any group. 

Procedure and materials. "The experiment was 
conducted in 4 phases, Phase 1 consisted of 1 or 10 
stimulus familiarization trials on the integrated 
(CVC) or unintegrated (CCC) materials, A trial 
consisted of a 2-sec, Presentation of each of the 5 
stimulus terms. The Ss were instructed to read 
silently each set of 3 letters, as it appeared, but 
were not asked to memorize the list. During Phase 

2, the 4 Phase 1 groups received a stimulus recogni- 
tion task. The recognition list consisted of 4 kinds 
of items: (a) original, and distractors which had 
(b) first, (c) second, or (d) third letters in common 
with the original items. The original items each 
appeared 3 times, and there were 2 examples of each 


Each of the 4 groups was divided 
Sr, and D conditions of Phase 3 PA 
study-test method was employed 
rate of presentation with a 2-sec. 

The Phase 4 stimulated-recall 
to the recognition task employed in Phase 2 except 
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recall task, and to compare these < 
changes assessed in Phase 2 which ;; t 
tained as a result of differential stimul 

tion. Since the form of encoding PDi 

by Martin (1968) to be, in part, a functi 

of learning, and since list criteria re 

widely varying levels of individuali: 

an apparatus was constructed to cont 
variable as carefully as possible, ‘1 
materials were presented on ries 8 

display units programmed by means oí 

reader. A series of switches allowed for 

diate removal and replacement of any 

items at any time. When an S correctly r 

an item on 2 successive tr; , the item w 

from the list and replaced by a pair z 
siderably different classes of materials 
interference, The replacement items werc 
form-2-digit number pairs, Phase 3 in 
noted the possibility of changing item: 
phasized attending to both types of pairs, 
indicated that the instructions were succ. 
that Ss did learn the replacement items, az. - 1 
able to successfully recall them at the conc 
the experimental session. 

Materials. The unintegrated stimulus 
consisted of CCC trigrams having a mea: 
(1935) A value of .33. The integrated sti; 
all 1.00-a-value CVC words (Noble, 1961) 
dike and Lorge AA nouns served as respon 
distractor stimuli had a mean Witmer A 
-38 and a Noble a value of -98 for the low 


a 
stimulus sets, respectively, Since, excluding Y > the 
are only 5 vowels, the vowels were repeated = 11 
CVC distractors. For comparability, the 5 x 11 «1 «dle! 
letters of the original-stimulus CCC terms Aere 
treated in the same manner. Although this n eces 


tated a confounding in the sense that all ty 
distractor items had their second letters in co 
with an original item, no distractor item wz 
to have 2 letters in common with an original ș 
Further, this confounding should have no differe x1tiail 
effect on the assessment of first and third POS ez Ri 
nor should any effect be differentia 1 eon 
Second Position between groups, at least within in cai 
tegration levels, The 5 CCCs take 15 conson a J 
leaving only 4 consonants. ER 


Only these 4 cons xr 
í i ( s SOMA = ad 
served as different letters in the distractors. "cs 


equate for this, only 4 different consonants were 4 
in the CVC distractors, E 


REsuLTS 
Stimulus recognition (Phase 2), X ^R 
primary objective of Phase 2 was to assess 
familiarization-induced changes in the PIDDE. 
or encodings isomorphic with letters resa 
» and 3 of the trigram stima J; 
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Mean proportion of the total number of false recognition errors in Phase 2 attribut- 


able to distractors having letters in Positions 1, 2, or 3 identical to original stimuli for low-integra- 


tion (Figure 1A) and high-integration (Figure 1B) materials. 


frequencies of each type of error.) 


in each of the 3 positions. The numbers 
above the bar graphs in both figures indicate 
the total number of errors given to each 
type of distractor item for each of the 
conditions. 

The total number of errors (false identifi- 
cations) were analyzed using a 2 X 2X 3 
analysis of variance with the factors being 
integration level (high or low), 1 or 10 
familiarization trials, and a within-Ss vari- 
able of first, second-, or third-letter 
identity position. The trend shown by 
the numbers in Figure 1 for a greater overall 
level of number of errors to be made with 
low-integration stimuli was reliable, F (1, 
188) = 86.05, p < .01, as was the trend 
for there to be fewer errors after 10 familiar- 
ization trials, F (1, 188) = 76.52, p < .01. 
The interaction of these 2 variables was 
also reliable, F (1, 188) = 36.68, p < .01. 
The interpretation of the overall false identi- 
fication rates are somewhat ambiguous, as 
it is unclear whether an individual false 
identification represents a genuine recogni- 
tion failure or merely a change in bias for 
emitting same responses as a function of the 
familiarization conditions. However, any 
such change in bias cannot differentially 
affect the within-Ss distribution of errors 
across the 3 positions, and these are the 


(The numbers represent the 


data of primary concern in evaluating the 
PDF for the various letter encodings. Simi- 
larily, interactions involving the within-Ss 
position variable are also unaffected by 
such a potential change in bias. 

The trend for Ss to differentially dis- 
tribute their false recognitions across dis- 
tractor items indicating different types of 
selective encodings was statistically reliable, 
F (2, 376) = 26.82, p < .01. Most im- 
portantly, the distribution of errors de- 
pended upon the level of familiarization, as 
indicated by a significant Familiarization 
Trials X Position interaction, F (1, 188) 
= 36.67, p < .01. Contrary to Martin's 
(1968) conclusion, this interaction indicates 
that there is a change in the PDF for en- 
codings as a function of stimulus-only 
familiarization. However, the analysis also 
indicated a statistically reliable Position X 
Integration Level interaction, / (2, 376) 
— 9.12, p « .01, and because of this inter- 
action the data for high- and low-integra- 
tion stimuli were further analyzed sepa- 
rately. Further, since the primary interest 
in evaluating the false identifications was 
not in the number of errors per se, but 
rather in terms of, given that an error was 
made, what kind of encoding did it repre- 
sent, the remaining analyses are in terms of 
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the proportions of total errors per S$ which 
could be attributed to the 3 position 
encodings. 

As can be seen from the Figure 1A, the 
proportions of false recognitions for the 
unintegrated stimuli for Ss given 1 familiar- 
ization trial are nearly rectilinearily dis- 
tributed over the 3 positions. However, 10 
familiarization trials resulted in à shift in 
the distribution of recognition errors away 
from Position 2 towards Position 1, indi- 
cating a trend for selective encoding of the 
initial letters with increased familiarization. 
The data from those Ss who made at least 
one error were analyzed in terms of a 2 X 3 
analysis of variance, where number of 
familiarization trials and a within-Ss posi- 
tion variable were the factors. In this 
analysis, the overall proportion of errors 
summed across position for each S is of 
necessity 1.00. Consequently, no con- 
sideration could be made of the between-Ss 
aspects of the analysis. On the other hand, 
the position factor and the Position. X 
Number of Familiarization Trials interac- 
tion were of direct concern. Since these 
are both within-Ss comparisons, the lack 
of between-Ss variance can have no effect 
on this portion of the analysis. The 
analysis confirmed the trend for the propor- 
tions of errors to be differentially dis- 
tributed across the 3 positions, F (2, 144) 
= 47.61, 5 < .01. Further, the reliable 
interaction, F (2, 144) = 12.97, P < .01, 
indicated a shift in the encoding of uninte- 
grated stimuli with familiarization training. 
Internal analyses showed that the differ- 
ences between 1 and 10 familiarization trials 
for both Positions 1 and 2 were reliable, 
ts (144) = 3.66 and 3.17, respectively, 
ps < .01. 5 

Contrarily, proportions of recognition 
errors for integrated stimuli indicate an 
initial encoding distribution which is in 
accordance with previous investigations of 
positional information of words (e.g., 
Horowitz, Chilian, & Dunnigan, 1969; 
Horowitz, White, & Atwood, 1968). That 
is, the majority of errors were obtained 
with first- and last-position identity, with 
identity in the middle position resulting 
in but few errors. Interestingly, a shift of 
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the error distribution to first, seco 
in order of decreasing percentages 
resulted when 10 familiarization i: 
given. This indicates a possible i: 
a shift to serial processing of C 
materials after several familiarizati 
However, an analysis identical to 
performed on the low-integration « 
not indicate a reliable interaction 
sequently, it must be concluded a: 
that familiarization training does nc 
the PDF for the positions of high m 
It may also be of some interest to c: 


rials. 
' one 
' did 
Con- 
sent 
{feet 
VCS 
nine 


the number of original-item correct "og- 
nitions. For the high-integration s: uli, 
the means were 12.77 and 14.46 fo: the 
groups given 1 and 10 familiarization i ials, 
respectively. These means for the lcw-1- 
and low-10-trial groups were 10.15 and 
13.52. For all groups SDs ranged :.47- 
2.37. The trend for high-integration stimuli 
to be better recognized was reliable, (1, 
188) = 38.78, p < .01. In addition, the 
trend for the 10-trial task to facilitate 
recognition of low-integrated stimuli io a 
greater extent was also statistically reliable, 
F (1, 188) = 8.70, p <.01. However, this 


interaction was probably the result of a 
ceiling effect for the high group receiving 
10 familiarization trials (38 of 48 Ss showed 
perfect recognition). An analy of the 
total number of same responses emitted 
regardless of whether they were correctly 
given indicates that it is doubtful that 
these findings can be attributed to differ- 
ential bias levels for the 4 groups. A total 
of 678, 718, 876, and 740 same responses 
were given by the high-1, high-10, low-1, 
and low-10 groups, respectively. These 
totals show a trend for the overall number 
of same responses to increase with greater 
familiarization for the high groups and to 
decrease with familiarization for the low 
groups. The interaction was Statistically 
reliable, F (1, 188) = 13.64, p < .01. 
Since. the overall trends for the tendency 
(bias?) to emit same responses are opposite 
to the trends for the same responses which 
indicated correct recognitions, it does not 
appear that the findings regarding the 
correct recognition data need be qualified. 

In Summary, the present data indicate 
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that nonrecall familiarization trials have 
several effects: (u) False recognitions indi- 
cate that after only one familiarization 
trial the dominance of letter encodings are 
in the order of first, last, second for inte- 
grated (CVC word) items, whereas there is 
a nearly rectilinear distribution of codings 
for unintegrated (CCC) stimuli. (b) Higher 
levels of familiarization result in a shift of 
the probability distribution of letter en- 
codings for unintegrated items to first, last, 
second in order of decreasing dominance, 
but have little effect on the probability dis- 
tribution of letter codings for integrated 
items. (c) Higher levels of familiarization 
result in more correct identifications of the 
familiarized items, and this facilitation may 
be more pronounced regarding uninte- 
grated items. 

Paired-associate learning (Phase 3). h 
will be recalled that during PA learning 
each item was only presented until it was 
correctly responded to on 2 successive test 
trials. Consequently, there was a trials- 
to-criterion score for each item. These item 
trials-to-criterion scores were averaged for 
each S yielding a mean trials-to-criterion 
score for each S. The data presented in 
Table 1 and used in all Phase 3 analyses 
were in terms of group averages of 5 mean 
trials to reach criterion. These data were 
analyzed by means of a 2 X 2 X 3 factorial 
design, where the factors were integration 
level, familiarization, and appropriateness, 
respectively. The analysis revealed statis- 
tical reliability for the integration level, 
F (1, 180) = 1244, p < 01, and appro- 
priateness, F (2, 180) = 32.64, p < 01, 
variables, as well as for the Integration 
Level X Appropriateness interaction, F (2, 
180) = 4.86, p < .01.. None of the remain- 
ing tests approached significance (largest 
F21.004). Since the dropout procedure 
was employed, a similar analysis was con- 
ducted which excluded the contribution of 
the last-learned item. This analysis re- 
vealed the same pattern of results except 
that the potential Integration Level X 
Familiarization X Appropriateness inter- 
action was more exaggerated though not 
statistically reliable, F (2, 180) = 2.09. 
Although there is a trend for both the 
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TABLE 1 


PHASE 3 PArRED-ASSOCIATE PERFORMANCE (GROUP 
AVERAGES OF 5 MEAN TRIALS TO CRITERION) ASA 
FUNCTION OF THE RELATIONSHIP BETWEEN THI 


StmuLus AND Resronse Term First 
Lerrers, AMOUNT OF PRIOR STIMULUS 
FAMILIARIZATION, AND STIMULUS INTR- 

GRATION 


Low integration | High integration 


— 


Paired-asociate Amount of Amount of : 
list familiarization familiarization — 
1 10 
Same 244 | 2.65 
Same re-paired | 5,76 5,29 
Different 470 | 4.95 


high- and low-integration-S, conditions to 
be affected by familiarization, neither of | 
these trends was statistically reliable in 
either analyses including all 5 items or ones — 
excluding the last-learned item. Conse- 
quently, it was concluded that the integra- — 
tion level and appropriateness conditions - 
affected learning, but that familiarization 
had no effect on the rate of PA learnings 
These findings are in accordance with the 
results reported by Schulz and Martin - 
(1964) and Martin (1966). Consequently, 
the data from Phases 2 and 3 taken to- 
gether indicate that stimulus-only familiar- 
ization of unintegrated units does affect 
the PDF for letter encodings, and that this 
change in encoding dominance does not 
necessarily lead to an alteration in the ralë 
of PA list acquisition. ^ 
Stimulated recall (Phase 4). The primary — 
objective in analyzing the stimulated-recall — 
data was to assess false recognitions of the * 
distractor stimuli which were followed by 3 
a PA-list response associated with the - 
critical letter of the distractor during ^ 
Phase 3 learning. Within the context of ~ 
Martin's (1968) stimulus encoding vari- : 
ability hypothesis, the critical data are any 
evidence of differences in the distribution. 
of such responses across the various types - 
of distractors as a function of familiariza- — 
tion. Should such differences be realized, 
they would imply that stimulus-only famil- 
iarization does result in a change in the 
PDF for the encodings (as the Phase 2 . 
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PHASE 4 ny THE Low 
Groups wHicH WE 


IN COMMON WITH THE 
DistRACTOR 


1 familiarization trial | 10 familiarization trials 


| 38 


| 
Crit- | | 
Qe | ae — 
letter `, 
Boats Same | pie | | Same | pir 
tion | sens | died ferent | Same | re. | erent 
1 6.44 | 1.19 1.62 1.69 | 56 NE 
2 | .06|..19 | .19.| .12| :06 | .06 
3090 25 | 06 | 25 


| 56 | 


data indicated), and familiarization does 
in fact do so in a way which alters the 
availability of alternative encodings for 
purposes of incorporation into overt-re- 
sponse-producing associations. A finding 
of this type would indicate an effect of 
familiarization on PA learning (presumably 
the stimulus encoding phase) even though 
the overall rate of list acquisition was 
unaffected. 

Those Ss receiving the familiarized inte- 
grated items (CVC words) as stimulus 
terms during the PA task made very few 
recognition errors to the distractor stimuli 
which were followed by a "correct re- 
sponse." Only a total of 20 such errors 
were made by 92 Ss, and these were not 
differentially distributed across conditions. 
It appears then, that stimulus-only famil- 
iarization of the integrated stimuli had 
little effect on the PDF for the various letter 
encodings (Phase 2 data), and there was 
no effect of familiarization on encoding 
during PA learning. This finding is not 
surprising as it would be anticipated that 
high m CVC words are well integrated and 
this integration would be little affected by 
familiarization training. y 

Contrarily, those Ss receiving the unin- 
tegrated stimuli (low m CCCs) made a 
total of 234 false identifications which were 
followed by a PA-list response previously 
associated with an original stimulus item 
having a letter in common with the dis- 
tractor. Further, these responses were 
differentially distributed with respect to 
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the 3 letter positions and the appropriate: 
ness and familiarization conditions. Table 
2 shows the means for these comparisons, 
As is apparent from Table 2, there were 
very few false identifications followed by 
correct responses to any distractor items, 
except those having their first letters im 
common with an original stimulus. Be- 
cause of the exceedingly small number of 
identification errors representing encoding. 
on Positions 2 or 3, only data from the 
Position 1 items were analyzed. A 2 x3 
(Familiarization Level X A; propriateness) 
analysis of variance indicated that prior 
stimulus-only familiariza fected the = 
number of Position 1 false identifications 
followed by a correct response which were 
made after PA learning, F (1, 90) = 41.03,” 
b < .01. an be sec m Table 2 
each of the 10-famili rial groups 
made fewer such e han did their 
respective 1-familiarization-trial 
Since very few Position 2 and 3 items 


rizati 


were falsely identified, this finding indicates 1 


that the 10-familiarization-trial groups 
knew more about the stimulus items than 
did the 1-trial groups, whose knowledge was 


more restricted to the initial letters, Since _ 


all groups had reached similar levels of 
learning during the PA task, this finding 
may reflect a more platykurtic PDF for the 


groups receiving a greater degree of stimu- 


lus-only familiarization, 

The type of PA list also influenced the 
number of Position 1 false identifications 
which were followed by 
F (2,90) = : 
of familiarization differentially affected Ss 

: : FQ 
-01. These findings 
amount of stimu- 
tion did affect the en- 
coding during PA learning. 

A consideration of the responses to the 
original stimuli provides for an assessment 
of recall as a function of recognition per- 
formance. The number of correct identi- 
fications (CIs) regardless of whether they 
were accompanied by the correct PA- 
response terms was compared with the 
number of CIs that were followed by 
elicitations of correct response terms from 


groups. — 
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the PA list (CECR): This within-Ss 
factor was combined with 3 between-Ss 
comparisons of the integration level, famil- 
iarization, and appropriateness variables. 
Overall, the design was a 2% 2 X 3 fac- 
torial arrangement. Table 3 shows these 
conditions and corresponding performance 
levels. Of the between-Ss sources of vari- 
ance, the integration level and appropriate- 
ness factors yielded statistically reliable 
differences, F (1, 180) = 8.02 and F (2, 180) 
=9,03, respectively, ps < .01. The famili- 
arity variable and Integration Level X 
Appropriateness interaction approached re- 
liability, £ (t, 180) = 3.82 and 3.28, 
respectively, ps < 10. The reliable 
within-Ss variable of CI vs. CI-CR, F (1, 
180) = 120.00, p < .01, and CI vs. CI-CR 
X Appropriateness interaction, F (2, 180) 
=26.96, p < -01, indicate that given 
stimulus recognition, Ss still had trouble 
recalling the responses, and the recall 
difficulty depended upon the PA condition. 
No other factors or interactions produced 
Fs greater than unity. 


DISCUSSION 


Taking all phases into account, the following 
conclusions emerge: (a) The error data indicate 
that nonrecall stimulus familiarization of 
unintegrated stimuli does lead to changes in 
the selective encoding of the familiarized items 
from rectilinearity to an ordering consistent 
with information analyses of words (first, last, 
second, in order of decreasing dominance). 
(b) There seems to be little evidence of a shift 
in encoding integrated stimuli as a result of 
familiarization, although any shift would 
appear to be away from first, last, middle, and 
in the direction of serial processing. — (c) 
Familiarization results in an overall increase 
in the number of correct recognitions and à 
decrease in the number of false recognitions 
for both integrated and unintegrated stimuli. 
(d) Although familiarization-induced encoding 
changes do occur, they appear to have little 
effect on the rale of PA learning. (e) A stimu- 
lated-recall task given after PA learning indi- 
cated that although 1 and 10 trials of familiar- 
ization lead to the same rate of obtaining 
correct responses in PA, the type of encoding 
may be quite different for unintegrated stimuli. 
The implications of the present findings for the 


TABLE 3 


N NUMBER OF CORRECT IDENTIFICATIONS (CU 
) MEAN NuMBER OF CIs FOLLOWED BY A Cor- 


N 
PRIOR FAMILIARIZATION TRIALS, AND 
PAIRED-AssOCIATE CONDITION 


'amiliarization ion High integration 
trials and 
paired-associate aE PY 
condition Ct CI-CR cL CI-CR 
H 
Same 12.69 | 12.31 12.88 | 12.00 
Same re-paired | 13.62 812 | 11.19 | 10,62 
Different 13.31 | 10.00 | 11.69 12.00 
10 
Same 13,50 | 13.50 | 13.94 13,38 
Same re-paired | 13.44 9,31 | 12.81 9,69 
Different 13.94 | 10,32 | 12:31 12.81 


stimulus encoding variability hypothesis will 
be considered below. 

The data from Phase 2 indicated that for 
unintegrated stimuli a shift in the encoding 
PDF occurs during nonrecall familiarization, 
even though there would appear to be no 
"reasons" for this to happen. In fact, given 
the simple instructions to read the items, it is 
unlikely that Ss did not suspect à second 
task such as recalling the items they had read. 
Since this type of 5 hypothesis should have 
led to rectilinear PDFs, the reliable shift to 
first-position dominant occurred in spite of 
this potential problem, ‘The finding of an 
altered PDF as a consequence of stimulus-only 
familiarization is clearly contrary to conclu- 
sions reached by Martin (1968). Although it is 
important to note that he was explicitly talking 
about familiarization procedures which altered 
stimulus availability in the sense of recall- 
ability, there can be little doubt that in the 
present study the familiarization variable 
would have produced differences in stimulus 
recall had such a measure been taken. How- 
ever, in order to assess this assumption 
directly, 4 groups of 16 Ss each were admin- 
istered the familiarization task, and then 
asked to recall the items. The obtained recall 
means were 1,50, 3.50, 4.31, and 5.00, for the 
low-1-trial, low-10-trial, high-1-trial, and 
high-10-trial groups, respectively. As can be 
seen, the familiarization task employed herein 
was effective in altering stimulus availability 
in the sense of recallability. Furthermore, the 
means for the recall task directly parallel those 
obtained with the recognition procedure, 
Thus, although it appears counterintuitive, 
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the present findings suggest that mere repeti- 
tion of nonintegrated verbal items results in a 
shift in the PDF for letter encodings toward 
first-position dominance. 
The obtained focusing on the first-position- 
letter encoding for the low-10-trial groups was 
expected to facilitate PA learning for the S 
groups as compared to the respective low-1- 
trial group. That the facilitation was not 
obtained for the low-integrated S comparison 
was probably due to a ceiling effect as further 
evidenced by the lack of difference between 
any of the low- and high-integration S groups 
on the PA learning task. However, equiva- 
lent rates of PA acquisition after varying levels 
of stimulus familiarization should not be 
taken to indicate that familiarization does not 
affect stimulus processing during PA learning, 
as has previously been concluded by Martin 
(1968). The Phase 4 stimulated-recall data 
from the present experiment indicated that the 
low-1-trial S group poorly discriminated be- 
tween original stimuli and distractors which 
had first letters in common with these stimuli. 
Contrarily, the analogous 10-trial group rarely 
made this type of error, but rather performed 
very much like the high-integrated S groups 
` jn that these Ss were not only able to identify 
the original stimuli, but also most often 
correctly identified the distractors and gave a 
response which would have been correct on 
the basis of the first letter. The percentages 
for these contingencies were 23.12, 64.38, 
50.00, and 43.75 for the low-1- and low-10-, 
and high-1- and high-10-S groups, respec- 
tively. As can be seen, the low-1-trial group 
is substantially different from the remaining 
3 groups. This indicates the low-10-trial S 
group knew the relationship between the 
stimuli and responses (first letters identical), 
and also knew something about the entire 
stimulus items. Contrarily, the low-1-trial S 
groups apparently knew little about the 
stimulus items except the first letters, In 
terms of the hypothesis advanced by Martin 
(1968), these data imply that for low-inte- 
grated items the PDF degenerated toward 
a single encoding with increased familiariza- 
lion, and may have become platykurtic more 
rapidly during PA learning than is the case 
with little or no familiarization. Hence, it is 
Suggested that the effect of nonrecall stimulus 
familiarization may be either to speed up the 
encoding phase during PA learning, or to allow 
a considerable portion of this phase to occur 
prior (and transfer) to the PA task. Of these, 


- the latter seems more appealing at present. 
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Although greater fan tion of te- 
grated items was expect ead d 
interference within th« l 
in the opposite direction 
lack of increase in intert: 
occurred as a result of a domin 
encoding leading to a more ra 
of the interference and a compen 
in encoding. However, it seem: 
this type of shift occurs. Williams 
wood (1970) have reported that 
spontaneously shift encodings away fr 
position to avoid interference, at least 
2-list situation. However, the trend fo 
creased interference in the low-integrated 
condition with increased familiarization wo: 
be consistent with the hypothesis that wit 
increased familiarization, the encoding PDI 
may become platykurtic more rapidly during 
a PA task. This would also tend to reduce 
interference due to re-paired stimulus and re- 
sponse first letters. The trend for increased 
familiarization of high-integrated stimuli to 
lead to increased interference is a puzzling one. 

Throughout this report, the emphasis has 
been on the distribution of encodings iso- 
morphic with the letter components. This is, 
of course, an exceedingly limited view. Other 
plausible encodings would include 2-letter 
combinations as well as an encoding of a 
unitized version of the item. Within this 
context, one could consider all of the items 
presented on the stimulus recognition trials 
to constitute tests of encodings. The 3 types 
of distractor items would be assumed to test for 
individual letter selection, or at least letter 
dominance, and the original items as tests of 
stimulus integration encodings. Unfortunately, 
the latter is confounded, as a correct recogni- 
tion of an original item could be based on 
single-letter encodings, 2-letter combination 
encodings, or encodings based on all 3 letters. 
Nevertheless, this confounding can be made 
somewhat less ambiguous by the following 
analysis. If familiarization results in only a 
shift to single-letter encodings based on the 
first position for low-integration materials, 
then one might expect not only an overall in- 
crease in the number of correct recognitions 
of original stimuli (as was obtained) but also an 
increase in the number of false recognitions to 
distractors having first-position letters identical 
to the original stimuli (the opposite was ob- 
tained). Familiarization resulted in an in- 
crease in the proportion of false identifications 
of Position 1 distractors but a decrease in the 
overall number of correct identifications. 


A 


ao 


This finding would be anticipated if the PDF 
for the various encodings was assumedsto shift 
from some initial distribution to a dominance 
of a single encoding, and finally to an inte- 
grated version of the nominal stimulus as 
learning processed during familiarization train- 
ing. If the rate of these encoding changes was 
less than uniform for all items, the higher level 
of familiarization would be expected to result 
in mose correct identifications and fewer 
errors; but given an error, they would be 
primarily those indicating first-posttion ew 
codings, as was obtained. Although the 
Phase 2 data indicated changes in the dis- 
tributions of encodings, the rate of PA acquisi- 
tion obtained in Phase 3 was not a function 
of the amount of prior stimulus-only familiari- 
zation. Nevertheless, the Phase 4 recognition 
data clearly indicate that the shift in the PDF 
for the various encodings resulting from 


* stimulus-only familiarization did transfer to 


the PA task, and in such a way as to alter the 
availability with which the alternative en- 
codings could enter into overt-response-pro- 


ducing associations. 
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PERIPHERAL VISION LOCATION AND KINDS 
OF COMPLEX PROCESSING 
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Processing capacity in the visual periphery was tested with 4 kinds of tasks: 
(a) detection of light onset, (5) recognition of the position of a U-shaped 
figure, (c) identification of a letter, and (d) categorization of 3-letter words. 
These tasks were presented at 2 luminances, 3 sizes, and 3 stimulus durations 
at distances from the fovea (10°, 15°, 25°, and 58°) at which retinal structures 
differ. In the near periphery, 10°, performance under optimal conditions was 
good for all tasks. At 15° information concerning detection, recognition, and 
identification was handled with a high degree of accuracy ; however, the classifi- 
cation task was performed at a level just above chance. The processing 
capacity in the far periphery, 25°, was good only for detection judgments. 
At 58°, extreme periphery, the detection task was the only task performed 
above chance. Of the state variables, size had an important effect at all 
locations, but brightness and stimulus duration had minimal effects. Response 


time measures gave supporting evider 


Conceptions of peripheral vision have 
come from studiés of acuity (e.g., Kerr, 
1971; Low, 1943, 1947; Mandelbaum & 
Sloan, 1947; Wertheim, 1894), visual search 
(e.g., Mackworth & Morandi, 1967; Wil- 
liams, 1966; Yarbus, 1967), and descrip- 
tions of the kinds of processing limits upon 
peripherally received stimulation (Engel, 
1971; Leibowitz, Johnson, & Isabelle, 1972; 
Mackworth, 1965; Menzer & Thurmond, 
1970; Newsome, 1972; Payne, 1967; San- 
ders, 1970). 

The salient fact of peripheral vision has 
been the marked effect on measures of 
acuity. The dominance of the acuity effect 
has inhibited study of other kinds of capa- 
bilities and limits of peripheral vision. 
There have been only a few writers who 
have accepted the acuity facts and con- 
tinued to ask questions of the role of periph- 
eral vision. According to Gibson (1966), 
the periphery functions to guide the fovea : 
“Interesting structures in the array, and 
interesting bits of structure, particularly 
motion, draw the fovea toward them, 
[p. 360]." In this context, stimulation 
reaching the periphery is presumed to be 
processed as a locus of concentration of 
information within a visual display, but the 


1 Requests for reprints should be sent to David C. 
Edwards, Department of Psychology, Old Botany 
Hall, Iowa State University, Ames, Iowa 50010. 


nce for the accuracy of response data. 


information itself is not fully analyzed. 
Mackworth and Morandi (1967) presented 
data that supplement Gibson's notion. 
They found a greater number of fixations in 
the regions of the pictures that contain 
unpredictable contours or unusual informa- 
tion. Thus the amount of information in 
each locus determined the adjustments of 
the eye. Peripheral vision in their account 
processes predictable and redundant infor- 
mation. They further relied on Hubel and 
Wiesel (1965) for physiological evidence to 
suggest that there is a rapid processing in 
the periphery of straight lines and slightly 
curved boundaries. Mackworth and Mor- 
andi summarized the nature of the visual 
mechanism by writing * . . . the periph- 
eral retina therefore quickly screens off the 
predictable features and leaves the fovea 
to process the unpredictable and unusual 
stimuli [1967, p. 551].” 

These notions suggests a processing di- 
chotomy in'the visual system : fovea vs. pe- 
riphery. Yet attempts at empirically map- 
ping the processing ability of the periphery 
do not support that notion. The classical 
acuity functions based on Landolt ring 
discrimination have shown performance to 
decrease rapidly from the fovea to 5? with 
a slower decline in performance over the 
remaining area (e.g, Wertheim, 1894). 
Sanders (1963) offered specifications for 
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PROCESSING CAPACITY IN THE VISUAL PERIPHERY 


functionally different areas within the visual 
feld based on a light discrimination task. 
Following a performance curve? that devi- 
ated from a linear function of visual angle 
at 2 points, at the region of 209-25? from 
the fovea and at 80?, he formulated a tri- 
partite system of the visual field. In the 
stationary field, particularly the area ex- 
tending from the fovea to 20°-25°, incoming 
signals are grouped and processed by foveal 
and peripheral vision. The eye field uses 
eye movements to handle information in 
the area out to 80°. The eye field processes 
in some form between grouping and succes- 
sive handling of signals. The head field 
includes the area beyond 80° and treats 
incoming signals successively using both 
head and eye movements. The importance 
of Sanders' distinction lies not in its delinea- 
tion of specific peripheral fields but rather 
in its theoretical movement away from 
describing the visual system as strictly 
peripheral vs. foveal and in its investigation 
of processing types within the periphery 
itself. Polyak's (1941) careful summary of 
the change in retinal structure through the 
periphery provides another consideration 
for assuming a continuous or at least 
multiple-staged retina. 

The studies described below are an ex- 
ploratory attempt at comparing different 
processing tasks at several locations in the 
periphery. Variables of luminance, size, 
and duration make up a matrix of state 
conditions to attempt to locate effective 
variables for peripheral processing. There 
are 4 studies and each compares 4 tasks at 
2 luminances, 3 stimulus sizes, and 3 ex- 
posure durations. Since the experiments 
differ only in. peripheral location and Ss, 
they are reported together. The results 
can be examined for evidence of the maxi- 
mum processing capacity at each location 
and also of the extent to which state varia- 
bles affect performance. 


METHOD 


A within-Ss factorial design was applied to sepa- 
rate groups of 10 Ss at each of 4 peripheral angles: 


" 2 These measures were taken on a binocular visual 
field whereas most of the Landolt ring studies were 
performed using a monocular field. 
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10? (the foveal edge of the near periphery), 15? (the 
foveal edge of the middle periphery), 25? (the foveal 
edge of the far periphery), and 58? (the foveal edge 
of the extreme periphery). It was assumed that 
these locations would best represent 4 peripheral 
regions distinguished neurologically by Polyak 
(1941), The 4 stimulus tasks used were presumed 
to vary in processing complexity. These were (a) 
detection of a lighted line segment, (b) recognition 
of the position of a U-shaped figure, (c) identifica- 
tion of a letter, and (d) categorization of 3-letter 
words. Three exposure durations, 3 stimulus sizes 
(distances), and 2 stimulus luminances completed 
the design, The tasks were given on successive 
days in the above order to all Ss. 

Apparatus. The visual tasks were presented by 
alphanumeric electroluminescent displays in which 
elements were separately programmed. The light 
detection task consisted of one lighted segment of 
the display measuring 4.75 X 34.90 mm. The 
stimuli of the recognition task were U-shaped figures 
measuring 25.40 X 34.90 mm. and appearing in one 
of 4 positions: upright, inverted, opening to the 
right, and opening to the left. The stimuli of the 
identification task were the letters H, L, O, and X, 
50.8 mm. high and 34.9 mm. wide. The categorizing 
task used 3 adjacent displays to present a 3-letter 
word, 50.8 mm. high and 162.4 mm. wide. The 
words belonged to one of 4 categories; animals, 
part of body, proper names, and utensils. 

The 3 exposure durations were 200, 600, and 1,000 
msec, Stimulus size was varied by placing the dis- 
play at 1.83, 2.74, or 3.66 m. from eye, The 
luminance of the lighted elements was 3.9 cd/m? or 
043 cd/m? upon a black background of .0017 cd/m?, 
measured by Spectra brightness spot meter, model 
UB 13°. 

The fixation point was an illuminated (.48 cd/m?) 
ball, 3.81 mm. in diameter, 150 mm, above the center 
of the display, displaced to one side so as to make 
the experimental angles of 10°, 15°, 25°, or 58°. The 
Ss were seated in an adjustable chair enabling easy 
control over the distance between S's eye, the fixa- 
tion point, and the displays. The chair also provided 
a brace against which S could restrict head move- 
ments. A black eye patch was always placed on the 
right eye, and the displays were presented to the 
temporal visual field of the left eye. 

The S, holding a response box containing 4 but- 
tons, indicated the response by depressing the appro- 
priate button. Th o provided a response time 
measure. An association between the stimulus 
alternatives and the position of the buttons was 
formed during the practice trials for each task. 

Electrooculograms were recorded from Beckman 
Biopotential electrodes 4 mm. in diameter affixed 
to the skin about 5 mm. to the temporal side of each 
eye. The Beckman Dynograph recording provided 
a record assuring that S made no saccades greater 
than 5° at the time of each stimulus presentation. 

Subjects. The Ss were 40 men and women stu- 
dent volunteers who were given credit towards their 
general psychology course grade. They all had 
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TABLE 1 
MEAN NUMBER OF CORRECT RESPONSES 


Visual angles 
Conditions 
10° 15° 25° 58° 

Task 

1 7.99 | 7.33 | 8.00 | 7.98 

2 6.17 | 4.19 | 3.16 | 2.07 

3 6.31 | 4.54 | 4.27 | 2.39 

4 3.32 | 2.15 | 1.93 | 1.63 
Stimulus size 

Large 7.05 | 5.89 | 5.07 | 3.73 

Medium 5.76 | 4.22 | 423 | 3.37 

Small 5.03 | 3.55 | 3.72 | 3.45 
Stimulus bright- 

ness 

.043 cd/m? 5.63 | 4.31 | 4.16 | 3.46 

3.9 cd/m? 6.27 | 4.80 | 4.52 | 3.58 
Stimulus dura- 

tion 

12: Sec. 5.82 | 449 | 423 | 3.46 

.6 sec. 5.98 | 4.50 | 4.35 | 3.59 

1.0 sec. 6.05 | 4.67 | 4.35 | 3.51 


normal vision in their left eye without glasses or 
contact lenses. The Ss were randomly assigned to 
each of the 4 peripherallocations. Apparatus design 
dictated that the 5 Ss run on any 1 day belong to 
the same group. 

: Procedure. Each S was given one of the 4 tasks 
in daily test sessions of 1 hr. Within each 144-trial 
test session, the stimuli were presented a total of 8 
trials at each factorial combination of size, bright- 
ness, and exposure duration. The presentation order 
was counterbalanced across Ss for the size of the 
stimulus (half of the Ss received the sequence at 
distances of 1.83, 2.74, and 3.66 m. and the others 
in the reverse order), and the presentation order of 
each brightness and exposure duration was random 
within each size level, 

"The Ss were visually familiarized with the stimulus 
forms and given at least 15 practice trials with 
feedback. However, in the word categorization 
task, they were given verbal examples of possible 
members of the 4 categories and then given 15 
practice trials with feedback. 

The Ss were instructed to fixate their eye on the 
illuminated ball, focus their attention to the periph- 
ery, and press the appropriate button as soon as 
they detected the stimulus. Any S found by the 
electrooculograph to have moved his eye fixation 
during stimulus exposure was removed from the 
experiment. Feedback as to the correctness of the 
response was not given during the experimental 
session, 


RESULTS 


A separate analysis was made for each 
angle and for each dependent measure: 
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number of correct responses and response ^ 
time. A factorial design was used having 
Ss as the only random factor (3 levels of = 
size X 2 levels of brightness X 3 exposure 
durations X 4 tasks X 10 Ss). The magni- 


tude of the experimental effects was ex- 
pressed as w*, indicating the ratio of that 
variance component to the sum o! the 
variance components of that factor aid its 
error term (see Hays, 1963; and Vaughan & 
Corballis, 1969). The main effects of all 4 
experiments (angles) are presented in Table 
1 (number correct) and Table 2 (response 
time). 

10° visual angle. Performance in the 


near periphery at 10? was high for the 4 
tasks. For the detection task (Task 1), the 


Ss made 100% correct judgment. They 
were 77% correct on the recognition task 
(Task 2), 79% correct on the identification 
task (Task 3), and 4196 correct on the 


classification task (Task 4). The strongest 
of the factors was due to tasks, accounting ~ 
for 48% of the variance, F (3, 27) = 82.54, — 
p «.005, œ? = .48. Performance levels 
among the tasks were in the expected direc- 
tion; however, Tasks 2 and 3 did not differ . — 
significantly, F >1. Tasks differences 
interacted significantly with size, F (6, 54) 

= 14.63, p < .005, w* = .04, and bright- 
ness, F (3, 27) = 19.66, p < .005, w? = .01, 
but these interactions are probably due to 
ceiling effects in Task 1 and possible floor 
effects on the measure in Task 4. 

Of the 3 presentation variables, stimulus 
size (distance) appeared to have the greatest 
effect, F (2, 18) = 70.17, p < .005, «? = .12. 
Brightness was not a strong factor, F (1, 9) 
= 62.28, p < .005, œ? = .02, but it did 
interact with stimulus size, F (2, 18) 
= 16.63, p < .005, w* = .01, so that bright- 
ness had a greater effect at smaller display 
sizes. Finally, although performance in- 
creased with longer exposure durations, 

F (2, 18) = 445, p < .05, wè = 0.00, it 
accounted for a negligible amount of ~ 
variance. E 

The analysis of response times supported 
the findings with number of correct re- 
sponses. The mean response time increased 
with the complexity, judged by Æ, of the — 
processing task: .63, 1.29, 1.63, and 2.36 - 
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sec., F (3, 27) = 40.58, p < .005, w? = .59. 
Again there was no significant difference 
between Tasks 2 and 3, F > 1. Exposure 
duration was positively related to response 
time, F (2, 18) = 16.77, p < .005, w* = .01. 
Levels of brightness, F (1, 9) = 9.23, 
p < .025, e? =.01, and size, F (2, 18) 
= 25.22, p < .005, «è = .04, were inversely 
related to response time, These latter find- 
ings were consistent across the first 3 tasks. 
The fact that response time did not increase 
with a decrease in size or a decrease in inten- 
sity of the display in the classification task 
indicates that the Ss had perhaps lost con- 
fidence in their ability to perform the task. 
That explanation perhaps also accounts for 
the significant Task X Exposure interac- 
tion, F (6, 54) = 7.68, p < .005, «è = 0.00, 
Task X Brightness interaction, F (3, 27) 
= 8.50, p < .005, «* = 0.00, and Task 
X Size interaction, F (6, 54) = 8.51, 
p < .005, w = .03. 

15° visual angle. In the middle periph- 
ery, 15°, accuracy of processing was high 
for the first 3 tasks, 91%, 52%, and 57%, 
but low for the classification task, 27%. 
Some decrement in performance at 15° may 
be attributed to the blind spot. The 
strongest factor was task differences. Per- 
formance was inversely related to task 
complexity, F (3, 27) = 79.79, p < 005, 
w? = .48, although Tasks 2 and 3 did not 
differ, F>1. Size, F (2; 18) = 88.47, 
p < .005, œ = .14, accounted for a greater 
portion of the variance than did brightness, 
F (1,9) = 23.18, p < .005, œ = .01, and 
exposure duration was not a significant 
effect, F (2, 18) = 1.69, Size and task 
again interacted, F (6, 54) = 10.41, 
p < .005, w = .06, as did task and bright- 
ness, F (3, 27) = 3.79, p < .025, w = 0.00. 
Both of those interactions are probably an 
artifact of a ceiling effect on the response 
measure in Task 1 and a floor effect in 
Tasks 3 and 4. The response time data 
were consistent with those from 10°. Mean 
response time increased with task com- 
plexity: .76, 1.56, 1.76, and 2.37 sec., F 
(3, 27) = 23.76, p < .005, a? = .39. Re- 
sponse time was positively related to ex- 
duration, F (2, 18) = 13.85 
.0t; and was inversely re- 


posure 
.p «.005, e? 
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TABLE 2 
Mean RESPONSE TIME 
Visual angles 
Conditions 
10° 15° 25° 58^ 
Task 
1 -63 76 -68 61 
2 1.29 | 1.56 | 1.59 | 1.81 
3 1.63 | 1.76 | 1.75 | 1.96 
4 2.36 | 2.37 | 2.33 | 2.25 
Stimulus size 
Large 1.2 | 1.48 | 1.52 | 1.69 
Medium 145 | 1:67 | 1.61 | 1.67 
Small 1.70 | 1.60 | 1.63 | 1.61 
Stimulus bright- 
ness 
043 cd/m? 1.55 | 1.58 | 1.57 | 1.63 
3.9 cd/m? 141 1.60 | 1,69 
Stimulus dura- 
tion 
;2 sec. 1.38 | 1.49 | 144 | 1.51 
.6 sec. 1.46 | 1.61 1.60 | 1.68 
1.0 sec. 1.60 | 1.75 | 1.72 | 1.78 


lated to size, F (2, 18) = 5.84, p < .025, 
w? = .01. A lack of consistency in effects 
was noted in the response times for Task 4 
as compared to those from the other tasks. 
Again, this was probably due to S's confi- 
dence that he could perform the task. 
That also explains the Size X Task inter- 
action, F (6, 54) = 7.21, p < .005, mi 
= 103, 

25° visual angle. In the far periphery, 
25°, the accuracy of performance levels for 
the 4 tasks was 91%, 40%, 53%, and 24%. 
The strongest factor was again task differ- 
ences, F (3, 27) = 191.52, p < .005, w? 
= .98, although Tasks 2 and 3 did not 
differ, F» 1. There was again a size 
effect, F (2, 18) = 60.6, p < .005, o? = .06, 
a brightness effect, F (1, 9) = 17.66, 
p < .005, «è = .01, a Task X Size inter- 
action, F (6, 54) = 19.26, p < .005, w? 
= .05, and a Task X Brightness interac- 
tion, F (3, 27) = 11.23, p < .005, w? = .01, 
but each of these effects was relatively weak. 

The response time data at 25° were con- 
sistent with those of the other peripheral 
regions. Mean response time increased 
with task complexity: .67, 1.59, 1.74, and 
2.33 sec., F (3, 27) = 61.47, p < .005, o* 
.64. Again response time was found to 


NT eae TR 


MEAN CORRECT RESPONSE 


TASKS 


FiGuRE 1, Selected mean performa 


e to reflect 
the optimal (best) performance on each task at each 
angle; the points are comparable only in that sense. 


be positively related to exposure duration, 
F (2, 18) = 30.07, p < .005, w? = .02, and 
negatively related to the size of the stimulus, 
F (2, 18) = 4.73, p < .025, w* = 0.00. Vari- 
ability was again evident within response 
time in the fourth task, and as before this 
is likely accounted for by chance perfor- 
mance levels of the Ss under all experi- 
mental conditions. As in the more nearly 
foveal groups it explains the interactions of 
task and size, F (6, 54) = 4.95, p < .005, 
o = 01, task and exposure duration, 
F (6, 54) = 6.50, p < .005, w? = 0.00, and 
task and brightness, F (3, 27) = 5.99, p < 
.005, w? = .01. 

58? visual angle. Performance in the 
extreme periphery, 58?, even under optimal 
conditions was low except for simple light 
detection in Task 1. The best perfor- 
mances were 100%, 26%, 31%, and 19%. 
Task, F (3, 27) = 1,222.09, p < .005, w? 
= .99, and size, F (2, 18) = 7.60, p < .005, 
w? = 0.00, were the only significant main 
effects. The Brightness X Exposure Dura- 
tion, F (2, 18) = 5.88, p < .025, «? = 0.00, 
and Size X Task, F (6, 54) = 3.66, 
p < .005, «œ = 0.00, interactions were also 
significant but with very small factor 
strength. These interactions are probably 
influenced by the Task 1 ceiling effect and 
Task 4 floor effect on the measure. Re- 
sponse time showed the usual increase with 
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task complexity: .61, 1.51, 1.96, 2,25 see, 
F (3, 27) = 49.40, p < .005, w? = .53. Ree 
sponse time also increased with exposure 
duration, F (2, 18) = 21.11, p < .005, e 
= .02, and there were minor effects of size, 
F (2, 18) = 4.64, p< .025, e? = 0,00) 
Size X Brightness interaction, F (2, 18) 
= 6.62, p < .01, w = 0.00, Task X Dura- 
tion interaction, F (6 = 4.77, p < .005, 
w = 0.00, Task X E ess interaction, 
F (3, 27) = 6.44, p «! = 0.00, and 
Size X Task interac! (6, 54) = 2.92; 
p < 025, wt = 01. 


Disci 


The ability of the ; to process in- 
formation depends upor sion stimulated, 
the complexity of the ta | the size of the 
stimulus. Figure 1 is z of the pro- 
cessing accuracy at c } peripheral 
regions under the mo ıl values of the 
significant factors in : iments. In the 
near periphery all 4 ty; Í information were d 
processed, given a large, bright, and long- 
duration stimulus. At the middle periphery — 
information concerning detection, recognition, 
and identification were handled at a high de- - 
gree of accuracy, but the more complex task, 
classification of words, was performed only at 
about chance. The processing capacity in the 
far periphery was good only for detection 
judgments. It dealt with information about 
recognition and identification but only at a 
limited degree of accuracy. At 58° in the 
extreme periphery the only task that was 
performed was light detection 

Across the periphery the factor which had 
the strongest effect on peripheral vision, in 
addition to task complexity, was size of the 
stimulus. The effect of size is in accord with 
the known facts of neural anatomy, especially 
the distribution of rods in the periphery and 
their manner of connection to the bipolar and 
ganglionic cells. Unlike the almost one-to-one 
connection with the foveal cones, the bipolar 
cells in the periphery connect with many rods, 
and in turn information from a number of 
bipolar cells is carried to each ganglionic cell. 
Assuming that this simple model of structure 
also applies to function, a given stimulus 
"edge" of adjacent light and dark extents 
would be less likely to be processed in periph- 
eral regions where both components strike 
cells that stimulate the same ganglion. In 
this framework one would expect size to be an 
increasingly important variable as angular 
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deviation increases. That did not occur in 
this experiment, probably because of floor 
effects upon performance at 25° and 58°. 

Brightness had a significant effect on correct 
response performance in all regions except the 
extreme periphery. It was not a strong effect, 
however. That supports Mandelbaum and 
Sloan's (1947) conclusion that "light intensity 
is not as critical a factor in peripheral acuity 
as it is in central acuity [p. 587]." That was 
especially true with the response time measure 
where brightness did not have an effect after 
10°. 

Exposure duration increased the accuracy of 
performance only in the near periphery, but it 
did have a significant effect upon response time. 
As is commonly found, the longer that the Ss 
had to view the stimulus, the longer they took 
to respond. Aside from the near periphery, 
however, exposure duration did not affect the 
level of performance. 

One of the most valuable contributions of 
this study is its demonstration that parts of 
the periphery, specifically the near and middle 
periphery, are capable of performing more 
complex types of processing than the visual 
information theories of Gibson (1966), Mack- 
worth (Mackworth & Morandi, 1967), and 
others usually ascribe to them. Perhaps in- 
stead of talking of the visual system as di- 
chotomous in nature, our notion should be 
extended to 3 or more areas based upon pro- 
cessing capacities with no sharp dividing line 
between them. The first area would be the 
fovea, extending to perhaps 2}°. The second 
area would extend from just outside the fovea 
up to and perhaps through the near periphery 
at 15°. With practice in attending in the 
periphery, an S could here handle information 
leading to judgments of identification or classi- 
fication. The third area would include the 
middle periphery from about 15° to about 25^, 
and this region appears to have a limited 
ability at processing complex information. 
The fourther is the remaining peripheral re- 
gion, which may have only the crude and primi- 
tive processing capacity of which Gibson and 
Mackworth write. 
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Two experiments using the same .5s in one sitting tested the hypothesis that 
the temporal redundancy inherent in rhythmically structured sound sequences 
is used during listening by the perceptual mechanism, Reaction times (RTs) 
were recorded for Ss who responded to the target phoneme /b/ in running 
speech. In the main experiment sentences containing the target phoneme as 
the word initial phoneme in either accented or unaccented syllables were 
presented to listeners. In the control experiment the words containing the 
target phoneme were spliced out of the sentence context and embedded in a 
string of nonsense words and presented to the same Ss, The RTs to the target 
phoneme were faster to accented than to unaccented targets in the sentence 
context, whereas RTs did not differ significantly in the nonsense sequence con- 
text. These results indicate that RT cannot be explained by simple differences 
in articulatory and other characteristics between accented and unaccented 
targets, and that temporal redundancy is a determining factor in RT to targets 
in connected speech. Accented targets are temporally predictable whereas un- 


accented targets are not. 


Consideration of the rhythmic movement 
involved in serial behaviors led Lashley 
(1951) to propose a general timing mecha- 
nism underlying all natural movement 
sequences, Rhythmic action and hier- 
archical motor organization were suggested 
to be related concepts which might provide 
the natural link between the perception and 
production of all integrated movements, 
including speech. The details of such a 
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relationship were not made clear at that 
time, although the need for methods to 
analyze such systems was emphasized 
Recently, Martin (1972) has offered 2 rules 
for the formal description of simple natural 
rhythmic patterns, and has shown how 
rhythmic action has consequences for 
theories of motor and perceptual b< t 
in real time. The general thesis develop: 
by Martin suggests that the organization 
of speech sequences as well as all natural 
movement sequences are constrained by 
relative timing (i.e., rhythm). 

Martin (1972) hypothesizes that speech 
elements are hierarchically organized in a 
coherent, internal temporal structure at the 
sound level. That is, the locus of each 
sound element along the time dimension is 
determined relative to the locus of all other 
elements in the sequence, adjacent and 
nonadjacent. The sequence is naturally 
constrained by the underlying temporal 
organization. An alternative to this view- 
point suggests that speech elements are 
simply ordered in a successive (concate- 
nated) sequence. According to the con- 
catenation view the relationship between 
elements at the sound level is that of 
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ordering, without regard to their structural 
relationships or organization at some other 
level. These alternatives, rhythmic pat- 
terning and concatenation, are related to 
distinct theoretical points of view concern- 
ing real-time behavior, and they have 
distinct implications for the perception 
and production of speech. The production 
of rhythmic speech patterns requires tem- 
poral patterning between adjacent and 
nonadjacent speech sounds, whereas the 
concatenative alternative requires only 
correct temporal sequence. These alter- 
native viewpoints make distinctly different 
predictions about the degree of organization 
in speech sounds and in their perception. 
The rhythmic viewpoint suggests that 
certain elements in a speech sequence 
should be more predictable in time than 
others, whereas the concatenative viewpoint 
would appear to suggest that each speech 
element is equally predictable (or better, 
equally unpredictable) in time. The hy- 
pothesis generated by the rhythmic theory 
will be developed in the following. 
Rhythmic patterns are element sequences 
in which some elements are accented, 
standing out to some degree, whereas other 
elements are not. The main implications 
of the rhythm rules stated by Martin 
(1972) are (a) accent level covaries with 
timing, and (b) main accents remain equi- 
distant. If speech sequences are rhythmic- 
ally patterned, then some of the speech 
elements will be accented relative to the 
other elements. The hierarchical organi- 
zation of these temporal sequences would 
suggest that main accents would fall on the 
potentially most important elements of the 
speech sequence. Therefore, a logically 
consistent hypothesis would suggest that an 
efficient perceptual mechanism would make 
use of the information furnished by the 
prosodic (rhythmic) contours of the speech 
pattern. That is, a rhythmic pattern is 
temporally redundant so that once early 
accents are heard the location of later 
accents (and perhaps the end of the pattern 
as well) can be predicted in time. Such a 
perceptual mechanism would allow for the 
focusing of attention on the elements in the 
speech sequence receiving accent. The 


hypothesis tested in this experiment is that 
the temporal redundancy in the speech 
sequence is used by the perceptual 
mechanism. 

Martin's (1972) theory predicts that the 
listener is capable of anticipating the 
occurrence of accented speech elements on 
the basis of the prosodic contours produced 
by the speaker. This implies that the 
listener's attention will be focused anticipa- 
torily on the accented syllables (i.e., 
accented syllables are targets during per- 
ception), The relationship between ac- 
cented syllables and relative timing is such 
that certain aspects of the spoken signal 
become temporally predictable to the 
listener once he has “locked in" to the 
pattern. Presumably the hearer's targets 
in listening will be the same as the targets 
of the speaker—namely, the accented 
syllables—as these are likely to be the 
most informative segments of the speech 
sequence. That is, the listener's perceptual 
strategy is such that during ongoing speech, 
it is possible for him to anticipate in real 
time the onset of accented targets and also 
the end of the speech sequence. Therefore, 
if he is asked to respond to a given aspect 
of the speech signal, for example, the initial 
phoneme in a syllable, his reaction time 
(RT) should be faster to phoneme targets 
in accented syllables than to phoneme 
targets in unaccented syllables, Since 
unaccented syllables are not temporally 
redundant with respect to the remainder 
of the pattern, they are not expected at any 
particular time. 

Of course, faster RTs might also be pre- 
dicted on the basis that a target phoneme 
in an accented syllable is not only louder, 
etc., but is also more clearly articulated 
than one in an unaccented syllable and 
hence should be perceived more easily. 
Experimental support of this point of view 
is provided by a study by Lieberman (1963). 
In that study experimental words were 
gated out of running speech. Lieberman 
found that words spoken in the context 
of sentences where they were informative 
were more easily recognized than the same 
words spoken in a context where they were 
redundant. If quality of articulation and 
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other characteristics of accented syllables 
themselves are the determining factor in 
RT, faster RTs would be expected to the 
accented target. Therefore, an experi- 
mental test of Martin's (1972) theory must 
demonstrate that the results cannot be 
explained by simple differences in articula- 
tory and other characteristics between 
accented and unaccented targets. 

Two separate experiments were con- 
ducted using the same set of Ss. That is, 
each .S participated first in the main 
experiment and then in the control experi- 
mentinonesitting. In the main experiment 
sentences containing the target phoneme 
/ b/ as the word initial phoneme in accented 
and unaccented syllables were presented 
to listeners. The S was asked to listen for 
the phoneme and respond by pressing a 
button as rapidly as possible. In the 
control experiment the words containing 
the target phonemes had been spliced out 
of the running speech and embedded in a 
string of nonsense words, and S was asked 
to perform the same task. If temporal 
redundancy is a determining factor in RT, 
RTs to the accented targets should be 
faster than RTs to the unaccented targets 
in the sentence context, and to a greater 
extent than in the control experiment. 


METHOD 
Design 


Each S served both in the main and control 
experiments, First he listened to a tape, which 
consisted of 30 sentences, His task was to press a 
button as quickly as possible whenever he heard the 
phoneme /b/ and to write down the /b/ word after 
the sentence ended, For the main (sentence context) 
experiment the design wasa 2 X 3 repeated measure- 
ments design (accented or unaccented target and 
3 phrase locations), For the main vs. control experi 
ment comparison, § listened to 30 sequences, each 
consisting of 8 2-syllable nonsense words, one of 
which was the /b/ word, and performed the same 
task as for the sentences, The design was a 2 x 2 
repeated measurements design (accented or un- 
accented target and nonsense or sentence context), 
There were 6 different tapes, each using the same 
sentence frames and nonsense sequences. All condi- 
tions were represented on each tape, but the particu- 
lar target conditions and words occurred in different 
sentences and nonsense contexts on the different 

tapes. 
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Materials 


Main experiment (sentence context), The experi- 
mental list consisted of 30 sentence frames and 30 
2-syllable nonsense words which were used in the 
sentences in a context where a noun would normally: 
occur, The nonsense word occurred at the beginning, 
middle, or end of the last phrase of each sentence. 
‘These phrases followed a n: ] clause break in the 
sentence and were approx ely 7-9 syllables long. 
Across the 6 tapes the accocnse word occurred 
equally often at the beginn:, ddle, or end of the 
phrase, An example of c ition follows. 

Beginning. They had long led that they were 
totally lost in the maze of side streets, for BENkik 
was a city of alleys. 

Middle. You will hav. 
sightseeing plans, as the plane to 
noon. 

End. First the group hiked along the side of the 
stream, then they forded the BENkik. 


ul any morning 
benKIK leaves at 


Twenty-four of the sentences had nonsense 
words that began with phoneme /b/ (/b/ 
words). The nonsense the remaining 6 
were distractors, The ) were designed so 
that the phoneme /b/ | occur'only in the 
target words, In the « he experiment, it 
was discovered that 2 ices each had 
another word with /b/ ir by," in the clause 
prior to the experimenta! Only a few Ss 
responded to the accide; These responses 
could be easily identific inated from 
the data, Because of t! h nonsense 
words and syllable ac need across 
tapes, the accidental /i v lanced across 
conditions. 

The 6 tapes were œ using the 30 


sentence frames and tar nce each on any 
given tape, Balanced s, each nonsense 
word occurred in all 6 nBons, Across the 6 
experimental tapes each containing a target 
was placed in 3 different sentence frames so that 
each word could occur in each phrase location — 
beginning, middle, and end. In each phrase location 
and sentence the target syllable was either acc ented 
or unaccented, For the exampl 
one tape the word Benkik wou 
the target syllable (i.e, PI ind on another 
tape it would occur in th« context but with the 
target syllable unaccented (i.e, benKIK). Within 
each tape each of the 6 possible conditions (3 phrase 
locations and accented ys. unaccented) occurred 4 
times with /b/ words and once each with distractors. 

An experienced female reader recorded each of the 
6 experimental sentence lists on 1 channel of a 
2-channel tape. The sentences were read with 
natural emphasis and intonation, The appropriate 
accent for the nonsense word was marked on the 
typed material from which she read. The record- 


ven above, on 
* accented on 


* This procedure, by drawing attention to the 
syllable to be accented, led the speaker to give the 
syllable primary accent. Thus it enhances the 
differences between accented and unaccented 


ings were checked for pronunciation and accent by 
Es and those found inadequate were rerecorded, 
The reader was naive as to the theoretical basis for 
the experiment. 

A tone was recorded on the second channel of the 
tapes, starting at the beginning of each phoneme 
target (the tone could not be heard by 5). The tone 
was used to close a relay which was in series with 4 
electronic timers and 4 buttons. ‘The timers started 
when the tone operated the relay. Each timer was 
separately connected to a button and when S 
b: pressed the button the timer was stopped. For the 
purpose of recording the tone on the second channel, 
the beginning of the phoneme target was located by 
moving the tape manually back and forth across 
the playback head. There were probably slight 
variations in the alignment of tone onset and the 
beginning of the nonsense word, but such variations 
were probably randomly distributed. Two copies 
were made of each of the original sentence tapes 
(with tones). One of these copies served as the 
experimental tape in the main experiment, 

Control experiment (nonsense word sequence), ‘The 
nonsense words used as phoneme targets and the 


7 distractor nonsense words were cut out of the second 
copy of the sentence tapes. Thirty sequences of 7 
- 2-syllable nonsense words were recorded by the 


same female reader, These words were then cut 
out, scrambled, and respliced so that there was no 
natural pattern to the sequences. Using a pre- 
determined nonsystematic order, the 24 /b/ words 
and 6 distractors from the sentences were spliced 
into the nonsense sequences, occurring equally often 
as the third, fifth, or seventh item in the sequence, 
which now had 8 items, The tone on the second 
channel was included, so for a given word the timer 
would be started at exactly the same time in both 
. the sentence and the nonsense context, These 
nonsense word sequences were used for the control 
experiment, 


Subjects 


The Ss were 48 students from introductory 
psychology classes at the University of Maryland 
who volunteered to participate for extra credit, 
Eight Ss were assigned to each of the 6 experimental 
tapes. 


Procedure 


The Ss were tested in groups of 1-4, with Zand S 
occupying adjoining rooms. Each S was seated 
out of direct view of the others and so that his arm 
and hand could rest comfortably with the preferred 
hand resting 3 in. in front of the response button. 

General instructions outlining S's task were re- 
corded on the experimental tape and played at the 
beginning of the experimental session. The Ss were 


B a a 
targets, since if the second syllable is strongly 
accented, then (everything else being equal) the 
first (target) syllable naturally receives minimal 
accent according to the rhythm rule (Martin, 1972). 


45 IT UTI PE 
IN CONTINUOUS SPEECH 


told to listen for the sound “b asin boy" and to press 

the button as quickly as possible when they heard it, 
In addition, they were told to write down the word 
after the sentence ended. This instruction insured — 
that S was attending to the sentences, and at the 
same time it provided a check on what he had 
responded to, 

At the end of the taped instructions, E returned. 
to the S room to answer questions, check hand | 
positions, and clarify the instructions, The 5 was 
familiarized with the RT task through a series of 4 
practice sentences, 3 of which contained a /bf 
word. Each S listened first to the 30 sentences and 
afterward to the 30 sequences of nonsense words 
played over earphones 1 comfortable listening — 
level, The E recorded individual RTs for each 
sentence on prepared recording sheets, 1 


ResuLTs 
Main Experiment (Sentence Context) 


Six median RTs were obtained for each 
S, one for each Accent X Phrase Location 
condition. Table 1 shows the mean scores 
for all Ss for these 6 conditions, A Phrase | 
Location X Accent analysis of variance 
revealed that Ss responded faster to 
accented targets than to unaccented targets 
at all phrase locations, (1, 47) = 49.60, 
p € 001. At the 3 locations, Ss responded 
more quickly the later the target appeared 
in the phrase, F (2, 94) = 19.92, p S .001. 
The significant main effects must be 
interpreted with reference to the significant. 
interaction of phrase location and accent, 
F (2, 94) = 8.00, p < .001. Differences 
between pairs of means were tested by the 
Tukey procedure (Winer, 1971). The RTs 
to accented targets were significantly 
faster than RTs to unaccented targets in the 
early and middle phrase locations. For 
accented targets, RTs were significantly 
faster to targets in the middle and late 


TABLE 1 


MkaN REACTION Timm Scores (IN MSEC.) FOR 
Accent X PHRASE Location CONDITIONS 


Phrase location 


Accent — ———— 


Beginning| Middle | End 3 

Accented so | s37 | $4; | 559 

Unaccented 657 640 562 | 619 
| 

x 625 589 854 589 
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TABLE 2 


MEAN Reaction Time Scores (IN MSEC.) FOR 
^ ACCENT X CONTEXT CONDITIONS 


| Context 
Sentence word 
sequence 
Accented | 549 671 610 
Unaeccented | 612 687 650 
£ | 581 679 630 


> 

phrase locations than to targets in carly 

phrase locations. For unaccented targets, 
RTs were significantly faster to targets in 

the late phrase location than to the early 

or middle phrase locations. 


l Control Experiment (Nonsense Sequence) 


_ Four separate median RTs were calcu- 
- Jated for each S for the accented targets 
and unaccented targets in the sentence 
— context and the nonsense sequence context. 
The mean scores for all Ss for these 4 
conditions are shown in Table 2, An 
- analysis of variance showed 5s responded 
- faster to accented targets than to un- 
accented targets, F (1, 47) = 2128, p 
S .001, and faster to targets in the sentence 
context, F (1, 47) = 34.10, p < .001, than 
in the nonsense sequence context. In addi- 
- tion, there was a significant Context and 
- Accent. Conditions interaction, F (1, 47) 
= $,80, p  .025. Tests of the individual 
mean comparisons by the Tukey procedure 
(Winer, 1971) indicated that the sentence 
. context RTs were shorter when the targets 
- were accented than when the targets were 
| unaccented. But when these same targets 
. were spliced out of the sentence context 
and spliced into the nonsense word se- 
quences, the differences in RTs to accented 
and unaccented target phonemes were not 
significant. 


Discussion 


The results obtained in the present study 
Suggest that the listener uses the rhythmic 
ttern of speech to anticipate accented 
in the speech sequence. The most 
nt comparison in support of such an 
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interpretation concerns the target locations dn 
the sentence context. In the sentence context 
the sharp drop in RT to accented targets in the 
middle location as shown in Table 1 is to be 
expected on the theory that some minimal 
temporal context (e.g., 2 beats") must occur 
to establish the rhythm of a sequence. Only 
then do the anticipatory consequences of 
temporal redundancy become useful to the 
listener, That is, the listener's attention is 
focused on the primary accents in the sequence 
and therefore RTs to targets occurring co- 
incident with the primary accent would be 
expected to be faster than to those targets 
which do not receive primary accent. For the 
unaccented syllables, the listener's attention 
will focus more strongly on the accented 
syllable which follows and it should take him 
longer to recognize the target in the un- 
accented syllable. Note that this does not 
imply that the listener is aware of the tempor! 
redundancy or is capable of making deliberat 
use of it, a» the reader can check by listening 
momentarily to someone speaking in the 
hallway. Rather, the argument here i» that 
since syntactic and semantic context (re 
dundancy) is ruled out, the results of the 
experiment can be taken as evidence that che 
postulated temporal redundancy exists in 
natural running speech. 

However, in the light of the above argument, 
it would be predicted that RTs to accented 
targets should improve toward the end of the 
phrase, but why should RTs to unaccented 
targets also improve? On the present th: 
rhythmic patterns, here phrases, are ho! 
internally organized units so that the listener 
can anticipate the end of the anit without a 
terminal cue, Hence, the faster RTs at the 
terminal phrase locations may be due, in part, 
to anticipatory guessing based on rhythmic 
cues, Redundancy due to temporal as well as 
other factors makes the targets more pre- 
dictable toward the end of the sentence. 
Support for this interpretation comes from 
other published work. Several phoneme- 
monitoring experiments (Cairns & Foss, 1971; 
Foss, 1969; Foss & Lynch, 1969; Hakes, 1972; 
Hakes & Cairns, 1970; Hakes & Foss, 1970) 
were run to show that latency differences 
depended upon differential difficulty levels in 
the processing of linguistic inputs, However, 
with the exception of one early experiment 
(Foss, 1969), in all studies where early and late 
phoneme positions were contrasted, RTs to 
the phoneme in the late position were found 
to be faster than to the phoneme in the earlier 
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position, which js in agreement with the results 
obtained in the present study, and at variance 
with the hypothesis that the burden on short- 
term memory increases as processing proceeds 
and thus phoneme RTs should also enc 
In addition, the differences observed in the 
present study cannot be explained by theories 
of linguistie complexity, since the sentence 
contexts were the same for accented and un- 
accented syllables, Therefore, differences in 
monitoring must be due to differences at the 

tic level. These differences must be 
more than differences in articulation of the 
target phoneme since the differences between 
accented and unaccented syllables were non- 
significant in the nonsense sequence context. 
It is consistent with the present results to 
hypothesize that much that has been left 
unexplained by theories of linguistic complexity 
may be clarified on the grounds of temporal 
patterning. 

Concerning the comparison between son- 
tences and nonsense sequence contexts, when 
a word beginning with an accented target 
phoneme was spoken in the rhythmical 
context of the sentence the listener's RT» were 
significantly faster than when the same word 
occurred in a nonsense word sequence, In the 
rhythmic context of the sentence, Sx were able 
to respond more quickly to the accented 
targets than to the unaccented targets. Yet 
when exactly the same acoustic signal was 
spliced out of the sentence context and placed 
among other isolated nonsense words, accented 
and unaccented target phonemes did not pro- 
duce significant differences in RT. It would 
appear that the rhythmie context of the 
sentence increased the predictability of the 
accented target phonemes. The differences 
in RTs when the targets were in the sentence 
context ve the list of nonsense words cannot 
be mo by acoustic differences in. the 
word itself, since the words were spliced out 
ol the sentence and embedded in the nonsense 
word context. While quality of articulation 
may be a factor in KT, the acoustic signal was 


In the nonsense sequence there ix no rhythi 
context the listener can use to anticipa 

accented targets, »o attention cannot be 
focused in advance on the accented t 7 
Hence, the advantage of the accented over 
unaccented targets diminishes and the wi 
predictability results in generally longer R 
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EFFECTS OF BASELINE SELF-REINFORCEMENT BEHAVIOR 
AND TRAINING LEVEL ON POSTTRAINING 
SELF-REINFORCEMENT BEHAVIOR 
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St, John's, Newfoundland, Canada 
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The present investigation evaluated the elfects of baseline self-reinforcement 
scores and level of training on posttraining self-reinforcement behavior. The 
Ss were classified as high, medium, or low self-reinforcers on a pretraining task. 
An equal number of Ss from each baseline grouping were assigned to one of 3 
training levels and trained to 40%, 60%, or 80% criteria on a 4-choice dis- 
crimination learning task. The training phase was followed by a posttraining 
phase during which rates of self-reinforcement behavior were assessed. Results 
showed that training ed baseline self-reinforcement behavior only if level 
of training exceeded baseline self-reinforcement rates. Moreover, the 3 base- 
line groupings differed in the number of correct and incorrect application of 


self-reinforcement scores, 


In recent years, considerable effort has 
been exerted in an attempt to isolate the 
variables affecting self-reinforcement (SR) 
behavior (Bandura & Whalen, 1966; Hay- 
nes & Kanfer, 1971; Skinner, 1953). In- 
vestigators using the directed learning 
paradigm to assess SR behavior have 
frequently concluded that Ss tend to 
match SR rates at which external feed- 
back had been provided during training 
Kanfer & Duerfeldt, 1967; Kanfer & Mars- 
ton, 1963). A closer inspection of the litera- 
ture strongly indicates that such a con- 
clusion is an oversimplification. In studies 
in which the level of training was low (where 
the criterion level was less than 5 correct 
responses in 10 trials on the last block of 
training. trials, or where noncontingent 
reinforcement was administered on fewer 
than 50% of the training trials), SR scores 
during the test phase exceeded training levels 
(Bartol & Duerfeldt, 1970; Kanfer & 
Marston, 1963; Marston, 1969). 

"These data suggest that increases in SR 
behavior will occur only if training rates 
exceed baseline SR scores. One of the 
aims of the current investigation was to 
test the preceding conclusion. 

A second aim of the current investiga- 
tion was to determine whether a sufficiently 


‘Requests for reprints should be sent to A, 
Kozma, Department of Psychology, Memorial 
University of Newfoundland, St. John's, New- 
foundland, Canada. 


low training level would significantly lower 
the number of SR responses of high base- 
line (HB) Ss. In the one relevant study 
reported in the literature (Bartol & Duer 
feldt, 1970), a low training level had no 
effect on the somewhat higher SR base- 
line rate. " 

A third concern of the present investiga- 
tion was to determine whether an SR 
change following training is indiscriminate, 
or whether it involves an increase in the 
accuracy of SR/CSR  (self-reinforcement 
on correct choices) matching. If SR re- 
sponses become highly correlated with cor- 
rect choices as a result of training, SR 
rates may merely reflect the degree of 
certainty S has in the accuracy of his 
choice, rather than reflecting some nebulous 
internal motivating state, 


METHOD 


, Subjects. The Ss were 45 male and 45 female 
introductory psychology students attending Me- 


morial University of Newfoundland. ch S was 
paid $1.00 per half hour of experimental 
participation. 


Apparatus and materials. Sixty 24 X 36 mm. 
slides, each showing a consonant-vowel-consonant 
nonsense syllable in every quadrant, were used as 
task stimuli. The 60 slides constituted 6 copies 
of 10 basic slides. The association values of the 
nonsense syllables were 499%-51% (Archer, 1960). 
The type of syllable, its position on the slide, 
and the order of slide presentation were randomly 
determined. 
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Stimuli were presented with a Kodak Carousel 
800 slide projector and covered an area of 17.8 
X 27.9 cm. on the screen, The 43.2 X 254 X 7.6 
cm. response panel had 4 toggle switches mounted 
along its base on S's side. The switches, num- 
bered 1, 2, 3, and 4, represented the 4 different 
quadrants. of the projected image, A diagram 
above each switch denoted the relevant quadrant 
‘A red SR button was centered above the switches, 
and a white reinforcement light (lamp size 1892) 
activated by the SR button was mounted above it. 
The E's side of the panel was identical to S's 
side, except that 4 differently colored indicator 
lights replaced the 4 toggle switches. ‘To control 
the 4-sec, exposure time, the 4-sec. response in- 
terval, and the 10-sec. interlock interval, BRS 
logic units were used. 

‘Design and procedure, An equal number of HB, 
medium baseline (MB), and low baseline (LB) 
self-reinforcers of both sexes received 40%, 60%, 
or 80% task contingent training, This procedure 
resulted in 18 treatment cells of 5 Ss per cell. 

‘The experiment involved 3 distinct phases— 
baseline, training, and posttraining or test phases. 
Immediately preceding the baseline phase, Ss were 
shown their side of the response panel, They were 
presented with the A-choice discrimination task, 
and instructed to select the correct syllable by 
pressing the switch which corresponded to the 
quadrant in which their choice was located, The 
task was presented as a forced-choice task, and Ss 
were required to give their best choice on every 
trial, even if that choice was only a guess In 
addition, Ss were asked to press the SR button 
whenever they thought that their selection of a 
nonsense syllable had been correct, All Ss received 
3 demonstration trials to familiarize themselves 
with the equipment and the task, 

The 3 demonstration trials were followed by the 
20 trials of the baseline phase, Feedback was 
entirely absent during this phase, and the proba- 
bility of making a correct choice at chance 
level (p = .25). The Ss with SR in the 
upper third of the distribution were labeled HB, 
those with scores in the middle third were labeled 
MB, and those with scores in the lower third were 
labeled LB. The Ss with similar SR scores during 
the baseline phase were randomly assigned to the 
training conditions. Separate distributions of base- 
line SK scores were calculated for males and 
females. 

Immediately following the baseline phase, Ss 
were instructed not to push the SR button, since 
E would turn on the reinforcer light every time 
they made a correct choice. The training phase, 
with feedback for correct choices, was continued 
until Ss reached their criterion level of 4, 6, or 8 
correct choices on a block of 10 trials. 

The training phase was followed by a 20-trial 
test phase, Before this final phase, Ss were once 
more asked to take over the reinforcer role and 
to provide their own reinforcement for correct 
choices by pushing the SR button. 

The Ss' task responses were recorded for each 
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Ficurn 1, Representation of the Baseline Group- 
ing X Training Level interaction, (SR = self- 
reinforcement; HB = high baseline; MB = medium 
baseline; LB = low baseline.) 


phase of the study while, in addition, SR. responses 
were taken during baseline and test phases, 


RESULTS 


The mean number of blocks of trials 
to criterion. during training was 3.9, 64, 
and 8.9, respectively, for 40%, 60%, and 
80% criterion groups. Neither baseline 
grouping nor sex had a differential effect on 
training rate. 

A preliminary analysis of baseline SR 
scores indicated that base rates for males 
in the LB condition were significantly 
higher than for females (p < 01). Within 
each sex group, baseline SR scores did not, 
however, differ for 40%, 60%, and 80% 
training groups. 

SR change. To assess the effectiveness 
of the different training levels in modifying 
SR behavior, SR change from pre- to post- 
training was evaluated. ‘The SR difference 
scores. were analyzed by a 3X3X2 
(Baseline Grouping X Training Level 
X Sex) analysis of variance design. Sig- 


nificant effects of baseline, F (2, 72) 
= 7814, p< .01, training, F (2, 72) 
= 13.03, p «.01, and the Baseline X 
Training interaction, F (4, 72) = 3.98, 


p < .01, were obtained. 
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TABLE 1 


MEAN BASELINE RATIO or SELF-REINFORCEM 


NT (SR) RESPONSES FOLLOWING ( 


orxecr Cuorces (CSR) 


TO Correct Responses (CR) AND or SR RESPON: OWING INCORRECT CHOICES 
(ISR) ro Incorrect Cnoicks (IR), A) XPECTED AND OBTAINE! 
PosrrRAINING SR Scores 
| Training lev 
Baseline | ——— -—— ————— —— ——— 
| 80% | 60% 10% 
<= a E iE =— = : NE 
| E NER SE: [5 SK | c : SR SR 
: SR SR/ | ISR bes o» |CSR/|isR/| SR. e |CSR/| ISR vie D. 
Grouping | score | IR. | dieted | EEA | ME | m | dicted- | tainea | CR. | IR | afad. | tained, 
jd ee Sidera nd pe. = 
High | 98 | .99 | 90 | 98 | 96 | 97 | 91 | 94 | 95 | 597 | 95 | 96 | 98 
| | | | 
Medium | 63 .93 | .32 | 80 | 80* | .90 | 25 | 64 | 66 -67 AD 56 | 58 
| | | | 
Low 13 | iis o2 | 75* | 83 | 40 | 65 | 6i* | st | 42 | |-42* 
1 lu 5 


* p < .05 between SR baseline and posttraining scores, 

An inspection of the treatment means 
contributing to the significant effects 
(Figure 1) suggests that there was little 
change in SR behavior of HB Ss, while 
a substantial increase in SR behavior of 
LB Ss appeared to have occurred at all 
training levels. Other relationships, espe- 
cially the effects of training on MB Ss, 
were less clear. Accordingly, a multiple 
comparison of the relevant treatment 
groups was undertaken by means of the 
Newman-Keuls procedure (Winer, 1962). 

Results of the comparison showed that 
LB Ss increased their SR behavior signifi- 
cantly more than MB and HB Ss after 
40% and 60% training (p < .05). After 
80% training both LB and MB Ss showed 
a significant increase over HB Ss (p < .05), 
with significantly greater changes occur- 
ring in the former than in the latter group 
(b < .05). Since SR difference scores for 
HB Ss do not reflect a significant pre- 
training-posttraining change (b > .10), it 
may be concluded that an increase in SR 
behavior occurred for LB Ss at all levels 
of training and for MB Ss only after 80% 
training. Training did not appear to 
modify SR behavior of HB Ss at any level. 
An inspection of "SR score" and "SR ob- 
tained" columns of Table 1 shows that 
training led to an SR change only when 
it exceeded SR baseline performance. 

SR accuracy. A second concern of the 
current investigation was to determine 
whether increases in SR behavior were 
indiscriminate, or whether they reflected 


an increased accuracy in SR — correct re- 
sponse (CR) matching. To resolve the 
issue, postraining SR responses were di- 
vided into those which followed correct 
choices (CSR) and those which followed 
incorrect choices (ISR). The CSR/CR 
ratio is a good*indicator of omission errors 
(i.e, the failure to provide seli-reinforce- 
ment after making a correct response). 
A second type of error, the tendency to 
provide SR responses after incorrect choices 
(IR) may be represented by the ISR/IR 
ratio. For close SR/CR matching, 
CSR/CR approaches 1, and ISR/IR ap- 
proaches 0. 

Since HB Ss failed to show a significant 
SR change and provided self-reinforcement 
after virtually every response, there ap- 
peared to be little value in including them 
in the analysis. Both CSR/CR and 
ISR/IR would approach 1 (actual values: 
CSR/CR = .98, and  ISR/IR = .92.) 
Training did not seem to modify SR be- 
havior of these Ss. 

Posttraining CSR/CR values for base- 
line groupings that did show an SR change 
with training were analyzed by a 2 X 3 X 2 
(Baseline Grouping X Training Level 
X Sex) analysis of variance design. ‘The 
only significant effects were baseline, 
F (1, 48) = 5.63, p < .01, and training, 
F (2, 48) = 15.53, p.01. The baseline . 
effect indicates that MB Ss had signifi- 
cantly fewer omission errors than LB Ss 
(CSR/CR = .83 and .72, respectively). 

A comparison of training means showed 
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that the 80% and 60% groups did not 
differ from each other, while the 40% 
group differed significantly from both, 
p«.05 (CSR/CR = .87, .86, and .69, 
respectively). There were, thus, signifi- 
cantly more omission errors after 40% 
training than after 60% and 80% training. 

Posttraining ISR/IR values were also 
analyzed by a 2 X 3 X 2 (Baseline Group- 
ing X Training Level X Sex) analysis of 
variance design. None of the effects of 
this analysis were significant (p > .05). 
Although commission errors did occur for 
MB and LB Ss, they occurred considerably 
less frequently than for HB Ss, p < .001 
(ISR/IR = .32, .34, and .92, respectively). 

The CSR/CR and ISR/IR values for 
baseline groupings and training levels, 
together with the number of SR responses 
predicted by these values, and the number 
of SR responses actually obtained, are all 
recorded in Table 1. The table shows that 
CSR/CR and ISR/IR ratios together are 
good predictors of SR behavior following 
training. 


DISCUSSION 


The data on SR changes were consistent 
with the prediction that training modifies SR 
behavior whenever training level exceeds base- 
line SR performance. They clearly illustrate 
that posttraining SR rate is not solely a func- 
tion of training, but is affected by S's initial 
operant SR rate. Results strongly suggest 
that posttraining SR responses reflect the 
amount of confidence Ss have in the accuracy 
of their response. The HB Ss appeared to 
be extremely confident, and they did not 
modify their SR behavior as information 
about accuracy was increased by training. 
The approach lead to very few omissions but 
a large number of commission errors. On the 
other hand, LB Ss, who appeared to be less 
confident, significantly increased their SR 
responses as training provided information 
about the accuracy of their responses. The 
commission errors were low during baseline 
perfermance and did not increase with train- 
ing. The MB group appears to fall some- 
where between the 2 extremes. Like LB Ss, 
they increased the accuracy of their SR re- 
sponses with training, but were not quite as 
cautious and had fewer omission errors. The 
MB Ss were, therefore, the most accurate of 
the 3 baseline groupings. 
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The interpretation of SR responses as an 
expression of confidence in the accuracy of 
one's choice would also appear to account for 
sex differences in baseline SR scores. In a 
male-dominated society such as ours, it may 
be more difficult for males than for females 
to appear indecisive, 

A comparison of current findings with those 
reported by Dorsey, Kanfer, and Duerfeldt 
(1971) points to a major difference in SR. 
behavior if a contingent as opposed to a non- 
contingent feedback procedure is employed 
during training. While both procedures led to 
increases in CSRs with increases in external 
feedback, increases in ISRs with increases in 
external feedback occurred only when the 
noncontingent feedback technique was used 
It would, therefore, appear that Ss will 
provide self-reinforcement even if uncertain 
about the accuracy of a specific response, 
provided that there is a reasonable chance of 
being correct by guessing (under high levels 
of noncontingent feedback). 
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INDUCED CHUNKING: 
TEMPORAL ASPECTS OF STORAGE AND RETRIEVAL? 


CHARLEY McCAULEY anb GEORGE KELLAS 
University of Kansas 

Instructions to 3 treatment groups were to form groupings of ' 
words via imagery, sentence formation, or repetition of unit member \ 
control group was given only typical free-recall instructions. Depe 
measures were employed that allowed the temporal flow of information to be 
monitored during both the storage and retrieval of list items. 1 esults 
indicated that functional higher order units were formed in all instructed con 

ditions and that the use of specific relational organizers differentially 

the amount of encoding time required for chunk processing. A tempor 
sis of retrieval revealed the presence of higher and lower order units in the 
recall sequences. Response times between induced chunk boundaries in- 
creased linearly across output positions in all conditions and were strongly in- 
fluenced by the nature of the encoding strategy. "The results are discussed in 


analy- 


terms of several retrieval models. 


An established tenet of modern theories 
of memory organization is that Ss form 
higher order units to overcome limitations 
in the information-processing system. The 
formation of these units can be based on 
perceptual factors (Bower & Winzenz, 
1969), semantic and/or syntactic relation- 
ships (Mandler, 1967), or other more 
elaborate relational organizers, such as in- 
teractive imagery (Paivio, 1971). How- 
ever, there is little if any evidence concern- 
ing the differential effectiveness of these 
relational devices. 

One question of primary interest con- 
cerns the ease with which units may be 
encoded and their subsequent stability 
during output. By employing measures 
that monitor the temporal flow of informa- 
tion through the processing system, it 
should be possible to examine the processes 
involved in unit encoding and subsequent 
utilization during recall. Recent evidence 
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by Laughery and Schmidt (1970) and 
Kellas and Butterfield (1971) has indicated 
that the profiles of idy times across 
input positions man listinct patterns 


that are related to rehears activity. 
These patternings ari to manipu- 
lations involving rc: s and the 
formation 6f higher 'emory units 
(Kellas, Ashcraft, Johns v Needham, 


in press). It was hvpoihesized in the 
present study that if Ss could be induced 
to use particular relational organizers for 
the construction of h rder units, the 
differential ease of unit encoding should be 
reflected in input exposure times. Using 
similar logic, the patterning of interresponse 
times (IRTs) during output should allow 
an assessment of the extent to which the 
higher order units served as functional 
memory units. Patterson, Meltzer, and 
Mandler (1971), for example, have shown 
that longer pauses occur at unit boundaries 
than between items within a unit during 
recall. Between-group IRTs have been 
interpreted as representative of a search 
strategy employed by Ss to retrieve higher 
order units. Patterson et al. (1971) postu- 
lated that the between-group times are a 
function of 3 components: (a) group exit 
time, (b) group access time, and (c) word 
access time. They suggest that group exit 
time and word access time remain constant 
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Over successive units recalled, and that in- 
creases in between-group times across out- 
put positions are due only to changes in 
group access time. The group access times 
are in turn assumed to reflect an underlying 
search process for previously unretrieved 
units. 

In particular, the exponential growth of 
between-group IRTs across successive out- 
put positions has been related to a proba- 
bilistic search model (Patterson et al., 1971; 
Pollio, Richards, & Lucas, 1969) in which 
S is assumed to hold representations of all 
higher order units in memory. During 
recall, higher order units are randomly 
sampled with replacement and subjected 
to a recognition test to determine if the 
components of a group have previously 
been recalled. The exponential growth 
function actually obtained agrees well with 
expectations derived from the probabilistic 
search model. An alternative fixed-search 
model has also been proposed by Pollio 
et al. Here, it is assumed that S stores a 
fixed sequence of higher order units which 
are systematically retrieved in serial order 
during recall. This model also assumes a 
test phase, in which S checks to determine 
if the retrieved unit has previously been 

‘produced. The major difference in predic- 
tions between the models involves the 
latency function describing between-group 
IRTs across output positions. The proba- 
bilistic model would predict an exponential 
increase, whereas the fixed model is best 
accommodated by a linear increase in 
between-group IRTs resulting from the 
serial component of a fixed search. Exist- 
ing research does not allow a clear choice 
of any single model, since Kellas et al. (in 
press) have recently reported a linear in- 
crease in between-group IRTs. 

In the present study, an attempt was 
made to determine whether temporal pat- 
terning and search processes comparable 
to those obtained with categorized ma- 
terials are involved in the recall of higher 
order units formed intraexperimentally. 
'To the extent that induced chunking does 
lead to the formation of functional higher 
order units, one would expect an interac- 
tion of type of IRT (within vs. between 


group) with output positions, consistent 
with previous research. The interaction 
should result from an increase in between- 
group IRTs acrossoutput positions, coupled 
with little if any increase in within-group 
times. It should then be possible, by ex- 
amining the within-group IRTs, to evalu- 
ate the cohesiveness of the higher order 
units, that is, the shorter the latency be- 
tween responses within a group, the more 
cohesive the memory unit. 

In the present experiment, Ss were in- 
structed to form higher order units via 
imagery, sentences, or rote repetition. An 
uninstructed control group was included to 
provide a baseline condition against which 
the effects of induced chunking could be 
compared. Dependent measures included 
input and output times as well as response 
accuracy in order to obtain a more compre- 
hensive view of organizational processes. 


METHOD 


Subjects, The Ss were 60 undergraduate students 
enrolled in an introductory psychology course who 
were given class credit for participating in the 
experiment. 

Apparatus, A Sawyer Rotomatic slide projector 
was used to present the stimuli on a daylight screen, 
A response button was programmed to the projector 
so that S$ could control the duration of slide pres- 
entation. The latency between the button re- 
sponse and visual presentation was .4 + .05 sec. 
The duration of each stimulus presentation was 
automatically recorded by means of a pulse-stream 
generator, a parallel-entry control panel, and a 
Sodeco printout counter. ‘Times were recorded to 
the nearest .05 sec. The stimuli were typed onto 
transparency paper and fitted into Easymount 35- 
mm. frames. A Lafayette Model 810 tape recorder 
and a lavaliere microphone were used to record Ss! 
oral recall. These taped recalls were later chan- 
neled into a Mechanics for Electronics Model AHA- 
20 heat-writing graphic recorder to measure inter- 
word response times (IRTs). The Ss’ recall acti- 
vated a pen that indicated word onset and offset on 
paper moving at a rate of 25 mm/sec. All IRTs 
were calculated to include only time between words, 
and word-production time was excluded from this 
measure (cf. Pollio et al., 1969). 

Lists. The stimulus materials were 144 nouns 
with high concreteness, meaningfulness, and imagi- 
nal values in the Paivio, Yuille, and Madigan (1968) 
word norms, In addition, all words were rated A 
or AA in frequency (Thorndike & Lorge, 1944). 
From this pool, 12 lists of 12 items each were ran- 
domly constructed with the restriction that no 
obvious associations existed within any list. Lists 
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presentation position for all treatment groups. 


were equated for concreteness (M = 6.75, SD = 
.099), meaningfulness (M = 6.94, SD = .146), and 
imagery value (M. = 6,42, SD 39). 

Design and procedure. The Ss were randomly 
assigned to 1 of 4 instruction conditions, yielding a 
total of 15 Ss in each condition. The Ss in the no 
instruction (NI) condition were treated as a control 
group, and received only standard free-recall in- 
structions. In addition to free Il instructions, 
Ss in the imagery (I), sentence (S), and repetition 
(R) conditions were told to learn the lists by group- 
ing items into 3s and forming an interactive image, 
forming a sentence, or repeating the grouped items 
contiguously, depending upon their treatment con- 
dition. The Ss in the I, S, and R conditions were 
told to expose the first 2 words in each successive 
group of 3 items only long enough to read them and 
to then spend as much time as they wished on the 
third word of each group. All Ss were told that 
item presentation would be self-paced, and that a 
red field would signal the end of cach list and the 
beginning of the recall period, 

The Ss were tested individually and received one 
study and one test trial on each of the 12 lists. 
The first 2 lists were practice lists and were not 
included in subsequent analy The recall period 
was unlimited, and .5s' oral recall was recorded on 
magnetic tape for further analysis, Following recall 
of the final experimental list, all Ss were given a 
2-page questionnaire, On the first page, they were 
asked to describe in their own words the method or 
methods used to learn the items. Page 2 contained 
a list of possible encoding strategies, including 
imagery, sentence formation, repetition, and cumu- 
lative rehearsal, together with appropriate examples. 
The 5s were to indicate on 9-point rating scales the 
extent to which they employed each of the sug- 
gested strategies. 
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The major focus of the pre 
was on the temporal correlates 
tion processing during storage 
trieval. To the extent that per 
approaches error-free respondin 
data should provide a relatively 


of storage and retrieval acti 5 aly- 
ses pertaining to input processi? call 
accuracy, and output processing re- 
sented successively. 

Input time. Figure 1 shows the exp- ure- 
time profiles across presentation pos. ons 
for all treatment conditions. Exp. ure 
times for the first 2 input positions of «ch 


triad are nearly invariant for the instruc ‘ed 
conditions, with a mean of about 2 soc. 
This indicates that Ss in these conditio:s 
were in fact following the instructions given 
them to spend only enough time at these 
positions to read the items. The processing 
time for each triad, as well as any tim: 
spent rehearsing previous triads, is + 
flected by the sharp peaks at the termi: il 
item of each group. 

The mean exposure times for each group 
of 3 items were computed for all conditions. 
Whereas these means reflect differences ob- 
tained only for the terminal items of each 
triad for the instructed conditions, they 
allow comparisons to be made between 
these groups and the NI condition in 
which Ss were free to spend as much time 
as they wished at all presentation positions. 
A 4 X 4 mixed analysis of variance with 
factors corresponding to instructions (NI, 
I, S, and R) and presentation position 
(1-4) was then performed on the mean 
exposure times for the triads. The main 
effects of instructions, (3, 56) — 4.81, 
p < .01, and presentation position, F (3, 
168) = 13.26, p < .01, were reliable, as 
was their interaction, F (9, 168) = 3.35, 
p <.01. Mean total input times for each 
list in the NI, R, I, and S conditions were 
121, 104, 75, and 65 sec, respectively. 
Tukey's (b) multiple comparisons indicated 
that both the NI and R groups spent sig- 
nificantly more time during input than 
either I or S groups (p < .05). The In- 
structions X Presentation Position inter- 
action resulted from the relatively large 


increase in processing time across positions 
for the R and NI groups, as compared 
with relatively constant times in both the 
I and S conditions. The slight decrease 
in processing time on the last triad for all 
groups appeared to be related to Ss’ ten- 
dency to output terminal items first. 

Correct responding. A 4 X 10. mixed 
analysis of variance was conducted on the 
total correct responses per list in order to 
examine instruction effects and to ensure 
that no List X Instruction interaction was 
present. 

The factors corresponded to instructions 
and experimental lists (1-10). The only 
significant finding was a lists main effect, 
F (9, 504) = 5.92, p < .001, which resulted 
from improved recall across lists in all con- 
ditions. Mean percent correct responses 
in the NI, R, S, and I conditions were 84, 
80, 76, and 82, respectively. 

In previously reported research under 
E-paced conditions, imagery and sentence 
instructions have typically led to superior 
performance relative to repetition and un- 
instructed control groups. It has been 
suggested (Paivio, 1971) that this recall 
facilitation may be related to the ease of 
unit encoding. The results of the present 
investigation confirm this hypothesis. It 
is clear that encoding materials in imaginal 
or sentential form greatly decreased the 
amount of processing time required by the 
I and S groups to reach a level of correct 
responding equal to that of the R and NI 
groups. 

A further analysis of the recall data was 
conducted to determine the extent to which 
items grouped during input were recalled 
contiguously. Since the nominal items 
which comprise each higher order unit or 
chunk are known in the instructed condi- 
tions, the mean chunk size was computed 
across output positions for each successive 
chunk recalled. The NI group was not 
included, since for thís condition the precise 
nature of the recall units was unknown. 
Only the first entry into successive chunks 
for instructed conditions was considered. 
Reentries into previously accessed chunks 
were rare and accounted for only 2% of 
the items recalled. A 3 X 4 mixed analysis 
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of variance was conducted on the chunk- 
size data, with factors representing instruc- 
tions (Il, S, and R) and output position 
(1-4). No effect of instructions was ob- 
tained, since a high level of clustering ac- 
cording to triad membership occurred for 
all conditions. (The mean chunk size was 
2.34 items across conditions, with a stan- 
dard deviation of .44.) The analysis 
yielded only a significant main effect of 
output position, / (3, 126) = 89.10, p < 
.001, which resulted from a consistent de- 
crease in chunk size across output positions 
in all treatment groups. This decrease 
suggests that the recall of items within a 
unitized grouping may not be solely de- 
pendent upon the accessibility of the higher 
order unit, but rather is subject to quali- 
fications based on position in output. It 
is conceivable that interference resulting 
from output processes may be more detri- 
mental to the individual components within 
a higher order unit than to the units 
themselves. 

Output time, In order to examine the 
cohesiveness and search rates for higher 
order units during output, IRTs were ob- 
tained from tape recordings of oral recall. 
The IRTs from Ss in the NI condition were 
not included as they were not relevant to 
these issues. Interresponse times involving 
reentries into previously recalled higher 
order units and extralist intrusions were 
excluded from the data and represented 
only 11.2% of the total responses. 

A 3 X 2 3 mixed analysis of variance 
was conducted on the mean IRTs for the 
instructed groups with factors correspond- 
ing to instructions (I, S, and R), type of 
IRT (within or between chunk), and out- 
put position (2-4). Only the last 3 output 
positions were included in this analysis, 
since there is no satisfactory way to mea- 
sure access time to the first word of the first 
unit recalled. The IRT analysis yielded 
reliable main effects for instructions, F 
(2, 39) = 5.67, p < .001, type of IRT, F 
(1, 39) = 91.36, p < .001, and output posi- 
tion, F (2, 27) = 22.57, p < .001. The 
highly significant difference for between- 
and within-chunk IRTs indicates that 
stable higher order units were established 
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F16, 2. Mean within- and between-group inter- 
word response times (IR Ts) as a function of output 
position for the instructed conditions, 


within 


in all instructed groups, The predicted 
Type of IRT X Output Position interac- 
tion was also highly significant, F (2, 78) = 
18.97, p < ,001.. The between-chunk times 
can therefore be viewed as representative 
of the search and retrieval processes for 
specific higher order units, while the within- 
chunk times reflect the output of nominal 
items within higher order units, It is ap- 
parent in Figure 2 that the within-chunk 
IRTs are unaffected by either instructions 
of output position, suggesting that all 
higher order units were equally cohesive. 
The effects of instructions are restricted to 
the between-group IRTs, as evidenced by 
the Instructions X Type of IRT interac- 
tion, F (2, 39) = 5,36, p < .001. While 
the between-chunk times all increased 
linearly across output positions, instruc- 
tions had substantial effects on. the slopes 
of these functions and, presumably, the 
search times for higher order units. 


Discussion 
The patternings of input and output times 


Strongly indicate that higher order units were 
formed during encoding by all instructed 
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groups and that these units re: 
tional during retrieval. Wheres 
correct responding did not differ any 
ment groups, the input-time data ind 
5s who organized list materials 
and sentence structures required less | 
time to perform the encoding oper 
follows that when Ss are limited | 
time constraints, experimental condit 
essarily favor encoding strategies 
image and sentence formation. 

One goal of the present research 
examine the nature of retrieval proce 
higher order units formed intraexperim: 
In general, the IRTs observed were con 
with those reported by Pollio et al. (1969 
Patterson et al, (1971) with categorized |. ts, 
However, a major difference is represented ov 
the shape of the between-group functio is 
across output positions. Data from the e 
previous investigations, in which E-paced lis | 
presentation was employed, were described by | 
an exponential function derived from a proba- 
bilistic retrieval model, The present result: 
are more consistent with a linear increasi: 
function similar to that reported by Kel! « | 


et al. (in press). In both the Kellas et |. 
study and the present investigation, item | 
sentation was S-paced, and the high level. f 
correct responding indicated that all uis 
were highly accessible during the search p! se ; 
of retrieval, A linear function across output | 
positions could be predicted by at lea 2 | 
alternative models of retrieval processes, One 
is the fixed-search model described, but not 
empirically supported, by Pollio et al. A 
linear function could also be predicted by a 
model based on the trace strength of memory 
units, as determined by systematic patternings — 
of rehearsal during input (Kellas et al, in — 
pres). Rundus (1971) has presented data 
which indicate that items afforded the most 
rehearsal during input are typically recalled 
early in the output sequence. Thus, while 
the trace-strength model would also assume | 
a fixed retrieval sequence for higher order 
units, the increase in retrieval times for units 
at later positions would not result from a 
reaccess of the memory set at a fixed position, 
but from an increase in group access times for 
later units, based on their decreasing trace 
strengths, 

Both linear and exponential functions could 
be accommodated by a model that includes a 
parameter related to the accessibility of higher 
order units. The between-group IRT func- 
tion would be linear to the extent that units 


3 


ved late in the output sequence are ac- 

ic and a consistent retrieval plan remains 

» onal. However, if recall breaks down at 

ositions due to inaccessibility of higher 

nits, § may perform an extended search 

‘eve additional units, This search may 

. for example, a repetitive generation 

iously units in an attempt to 

an associative link to unrecalled items, 

ved by a recognition check for list mem- 

‘ip, Consequently, the recall times for 

ater units could be expected to increase 

tially. 

e the between-group IRT functions in 

went study are all linear, differences are 

it in the slopes of the functions, The 

^n repetition instructions were able to 

ve higher order units significantly faster 

- positions than either I or S S». The 

presented by Patterson et al. (1971) 

© that the observed retrieval facilitation 

K condition might best be localized in 

ihe group access component, An examination 

of the input processing times for the instructed 

om suggests an explanation for this facilita- 

Little if any increase in processing time 

voted across input triads in either the 

5 or | conditions, while time spent per triad 

increases nearly linearly in the R condition, 

‘This increase provides suggestive evidence that 

< rehearsing the earlier presented item» 

positions, thereby establishing associa- 

(u among higher order units via cumula- 

hearsal of previously formed triads. 

liu- particular encoding process would lead to 

a decrease in the time required to access addi- 
tional unrecalled units during recall, 

Evidence for the establishment of sequen- 

tial dependencies in the R condition was ob 


- tained from the questionnaires given to all S» 


following the experimental session. The data 
from these questionnaires were not previously 
reported since, for the most part, they simply 
verily a conclusion drawn from the input-time 
were indeed following instruc: 
tions. However, when an analysis was yer 
the degree to which Ss reported use of 
cumulative rehearsal in addition to the par- 
ticular strategy given them, significantly more 
cumulative rehearsal was reported by S» in 
the condition than in either the D or S 
groups. While such a sequential strategy is 
Er Miet MEE ma to. iha en. 
coding. time required, it apparently facilitates 
transitions between units during retrieval. 
The high similarity between the IRT data 
from the present study and research with 


press), suggests a communality in i 
operations, Pollio et al, (1969) have proposed 
that retrieval of categorized material i» best 
conceived as a “reconstruction” process, based. 
on the attribute overlap of nominal items to 
their superordinate label, However, it is dif: 
cult to envision such a process operating on 
higher order units formed intraexperimentally, 
It seems more parsimonious in both cases to — 
view the retrieval of higher and lower order 
units as a function of trace strength, deter- — 
mined by the systematic execution of rehearsal 
processing. 
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STIMULUS SUFFIX EFFECTS IN DICHOTIC MEMORY 
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A very-short-term “preperceptual” or "precategorical" auditory buffer has 
been proposed by Crowder and Morton as the storage system mediating 
the retention of items in the terminal serial positions of long lists presented 
sequentially, and by Broadbent as the storage system mediating retention 
on the ear reported second or “unattended” ear in dichotic tasks. Previ 
research has shown that terminal serial position performance in long lists 
presented monaurally or binaurally is selectively impaired when an extra 
locution (stimulus suffix) is placed at the end of a list, In the present dichotic 
memory research, the addition of either a monaural stimulus suffix on the 


unattended ear or a binaural suffix w 
tended-ear performance. 
these ta: 
memory 


of long lists of sequentially presented items. 


Crowder and Morton (1969) have re- 
cently proposed and extensively investi- 
gated (e.g., Morton, Crowder, & Prussin, 
1971) a model of auditory—verbal informa- 
tion processing. The major emphasis in 
their empirical work has been to demon- 
strate the existence of a very-short-term 
auditory buffer in which information is 
thought to reside prior to the process of 
categorization. The stimuli employed in 
their research have consisted of long lists 
of verbal items presented monaurally or 
binaurally, and the dependent variable 
has been the retention function measured 
immediately following stimulus presenta- 
tion. 

Their model is based on the assumption 
that the primacy and recency effects, ob- 
tained in serial position analyses of im- 
mediate recall functions, are mediated by 
two distinct storage systems. They main- 
tain that the first few items presented are 
categorized and are entered into a short- 
term or active store while the terminal 
items are held in an ephemeral (1-2 sec.) 
precategorical acoustic store (PAS). 


1 This research was supported in part by the U.S. 
Air Force Human Resources Laboratory, monitored 
by the Flying Training Division under Contract 
F41609-72-C-0037. The authors wish to thank 
Mark Knight and Alan Perey for their assistance in 
running Ss in these experiments. 

* Requests for reprints should be sent to Stanley 
R. Parkinson, Department of Psychology, Arizona 
State University, Tempe, Arizona 85281. 


The acoustic quality of PAS is inferred 
from data showing that (a) the recency 
effect in serial position functions is generally 
greater with auditory than with visual 
presentation (e.g., Corballis, 1966; Crow 
der & Morton, 1969) ; (b) recency effects « 
the magnitude of auditory presentati: 
are obtained with visual presentation if 
vocalize the stimuli during presentati 
(e.g; Routh, 1970); (c) an additio: 
acoustic element placed at the end oí a 
stimulus list (stimulus suffix) selectively 
impairs performance over the terminal 
serial positions (Crowder & Morton, 1969; 
Morton et al., 1971); and (d) the stimulus 
suffix is effective only if both stimulus list 
and suffix are presented aurally (Morton 
& Holloway, 1970). 

The precategorical nature of PAS is 
inferred from findings indicating that the 
magnitude of the stimulus suffix effect is 
unaflected by either the frequency of 
occurrence or emotionality of words em- 
ployed in the suffix position, or the se- 
mantic similarity between stimulus list 
and suffix (Morton et al., 1971). 

The model and theoretical constructs 
proposed by Crowder and Morton (1969) 
are very similar to those of Broadbent 
(1971). Working with a dichotic paradigm, 
Broadbent found that when Ss were given 
2 lists of digits simultaneously, one to each 
ear, they were more accurate if they first 
recalled the items from one ear and then 
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shown to selectively impair unat- 
The comparability of stimulus suffix effects in 
s suggests that the storage systems mediating retention in dichotic 
ire identical with the storage systems operating in the retention 


——— pr 


STIMULUS SUFFIX EFFECTS IN DICHOTIC MEMORY 267 


the items from the other (ear-by-ear report) 
than if they reported items by pair (pair- 
by-pair report) according to their order of 
arrival. Broadbent took this finding to 
imply the existence of a stage of parallel 
processing (S or storage system) followed 
by a stage of sequential processing (P or 
perceptual system). According to the 
model, both messages of a dichotic pair 
enter the S system in parallel. A selective 
filter, operating in time between the S and 
P systems, selects one message for entry 
into the P system and attenuates processing 
of the other message. The P system 
processes the items in the selected message 
and that message is reported first during 
recall. The initially unselected message 
is held in the S system and that message 
does not gain access to the perceptual 
system until the completion of first-ear 
processing. Like PAS, the level of in- 
formation processing in the S system is 
preperceptual or precategorical. 

Comparison of the Broadbent (1971) and 
Crowder and Morton (1969) models reveals 
that both distinguish between storage sys- 
tems in which information is relatively un- 
analyzed, preperceptual or precategorical 
(S system and PAS, respectively), and 
storage systems in which information is 
processed or categorized (P system and 
short-term or active store). Although the 
theoretical constructs S system and PAS 
are derived from different experimental 
paradigms and hence remain operationally 
distinct, much theoretical power could be 
gained by establishing a “working equiva- 
lence” between them. 

If the storage systems mediating re- 
tention in dichotic tasks are the same as 
those mediating retention with a single 
auditory channel, we should be able to ac- 
curately predict dichotic performance given 
the single-channel performance. Parkinson 
(1974) assessed conventional single-channel 
digit spans and grouped Ss according to 
that measure in a dichotic memory task. 
He found a correlation of +.891 between 
digit span and overall dichotic perform- 
ance. This relationship lends consider- 
able support to the merit of establishing a 
working equivalence between constructs 


derived from the dichotic task and those 
derived from a task in which long lists of 
items are presented over a single auditory 
channel. 

Another way of evaluating a working 
equivalence between constructs would be 
to show an effect of an independent variable 
in one task that is known to be effective 
in the other task. Precategorical acoustic 
storage is thought to mediate retention of 
items occurring in the terminal serial 
positions of long lists presented monaurally 
or binaurally. Crowder and Morton (1969) 
and Morton et al. (1971) have shown that 
retention of terminal serial position items 
is selectively impaired by an extra acoustic 
locution (e.g., *uh") placed at the end of 
a stimulus list (stimulus suffix). The S 
system of Broadbent (1971) is thought to 
mediate the retention of the “unattended” 
ear or the ear reported second in dichotic 
tasks. If the S system were equivalent to 
PAS, a stimulus suffix placed at the end 
of a dichotic list should selectively im- 
pair unattended-ear performance. It is 
the purpose of the present experiments to 
explore stimulus suffix effects in dichotic 
memory. 

EXPERIMENT | 
Method. 


Subjects and design. Sixteen introductory psychol- 
ogy students served as Ss in the present experiment. 
The Ss were tested individually in a sound reduction 
chamber and they were paid for their participation. 
Morton et al. (1971) found that the magnitude of 
the stimulus suffix effect varied as a function of the 
similarity of “apparent spatial location” between 
suffix and stimulus list. Therefore, suffix location 
was manipulated in the present experiment as a 
between-Ss factor. Individual single-channel digit 
spans were ed prior to the experimental 
sessions (Parkinson, 1974), and each S was then 
assigned to one of 4 groups that determined the 
location of the stimulus suffix to be received: con- 
trol—no suffix; monaural suffix—attended ear; 
monaural suffix—unattended ear; and binaural 
suffix. The method for assigning Ss to groups 
roughly equated all 4 groups for mean digit span. 

Materials and apparatus. On each trial Ss re- 
ceived 2 lists of digits simultaneously, one to each 
ear. All lists were recorded by the same male 
speaker at the rate of 2 pairs per second. Stimulus 
lists were recorded on a. TEAC model TCA-42 
4-channel tape deck and they were played to Ss 
via stereo headphones (Superex Pro-BV). The 
stimulus lists were constructed from the set of 
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numbers 0-10 and no number appeared twice on a 
given trial. 

The stimuli were recorded on 5 audiotapes, one 
practice and 4 test, The practic tape was com- 
prised of 42 dichotic lists, each list consisting of 4 
digits per ear (8 digits per trial). The 4 test 
tapes each contained 72 dichotic lists with 5 
digits per ear (10 digits per trial). One of these 
test tapes was used in the first experimental session 
to establish baseline performance for all 16 Ss in 
the dichotic task. In the remaining 3 test tapes a 
stimulus suffix was placed at the end of each stimulus 
list. A monaural stimulus suffix was placed at the 
end of each list in 2 of these tapes, on the right 
channel for Tape 3 and on the left channel for Tape 
4. On Tape 5, a binaural suffix was placed at the 
end of each stimulus list. In all cases (whether 
monaural or binaural) the stimulus suffix was the 
locution uh." The suffix was recorded in the same 
voice and in the same tempo as the digit list. 

Procedure. According to the Broadbent (1971) 

model, Ss attend to one ear during stimulus 
presentation and report that ear first during recall. 
In previous research (Bryden, 1971; Parkinson, 
1974), the ear of attention has been controlled by 
instructing each S to "attend" to a particular ear 
during the course of an experiment. A payoff 
schedule was introduced in the present experiments 
to meet the same purpose. Each S was told that he 
would receive 1 cent for each digit recalled in the 
correct order for one ear, hereafter referred to as 
attended ear, and $ cent for each digit on the other 
ear, hereafter called unattended ear. The ear 
receiving maximum payment was balanced within 
each group (2 left ear and 2 right ear) and, for each S, 
remained constant during the course of the experi- 
ment. The Ss were told to earn as much money as 
they could, and feedback regarding the amount 
earned was given following S’s response on each 
trial. 

The Ss were instructed to report ear by ear, 
Following a procedure developed by Bryden (1971), 
order of report was manipulated, i.e., on one half 
of the trials Ss were instructed to report attended ear 
first and unattended ear second (A-U report), and 
on the other half of the trials unattended ear first 
and attended ear second (U-A report). Unlike 
the Bryden procedure, in which order of report 
was blocked, report instructions were delayed until 
the end of each stimulus list and the order of report 
was randomized. The ear to be recalled first (left 
or right; for S, attended or unattended) was indi- 
cated by the onset of one of 2 lights at the end of 
each stimulus list. The lights, one red and one green, 
were oriented horizontally on a panel in front of Ss 
to coincide with right and left ears. Tone bursts 
recorded on a third channel of the tape triggered 
the lights to come on immediately following list 
presentation. In the suffix conditions, the indicator 
light was presented immediately following the suffix, 
It was thought that the delayed order-of-report 
instructions coupled with the payoff schedule would 
encourage Ss to differentially process attended- and 
unattended-ear messages, 
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At no time were 5s req It ) items. 
The 5s were instructed to items 
they actually remembered ed ear. 
In order to receive maxim his re- 
port on each trial, S had t report 
instructions as indicated by th report 
digits in correct serial order for « he Ss? 
verbal responses were written dow com- 
pared with mimeographed copie hotic 
lists actually recorded on each of Fol- 
lowing 5's report on each trial, E ass: rrors 
made by S and told him how much ! rned 
for that trial. Although serial recall was 
used in determining payment, free re: ring 
was used in data analysis. 

The Ss from all 3 suffix groups were t« fore 
the second test session that an extra locutio 1h") 
would occur following each list, and the vere 
informed which ear would be receiving the Mix, 
They were told to continue to do what they had 


been doing in the previous session, to ignore «he 
suffix, and not to report it during recall. In he 
second test session the control group repeated iic 
session with the first test tape with headphon:s 
reversed so that attended and unattended messages 
were reversed. 


Results 


Raw scores were converted to perce t- 
age correct recall and separate 2>. 3 
(Between X Within) analyses of varii:ce 
(ANOVAs) were conducted on data ob- 
tained in the first and second test sessions. 
Between-groups factors in these analvses 
were ear of attention and suffix group. 
Within-groups factors were order of report 
(A-U vs. U-A), attention (A vs. U), and 
serial position. Baseline and suffix perform- 
ance is shown in Figures 1-4 for the con- 
trol group, attended-ear suffix group, un- 
attended-ear suffix group, and binaural 
suffix group, respectively, As there was 
no interaction between order of report and 
type of suffix, performance in both sessions 
is shown collapsed over order of report. 

Baseline test. Comparison of the base- 
line performance shown in Figures 1-4 
indicates that prior to experimental treat- 
ment (administration of a stimulus suffix) 
the 4 groups were not different. This was 
confirmed in statistical analysis, as neither 
the main suffix group factor nor any of 
the suffix group interaction factors were 
reliable, all ps > .05. 

As in previous research in which order of 
report was manipulated (Bryden, 1971; 
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Parkinson, 1974), performance on the at- 
tended ear was superior to performance 
on the unattended ear, F (1, 8) — 191.31, 
p < .001, and A-U reports were superior 
to U-A reports, F (1, 8) = 17.64, p < 
.003. Serial position functions for the 
attended ear were relatively flat while 
positively increasing serial position func- 
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tions were obtained for the unattended ear, 
F (4, 32) = 101.52, p < .001. 

Suffix test. Based on the Broadbent 
(1971) model and on the findings of Mor- 
ton et al. (1971), we would predict that 
stimulus suffix effects would be limited to 
the terminal serial positions of the unat- 
tended ear in conditions in which the un- 
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FIGURE 3. Unattended-ear suffix group, Experiment |: Percentage correct 
recall as a function of test (suffix vs. no suffix), ear of attention, and serial 


position. 


attended ear received input subsequent to 
the stimulus list. In the present experi- 
ment this occurred in only the unattended 
suffix and binaural suffix groups. No effect 
should be observed in the attended suffix 
group. 

Inspection of Figures 1-4 reveals that 
stimulus suffix effects were indeed limited 


to the terminal serial position of the un 
attended ear in the unattended suffix an 
binaural suffixk groups. Results of t! 
ANOVA indicated a reliable Suffix Gro: 
X Serial Position interaction, F (12, 
2.41, p < .02, and a Suffix Group 
tention X Serial Position interac d 
(12, 32) = 2.64, p < .01. Neither the 
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main effect of suffix group nor any of the 
other suffix group interactions were re- 
liable. 

Difference scores between baseline and 
suffix conditions were obtained for all Ss 
on the terminal serial position of the un- 
attended ear. As a test of the reliability 
of the suffix effects, £ tests were conducted 
on the difference scores. The £ tests showed 
reliable decrements in the unattended 
suffix group, ¢ (6) = 3.15, p < .02, and the 
binaural suffix group, £ (6) = 3.44, p < .02, 
relative to the control group, but no dif- 
ference between the control and the at- 
tended suffix groups, ¢ (6) < 1. 

The comparability of suffix effects in 
the present experiment with those in the 
Morton et al. (1971) study suggests that 
the storage systems mediating retention in 
dichotic memory are identical with the 
storage systems operating in the retention 
of long lists of sequentially presented 
items. 

The binaural suffix group had the largest 
stimulus suffix effect (mean decrement on 


the terminal serial position = 33.25%); 
however, there was considerable variability 
between Ss in this group with regard to 
the magnitude of the suffix effect. Con- 
sidering only the terminal serial position, 
the suffix effects of the 4 Ss in the binaural 
suffix group were 55%, 50 28%, and 0 
(see Figure 5). Of considerable int 
is the fact that there was an orderly re 
tion between magnitude of suffix effect and 
digit span. The Ss showing the largest 
suffix effects had digit spans of 5 and 7, 
the S with an intermediate suffix effect 
had a digit span of 9, and the S with no 
demonstrable suffix effect had a digit span 
of 11. Experiment Il was conducted in 
order to further delineate the relation be- 
tween magnitude of suffix effect and digit 
spar. 


a- 


EXPERIMENT lI 
Method 
Digit spans were assessed and only individuals 


with a high or low digit span were selected for 
participation as Ss. Sixteen Ss were assigned on 
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the basis of digit span to one of 2 groups: high span 
ot low span. The Ss in the high-span group bad 
-. individual digit spans of 10, 10, 9, 10, 9, 10, 10, 
—— and 10, with a mean of 9.75; Ss in the low-span 
i group had individual spans of 6, 7, 7, 6, 6, 6, 6, and 
— 6, with a mean of 6,25. The Se received the same 
teat sequence (one practice session followed by 2 
_ test sessions) and same instructions as those em- 
" in the first experiment. One half of the Ss 
. m each group received maximum payment for 

right-ear performance and one half for left-ear 

performance, 
In the present experiment, the stimulus suffix 
("ah") was presented binaurally for both groups. 


Results 


As the primary concern in Experiment I 
was the relation between digit span and 
‘stimulus sufix effect magnitude, the data 
were collapsed over order of report 
and separate 2 X 2 (Between X Within) 
ANOVAs were conducted for attended-ear 
and for unattended-ear reports. 
In these ANOVAs, span and ear of atten- 
tion were between-groups factors. while 
test (baseline v». suffix) and serial posi- 
tion were within-groups factors. The 
results for the low-span group and for the 
bighepan group are shown in Figures 6 and 
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SERIAL POSITION 


1 group, Experiment 11: Percentage 
a function of test feft vu. so sufix), ear of attention, and serial position. 


*3 fm * ` r 
AND LORA L. HUBBARD 


low-span group to be a positively increas 
function of serial position. The « 
responding function of the high-span g 
(Figure 7) shows level of accuracy to 
relatively flat over the first 4 serial | 
tions, with somewhat higher perform 

on the terminal item. Statistical ana 
revealed significant main effects of 

F (1, 12) = 157.25, p < .001, and n 
position, F (4, 48) = 22,84, p < .001, ind 
à reliable Span X Serial Position i: er- 
action, F (4, 48) = 301, p <.03. T se 
findings provide confirmation of ear! . 
sults (Parkinson, 1974) showing an 
relation between unattended-ear rec 
curacy and digit span, 

Inspection of Figures 6 and 7 shows | 
the effect of a binaural stimulus suffi 
limited to the terminal serial position, : ! 
statistical analysis revealed the Test (b 
line vs. suffix) X Serial Position in 
action to be reliable, F (4, 48) = 26 
p.001. This result replicates + 
obtained with the Dinaural suffix group. 
Experiment I and provides a clear den: 
stration of a stimulus suffix effect in dich: 
memory, Of further interest is the may 
tude of the stimulus suffix effect, The mou 

ent in the terminal serial posit: i 
(baseline-suffix) for the low-span group 
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aT while that for the h n 
16.04%. The Span X Test X 
Sero 1 voition interaction was reliable, F 
(4,48) = 5.01, p < 08. 
Attented-car recall. The effect of span 
limited to the unattended ear. 
pan group also performed more 
ac than the low-span group on the 
ait car, F (1, 12) = 1.14, > < 008, 
With the greatest disparity between groups 
on the vorminal serial positions, F (4, 48) 
= ANS, p < (004 
here was also an effect of the stimulus 
On the attended car, as indicated by 
X Serial Position interaction, P 
TEES P6 1. Det tb fort was 
restricta to span group, F (1, 12) 
^64, p < 03. The thind-onder inter 
. eon between span, test, amd serial posi- 
ton was not relialde. 
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UNATTENDED EAR 


SERIAL POSITION 


FIGURE 8. 


Group data, Experiment III: Percentage correct recall as 


a function of test (suffix vs. no suffix), ear of attention, and serial position, 


Results 


As in Experiment II, the data were col- 
lapsed over order of report and separate 
ANOVAs were conducted on attended-ear 
and unattended-ear performance. These 
results are shown in Figure 8. 

Altended-ear recall. The only effect 
reliable in attended-ear analyses was the 
main effect of serial position, with primary 
serial positions better recalled than ter- 
minal serial positions, F (5, 30) = 8.81, 
b < .001. 

Unattended-ear recall. Baseline perform- 
ance was superior to that observed when 
a stimulus suffix was administered, F (1, 6) 
= 10.01, p < .02, and terminal serial posi- 
tions were recalled with greater accuracy 
than primary serial positions, F (5, 30) = 
13.49, p < .001. Inspection of Figure 8 
reveals the stimulus suffix effect to be 
restricted to the terminal serial positions, 
and statistical analysis showed the Test x 
Serial Position interaction to be reliable, 
F (5, 30) = 4.90, p < .002. 

Of considerable interest is the resem- 
blance of the unattended-ear serial posi- 
tion function for the high-span Ss in the 
present experiment (with dichotic lists 
containing 6 digits per ear) to the unat- 
tended-ear serial position function of the 
low-span Ss in Experiment II (with dichotic 


lists containing 3 digits per ear). 1 
these show unattended-ear accuracy ‘o be 
a positively increasing function of 
position. There is also considerabl« 
larity in terms of the magnitude : 
stimulus suffix effect in these fur 
The mean decrement on the termin 
position (baseline-suffix) was 28.759; i 
present experiment. This can be compared 
with the 29.16% and 16.04% decrements 
shown by the low-span and high-span 
groups, respectively, in Experiment Il. 
A difference score (baseline-suffix) was 
computed for the terminal serial position 
performance of high-span Ss in Experiment 
II and Experiment III. A correlated / test 
of these differences indicated a reliably 
larger decrement in Experiment III, / (7) 
=J51 po 05. 


GENERAL Discussion 


According to the Broadbent (1971) model, 
attention is directed to one message in the 
dichotic task. That message enters the P or 
perceptual system while the initially unat- 
tended message is held in a short-term audi- 
tory buffer, the S system. In this model 
information is held in different storage systems 
and these systems can be characterized by 
different levels of processing. Information 
held in the S system is relatively unanalyze 


While information in the P system is cate 
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gorized. Given this type of model we would 
expect (a) differences in the recall functions of 
attended and unattended messages, and (b) 
differential effects on attended and unattended 
messages from an independent variable (stim- 
ulus suffix) which is thought to selectively inter- 
fere with relatively unanalyzed information. 
Both of these expectations were realized in the 
present experiments. 

Another finding of considerable interest in 
the present experiments was that the magni- 
tude of the suffix effect was directly pro- 
portional to the amount of stimulus overload, 
ie, the degree of disparity between an in- 
dividual's digit span and the number of to-be- 
remembered items. In Experiment II, given 
messages containing 5 digits per ear, Ss with 
low digit spans showed larger stimulus suffix 
effects than did Ss with high digitspans. That 
it was the disparity between digit span and 
number of items, and not digit span per se, 
which was important, was revealed in Experi- 
ment III, in which Ss with high digit spans 
showed larger stimulus suffix effects when they 
were given dichotic lists containing 6 digits 
per ear. 

How can we interpret the relationshi 
between digit span and suffix effect? First 
consider results obtained by Parkinson (1974). 
He manipulated order of report in dichotic 
memory and found correlations of +.881 and 
+.844 between digit span and unattended- 
ear performance in attended-unattended and 
unattended-attended reports, respectively. 
Parkinson offered two interpretations of his 
results one based on S system storage capacity 
and one based on “central processing capacity." 
Attempts to delineate the relationship between 
digit span and suffix effect magnitude will be 
oriented along these interpretations. 


Storage Capacity 


One interpretation of the relation between 
digit span and unattended-ear performance is 
that S system capacity varies as a function 
of digit span. Accordingly, Ss with high digit 
spans have more "storage bins" in their S 
system than do Ss with lower digit spans. The 
stimulus suffix in the present experiments was 
presented in the same voice and tempo as the 
stimulus list; hence, the relation was one of 
high similarity. Morton et al. (1971) manip- 
ulated similarity between stimulus list and 
suffix along dimensions of intensity, pitch, 
and apparent spatial location. They found 
the magnitude of the stimulus suffix effect to 
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be directly proportional to the degree of simi- 
larity between suffix and stimulus list. Re- 
ports of Ss in the present experiments indicated 
that the degree of similarity between the 
suffix and the stimulus list made it very 
difficult for them to ignore the suffix during 
presentation. One way of viewing this effect 
is that as the similarity between stimulus 
list and suffix increases, Ss find it more dif- 
ficult to separate out or isolate the suffix from 
the stimulus list and the suffix comes to “act” 
like an additional item in the list. Given a 
system of storage bins and a displacement 
principle, the suffix would simply displace 
the last item in the list as the last stimulus 
item displaces the penultimate item. Given 
sufficient storage bins (higher-span Ss), the 
displacement would not affect recall to any 
great extent; but given more items than bins 
(low-span Ss in Experiment II and high-span 
Ss with more items as in Experiment III), 
the effect of a suffix would be to even further 
overload the system and the result would be a 
significant impairment of recall performance. 


Central Processing Capacity 


In some models of information processing, 
attention is used synonymously with central 
processing capacity (CPC) or “mental effort" 
(e.g., Kahneman, 1973; Posner & Boies, 1971). 
Models of this type posit a finite amount of 
CPC that is available for information proces- 
sing. Mental operations requiring CPC share 
a common source; thus, if a large amount of 
CPC is invested in one operation, the amount 
remaining for others is reduced. 

Consider that the present task consists of 2 
components, recall and separation or isolation 
of the suffix from the stimulus list, both of 
which require CPC. The effects on recall 
observed following manipulation of similarity: 
between stimulus list and suffix (Morton et al., 
1971) can be viewed in terms of the amount of 
CPC required by the isolation component. 
In the high-similarity conditions, more CPC 
was required by the isolation process than in 
the low-similarity conditions; therefore the 
amount remaining for recall was insufficient 
to maintain accurate performance. 

In Experiment II of the present research, 
both groups, high and low span, were given 
binaural suffix presentation in the same voice 
and tempo as the stimulus list. We can 
assume from the findings of Morton et al. 
(1971) that some amount of CPC was de- 
manded in the present experiment to separate 
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the stimulus list and suffix. Parkinson (1974) 
hypothesized that the CPC required for recall 
varied in direct proportion to the difference 
between an individual's digit span and the 
number of to-be-remembered items. Assum- 
ing Parkinson’s hypothesis to be correct, 
more CPC was demanded in the recall task 
of Experiment II for the low-span group than 
for the high-span group. Given a condition in 
which substantial CPC is needed for recall, 
any CPC diverted to the separation of stim- 
ulus list and suffix should be sufficient to im- 
pair recall performance. We would expect on 
this basis a greater discrepancy between base- 
line and suffix test performance for the low- 
span group than for the high-span group. We 
would also expect that as CPC requirements 
for recall were increased for the high-span group 
by increasing the number of items (Experi- 
ment III) a greater difference would exist 
between baseline and suffix performance. Both 
of these results were obtained in the present 
experiments. 

The purpose of the present experiments was 
to evaluate the merit of establishing a working 
equivalence between theoretical constructs de- 
rived from dichotic memory and from pro- 
cedures in which long lists of items are pre- 
sented sequentially. The comparability of 
stimulus suffix effects in the present experi- 
ments to those of Morton et al, (1971) sug- 
gests that the storage systems mediating reten- 
tion in dichotic memory are identical with the 
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storage systems operating in the 
sequentially presented items 
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RETROACTIVE INHIBITION OF RHYME CATEGORIES 
IN FREE RECALL: 


INACCESSIBILITY AND UNAVAILABILITY OF INFORMATION ! 


DOUGLAS L. NELSON ? anb DAVID H. BROOKS 
University of South Florida 


This experiment was designed to investigate retroactive inhibition in free recall 
of categorized word lists with categories defined as sets of rhymes, Each S 
learned 1-6 lists prior to being tested for total free recall of all words from all 
lists. Following this test, half of the Ss were given a second total frec-recall 


test and half of the Ss were given a cued-recall test. 


The cues consisted of 


sequences of letters which, when pronounced, rhymed with 4 of the words from 
the list. The results indicated that retroaction effects were evident for the recall 
of words, categories, and for words within recalled categories, Providing re- 


trieval cues eliminated retroaction effects but, 
number of words recalled per category. 


multaneously, depressed the 
list intrusions 


An analysis of exti 


suggested that the loss of instances within recalled categories arose because of 
the difficulty of discriminating between intralist and extralist instances. 


Tulving and Psotka (1971) have shown 
that retroactive inhibition in the free recall 
of categorized word lists is apparently 
produced by the loss of higher order 
memory units rather than instances falling 
within those units. Furthermore, since 
providing category names as retrieval cues 
essentially eliminated retroaction, they 
concluded that this loss reflected the in- 
accessibility rather than the unavailability 
of the higher order units. These findings 
were interpreted as indicating that words 
of a conceptual category are "stored 
together" as a functional unit. Providing 
the name for an apparently forgotten 
memory unit ostensibly provided retrieval 
ss to most of the instances coded 
within it. 

This experiment was concerned with 
whether or not rhyme categories could 
serve as functional units of memory and 
with determining if rhymes could serve as 
effective retrieval cues for recalling in- 
stances coded within these categories. To 
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of South Florida, Tampa, Florida 33620. 


implement these aims, the basic procedures 
of the Tulving and Psotka (1971) study 
were replicated using rhyme-defined cate- 
gories. Findings were expected to be 
relevant to recent conceptualizations of 
word-coding proc Despite the grow- 
ing consensus that words are represented 
in memory as sets of features, attributes 
or dimensions (e.g., Gibson, 1971; Under- 
wood, 1969; Wickens, 1970), very little 
research on organizational processes in free 
recall on other than semantic attributes 
has been reported. Thus, one aim of this 
experiment was to show that the forgetting 
of rhyme categories could be described as 
an analogue of the forgetting of conceptual 
categories, or alternatively, to show that 
memory processes involved in phonetic 
representation are different from those in- 
volved in semantic representation. 

The experiment was designed to provide 
evidence on 2 specific questions. First, 
does the acquisition of other lists reduce 
the recall of higher order units, instances 
within those units, or both? If rhymes 
are organized into functional units during 
the study phase of the task, i.e., if they 
are in some sense stored together in a 
manner analogous to conceptual catego- 
ries, then retrieval of the rhyme category 
ought to mediate the recall of most. of its 
instances. If this supposition is correct, 
retroaction ought to be reflected in the 
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recall of categories, not in the recall of 
instances within categories. | However, 
there are inherent differences between the 
2 types of categories, differences which 
may affect the manner of the processing 
of instances. If rhyme categories are not 
represented in memory as units, then ret- 
roaction may be apparent only within the 
number of words recalled from each list. 
Alternatively, rhyme categories may be 
represented in memory as subjectively 
organized units, but because of their 
unique qualities, recalling the unit or cate- 
gory may be insufficient for mediating 
recall of its instances. If so, then retro- 
action effects would be reflected in cate- 
gory recall and in the recall of words 
within recalled categories. 

The second question of interest con- 
cerned the status of information loss 
produced by the learning of other struc- 
tured lists. Does retroactive inhibition 
reflect the unavailability of information, 
its inaccessibility, or both? Evidence re- 
lating to this question will be provided by 
presentation of rhyme categories as re- 
trieval cues following free recall. If retro- 
action is produced by a loss of information 
from storage within the context of the task, 
rhyme cues should be ineffective as retrieval 
aids. If retroaction is produced only by the 
inaccessibility of information, these re- 
trieval cues might provide access to the 
information so that forgotten items might 
be recovered. Of course, it is possible that 
these cues may be differentially effective at 
the levels of “units” and "instances." Pro- 
viding rhymes as retrieval cues may provide 
access to forgotten categories, but these 
categories, in turn, may be insufficient for 
generating most of the list words occurring 
within these categories. In this cas F 
retroaction would reflect the inaccessibility 
of higher order units and the unavailability 
of instances within those units. 


METHOD 


Design. There were 8 independent groups of Ss 
differing in terms of number of lists learned, and 
whether second total recall of all words from all 
lists was either free or cued. Thus, Ss learned 1, 3. 
or 6 different lists; following the first total free- 
recall test, half of the Ss were given a second total 
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free-recall test and half were given a < 
test. Controls acquired a single list. 

a delay appropriate for the first list lear: 
6 list condi ions and following the first 
recall, half of the controls were given 
total free-recall test and half were give: 
recall test. The controls will be referred 
ie. a one-list delay condition. Thus, t! 
mental design conformed to a 4 X 2 X 
model factorial, the between-Ss [acto 
number of lists learned (ID, 1, 3, or 6 
of recall (free or cued), and the within 
being order of recall (first total recall 
total recall). Since the first total reca 
always free, comparisons of type of re 
point provide information only on the 
bility of the free and cued groups, There! 
this design, meaningful comparisons will b 
only in interactions between type of 1 
order of recall. That is, for a given m 
performance the effects of cued recall 
apparent on the second test and not on 
test. Type of recall was made a betwee 
able because of the significant improv 
performance from the first “total” free- 
to the second “total” free-recall test ol 
Tulving and Psotka (1971). To the « 
performance improves with successive recoil 
some of the improvement obtained in « 
might have been reasonably attributed 
factor. However, because cued recall was 
by free recall, it was also possible with th:s 
to evaluate the effects of cuing as a 
variable. 

Verbal material, Six different 24-word lists we 
constructed with each list containing 6 categori 
of 4 words, Items within each category, all re 
tively common words, consisted of monosyllab 
rhymes taken from Wood's (1971) unabridgé 
New World Rhyming Dictionary. Thus, with 6 li 
and 6 categories/list, there a total of 36 d 
ferent rhyme categories. Within a given categol 
of 4 words, all items were both phonetically à 
visually similar, sharing 2-4 letters in the termim 
positions. According to a tabulation and com 
of words selected from Wood's dictionary thé 
was an average of 15.00 (SD = 4.88) words sharin 
each ending for each of the 36 categories used M 
this experiment. The smallest :ontained 8 item 
and the largest contained 24 iten e 
within a given rhyme category were dilferell 
and were consonants. In addition to these col 
straints, no homophones were used in any of thi 
lists, and an attempt was made to minimize OB 
vious associative relationships among items in d 
lists. Furthermore, an attempt was made Ú 
maximize the number of unique word beginning 
There was a total of 42 different beginnings t% 
the 144 words, counting B . . . as different from 
BRE Nete 

In each list all words of a given rhyme categos 
were presented in immediate succession, im i 
"block," e.g., DITCH, WITCH, STITCH, PITCH, BOOÍ 
LOOT, ROOT, SHOOT, etc. Latin squares were use 
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" to counterbalance the order of appearance of the 
categories within a given list as well as the ordinal 
position of lists for groups acquiring more than 
a single list. The order of the items within a given 
rhyme category was unsystematically randomized 
for each S. 

Procedure. AN Ss participated in individual 
sessions. In immediate recall, the words were 
shown, one at a time, by a Kodak Carousel slide 
projector driven by an auxiliary timer so that each 
word was shown for 1 sec. Every list was shown 

-3 times in succession, each time in a different 
order, before its recall was tested. There was a 
10-sec. interval separating successive inputs. Fol- 
lowing the third presentation, Ss were allowed 
90 sec. to write down as many words as could be 
recalled in any order. Guessing was encouraged 
but not required. The recall sheet contained 24 
unnumbered, horizontal lines arranged into 6 groups 
of 4 lines. Upon completion of the recall phase 
S handed his protocol to E who provided a new 
sheet if there was another list. The next list was 
presented as soon as the recall sheets were exchanged. 

The immediate recall test of the last list was 
followed by 5 min. of interpolated activity con- 
sisting of solving multiplication and long division 
problems, Speed and accuracy were emphasized. 
Since the 6-list condition required 25 min. for the 
immediate recall series, Ss in the ID condition 
engaged in this task for 30 min. so that this in- 
terval was equivalent to 5 lists plus 5 min. of 
interpolation. 

Following the interpolated task, all Ss were 
given a total free-recall test. Each S provided 
with new recall sheets and was instructed to write 
down as many words from all previous lists as he 
could, in any order. Recall time was equal to 
90 sec. multiplied by the number of lists. After 
the protocols were collected, half of the Ss in ach 
condition were asked to provide a second total 
free recall and the other half of the Ss in each 
condition were given instructions for cued recall. 
The latter indicated that all words from all lists 
were to be recalled but that cues would be pro- 
vided on the recall sheets to aid them. They were in- 
formed that these cues would consist of groups 
of letters which, when pronounced, would rhyme 
with 4 of the words presented, e.g., "oor" for 
BOOT, LOOT, etc, Each cued-recall sheet corre- 
sponded to one list arranged so that each rhyme 
cue was followed by 4 lines for recording the re- 
sponses. The order of the category names for a 
given list was randomized and was unrelated to 
order of appearance during input, but the order 
of the lists corresponded to that used at input. 

Subjects, Eighteen Ss were assigned to each of 
the 8 between-Ss treatments, making a total cf 
144 Ss. These Ss were assigned in blocks of 8, 
with 1 S from each condition per block. Assign- 
ment within blocks was determined by a table of 
random numbers. All Ss were selected from in- 
troductory psychology courses and received points 
toward their grades for participation. Each of 3 
female És ran one-third of the Ss in each treat- 


ment so that E was balanced with respect to all 
conditions, 


RESULTS 


Separate analyses of variance were per- 
formed on immediate recall within each 
list condition, including a separate break- 
down for type of recall. All of these 
analyses showed that type of recall was 
insignificant, attesting to the compara- 
bility of the free- and cued-recall groups 
on immediate recall. The mean number 
of words recalled per list, with the data 
pooled across all lists in the 8 groups, 
was 13.86. All deviations from this value, 
for all lists and conditions of recall, were 
less than one word. Similar analyses of 
variance were performed on categories and 
on words/category. For both of these 
measures, all analyses indicated that type 
of recall was insignificant and that recall 
per list was constant. The mean number 
of categories per list represented in im- 
mediate recall was 4.95, and the mean 
number of words/category was 2.80. 

The results of a clustering analysis of 
the immediate recall data were consistent 
with the supposition that rhyme categories 
can serve as functional units of memory. 
Ratios of number of items recalled in 
clusters to total recall were calculated 
with a cluster defined as at least 2- ad- 
jacently recalled rhyming words (Marshall, 
1967). These values indicated that nearly 
all of the words were recalled in rhyme 
groupings, and that this tendency was not 
affected by amount of practice. For 
example, for Ss learning 6 lists, 89%, of 
the words recalled from the first list were 
recalled in categories and 86% of the 
words recalled from the sixth list were 
recalled in categories. To the extent that 
Ss attempted to recall the list in terms 
of order of input, these values would tend 
to overestimate the degree of clustering. 
Nevertheless, each list was presented in 3 
different input orders prior to immediate 
recall, a procedure that should have dis- 
couraged the utilization of a seriation 
strategy. 

One of the principal findings of this 
experiment was that the words/category 
measure, in addition to the word and 
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TABLE 1 

Mean Finsr-Lisr RECALL or Worps, CATEGORIES, 
AND Worps/CATEGORY FOR THE Frrst TOTAL 

FREE-RECALL TEST 
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was increased from 0 to 2 to 5. 

more, performance on ID fell betw 
obtained for the 1- and 3-list cc; 
However, even though the passag 


| Number of lists resulted in some apparent forget! the 
Measure -— magnitude of this loss was smal! iive 
Das-1 nent 6 to that produced by retroaction. | 
Words 11.00 | 12.58 | 6.36 | 450 Table 2 presents the mean nu: oer of 
Categories 4.11 4.56 | 2.53 | 1.80 words recalled per list on the first id on; 
Words/category 2.56 | 2.13 | 255 | 2.1 the second total recall tests. irate 
EEUU e pae iar diay condition possel analyses were performed for each cong 
to control Ss, dition with type of recall as a bc nss 
factor and with order of recall ano when 
category recall measures, varied as a func- appropriate, with list position as: cated 
tion of the number of interpolated lists. measurements variables. Since ncu 
Table 1 displays mean recall of words, sions reached by within-Ss com; isons 
categories, and words/category from the were identical to those reached bed 


first list learned as a function of number 
of acquired lists. These data were ob- 
tained from the first total free-recall test. 


tween-Ss comparisons, only the la M 
be discussed. These analyses all 1 vale 


significant Type of Recall X Orde: Re- 
Separate analyses of variance indicated call interactions: ope ID, 1-, and ^ 
that number of lists learned was a reliable — 6-list conditions, the obtained Fs! this 
source of variance for words, F (3, 140) interaction were, respectively, F 34). 
= 34.86, p < .01, categories, F (3, 140) = 12.83, 8.23, 29.56, and 63.10, ps 01. 
= 40.37, p « .01, and words/category, Each of these interactions indica that 
F (3, 140) = 3.03, p < .05. Fishersleast the facilitating effects of cuing wee obs. 


significant difference (LSD) values for each 
of these respective measures were 1.79, .57, 
and ,38. Thus, the results of these anal. 
yses confirm the obvious trends displayed 
in the table, Significant and increasing 
amounts of retroaction on the initial list 
were obtained, for every performance mea- 
sure, as the number of interpolated lists 


TABLE 
MEAN Nuwnk& Or Wonps RECALLED ON THE F 


tained for only the second total reca! test, 
and hence, these analyses attested io the! 
comparability of the free- and cued recall! 
groups on the first total recall test. The 
sizable amounts of retroactive inhibition w 
obtained in first total recall for both free 
and cued groups and in second total recall” 
for the free group were completely climi- 


rM "IRST AND SECOND Toran RECALL TksTs 
Number | First total recall-test ordered lists | Second total recall-test ordered lists 2 
TA. | [S ERES PS REN: ET 
ghe i pill A ko | ex 1 2 3 4 5 6 ~ 
1D | | | i EC 
Free | 11.22 
4 oe | 11.22 | 11.50 | 50 
į Ced | 1078 | | 10.78 | 13:40 | 140 
Free | 12,50] | 1 | 
ree | | 2.50 | 13.00 3,00 - 
Cued {12.67 | Í | 12.67 15.56 | 1556 
| 
Free | 722| 9.17 | 12.33| 9.57 5 
] 7 | 57 | 844 | 10.11 | 13.06 10.54 
$ Cued | 5.50 | 8.17 Ina | 8.00 | 11.94 | 12.39 | 12.61 1231 | 
Free | 4.601| 622 | 7.50| 8.67 | 8.56 
"ree. | 67| 8.56 | 10.83] 7.73| 4,99| 6.98] 7.11 58 
en Mesa p 391 i ! 2 i $.00| 8.61| 898| 7. 
u pes | 6.61 | 8.28) 6:72 | 8.33 10.11 | 7.40 | 12.44 | 10.61 | 11,61 | 12.39 | 11.89 | 10.50 11.57 
* The abbreviation ID = one-list r ES 


delay condition presented to control Ss, 
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nated within a given list condition by 
providing higher order rhyme cues. In 
the 6-list condition the facilitating effects 
of cuing were greater on earlier lists. The 
Ss in the free-recall group recalled an 
average of 4.89 items from the first list 
and those in the cued-recall group recalled 
an average of 12.44 items from this list, 
a difference of 7.55 items. This difference 
was gradually reduced, because of de- 
sing amounts of retroaction in the 
free-recall groups, until eliminated on the 
sixth list. The effect was reflected in a 
reliable Type of Recall X Order of Recall 
X List Position interaction, Æ (5, 170) 
= 3.31, p <.01. Fisher's LSD value, 
based on the pooled between-within error 
term, was 2,60, It should be noted that 
although cuing eliminated — retroaction 
within a given list condition, comparison 
of the column means for cued recall in- 
Heated. that retroactive effects were still 

it Whereas Tulving and Psotka 

found that cuing restored recall to 
its inal levels, the levels attained in 
this periment were still below those of 
immediate recall, 

Since the categories measure showed es- 
sentially the same pattern of results as 
the words measure, a table parallel to 
Table 2 was not presented, Instead, for 
all 3 measures, performance was pooled 
across list position, and the data were 
analyzed with number of lists and type 
of recall as between-Ss factors and with 
order of recall as a within-Ss factor, The 
results of these separate analyses indicated 
that the Type of Recall X Order of Recall 
interactions were significant for words, 
F (1, 136) = 95.80, categories, / (1, 136) 
= 199.58, and words/category, F (1, 136) 
= 28. 31, ps < .01. These interactions are 
displayed in Table 3. Fisher's LSD values 
for between-Ss comparisons for each of 
these respective measures were 1.20, .29, 
and .18; for within-Ss comparisons, these 
values were .46, .18, and .08. The pat- 
terns of mean errors for words and for 
categories indicated that, for both within-Ss 
and between-Ss comparisons, significantly 
more items were recalled on the second 
recall test when recall was cued. These 


TABLE 3 


_ Mean Recan or WORDS, CATEGORIES, 
Wonps/CATEGORY, AND INTRUSIONS/CATEGORY ON 
THE First AND SECOND Toran RECALL 

Tests 


r 
t 


Order of recall tests 


Measures First Second. 
| Free. | Cued | Free | Ced 
Words 10.27 | 9.70 | 10,53 | 13.21 
Categories 3.89 | 4,58 3.89 | 541 
Words/category 2.53 | 2,62 2,68 | 2.442 
Intrusions/category «31 KSS 42 | .64 


facilitating effects were somewhat greater 
in the 3- and 6-list conditions than in the 
ID and 1-list conditions. The Fs for the 
3-way interactions were /^ (3, 136) = 2.50, 
b «.08, and F (3, 136) = 3.16, p < .05, 
respectively, for the words and categories 
measures. For the words/category mea- 
sure, the means presented in Table 3 show 
that the directions of these effects were 
reversed, Significantly fewer words/cate- 
gory were recalled on the second test when 
recall was cued. Unlike conceptual cate- 
gory cues, rhyme cues failed to generate 
the majority of correct instances stored 
within particular categories, These results 
suggested that the retroaction displayed in 
Table 2 was the result of the loss of higher 
order units as well as the apparent loss 
of instances within those units, The in- 
crement in words recalled under cued 
recall, and the consequent elimination of 
retroactive inhibition within a given list 
condition, apparently occurred because Ss 
were recalling words from more categories, 
not because the rhyme cue provided access 
to the majority of correct instances within 
a given category. 

This failure of the cue may have resulted 
because of difficulty encountered in dis- 
criminating intralist and extralist instances 


within rhyme categories (eg., Bower, 
Clark, Lesgold, & Winzenz, 1969), 1f this 


supposition is correct, then extralist in- 
trusions per category ought to be greater 
under conditions of cued as compared with 
free recall. To check on this possibility 
extralist intrusions/category were tabu- 
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lated for recalled categories (i.e., at least 
one correct instance had to be recalled 
from the category). Nearly all Ss ex- 
hibited these intrusions. The results of 
the statistical analysis indicated that the 
Type of Recall X Order of Recall inter- 
action was significant, / (1, 136) = 9.96, 
p «.01. This interaction effect, which is 
displayed in the bottom row of Table 3, 
indicated that significantly more intru- 
sions/category were obtained under cued- 
recall conditions. Sixteen percent of the 
items recalled per category in the cued- 
recall conditions were intrusions. Fisher's 
LSD values for within-Ss and between-Ss 
comparisons were, respectively, .08 and .13. 
These findings suggested that the rhyme 
cues provided access to the category as a 
unit, and that the loss of instances within 
the unit may have been produced by the 
difficulty of discriminating between intra- 
list and extralist instances within that unit. 
In fact, pooling the results for the words/ 
category and intrusions/category measures 
indicated that items recalled per category 
was a constant of approximately 75% for 
all conditions, a value similar to that 
reported by Tulving and Psotka (1971). 
An analysis of variance of these pooled 
measures indicated that none of the Sources 
of variance was statistically reliable, in- 
dicating that items/category was relatively 
invariant. 


DISCUSSION 


The results of this experiment can be de- 
scribed by summarizing 3 major findings. 
First, the physical properties of words, in 
this case their visual-phonetic characteristics, 
can be used to subjectively organize words 
already represented in memory. That is, 
rhymes, like conceptual categories, apparently 
can function as higher order memory units. 
Second, the retroactive inhibition in free recall 
of lists organized into rhyme categories rep- 
resents a decrement in the recall of these 
higher order units as well as instances within 
recalled units. Retroaction effects on the 
first list were evident within all measures of 
recall performance: words, categories, and 
words within recalled categories. Finally, 
these retroaction effects reflect both inacces. 
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sibility of higher order units and unavailability 
of instances within those units. Providing) 
retrieval cues in the form of rhyme endings 
provided efficient access to stored informat 
tion, essentially eliminating retroactive effects. 
However, retention was not restored to the 
level attained in immediate recall, and the 
recall of correct instances within recalled 
categories actually decreased when relevan 
cues were provided. 

The findings associated with the loss off 
rhyme categories and the subsequent elimina: 
tion of retroaction when these higher order 
units were provided at recall parallel findings 
associated with conceptual categories as res 
ported by Tulving and Psotka (1971). The 
losses obtained for instances within recalled 
categories contrast with the invariance ob- 
tained for this measure when instances belong 
to conceptual categories. In the case of) 
rhymes, instances appear to have been lost 
from the store since they are "unavailable" 
even when a highly relevant cue has 
provided to aid their retrieval. This loss 
may have occurred because of the difficult 
of making discriminations between prescribed) 
and extralist instances. The rhyme cues are 
presumed to be effective higher order retrieva 
aids providing access to the category of rele 
vant words. This effectiveness may (Tulving 
& Thompson, 1973) or may not (Bahrick, 
1970) be determined by what information is 
encoded during the study trial. However, it 
is assumed that the cue initiates a search 
through relevant instances. For each re 
trieved instance a decision is made as t0 
whether the item was or was not from the 
list (Anderson & Bower, 1972). The SS 
reportedly experienced little difficulty in gen 
erating instances from the category cue ant 
many indicated that they simply matche 
word beginnings to the rhyme endings, often 
by going systematically through the alphabet. 
Accordingly, what is apparently lost from 
storage is the ability to remember what items 
of the category appeared in the list and what 
items did not. 


trusion findings associated with the experi- 
ment. To account for the invariance im 
words/category obtained with conceptual cat 
egories and for its absence with phonetic 
categories, it must be assumed that greater 
relative difficulty is encountered in differen- 
tiating instances defined by rhyme than in 
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dillerentiating instances defined by conceptual 
relations. A logical analysis of the 2 types 
of "categories" attests to the reasonableness 
f this assumption. Each instance of a rhyme 
ory literally recapitulates the category 
‘Hf, e.g, “OOT” is contained in BOOT, 
vr, etc. Hence, all instances are directly 
rt of the category and, for that matter, all 
tances are directly related to each other 
ice, by definition, to be a member of the 
category requires the sharing of identical 
physical features. Each exemplar of a con- 
ceptual category is associatively related to 
the category name but recapitulates it only 
virtue of the strength of this associative 
tionship and, furthermore, the instances 
n the category may or may not be 
iatively related. 
ne implication of this interpretation is 
iat similar mechanisms may be involved in 
acquisition and retention of lists con- 
taining conceptually defined categories and 
lists containing rhyme-defined categories. 
Whether the results indicate that words/cate- 
gory is invariant may simply be dependent 
upon the degree of relatedness of the instances 
falling within the category. If this inter- 
pretation is correct, then invariance of words/ 
category may not be obtained if only some 
portion of the items of an exhaustive con- 
ceptual category are used since items within 
these types of categories tend to be highly 
related (e.g., Cohen, 1963). 
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Two experiments examined the effects of constant vs. varied input of letter 


strings on recall and recognition memory. 


Letter strings constructed from 


pairs of trigrams were fractionated or spatially grouped and presented as either 
a constant or varied grouping on successive presentations. Varied input pro- 
duced much greater recall of the entire letter sequence than did constant input. 
Presumably, Ss in the varied condition were more likely to adopt a strategy 
of looking for and encoding a larger unit or structure than Ss in the constant 


condition, a strategy which facilitates recall. 


In contrast, recognition was 


enhanced by constant input of the letter sequences, a finding predictable from 


simple frequency considerations. 


The following experiments concern the 
role of nominal-stimulus variation vs. con- 
stant or fixed input on recall and recogni- 
tion memory for groupings of letter se- 
quences. A classical assumption in human 
memory has been that repetition of stimu- 
lus events, in the sense that nominal 
stimuli are presented in the same manner 
or form on successive study trials, results 
in superior memory compared to condi- 
tions in which nominal stimuli are yaried 
in some fashion. Nominal-stimulus vari- 
ation has generally referred to situations 
in which contextual and/or structural vari- 
ations of stimuli are introduced in the 
course of learning. 

This assumption has been recently chal- 
lenged by Bevan and his colleagues in 
several experiments which have shown that 
variation among instances of stimulus cate- 
gories could increase the likelihood of 
recall. In one study (Dukes & Bevan, 
1967), stimuli of the class ax, bx, cx, dx, 
etc., were presented, where x was the 
common feature and a, b, c, and d were 
the modifying elements. For example, 
x represented a photograph of a particular 
person, and a, b, c, or d represented his 
appearance in different poses or dress. 
To each stimulus compound of a given 
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class Ss learned a particular name : ise 
and then were tested for recall hat 
response when shown either one the 
training stimuli (e.g., bx), or new ins:¢nces 
of the category (e.g. jx). They “ound 
that the superiority of varied trsining 
depended upon whether the test stilus 
was a truly old one vs. a “new” item ‘rom 
the category employed. Specifically, when 


the stimuli were varied during training, 
varied training produced superior cued 
recall (using either old or new stimulus 
items) compared with repetition training 
(using only new stimulus items). In con- 
trast, repetition training always produced 
superior recall compared with all other 
conditions when the test stimuli were old. 

Although these results place constraints 
on any simple generalization about the 
superiority of varied training, a similar 
study (Bevan, Dukes, & Avant, 1966) 
using free-recall procedures indicated that 
the free recall of generic stimuli (e.g., 
"pine. tree," “oak tree”) increased as a 
result of varied context with both pictorial 
and word stimuli. Nevertheless, the su- 
periority of varied training was quite small, 
although reliable, and attempts by Goggin 
to replicate this finding with word stimuli 
have not been successful (J. Goggin, per- 
sonal communication, March 1971). 

An alternative approach (e.g., Bower & 
Winzenz, 1969) is to vary directly the 
Structural properties of the entire stimulus 
to be placed in memory as distinct from 
varying the modifying elements. The 
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tucies by Bevan (Bevan et al., 1966; 
Dukes & Bevan, 1967) have varied only 
modifying features, making them in 

i\e sense concept identification studies 

cause a common relevant cue is always 

ent in the stimulus compound; the 
ulus context is varied, whereas the 
oe-recalled response term is kept con- 
int. This distinction differentiates be- 
en 2 kinds of variation, which may 

e different effects on memory : variation 

posed on the entire event or sequence 

be remembered vs. variations in only 
contextual stimuli present during 
taining. 

[he purpose of the present experiment 
was to exploit the second approach by 
varying the grouping structure of pairs of 
trigram doublets (e.g., DENGIT) and to 
determine the effect of constant vs. varied 
presentation of grouping structures on 
recall and recognition. Bower and Winzenz 
(1969) have reported that variation of the 
grouping structure of digit series presented 
auditorily leads to severe degradation of 
recall and. recognition performance. Only 
when the digit series were grouped in a 
constant fashion on successive trials did 
improvement in memory result. The digit 
series lacked, however, any evident "in- 
herent" overall structure that might be 
detected by S, since they were random 
permutations of digits. This is simply to 
say that a string such as 391, 87, 625 is 
no better a 3-chunk set than is 39, 1876, 25, 
or vice versa; there is no inherently better 
combination of the numbers for S to seek 
In this instance, stability rather than vari- 
ability of the Z-imposed grouping structure 
was essential for memory improvement, 

It is reasonable to examine the effect. of 
constant vs. varied input under conditions 
in which an existing structure is present 
in the series. Structure is used here simply 
to refer to natural groupings of letters, 
pronounceable groupings, or easily inte- 
grated units. If an inherent (preexperi- 
mentally developed) structure is present, 
as well as detectable by S, then it is con- 
ceivable that varied input, in the sense of 
changes in grouping structure on successive 
presentation, may facilitate recall of the 
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letter series. This finding could result 
from the modification of S's search strategy 
in looking for groupable letter sequences. 
This is simply to suggest that varied in- 
put may lead .S to achieve S-imposed en- 
codings, based upon the existing overall 
structure, more rapidly than would occur 
with constant input. In contrast, Ss re- 
ceiving constant input of letter sequences 
may be more likely to process them as 
presented by Æ, and in so doing, they 
would be less likely to detect codings that 
would enable a high level of recall. Tri- 
gram doublets such as CUPNET or CIMBUX 
contain an easily recallable structure, if 
detected, and thus the recall of a series 
of letters, even when “fractionated” ac- 
cording to different grouping rules on 
successive presentations, may be enhanced 
by varied grouping of the sequence. Con- 
stant input in this experiment refers to à 
given doublet such as cUPNET being pre- 
sented, for example, as € UPN ET consist- 
ently on successive presentations, whereas 
varied presentation refers to changes 
the grouping structure on successive pres 
entations, such as C UPNE T on one and 
cu PN ET on another. 

Experiment I examined the effects of 
constant vs. varied input on recall of letter 
groupings and indicated that varied input 
produced superior recall. Experiment Il 
examined the effects of input structure on 
recognition memory and revealed that con- 
stant input produced superior recognition, 
a finding predictable from simple frequency 
considerations. In both experiments, 3 lev- 
els of trigram meaningfulness (m) were 
employed in order to examine the effects 
of varied-constant input as a function of m. 


EXPERIM 


Method. 


Design. The Ss were 30 undergraduate student 
volunteers from the University of New Mexico. 
Each S received 8 study-test trials to learn and 
recall a mixed list of 12 trigram doublets. A con- 
stant and a varied condition of stimulus presenta- 
tion were employed. In the constant condition, 
the doublets were presented consistently on each 
study trial. In the varied condition, Ss saw dil- 
ferent perceptual groupings of each doublet on 
successive lists presented within a trial. Inter- 
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spersed between each study trial was a test trial 
in which S was required to recall! the entire se- 
quence of 6 letters in the order and sequence in 
which they were presented. 

Stimuli. The stimulus items were trigram dou- 
blets constructed by combining 2 trigrams of com- 
parable m values selected from the Archer (1960) 
norms. For example, the trigrams RUM and WIG 
were combined to form the doublet RuMwiG. The 
average ociation values for the 4 high-, mod- 
erate, and low-m trigram doublets were 100%, 
56%, and 12%, respectively. Doublets used were 
RUMWIG, DENGIT, BANCOW, CUPNET, CIMBUX, JUM- 
HYR, PUZWEK, SUHTYC, TIJFEQ, NEJWOQ, YIWFEP, 
and wojyzuv. Each of the doublets was fractionated 
(spatially grouped) into 3 groupings of letters ac- 
cording to one of 4 grouping rules. For exa aple, 
the 4 high-m doublets, RUMWIG, DENGIT, BANCOW, 
and cuPNET, once fractionated were presented as 
groupings of R UMWI G, DE NGI T, BA NCOW, and 
CUPN ET, respectively. The 4 moderate-m and 4 
low-m stimuli were also fractionated according to 
the same rules. 

Four stimulus lists were prepared, each list con- 
taining the 12 fractionated stimuli presented in a 
different serial order. In the constant condition, 
each doublet was presented in the same fractionated 
fashion over successive presentations of the lists. 
In the varied condition, the doublets were frac- 
tionated according to a different rule Over succes- 
sive presentations of the 4 lists, For example, 
the doublet Bancow appeared as BANCO W, 
B ANCOW, BA NCOW, and BA NCO W, on each list, 
respectively. "Thus, the grouping rules specify letter 
groupings as follows: 1,4, 1; 1,3, 2; 2,2,2; and 
2,3,1. Variation of the nominal stimulus was 
thus defined in terms of the use of a different 
fractionation rule applied to the same doublet on 
Successive list exposures. Moreover, the intact 
trigram (e.g. BAN or cow) was never presented 
in either the varied or constant condition. The 
letters of a given doublet were always presented 
in the same order and sequence for both condi- 
tions, despite changes in the grouping structure 
for the varied condition. 

Procedure. There were 8 study-test trials ad- 
ministered in each condition, a trial consisting of 
the successive presentation of 4 stimulus lists of 
the 12-doublet letter strings. Thus, S saw a total 
of 48 (4 Lists X 12 Doublets) stimulus presenta- 
tions per trial, Each fractionated doublet was 
exposed for a 2-sec. interval on a Stowe memory 
drum, and S was instructed to remember the items, 
as he would be tested for recall. Since S saw each 
doublet fractionated 4 ways in the varied condi- 
tion, input variation occurred within each trial; 
on successive trials, the same 4 varied Versions 
were repeated. 

The Ss were given a recall task interspersed 
between each trial that required them to recall 
the letters in the order and sequence presented, 
The Ss were given a test sheet containing 12 hori- 

e zontal arrangements of 6 adjacent boxes, and 5 
: filled in each arrangement of boxes with the ap- 
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propriate stimulus letters. The Ss were i: 
to fill in all of the boxes (guessing unknown 
whether or not they were sure of their ri 
making the procedure one of forced reca 
though the task required that the letters b 
in the order and sequence in which th 
shown, Ss were allowed to free recall the let 


Results 

Recall. The principal data obtai: in 
recall are shown in Figure 1, whic! a 
plot of the probability of correct co te 
recalls (all 6 letters in correct ord nd 
sequence) for the constant and vai in- 
put conditions over 8 trials. Thi ire 
shows a substantial superiority in |< call 
due to varied input throughout the test 
sequence, F (1, 28) = 12.90, p < 005. 


Terminal recall probabilities for the ied 


and constant conditions were .73 and 43, 
respectively. Recall improved for | oth 
conditions as a result of practice, F (1, 196) 
= 81.08, 5 < .001, and the rate of im- 
provement was faster under the varied 
condition, as reflected ain the significant 
Type of Training X Trials interaction, 
F (7, 196) = 10.89, p < .001. The effect 


of meaningfulness was also significant, 
F (2, 56) = 58.73, 6 < .001. Mean proba- 
bilities of recall in the varied condition for 
the 3 levels of meaningfulness (high, mod- 
erate, and low) were .70, .45, and 4, 
respectively, whereas the comparable 
means of the constant condition were .41, 
21, and .11, respectively. Since meaning- 
fulness did not interact with type of 
training, F (2, 56) — .67, b > .20, separate 
recall figures showing this effect are not 
shown. The data of Figure 1 are thus 
based upon all levels of meaningfulness. 
Partial recall of sequences, Partial recalls 
of 5-, 4-, 3-, and 2-letter strings were cal- 
culated in order to examine the effect of 
varied-constant input on size of the re- 
called chunk. A partial recall was defined 
as the correct recall of 2 or more adjacent 
letters. Thus, a correct partial recall of a 
3-letter sequence (unit) for the trigram 
RUMWIG would be, for example, RUM, UMW, 
MWI, Or WIG. A given recalled sequence 
Was scored only once, so there were no 
duplicate or nonindependent scorings. For 
example, a partial recall of rumw was 
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sd as 1 4-unit recall, not as 2 3-unit 
lls or 3 2-unit recalls. As expected, 
varied group gave more 4- and 5-unit 
ings, F (1, 14) = 11.82, p < .005, 
reas the constant group gave more 
and 3-unit strings, F (1, 14) = 52.2, 
.001. This trade-off is necessary, 
ce the higher recall of 2- and 3-unit 
ings for the constant group is due to 
fact that the varied-group Ss use up 
eir potential 2- and 3-unit strings in 
cir more frequently recalled 4- and 5-unit 
ings. 
Single-letter recall. A third recall mea- 
are, the total number of correct single- 
tter recalls, was calculated. Single-letter 
ecalls were again scored only if they were 
n the correct serial position, and each 
recall was scored correct or incorrect regard- 
less of the recall of an adjacent letter. 
Thus, on a given trial there were 72 pos- 
sible correct single-letter recalls (12 Tri- 
gram Doublets X 6 Letters per String). 
The mean correct single-letter recalls, 
averaged over 8 trials, were 49.9 for the 
varied condition and 31.3 for the constant 
condition, F (1, 238) = 62.44, p < .001. 
l'his simply indicates that the superiority 
of the varied condition in recall holds 
regardless of whether single letters or the 
entire sequence of 6 letters is the scoring 
unit. 


EXPERIMENT II 


The second experiment examined the 
effects of varied vs. constant training on 
recognition memory where it was proposed 
that nominal-stimulus variation would lead 
to poorer recognition performance. 


Method 


Design. The Ss were 34 undergraduate student 
volunteers from the University of New Mexico. 
The Ss were given training with varied or constant 
presentations as in Experiment I, followed by a 
2-item forced-choice recognition test. Three modi- 
fications of the training procedure were introduced. 
The study trial consisted of only one presentation 
of the list (rather than 4), followed by the recogni- 
tion test. Thus, in the varied condition, S saw 
a different list on each trial for a block of 4 trials. 
Twelve study trials were administered (rather 
than 8), with lists varied within blocks of 4 trials. 
Thus, Ss in the recognition experiment were given 
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FicuRE 1. Probability of complete recall of the 
letter sequences as a function of trials, with varied 
vs. constant input of letter groupings as parameter, 


12 presentations/trial for a total of 144 presenta- 
tions (12 Items X 12 Trials), whereas Ss in the 
recall experiment were given 48 presentations/trial 
for a total of 384 presentations (4 Lists X 12 Items 
X 8 Trials). This was done simply because it was 
unnecessary to provide as much training in the 
recognition experiment, Each fractionated doublet 
was presented for only 1 sec. 

Procedure. The experiment again used a study- 
test procedure. After each study trial, Ss were 
shown the old stimuli singly, paired with a highly 
similar distractor item, Distractors were formally 
similar to the old (target) items, with a distractor 
consisting of a 1-letter substitution of one of the 
4 middle letters of the old doublet. A vowel was 
replaced by a vowel and a consonant was replaced 
by a consonant. For example, the old item 
R UMWIG was paired with the distractor R AMWI G 
on one trial and R uwNiG on another trial, for 
the constant condition. Thus, an old item and 
its distractors retained the same fractionation rule 
on study and test trials in the constant condition. 
A different distractor item was used on each of 
the 12 trials, and the right-left position of the 
pairs was randomly varied on each trial. The Ss 
were instructed to underline whichever item in 
the pair they had seen during the study trial. 
The procedure was the same for the varied condi- 
tion except that the distractor items were frac- 
tionated according to the same rule as the old 
item as it appeared on a given study trial. Thus, 
on a given recognition test, R AMWI G was paired 
with R UMWI G, and on a subsequent test, RU MN 1G 
was paired with RU MWI G. 


Results 


Recognition. The principal data ob- 
tained in the recognition test are shown 


in Figure 2, which is a plot of probability 
of correct recognition over test trials. 
This figure shows that constant training 
leads to superior recognition compared 
with varied training, F (1, 32) — 15.16, 
p «.001. Terminal probabilities of cor- 
rect recognition for the constant and varied 
conditions were .90 and .80, respectively. 
Recognition improved for both conditions 
as a result of training, F (11, 352) — 14.54, 
p < .001; however, there was no reliable 
Type of Training X Trials interaction, 
F (11, 64) = 1.20, p > .20. The effect 
of meaningfulness was also significant, 
F (2, 252) = 38.18, p < .001. Mean rec- 
ognition performances in the constant con- 
dition for the 3 levels of meaningfulness 
(high, moderate, and low) were .85, .85, 
and .73, respectively, whereas the com- 
parable means for the varied conditions 
were .75, .73, and .63 respectively. Again, 
since meaningfulness did not interact with 
type of training, F (2, 64) = 1.23, p > .20, 
only a single recognition plot is presented. 


Discussion 


The principal findings of the experiments 
were that varied input, in the form of altered 
groupings of letter sequences taken from 
trigram doublets, facilitated recall of the let- 
ters, whereas constant input facilitated stim- 
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ulus recognition. The superiority of ied 
input on recall occurred whether me: red 
by complete recall of all letters in the ap- 
propriate position or by total num of 
correct single-letter recalls. Thus thes ita 
indicate one restriction on any assur ion 
that sheer repetition of stimulus event he 
condition for achieving maximum recal! 

An explanation of the general supe ity 
of varied input on recall is reasonably st ht- 
forward and can be accounted for b hat 
we call a perceptual regrouping hyf sis. 
The process is viewed as operating the 
following manner. When S is shown ed 
groupings on successive presentati he 
eventually learns to ignore the Ji ed 
groupings, since they change on su ve 
presentations and are not a relevant { re. 
In addition, it is assumed that varied "ut 
leads S to adopt the strategy of seeking ew 
and overall grouping which, if successhi:, will 
facilitate recall because the particula ter 
sequences used can be grouped as a nit. 
Since the letter sequences contained in- 
herent grouping structure in the fori: of 
pronounceable or easy-to-integrate units, de- 
tection and encoding of this overall structure 


must facilitate recall. Moreover, the recall 
data suggest the general principle that vari- 
ability in nominal-stimulus input leads to 
greater stability in S's functional encoding 
under conditions in which S can achieve an 
overall or unitary encoding of the input se- 
quence. In contrast, if no existing or inherent 
structure were present, varied input would 
not be expected to facilitate recall, Search 
for such a structure and the corresponding 
development of S encodings would be fruit- 
less, and performance in such instances would 
be degraded. 

On the surface, the superiority of recall 
under varied input appears contrary to the 
results reported by Bower and Winzenz (1969) 
and may further imply that their reallocation 
hypothesis is limited. Critical differences be- 
tween the experiments appear, however, to 
make the superficially divergent results under- 
standable. The perceptual regrouping hy- 
pothesis can account for differences between 
this experiment and Bower and Winzenz's 
Experiments I and II regarding recall, where 
in the latter study it was necessary for S to 
have a stable Z-imposed structure for learning 
to occur. Moreover, a stable E-imposed struc- 
ture was necessary for learning even when 
letter sequences were used (Winzenz, 1972. 
Experiment V), if the groupings themselves 
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{ words. The perceptual regrouping 
esis assumes only that varied input in 
m of letter groupings, instead of being 
ved to separate storage locations, even- 
comes to be regrouped as an integrated 
nd is then shunted to the same storage 
on. The point is that S must discover 
a encode a sequence that contains an in- 
! ;t structure if variable input is to enhance 
" 
iere are other differences between these 
riments, including length of series and 
sunt of information processed prior to each 
In addition, differences in auditory vs. 
tal presentation may have contributed to 
pattern of differences in recall between 
2 studies. In the present experiment, 
saw the entire letter sequence and would 
esumably have greater opportunity to en- 
ide it as an integrated unit. In contrast, 
;wer and Winzenz (1969) presented the 
git series auditorily, either as single digits 
cing read with pauses between groups, or as 
successive numerical groupings being named. 
n either instance, auditory presentation 
vould appear to reduce the likelihood of S 
cetecting a larger structure or grouping even 
if such structure had been present. In short, 
sirhough there is no doubt that the con- 
si-tency of Z-imposed groupings is necessary 
for improvement in recall under conditions 
examined by Bower and Winzenz, structural 
variability of letter sequences aids recall under 
conditions in which S$ can detect and encode 
an underlying structure consisting of the 
entire unit. More generally, these findings 
further attest to the significance of encoding 
processes in human learning and memory and 
to the importance of structural properties of 
stimuli in learning tasks (cf. Ellis, 1973). 
The superiority of recognition under con- 
stant input is predictable from relatively 
simple frequency considerations. The recog- 
nition task does not require integration or 
organization of the letter string into a larger 
unit. Moreover, the test compared distractors 
conforming to the same grouping rule as the 
old items, so that S did not have to attend 
to or encode the particular grouping in the 
constant condition. Thus, the variable con- 
dition reduces the likelihood that S will 
develop a stable encoding necessary for recog- 
nition, Since the recognition task requires 
à choice between 2 grouped sequences, S would 
be less disposed to develop unitary encodings 
in the recognition task. All that would ap- 
pear to be necessary for correct recognition 


TES MU SECUS AT Hi a 


289 


is the development of a relatively stable 
encoding which is maximized under condi- 
tions of consistent presentation of a fixed 
structure. 

The positive relationships between meaning- 
fulness and recall and between meaningfulness 
and recognition (Ellis, Parente, & Shumate, 
in press; Ellis & Shumate, 1973) is well- 
established and is of no great interest here. 
What is of interest is the lack of an inter- 
action between varied-constant input and 
meaningfulness in recall. Although the su- 
periority of the varied condition occurred 
across all levels of meaningfulness, it might 
have been expected that the effect would 
have been attenuated at lower m levels, since 
these items are less easily unitized. Since 
this did not occur, it would seem reasonable 
to assume that even low-m trigram doublets 
possess sufficient inherent structure for the 
variability effect to appear. 

Finally, these experiments indicate that 
variably grouped presentations of letter se- 
quences affect recall and recognition differen- 
tially, This outcome is not readily handled 
by a strength theory of recall and recognition 
(cf, Kintsch, 1970), which assumes that recall 
and recognition involve essentially the same 
processes, The fact that varied input affects 
recall and recognition in different ways sug- 
gests that different proce: may underlie 
the 2 performances, Some caution must be 
used in making this inference, however, be- 
cause the 2 training procedures were not 
exactly comparable, with different amounts 
of practice, different stimulus durations, and 
differences in manipulating variability (within 
trial in recall and between trials in recogni- 
tion). Despite these differences, the effect of 
variability was reversed for the 2 conditions 
over the entire training-testing sequence. 
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Retention effects of retrieval practice and of predictability of recall delay 
were investigated in 2 experiments. Several 5-word lists were recalled ini- 
tially after no delay or delay filled with number subtraction; delay type 
was predictable for Ss in a precue but not in a postcue condition. Words 
from all lists were tested in final free recall (FFR). The advantage of filled 
over no delay in FFR was greater in the precue than in the postcue condition. 
This result was taken as evidence that attributes selected for encoding 
depended on the type of delay anticipated. In Experiment II only half of 
the lists were recalled initially. Initial recall aided FFR more for filled- than 
for no-delay words. It was concluded that retention benefits from initial 
retrieval to the extent that retrieval cues used at initial and final recall are 


similar. 


earsal has been used extensively as 
lanatory concept in recent analyses 
iory processes. In Atkinson and 
's (1968) 2-store theory of memory, 
ıl serves the dual purpose of main- 
items in short-term store (STS) and 
ermining the amount of information 
an item transferred to long-term 
'S) Serial position effects in- 
ing the negative recency effect found 
in final free recall (FFR), have been inter- 
preted in terms of the positive relationship 
between rehearsal frequency and transfer 
to LTS (Atkinson & Shiffrin, 1971; Craik, 
1970). More recent results suggest, how- 
ever, that rehearsal might only serve the 
function of maintaining items in STS 
(Jacoby, 1973; Jacoby & Bartz, 1972; 
Meunier, Ritz, & Meunier, 1972). In the 
Jacoby and Bartz study, 3 groups of Ss 
were presented with several 5-word lists; 
each list was recalled initially, either im- 
mediately after its presentation, after a 
silent-delay interval, or after a delay filled 
with number subtraction. Initial recall 
int he subtraction condition was well below 
that in the other 2 conditions, supporting 
the contention that rehearsal is necessary 
to maintain items in STS. However, in 
an unexpected FFR test for all presented 
words, recall after silent delays was lower 


* Requests for reprints should be sent to Larry 
L. Jacoby, Department of Psychology, Iowa State 
University, Ames, Iowa 50010. 


291 


than after number subtraction ; amounting 
to an inverse relationship between re- 
hearsal opportunity and storage in LTS. 
This and the finding that silent-delay items 
were not better recalled than no-delay 
items led to the conclusion that rehearsal 
frequency does not determine transfer to 
LTS. As possible explanations for the 
final-recall advantage of filled-delay items, 
the authors suggested differential encoding 
during study and differential retrieval 
practice at initial recall. 

According to the differential encoding 
interpretation, the coding of items during 
study was dependent on the nature of 
delay anticipated. For immediate recall 
or recall after a silent delay, it would be 
sufficient to keep list items in STS by 
means of rehearsal with no necessity to 
generate more permanent retrieval cues. 
The Ss in the subtraction condition were 
likely to organize items or process them 
to a "deeper level" (Craik & Lockhart, 
1972) to insure that they would be re- 
trievable after the filled delay. Thus, 
subtraction Ss generated longer lasting 
retrieval cues that more likely would be 
still accessible at the time of FFR. The 
second interpretation appeals to the in- 
fluence of retrieval practice on later free 
recall (e.g., Lachman & Mistler, 1970). 
Immediate recall or recall after a silent 
delay might only make use of short-lived 
retrieval cues such as the acoustic trace 
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of an item (Tulving, 1968), corresponding 
to retrieval from STS. In contrast, re- 
trieval after a filled delay might neces- 
sitate the use of more permanent retrieval 
cues, Filled-delay items would thus benefit 
more from retrieval practice since the re- 
trieval cues employed would be more sim- 
ilar to those accessible at the time of final 
recall. Since differential encoding is as- 
sumed to depend on S’s ability to anti- 
cipate the nature of the delay interval, 
removal of this ability should nullify the 
difference between filled-delay and no-delay 
items in final recall. Differential retrieval 
practice would be expected to operate 
regardless of S's ability to anticipate the 
nature of the delay preceding initial recall. 
Predictability of delay preceding initial 
recall was manipulated in the present in- 
vestigation in order to separate the effects 
of differential encoding and differential 
retrieval. 

In Experiment I, short lists were pre- 
sented and free recalled either immediately 
or after a period of number subtraction. 
In a precue condition, Ss were informed 
prior to list presentation concerning which 
of the 2 delay conditions would follow. 
Thus, they had the opportunity to encode 
words more permanently preceding a filled- 
delay interval. Final recall for the precue 
condition was expected to replicate the 
earlier reported superiority of filled over 
no delay. In a postcue condition, infor- 
mation about delay type was not available 
until after list presentation. Since delay 
type was not predictable during study, the 
final recall advantage of filled-delay items 
should be absent if differential encoding 
was the only factor responsible for the 
earlier results. No effect of the cuing 
manipulation would be expected if only 
differential retrieval was operative in the 
earlier experiment. 


EXPERIMENT I 
Method 


Materials and procedure. Two sets of 150 words 
with A and AA ratings were randomly selected 
from the Thorndike and Lorge (1944) wordbook, 
and used to form 2 replications of the basic design. 
Within each set, words were assembled into 30 
5-word lists and tape recorded for auditory pre- 
sentation at a 2-sec. rate. The Ss’ oral recall was 
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recorded by E. Half of the lists we called 
immediately (no delay, ND) after thei senta- 
tion while the other half were recalle ter a 


filled-delay (FD) interval. The seque Í im- 
mediate and delayed recall was rando: sepa- 
rately for each of 2 sets of lists. The tasi sloyed 
in FD intervals consisted of auditory p ation 
of randomly selected 2-digit numbers abers 
were presented at a 2-sec. rate, wit first 
number occurring 1 sec, after the last ina 
list; 7 numbers were presented withi: 5-sec. 
delay interval. The Ss were instruct sub- 
tract 1 from each number and say the + iloud 
prior to the presentation of the ne nber, 
For all conditions, the word “go” si | the 
beginning of the recall interval; recall v ken. 
The 7.5-sec. recall interval was terminau y the 
word "ready" which preceded the first v f the 
next list by 2 sec, 

The Ss in a precue condition were med 
prior to the presentation of each list rning 
the nature of the delay that wovld follow stack 
of 3X 5 in. note cards was placed befor Ss in 
that condition; each list was represented b: card. 
Printed on each card was either the word layed 


or immediate, indicating the nature : ay. 
Upon hearing “ready” preceding list preso- ation, 
S read the top card and then placed it «xt to 


the stack; the card read corresponded to (^e list 


that was to be presented. In a postcue condition, 
Ss learned about delay after presentation = each 
list by hearing either the signal for recall the 
first of the 2-digit numbers for subtraction 
Following recall of the last list, all were 


read 3 sets of 9 digits and asked to recoil each 
set in order. Next, Ss were instructed to write 
down all the words they could remember from all 


lists. Prior to these instructions there was no 
reason for Ss to anticipate the FER test. There 
was no time limit on the final test, The digit 


span task was employed as an attempt to minimize 
the effects of retrieval from STS in FFR. 

Subjects and analyses. The Ss were 20 volun- 
teers from undergraduate courses at Iowa State 
University; 10 Ss were assigned to each of the 
cue conditions according to a prearranged random 
schedule with the restriction. that the ratio of 
males to females must be constant across condi- 
tions. The Ss were tested individually and received 
extra course credit for their participation. For 
each S5, recalled words were classified according 
to delay condition and serial position. 

Analyses to be reported employed a 2 X 2 xs 
(Cue X Delay X Serial Position) analysis of vari- 
ance with repeated measures on the last 2 factors; 
initial and final recall were analyzed separately: 
Replications were not included as a factor since 
a preliminary analysis indicated that results did 
not differ across replications. 


Results and Discussion 


Initial recall. Recall probability as 8 
function of input position, delay, and cue 


^ 
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conc is displayed in Figure 1. The Ta ap 
FD i were recalled at a lower level Es POS Tare 
tha items, F (1, 18) = 313.67, p *9 
E cating that interpolated sub- .8 
tract iterfered with the maintenance = 
of li ms in STS. Recall probability ; 
was ‘ly higher in the precue (.74) than € S 
in t ;stcue (.67) condition, F (1, 18) 2.5 
=4 p < .05. The main effect of serial 2 4 
posi F (4, 72) = 14.72, and the Delay 2 
X Sc | Position interaction, F (4, 72) 5^? RORIS 
=9 were also significant (ps < .001). x O-O NO DELAY FINAL 
Reca! probability declined steadily across D MEUNHIPSEDBUEUAHESECAUS | 
seri: itions in the FD condition while à 
rem ig relatively stable in the ND i ] 
con ] 
Fi call. Final recall probability was zi 4 
high r FD than for ND items, F (1, 18) CN 
= 3 p < .001. As predicted by the 
diffe i] encoding hypothesis, the recall "ONUONSUIET ECHTE C IE YE EEG 
adva: : of FD over ND items was SERIAL POSITION 
E. e Pu E FIGURE 1. Probability of initial and final free 
ed Wi , , recall as a function of serial position, cue, and 
- 00, b «.01. Overall, recall proba- delay condition in Experiment I, 
bility declined across input serial positions, 
F (4, = 6.51, p < .001, Although the 
appropriate interaction was not significant, hypothesis. However, the use of different 
the decline across positions was almost retrieval cues after the 2 types of delay 
totally absent for recall of ND items in might develop gradually across the test 


the precue condition. 

, Results of the present investigation pro- 
vided support for the differential encoding 
hypothesis. When Ss could anticipate 
delay ty pe, FD items were coded in a more 
Permanent fashion than were ND items. 
Coding of the 2 types of items was neces- 
sarily the same when Ss could not predict 
delay type; coding when delay was un- 
Predictable appears to have been inter- 
mediate to the 2 forms of coding employed 
in the precue condition. ‘The decline in 
recall probability across input positions 
can be attributed to the study of earlier 
Items during the presentation of later ones 
(e.g., Atkinson & Shifrin, 1971). The 
absence of serial position effects when Ss 
anticipated an immediate recall test sug- 
ests that the cumulative study strategy 
Was primarily used to prepare for delayed 
recall, 

: The failure to obtain an effect of delay 
in the postcue condition can be taken as 
evidence against the differential retrieval 


E 


session. In addition, retrieval cues em- 
ployed after an FD would at best be 
expected to be only relatively permanent 
and might not survive the presentation 
of a large number of later lists, Both 
considerations suggest that an advantage 
for FD items might be more likely in final 
recall of lists presented near the end of 
the experimental session. Filled delay and 
ND in the postcue condition were there- 
fore compared separately for the first 8 and 
second 7 lists of each type. Final recall 
probabilities were nearly identical for FD 
and ND items presented in the first lists 
studied (.10, .11). However, recall proba- 
bility from the last study lists was higher 
for FD than for ND items (21, .12), 
t (18) = 2.83, p <.05. Thus, the dif- 
ferential retrieval hypothesis cannot be 
totally dismissed. 

Similar comparisons in the precue group 
reveal higher recall of FD over ND in the 
first lists studied (.15, .05), ¢ (18) = 3.54, 
p < .01, as well as in the last study lists 
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(.22, .07), t (18) = 4.78, b < .001. Ap- 
parently, in contrast to differential retrieval 
strategies, differential encoding is estab- 
lished early in the course of the experiment. 


EXPERIMENT II 


Experiment II was designed to further 
separate the effects of differential encoding 
and differential retrieval. As in Experi- 
ment I, Ss in a precue and postcue con- 
dition were exposed to lists followed by 
either an FD or ND. Now, however, 
Ss recalled only half of the lists initially; 
recall vs. no recall was factorially com- 
bined with cue and delay conditions. In 
all conditions, Ss were unable to predict 
whether or not recall would be required. 
Thus, study encoding should be constant 
across recall conditions. The effects of 
differential encoding can be observed un- 
contaminated by retrieval practice in the 
no-recall conditions. Comparisons of the 
final-recall effectiveness of initial recall in 
the 2 delay conditions yield information 
that is relevant to the differential retrieval 
hypothesis. Initial recall after an FD 
should have a larger effect in FFR if the 
differential retrieval hypothesis is accurate. 


Method 


Materials and procedure. A set of 200 words 
with A and AA ratings were randomly selected 
from the Thorndike-Lorge (1944)  wordbook. 
These words were assembled into 40 5-word lists 
and tape recorded for auditory presentation at 
à 2-sec. rate. The Ss’ oral recall was recorded 
by E. Half of the lists were followed by FD while 
the other half were followed by ND. Each half 
was further subdivided into an equal number of 
lists initially recalled (IR) vs. not initially recalled 
(NIR). Four delay-recall combinations resulted : 
FD-IR, FD-NIR, ND-IR, and ND-NIR. Ten 
lists were randomly assigned to each delay-recall 
combination; the presentation order of lists was 
also randomized. Four replications of the basic 
design were constructed so that each list repre- 
sented each delay-recall combination equally often. 
"The distractor task employed in FD intervals was 
the same as in Experiment I, as were the arrange- 
ments for recall in the FD-IR and ND-NIR 
conditions. In the FD-NIR condition, the word 
"ready" occurred 2 sec. aíter the last 2-digit 
number and signaled the beginning of the next 
list, The word "ready" served the same purpose 
in the ND-NIR condition but occurred 1 sec. 
after the last word of the preceding list, 
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Cuing arrangements for a precue postcue 
condition were the same as those in E iment E 
None of the Ss was precued with r to recall 
condition. The recall-no-recall mani on was 
explained to Ss by describing the i; tion as? 


being concerned with the effects of « tion om 
memory. Other procedural details 
rangements for digit span tests and 


were identical to those in Experiment 


recall 


Subjects and analyses, The Ss w volun- 
teers enrolled in psychology courses a State 
University and received extra cou dit for 
their participation. Twenty Ss wer ened to 
each cue condition according to : rranged 


random schedule with the restriction he ratio 
of males to females must remain co across 
conditions. 

Study lis 
size for prel 


s were divided into 2 blo: 
liminary analyses of init 


of equal 
id final 


recall. The 20 lists of Block 1 were th t 5 lists 
representing each of the 4 delay-re ombinas 
tions while those in Block 2 è the s l 5 lists 
from each combination of conditions. inalysis 
of initial recall failed to reveal any nificant 
effects of input block. In final recall proba- 
bility of recall was higher for lists prese: d in the 


second (.11) than in the first (.06) bloch 1, 266) 
= 67.94, p < .001; effects of other vari: oles were 
generally consistent across blocks but re pro- 
nounced in Block 2. The finding of re pros 
nounced effects in final recall of later presented 
lists is similar to that reported in Ex ment I 
and may be due to either recency of presentation 
or the development of study strategi Since 
effects in recall of early s were less clear and 
did not contribute additional information, results 
will be reported only for lists presented in Block 2 
Initial recall data were treated as in Experiment 1, 
Final recall scores were entered intoa 2 X 2 X 2 X $ 
(Cue X Delay X Initial Recall X Seria! Position) 
analysis of variance with repeated measures On 
the last 3 factors. 

Since in earlier experiments, including Experi- 
ment I, interactions of Ss with other factors did 
not reach significance (p = .05), it was decided 
a priori to use the pooled interactions with Ss aš 
the denominator for F tests, The appropriateness 
of this procedure was checked for the anal of 
initial and final recall. Only 1 of the 8 interactions 
involving Ss exceeded the .10 level, F (38, 152) 
= 1.39, indicating that the assumptions necessary 
for pooling were met. 


Results and Discussion 


Initial recall. Initial- and final-recall 
probabilities for each combination of con- 
ditions are plotted in Figure 2. As eX 
pected, recall was lower in the FD than 
in the ND condition, F (1, 342) = 336.10, 
p < .001. The Delay X Serial Position 
interaction was also significant, F (4, 34 
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=3 .05. Recall probability de- 
cline: ss input positions in the FD 
condi while showing a slight increase 
for ! us. Intrusion errors were in- 
freq: ntrusions from lists that were 
not : d were only slightly more fre- 
quen 2%) than those from recalled 
lists >). Thus, it appears that Ss 


lly differentiated lists. 

recall. The effects found in Ex- 
| were replicated. As shown in 
ecall probability declined across 

sitions, F (4, 722) = 15.20, p 
nd was higher for FD than for 
F (1, 722) = 18.59, p < .001. 


The l| advantage of FD over ND 
item larger in the precue (.14, .08) 
than he postcue (.12, .11) condition, 
F (1 — 9.68, p « .01, again support- 
ing 1 lifferential encoding hypothesis. 
For | iot initially recalled, final recall 
in thc cue condition was higher for FD 
than { D items (.10, .05);#(722) = 2.70, 
p< hus, the effect of differential 
encoc manifested itself in pure form. 
Lists were recalled initially showed 
higher Anal recall than did lists that were 
Not initially recalled, F (1, 722) = 29.20, 
b < .001. In addition, the differential re- 


trieval hypothesis predicts that FD items 
should benefit more from initial recall than 
should ND items. The corresponding 
Initial Recall X Delay interaction was sig- 
nificant, F (1, 722) = 4.53, p < .05, and 
showed that initial recall was most bene- 
ficial for FD items. As shown in Figure 2, 
the advantage of FDs was larger in the 
precue than postcue condition but present 
in both among items that had been re- 
called initially. The present investigation 
thus provided evidence of both differential 
encoding and differential retrieval. 

The results are also relevant to a third 
Possible interpretation of the final-recall 
advantage of FD items. It might be 
argued that the subtraction task was not 
demanding enough to completely eliminate 
rehearsal, and that the higher final recall 
of FD items was produced by this addi- 
tional rehearsal, This interpretation would 
Predict higher recall of FD items even for 
lists that were postcued and not recalled 
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Ficure 2. Probability of initial and final free 
recall a function of serial position, cue, delay 
and initial recall condition in Experiment 11. 


initially; neither differential encoding nor 
differential retrieval would predict an ef- 
fect of delay for these lists. The results 
showed that final-recall performance was 
nearly identical for FD and ND items 
(.09, .10) in the postcue condition that 
had not been tested initially. Thus, there 
was no evidence that rehearsal during sub- 
traction aided final recall. 


GENERAL DISCUSSION 


Among the most widely observed effects in 
studies of memory is the dependence of recall 
probability on study opportunity (e.g., Cooper 
& Pantle, 1967). One possible explanation 
of this effect is that rehearsal frequency de- 
termines the probability of delayed recall or, 
in the context of a 2-store theory, the transfer 
of information to LTS. While the term 
"rehearsal" has been defined in a variety of 
ways, it is used in this discussion to denote 
Ss’ overt or covert repetition of items. 

Memory theorists holding widely divergent 
orientations (e.g., Atkinson & Shiffrin, 1968; 
Underwood, 1972) have agreed concerning 
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the central role played by rehearsal frequency. 
The rehearsal frequency interpretation also 
has the advantage of parsimony and has been 
applied to a wide variety of phenomena. 
Despite its wide acceptance, there is no firm 
evidence of a causative relationship of re- 
hearsal frequency to delayed-recall probability. 
The most convincing evidence has come from 
experiments that have required overt re- 
hearsal and have related rehearsal frequency 
to recall probability (Atkinson & Shiffrin, 
1971; Rundus, 1971; Rundus & Atkinson, 
1970). It should be recognized, however, that 
overt rehearsal only renders observable Ss’ 
vocalization of study items; rehearsal might 
be accompanied by considerably more com- 
plex processing that is not observable. The 
relationship of overt rehearsal frequency and 
recall probability that has been found might 
be due to the correlation of both factors with 
underlying coding processes. 

Recent evidence suggests that explanations 
of long-term memory effects in terms of re- 
hearsal frequency are insufficient. Several 
studies (Jacoby, 1973; Jacoby & Bartz, 1972; 
Meunier et al., 1972) have found no effect of 
variations in rehearsal opportunity or fre- 
quency of overt rehearsal in long-term mem- 
ory performance. Moreover, the present in- 
vestigation found differences in final recall 
for conditions that were equated with regard 
to rehearsal opportunity during study; final 
recall in the precue condition was higher for 
lists followed by FD rather than ND. It 
might be argued that the subtraction task 
employed did not eliminate rehearsal so that 
the FD condition had additional opportunity 
for rehearsal. However, further analyses 
revealed that rehearsal was either absent 
during delays or did not aid final recall. 
These results can be accounted [or if re- 
hearsal, defined as repetition of items, is 
assumed to be only one of several types of 
processing available to Ss. Rehearsal may be 
sufficient to maintain items for immediate 
recall while additional processing is required 
to aid performance on a long-term memory 
test (Craik & Lockhart, 1972; Jacoby, 1973). 

The trace resulting from the encoding of 
an item can be conceptualized as a set of 
attributes, with the level of processing de- 
termining the particular attributes included 
in the set (Craik & Lockhart, 1972). Pro- 
cessing in addition to rehearsal will lead to 
the inclusion of attributes that are more 
resistant to loss. If an item is studied under 
the expectations of immediate recall, its 
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memory trace will contain fewer ıq less 
permanent attributes than if it tudied 
with the anticipation of delayed re More 
permanent attributes are encode luding 
organization of list items, when de! recall 
is anticipated. Consistent with t! eding 
statement are Ss’ descriptions o! study 
activities. When asked after th iment, 
several Ss in the precue conditi ported 
making up sentences, stories, rental 
images involving list items when : D was 
anticipated, and merely repeating items 
when they expected an immediate iL test. 
The nature of delay preceding rec: is not 
predictable in the postcue conditio The Ss 
in that condition may have eith: lopted 


an intermediate level of processin for all 


lists or vacillated between rehear- items 
and attempts to organize them. hould 
be noted that the relating of processi to Ss’ 
expectations and cognitive states is n> a new 
development, and gains support fron older 
literature. Müller (1911, pp. 11- listed 
a number of variables that he fourm to in- 
fluence study strategy; the time into val be- 
tween study and test was among thor» listed. 

Data from the present experime.‘s also 
provided support for the differential \trieval 
hypothesis. The final-recall effects o! initial 
recall can be interpreted within the :iemory 


attribute framework. Attributes used as re- 
trieval cues experience an increase in cue 
effectiveness as a consequence, The use of 
short-lived attributes requires less effort so 
that they are employed as retrieval cues 
when there is a choice; immediate recall 
primarily uses short-lived attributes such as 
the acoustic trace of an item. If an FD 
precedes recall, short-lived cues will not be 
available and initial recall will be based on 
more durable attributes. Later recall will 
benefit from initial recall only to the extent 
that both can use the same attributes as cues. 
Initial recall after an FD was more advan- 
tageous, since the attributes employed for 
retrieval were more likely to be also available 
at the time of final recall, than were those 
used in the ND condition. It would be 
predicted that the effectiveness of initial recall 
could be even further enhanced by increasing 
the duration of the FD interval. 

Recall differences observed in the present 
experiments cannot be explained in terms of 
amount of study opportunity or rehearsal 
frequency. Rather, recall performance ap- 
pears to be a function of the attributes that 
are accessible at the time of test. Accessi- 
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s bilit wn depends on 2 factors: (a) Dif- Jacosy, L. L., & Bartz, W. H. Rehearsal and 
foren ncoding during study determines transfer to LTM. Journal of Verbal Learning 
which © iributes are used originally to form and. Verbal Behavior, 1972, 11, 561-565. 
the | ! an item, and (b) the cue effective- ere 3 = MIROR Pr Paral bs 

E i 3 F S tials, and component processes in free recall, 

Ecos ributes d enhanced by their prior Journal of Experimental Psychology, 1970, 85 

use < ieval cues. Long-term recall per- 374-382. ef 1 

form will be aided by an earlier recall to Meunier, G. F., Rivz, D., & MEUNIER, J. A. 

the that the same attributes can be Rehearsal of individual items in short-term 

used 'trieval cues on both tests. memory, Journal of Experimental Psychology, 
1972, 95, 465-467. 
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VARIED FUNCTIONS OF PUNISHMENT IN DIFFERENTI 
INSTRUMENTAL CONDITIONING ! 


GEORGE T. TAYLOR * 


Slate University of New York College at Polsdam 


A simple discrimination setting—differential instrumental conditioning— 


was used to assess the functions of punishment in discrimination learning in a 
2 X 2 X 3 factorial design, plus 2 control groups, with 168 rats. The vari- 
ables manipulated were type of stimulus (tone or shock), presentation of the 
trials (forced-free or forced), and alley of stimulus presentation (postive 


stimulus, negative stimulus, or both). 
g., as (a) an aversive stimulus, (b) a distinctive, 


serve a variety of functions, 


The results indicated that shock can 


informative stimulus, (c) a secondary reinforcer, and (d) perhaps most inter- 


estingly, an alerting stimulus. 


These findin 


are interpreted as suggesting 


that the role of punishment in differential instrumental conditioning is complex. 


A response-produced aversive stimulus, 
ie. punishment, does not always reduce 
the probability of a response. A notable 
example is a study reported by Muenzinger 
(1934) which demonstrated that, in a dis- 
crimination setting, shocking rats in the 
presence of the positive stimulus (S+) may 
facilitate learning the discrimination. This 
finding, the shock-right phenomenon, has 
generated a large body of data and theo- 
retical speculations which relate to the 
more comprehensive properties of aversive 
stimulation, 

Muenzinger (1934; Muenzinger & Wood, 
1935) interpreted his data as implicating an 
effect of punishment which was general in 
nature, The function of noxious stimuli 
was suggested to be that of alerting the 
animal so as to make him respond 
more readily to the significant cues in the 
learning situation. The general alert- 
ing mechanism was proposed to be re- 
vealed behaviorally as vicarious trial-and- 
error (VTE) activity (Muenzinger, Bern- 
stone, & Richards, 1938). 


1 This article is based upon a dissertation sub- 
mitted to the University of New Mexico in partial 
fulfillment of the requirements for the PhD degree, 
The author would especially like to thank Frank A, 
| who served as the committee chairman. 
earch was performed while the author was a 
National Institute of Mental Health trainee and 
was supported in part by National Science Founda- 
tion Grant GB-12954 to Frank A. Logan. 

* Requests for reprints should be sent to George 
T. Taylor, Psychology Department, State University 
of New York College at Potsdam, Potsdam, New 
York 13676. 


Later, Wischner (1947) questic: id 
generality of these results and su! 
conclusions, 
Muenzinger (1934; Muenzinger al., 
1938; Muenzinger & Wood, 1935) «sing 


the 
quent 
especially with respo t to 


only correction procedures in wh the 
animal must retrace his incorrect responses. 
Using the noncorrection procedure, Wisch- 
ner found that punishing the correct re- 
sponse appeared to only slightly fa :litate 
discrimination efficacy relative to a non- 


shocked group, although the differences 
between the 2 conditions were smali and 
statistically unreliable. However, though 
the difference was not as great as that 
found earlier by Muenzinger (1934), Wisch- 
ner replicated the basic shock-right phe- 
nomenon. Thus, even in the noncorrection 
paradigm, punishment in the presence of 
S+ facilitated the discrimination rather 
than the expected inhibitory effect. 

An alternative interpretation of these 
data emphasized the specific effects of 
punishment (cf. Fowler & Wischner, 1969). 
For example, the shock, by virtue of its 
association with the reward, served as à 
discriminative stimulus and, therefore, as à 
conditioned stimulus (Azrin & Holz, 1966). 
Extending the discriminative stimulus anal- 
ysis, Fowler and Wischner (1965) have 
suggested that shock for either correct oF 
incorrect responses served to differentiate 
the alleys more quickly and thus delimit the 
generalization between the 2 alternatives. 
As a result, these authors (Azrin & Holz; 
1966; Fowler & Wischner, 1969) have re- 
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jected general alerting analysis of 
J g 


Muen et al. (1938) and have adopted 
the p that the facilitative conse- 
quence shock-right training is based 
upon serving as a secondary rein- 
forcer or as a distinctive stimulus. 
Alti h these 2 interpretations are not 
mutu xclusive, and indeed both may be 
implic theorists, in the interest of 
parsi have been reluctant to accept 
both count for the shock-right data. 
Itis p. le that theresulting controversies 
(cf. I r & Wischner, 1969) stem from 
the c ve use of the simultaneous dis- 
crimi! ı paradigm. ‘This procedure has 
sever isadvantages for an analysis of 
the « : of shock. First, it enables the 
anima! o compare directly the discrimi- 
nand hus, if VTE, or some comparable 
gener echanism, contributes to the 
shock ıt phenomenon, all Ss had similar 
oppor!;.oties to make such comparisons. 
Second, end perhaps more importantly, the 
paradicin provides only one measure of 
performance, choice behavior. This mea- 
sure r ents the differences between the 
response strengths of the alternatives, but 


does not indicate directly which is being 
affected by the shock. 

Accordingly, the present experiment at- 
tempted to make a unique contribution by 
including a simpler discrimination setting— 
differential instrumental conditioning—in 
which the discriminable cues are presented 
singly. This paradigm, therefore, provides 
@ procedure which limits the opportunity 
to compare the stimuli directly, as well as 
Providing an individual measure of response 
Strength to each alternative. In addition, 
4 more neutral tone stimulus was given to 
Some Ss to allow an assessment of the 
Aversiveness of shock. 


METHOD 


Subjects. The Ss were 168 female albino rats ob- 
tained from a commercial supplier. The experi- 
Mentally naive Ss were 90-120 days of age at the 

eginning of experimentation and were housed indi- 
Vidually with water freely available. A daily ration 
Of 12 gm, of laboratory chow was given immediately 
alter each day's experimental session. The Ss, 
Tandomly assigned to 1 of 14 groups with 12 Ss in 
Eh. Were run in replications of 2 Ss per cell; each 

Teatment was represented in each replication. 
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Apparatus, The apparatus was a trio of parallel — 
alleys, each 2.44 m. long. The 2 outside alleys were 
black and the middle alley was white, thereby allow- 
ing spatiality of stimulus presentation as a non= 
relevant cue on choice trials. On forced trials, the 
gray start box was positioned directly in front of the 
appropriate black or white alley and the adjacent 
alleys were blocked from S's view. The free-choice 
trials, however, involved placing the start box in 
such a position that § could view both a black and 
a white alley, Thus, assurances were made so that 
S could view both diseriminanda only in the frec- 
choice setting. The apparatus and the start-box 
arrangement are described in detail by Logan (1960), 
Photocells were located 10 cm. from the start door, 
after the first 1.22 m., and directly before the 
food cup. 

Grid floor inserts completely covering .61 m. of the 
alley and through which shock could be delivered 
were placed in cach of the 3 alleys directly preceding 
the final .61 m. of the runway. A speaker connected 
to a tone generator and amplifier, located outside 
the apparatus, was positioned beneath the grid floor 
inserts. The start door opened vertically by a 
spring when released by a solenoid, Goal-box doors 
lowered manually enclosing the final ,61 m. of the 
alley. Reward consisted of 45-mg. Noyes pellets 
delivered by Davis feeders. 

Shock was delivered by 120-v, ac passing through 
a variac into a step-up transformer with 120-kf? 
resistance in series with S. The grid floor consisted 
of .6-cm. stainless steel rods .3 cm. apart, The 
shock stimulus was 60 v. at ,2-sec, duration, a value 
used frequently in the relevant literature (cf. 
Fowler & Wischner, 1969), The 1,000-Hz,, 60-db. 
tone was presented at .2 sec, 

Procedure, Pretraining began 1 wk, prior to the 
beginning of experimentation when 5s were handled 
and placed on 23-hr. food deprivation. ‘The Ss 
then received 5 d of exploration, goal-box place- 
ments, and magazine training in the 3 alleys. 

Brightness-discrimination training commenced 
with the introduction of differential instrumental 
conditioning. ‘The S received 9 pellets in the S+ 
alley, black for half of the Ss in each group and 
white for the remaining Ss, and 1 pellet reward in 
the other (negative stimulus, S—) alley. hs 
received a total of 8 trials/day for 20 days. For Ss 
in one set of conditions (forced-frec), the first 4 
trials of each 8-trial block were forced according to 
one of the following rotated sequences: + + — =, 
= — ++, +- — +,— + + (Logan, 1969). 
Trials 5 and 6 were free-choice trials and were 
randomly switched so that over a block of 8 choice 
trials, the positive alley appeared an equal number 
of times on the left and the right. Trials 7 and 8 were 
then forced opposite to those chosen on the free- 
choice trials (Logan, 1969) The Ss in the other set 
of conditions (forced) received only differential 
conditioning trials in each 8-trial block. The first 4 
trials were sequenced as above and the last 4 trials 
were opposite to the first trials’ sequence, 

A trial began by placing S into the start box and 


: 
^ 


so 


— 


releasing the start door. 
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When broken, the first 
photobeam measured starting times on forced trials 
and choice times on the free trials. When the second 
photobeam halfway down the alley was broken, S 
received either a shock or a tone, if appropriate. 
Thus, for example, a shock-right S received a 
shock only when in the S4- alley, A shock-both or 
tone-both S, however, was Presented with the 
appropriate cue on half of the trials in each alley, 
thereby equating the total number of stimulus 
presentations. These Ss received the appropriate 
stimulus on one positive and one negative trial of 
the first 4 trials, randomly sequenced from day to 
day, and then on the Opposite positive and negative 
trials of the second 4 trials of the day. The Ss were 
run in squads of 4, providing an intertrial interval 
of 3-4 min, 
Experimental design. A 2 X 2 X 3 factorial, plus 
2 control groups, was the experimental design em- 
ployed. The variables manipulated were type of 
stimulus (tone or shock), presentation of the trials 
(forced-free or forced), and alley of stimulus presen- 
tation (S+, SC, or S). The 2 control groups re- 
ceived no stimulus, one group was given forced—free 


trials and the other group was given only forced 
trials, 


RESULTS 


Since all Ss received 4 initial differential 
conditioning trials each day, 2 in the posi- 
tive alley and 2 in the negative alley, those 
trials formed the primary basis of com- 
parison. The start speeds served as the 
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function of the data ger 
setting and in part on theore 
The shock was administered i 
of the alley. The data dem 
creased reductions in alley 
shocked Ss neared the shock loc 
data are consistent with the hyy 
approach-avoidance gradients 
1944), i.e., the avoidance gradier 
increasingly steep, relative to the 
gradients, as S nears the shock loca 
à result, the alley speeds were coni 
by the presence of shock within th: 
making comparisons between shock 
tions and between shock and 


spi 


non 
conditions based on alley speeds dit} «ilt, 
at best. 
The goa! speeds were confounded beca se 
shock served as an intense motivation il 


variable (Spence, 
ran into the goal 


stimulus. 


capacities. Finally, 


1956) and shocked .'s 
h compartment unusually 
fast following Presentation of the aversive 


The start speeds were the 
confounded measure and could bette 
flect differential performance of Ss wi 


in the 
ounds. 
niddle 
ite in- 
is the 
lhese 
sized 
Hiller, 
mes 
ich 
As 
ded 
ley; 
idi- 
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i [ s since the start speed 
major dependent variable, in part as a of Ss with the black stimulus as the S- 
TABLE 1 

ABSOLUTE MEAN START SPREDS (ix KECIPROCALS) OF SHOCKED A 5 ON FFEREN 
CONDITIONING TRIALS FOR Positive (S+) AND NEGATIVE (S) ALT ERENTIAL 
2-day blocks | PO 
Group — ————— 
E 2 3 4 5 | 6 | 7 8 9 10 
Shock-right (FF) | sr - 
4 1.13 1.11 1.28 1.43 1.33 1.2 23 
ine REA qud t fe m MET E I a 
S bic te (F) j j Ms AT 
30 | L33 | 123 | 117 L25 | 129 | 
SE 102 | i1 i do tis RESI EUIS 7] 122 
Shock-wrong (FF) y Ux) d AU e pL A.U y- 
SES in 23 1.39 1.38 147 | 1.50 1.51 1.53 
Shock-wrong (E) 3 : POE 565-1 «so |39 3r Je as 
+ otha fers sae me Siem ptu zl 
s 35 : 45 35 | 150 1.37 141 33 
Shock-both (FF) Ens 0 85 85 | 83 |: 39 | .69 
ER » ee $1 105 | 112 | 138 | 124 | 127 
Shock-hoth (F) i : Nk a r ft 57 
S+ 467 87 91 106 | | 
nr : j OF | 4, i i 
3 T3 | 288 | 338 | 506 |-74 | Su vg e: 
Nole. Abbreviations: F = forced trials, FF < forced-free trials. —— T ^ 
£ 


A 


TABLE 2 


wisoLUTE MEAN START SPEEDS (IN RECIPROCALS) FOR Tone GROUPS ON DIFFERENTIAL 


CONDITIONING TRIALS FOR Positive (S+) AND NEGATIVE (S—) ALLEYS 


2-day blocks 
Group — — — — - 
2 3 4 5 6 7 8 9 10 

l'one-right (FF) | | 

S+ 84 1.42 1.60 | 1.58 1.69 1.69 L8 1.84 1.83 

s- 83 1.33 | 144 | 133 1.08 .95 1.02 .82 85 
Tone-right (E) | 

S+ 1.09 147 1.65 1.63 1.73 1.71 1.80 1.83 1.78 

S- | 14.12 1.43 1.38 1.33 1.02 .92 83 .88 .91 
Tone-wrong (FF) 

S+ 90 98 1.28 1.62 1.78 1.63 1.56 149 1.57 

S- 8 93 142 1.20 1.19 .90 ,65 61 .63 
"T'one- wrong (F) 

S+ .99 1.28 1.30 1.51 1.64 1.70 1.61 1.58 1.55 

S- | 298 1.19 1.08 1.09 .99 76 ,80 79 T 
Tone-both (FF) 

S+ 1.13 1.22 1.19 1.31 1.27 1.42 1.40 1.39 1.41 

S= 1.08 1.13 1.17 1.2 1.12 88 .82 28 77 
Tone-both (F) 

S+ .95 1.12 1.28 1.32 141 1.42 1.52 1.39 1.30 

sS- | 92 99 1.09 1.20 1.23 1.00 E 63 59 


Note. Abbreviations: F = forced trials, FF = forced-free trials. 


did not differ with their respective counter- 
balanced Ss with the white stimulus as the 
S+, those subgroups were combined for 
the subsequent analyses. 

Aloe start speeds, The start speeds to 
S+ and S— were averaged and appear in 
tabular form (Tables 1-3). These data 
suggest that only in the shock groups did 
the forced-free procedure produce superior 
differential performance relative to the 
forced condition. Two analyses of variance 
of the S+ and S— start speeds over all 
trials, with all groups included, yielded 
significant values for both alleys, F (13, 


154) — 2.25, p « .05, and F (13, 154) 
= 7,59, p < .01, respectively. 

A posteriori comparisons (Schefié method) 
indicated that the forced-free trained 
shock-wrong Ss ran more slowly in the S— 
alley than the other Ss. The other forced- 
free Ss receiving shock in the S— alley 
(shock-both) ran slower in that alley than 
the nonshocked Ss, with the notable excep- 
tion of Ss with an informative tone stimulus 
at that locus, i.e., tone-wrong. Therefore, 
shock in the presence of the negative 
stimulus had a significant inhibiting effect, 
with respect to start speeds, in the S— 


TABLE 3 
ABSOLUTE MEAN START SPEEDS (IN RECIPROCALS) FOR No-STIMULUS, CONTROL ANIMALS ON 
DIFFERENTIAL CONDITIONING Trias FOR Positive (S+) AND NEGATIVE (S—) ALLEYS 


2-day blocks 
Group - : 
2 3 4 Sae emis 8 9 10 
No stimulus (FF) 
S+ 80, 1.23 147 142 1.51 141 142 1.32 1.37 
S= 94 1.19 141 1.30 1.15 1.04 A .66 70 
No stimulus (F) 
S+ 1.08 1,29. 1.38 1.36 145 147 144 1.35 1.37 
S- 1.13 1.35 | 1.29 -98 .91 .80 78 61 10 


Was aC S WM EROS VS 


Nole. Abbreviations: F = forced trials, FF = forced-iree trials. 
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alley. Subsequent comparisons of the 
groups’ performance in the S+ alley indi- 
cated no 2 groups differed. One may note 
that there was the suggestion of differences; 
however, the error term was sufficiently 
large to yield no statistically reliable 
differences. 

Relative siart speeds. Because shocked .Ss 
ran slower than nonshocked Ss, relative 
Start speeds to the 2 discriminanda are 
more appropriate than the absolute speeds 
for statistical comparisons of discrimination 
performance. Two relative measures were 
employed. One such measure (Goldstein 
& Spence, 1963) is simply the difference 
between the S+ and S— start speeds. 
"These difference scores are plotted in 2-day 
blocks in Figure 1. A 2 x 4 factorial 
analysis of variance was performed on the 
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FIGURE 1. The difference between start s 
on the differential conditioning trials in the posi 
(S+) and negative (S—) alleys. 
condition is plotted 
and forced groups.) 
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difference scores of the shox id 
stimulus groups (Figure 1A—11D | type 
of training (forced-free or fo E 
alley of stimulus presentation S- 


S, or neither SÆ), as the mai 


The results indicated that both ining 
and stimulus variables signifi in- 
fluenced responding, / (1, 88) 1.13, 
P < .01, and F (3, 88) = 47.49, .01. 
A posteriori comparisons indicate; that 
each shock group trained under a ed- 
free procedure yielded greater diíic.-«ces 
between S+ and S— speeds than re- 
spective shock group receiving only forced 
trials. Also, shock-wrong groups lea: ned 


the discrimination more rapidly than their 

comparably trained shock-right, shock- 

both, and no-stimulus groups, which cid 
not differ. 

A separate 2X4 factorial analysis with 
the same factors was performed on thi 
data of the tone and no-stimulus groups. 
As suggested in Figure 1 D-1G, the different 
Stimulus presentation procedures had no 
statistically reliable effect upon the various 
groups’ performance, F (1,88) < 1. How- 
ever, the alley of stimulus presentatior 
variable exerted significant effects upo: 
discrimination efficiency, F (3, 88) - 
121.01, p < Of. posteriori analyses 
Showed that the performance of tonc 
groups with information (tone-wrong and 
tone-right) was Superior to the groups 
lacking an added informative cue (tone- 
both and no stimulus). 

Thus, the start Speeds indicate that inter- 
spersing free-choice trials into differential 
conditioning training produced greater dif- 
ferences between S+ and S— responding 
only for Ss presented with an aversive 
stimulus. Furthermore, a redundant, in- 
formative stimulus, either tone or shock, 
enhanced differential responding relative to 
à no-stimulus control condition. 

Correct response measure of speeds— 
within stimulus conditions. A second w vay of 
measuring differential performance, which 
may be less affected by absolute levels, is 
the method Suggested by Grice (1966). 
Successive blocks of differential condition- 
Ing trials (8 trials in the present experiment) 
were ranked from fastest to slowest and 
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cd with respect to the U statistic. 
erect response was defined as S run- 
aster in the S+ alley than in the 
illey. The percentage of correct re- 
ises for the different stimulus groups is 
ited in Figure 2. Again the figure 

:gests that the shocked groups differ with 

spect to number of correct responses on 

he discrimination under the 2 types of 
training. A 2 X 4 factorial analysis on 
percentage of correct responses by the 
shock groups and the no-stimulus controls 
(Figure 2A-2D) revealed both type of 
training and stimulus presentation to be 
significant variables, F (1, 88) = 24.19, 
p < Ol, and F (3,88) = 38.33, p < 01. 
A posteriori comparisons indicated that 
each forced-free shock group made more 
correct responses than its respective forced 
shock group. Also the shock-wrong groups 
performed more efficiently than the com- 
parably trained shock-right, shock-both, 
and no-stimulus groups. Finally, the 
shock-right and shock-both Ss made 
similar numbers of correct responses and 
both were superior to the similarly trained, 
forced-free no-stimulus Ss. 

Next, a 2 X 4 factorial analysis was 
applied to the percentage of correct re- 
sponses by the tone and no-stimulus Ss. 
As suggested in Figure 2D-2G, alley of 
stimulus presentation yielded a significant 
value, F (3, 88) = 28.70, p < .01, which 
subsequent a posteriori analyses revealed 
to be a function of the information groups 
(tone-right and tone-wrong) making more 
correct responses than the noninformation 
groups. In contrast to the speed differ- 
ences scores, the type of training was also 
a significant factor, F (1, 88) = 7.38, 
p < .01. However, a posteriori compari- 
sons indicated that there were no differences 
between forced-free trained Ss and their 
comparable but forced Ss. 

Therefore the graphical evidence that 
the type of training was a significant 
variable only for the shock groups and 
alley of stimulus presentation was à 
significant factor for both shock and non- 
shock conditions was confirmed statistically 
with both difference scores and the measure 
of correct responding. 


senses 


FIGURE 2. 
on the first 4 differential conditioning trials in blocks 
of 2 days (see text for explanation), (Each stimulus 
presentation group is shown in relation to its respec- 
tive training condition, either forced-free or forced.) 


The percentage of correct responses 


Correct response measure of speeds—be- 
tween stimulus conditions. Since compari- 
sons of shock, tone, and no-stimulus groups 
with comparable stimulus presentations 
were of interest, the correct response mea- 
sure was employed. Three 2 X 3 analyses 
of variance with days as a repeated factor 
were performed with a major interest in the 
Stimulus Presentation X Days interaction. 
The assumption was that a significant inter- 
action would suggest different learning rates 
by the Ss, a finding relevant to a number 
of predictions. 

Analysis of correct responses of the 
shock-right, tone-right, and no-stimulus 
control Ss yielded significant differences 
between the various stimulus conditions, 
F (2, 66) = 17.38, p < .01. The Days X 
Stimulus interaction was also statistically 
reliable, F (16, 528) = 4.25, p < .01. Sub- 
sequent a posterior comparisons between 
the groups indicated that tone-right Ss 
trained under both procedures and forced- 
free trained shock-right Ss performed more 
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efficiently than no-stimulus and forced 
shock-right Ss. Thus one may conclude 
that a stimulus in the correct alley facili- 
tates learning the discrimination, with re- 
spect to fewer errors and faster rates of 
learning. However, when shock is used as 
the cue, this conclusion is valid only if Ss 
are allowed free as well as forced trials, 

A 2 X 3 factorial analysis was then per- 
formed, again with days as a repeated 
factor, for Ss with the informative stimulus 
in the S— alley and no-stimulus Ss. The 
results indicated that both the stimulus 
variable and the Days X Stimulus inter- 
action yielded statistically reliable results, 
F (2, 66) = 48.20, p < .01, and F (16, 528) 
= 6.40, p < .01. A posteriori analyses be- 
tween the groups indicated that shock- 
wrong groups were statistically superior to 
tone-wrong groups which, in turn, were 
more efficient than no-stimulus groups. 
Therefore, shock in the incorrect alley 
greatly hastened learning the discrimina- 
tion, and a tone stimulus at that locus 
produced facilitation relative to the no- 
information control groups. 

A final 2 X 3 factorial analysis with days 
as the repeated measure was performed 
on the correct responses of the noninforma- 
tive groups (shock-both, tone-both, and 
no stimulus). Although no overall statis- 
tical differences were found between the 
various groups, F (2, 66) < 1, the Stimulus 
X Days interaction was a significant factor, 
F (16, 528) = 2.25, P < .0l. A posteriori 
comparisons between the various condi- 
tions revealed that shock-both 5s trained 
under the forced-free procedure made 
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FIGURE 3. The percentage of correct choices on 
the free-choice trials blocked over 2 days for the 
forced-free groups. 
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significantly more correct respo than 
the other noninformative groups here- 
fore these data suggest that an : iable 
aversive stimulus enhanced discri ition 
performance if Ss were allowed fr« well 
as forced trials. 

The general conclusion of the dire. com- 
parisons of shocked Ss with the NE- 
stimulus counterparts and no-stimul;/ con- 
trol 5s indicates that shock in the i: ect 
alley and in both alleys improve: the 
differential conditioning preforman: er 
tone-wrong and tone-both traini: ex 


spectively. This generalization i: l- 
sonant with the data only insofar as he 
training of the shocked Ss allows for fi e- 
choice trials. Finally, shock in the cor: t 
alley does not improve performance rela 

to tone-right training, though both to; 
right and shock-right facilitated perfor: 
ance relative to no stimulus. 

Choice trials. "The results of the choic 
measure are plotted in Figure 3 alongside 
the results of the differential conditioning 
trials. The figure suggests similar ordering 
of the groups with respect to percentage 
of correct choices and percentage of correci 
responses, with the possible exception tha 
the shock-right and Shock-both perform 
ance appears somewhat poorer on the frc 
choice trials. 

An analysis of variance on the percentage 
of correct choices yielded a significant valuc, 
F (6, 77) = 26.03, b «.01. Subsequent à 
posteriori analyses indicated that the shoc! 
wrong group made significantly more 
rect choices than the other groups. Also, 
the 2 information groups (tone-right and 
tone-wrong) performed more efficiently 
than the no-stimulus control Ss. Finally, 
the Shock-right and shock-both, tone 
both, and no-stimulus groups performed 
comparably on the free-choice trials. 

Summary. Most of the results of the 
present study are clear and highly reliable. 
First, interspersing some free-choice trials 
into a predominantly differential condition- 
ing procedure facilitated discrimination 
performance, but only for those conditions 
involving an aversive event. This general- 
ization may be restricted to the rela- 
tively simple black-white discrimination 


cor- 


Ep 


ie 


c 
" 


stimulus. 


VAR 


employed in the present investigation, but 
it is noteworthy. Second, providing an 
informative, albeit redundant cue in the 
negative alley facilitated discriminative 
behavior, and more so if that cue was 
aversive. Providing an informative, re- 
dundant cue in the positive alley was 
facilitative in the case of the neutral, tone 
However, with respect to the 
aversive, shock stimulus in the S+ alley, 
only with one measure, percentage of faster 
speeds in S+ than in S—, were the effects 
positive. There was no difference either 
in raw speed differences or choice behavior. 
Finally, a noninformative cue produced 
improved discrimination performance, at 
least in one measure, but only if that cue 
Was noxious. 


Discussion 


The results suggest that the role of punish- 
m differential instrumental conditioning 
"There is extensive evidence in the 


15 Ci 

data upport the notion that shock in a 
relativ simple discrimination setting can 
serve variety of functions, e.g., as (a) an 
aversive stimulus, (b) an informative stimulus, 


(c) a secondary reinforcer, and (d) an alerting 
stimulus. 

Consider first the aversiveness or negative 
incentive value of punishment. Logan (1969) 
has presented data which suggest that an 
animal is choosing continuously over time to 
determine the net incentive motivation value 
associated with an alternative, including not 
only the parameters of reward, but also the 
parameters of punishment. Thus, an animal 
weighs the negative and the positive incentive 
properties associated with the various response 
alternatives and responds on the basis of this 
net incentive value. 

This role was clearly evident in several as- 
pects of the present data. The shocked Ss 
generally ran more slowly than their non- 
shocked counterparts. Furthermore, the S— 
speeds of Ss receiving the aversive stimulus in 
the negative alley were reduced relative to the 
S— performance of Ss shocked in the positive 
alley. Also, differential performance developed 
more rapidly for shock-wrong Ss, i.e., Ss with 
the greater difference in net incentive between 
the 2 response alternatives, than for any other 
condition. Finally, a shock stimulus in the 
positive alley was not as beneficial as a neutral, 
tone stimulus, suggesting that the differential 
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in incentive between the alternatives was 
thereby reduced for the shock-right 5s. Ac- 
cordingly, this study, including differential 
conditioning, substantiates the earlier studies 
(e.g, Fowler & Wischner, 1965) employing 
simultaneous discrimination learning para- 
digms in indicating that one of the roles of 
shock is to contribute negative incentive selec- 
tively to the alternativein which it is presented. 

A second hypothesized role of shock is its 
distinctive stimulus or information value, more 
specifically its secondary reinforcing value in 
the shock-right procedure. The present data 
are also generally consistent with this interpre- 
tation. The finding that an informative tone 
facilitated discrimination performance is strong 
evidence that a redundant, predictive cue is 
beneficial. An informative shock, as a dis- 
tinctive stimulus, certainly could have served 
the same function. Also, that tone-wrong Ss 
were as slow in the S— alley as shocked Ss, 
with the exception of the shock-wrong condi- 
tion, is further evidence of the cue value of a 
stimulus at that locus. Finally, the fact that, 
at least in one measure, the shock-right pro- 
cedure was superior to the no-stimulus pro- 
cedure further attests to this role, in spite of 
the negative incentive properties of the shock. 

Interest then centers on whether there is yet 
a third role of shock in discrimination learning. 
Several authors (Muenzinger, 1934; Tolman, 
1948) have proposed that shock serves an 
alerting function in making the animal more 
sensitive to the stimuli in the experimental 
setting. The unique contribution of the 
present study is employing differential condi- 
tioning to examine the properties of shock. 
The data of most interest, therefore, are the 
speeds on the differential conditioning trials. 
When the start speeds were transformed to a 
measure of correct responding, i.e., faster 
speeds in the positive than in the negative 
alley, to provide for comparisons of shocked and 
nonshocked Ss, the data were highly suppor- 
tive of an alerting role of aversive stimulation. 

The finding that providing shocked Ss free- 
choice trials facilitated differential responding, 
while not aiding the nonshocked Ss, is con- 
sistent with a general alerting interpretation. 
Muenzinger et al. (1938) contend that shock 
produces increased amounts of VTE resulting 
in more efficient discrimination performance. 
Reducing the opportunity to VTE and com- 
paring the stimuli by giving only single pre- 
sentations of the stimuli should retard di 
crimination efficacy. specific effects analys 
e.g., distinctive stimulus properties (Fowler & 
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Wischner, 1969), would make no such differ- 
ential predictions with respect to training pro- 
cedures. The present data, implicating a 
comparison of stimuli consequence of shock, 
Support an alerting role of punishment such as 
that conceptualized by Muenzinger et al. 

An important prediction from an alerting 
analysis is that shock, by virtue of its sensi- 
tizing effects, facilitates learning rates relative 
to comparably trained but nonshocked animals. 
This prediction was upheld by the data of 
Shock-wrong and shock-both Ss who had been 
given free-choice trials, for those Ss learned 
the discrimination more rapidly than the 
similarly trained tone-wrong and tone-both 
groups. However, a shock or tone stimulus in 
the correct alley produced. comparable learning 
rates, One may contend that specific effects, 
i.e., inhibition as a function of aversiveness, of. 
the shock contributed to the shock-wrong 
tion and the lack of facilitation in the 

shock-right condition relative to their respec- 
tive tone groups. As a result the shock-both 
findings take cn even greater importance. 
Providing a nonspecific stimulus, Le, not 
associated reliably with either the positive or 
negative alternative, hastens the learning rates, 
but only if that stimulus was noxious, "Thus, 
these data also implicate a general alerting role 
of punishment in discrimination learning. 

Of several theoretical formulations of a 
general effect of aversive stimulation, none 
appear entirely satisfactory, Tolman (1948) 
mentioned, but only briefly, that shock pro- 
duces "looking around" by the animal to see 
what it was that had hit him. Muenzinger 
(1934; Muenzinger et al., 1938) proposed an 
alerting concept, but one which is totally de- 
pendent upon VTE behavior. Although the 
present results implicate a comparison factor, 
other investigators (e.g., Farlie, 1937; Wisch- 
ner, 1947), and the present Æ through in- 
formal observation, found no reliable rela- 
tionship between VTE and discrimination 
efficacy, A more realistic Proposal is that 
the anticipation of shock slows the animal 
(Logan & Wagner, 1965) allowing it to vacil- 
late at the choice point This vacillation need 
not be overt, as is VTE, but could bea changing 
covert perceptual orientation (Spence, 1936) 
which has an effect similar to that proposed 

earlier for VTE (Muenzinger et al., 1938), i.e., 
S's increased sensitivity to the discriminanda. 

In conclusion, the novel contribution of the 
present study involves examining the function 
of shock in a simple discrimination setting— 
differential instrumental conditioning. Anal- 
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encies to 
le in the 


ysis of the individual response : 
each stimulus, a measur not a: 


simultaneous discrimination par: 1, indi- 
cated that shock serves an aversi. e which 
reduces response strength to the p d alter- 
native, Furthermore, examinatio the be- 
havior of animals with an inforn e, but 
neutral tone stimulus, a compari« t made 
previously, indicated that such imulus 
enhances discrimination performa "The. 
implication is clear that the Shock, as . distinc- 
tive stimulus, is serving a similar ction. 
Finally, a comparison of simultan and 
single presentations of the stimuli, 1 not 
attempted previously, suggested a eneral 
alerting role of shock. Thus, the present ex- 


periment added several unique conditi ns to 

the usual shock-right paradigm and substan- 

tiated the earlier speculations that -hock 
can serve a variety of roles in discrimination 

learning. k 
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MEANINGFULNESS, PERCEPTUAL GROUPING, AND 
ORGANIZATION IN RECOGNITION MEMORY? 
HENRY C. ELLIS? FREDERICK J. PARENTE, anv E, CHANDLER SHUMATE 
University of New Mexico 

Two experiments examined the role of meaningfulness (m), perceptual 
grouping, and organizational factors in recognition memory of conscnant— 


vowel-consonant (CVC) trigram stimuli, 
recognition memory of trigram doublets of the integrated form Cvc« 
is a direct function of the m value of the 
doublets were fragmented by physically separating the letters accord: 
thus disturbing the initial unitary perceptual organiz 


grouping rules, 
of the stimuli, 


d 
by training in which the 


rbal 


The fractionated vs. integrated nature of 
stimuli is thought to be related to the ease 
with which they can be placed in memory. 
Highly integrated stimuli (e.g., stimuli 
high in meaningfulness, 7) may be en- 
coded quite rapidly as they are processed 
às a single unit, whereas fractionated 
stimuli (e.g., low-m stimuli) are encoded 
more slowly because the encoding con- 
sists of more than 1 component. (Shepard, 
1963). "This particular account of the un- 
derlying dimensions of m bears a strong 
resemblance to Miller's (1956) concept of 
"chunking," wherein units that may be 
integrated into larger chunks (and there- 
fore fewer discrete ones) are more easily 
processed. In a similar vein, variations in 
stimulus m may produce corresponding 
demands upon S's memory load, with the 
m value of stimuli being negatively corre- 
lated with memory load (Ellis, 1973). If 
low-m stimuli are regarded as more difficult 
to process because of their fragmentary, 
unintegrated nature, it should be possible 
to alter the m value of a unitary or high-m 
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emonstrated that the fragmented high-m stimuli operated as if th 
low-m stimuli in an associative learning task; their recognition was 


ependent stimulus 


Experiment I demonstrated that 
VC 
stimuli, 


The second exper i 


associated with representative 
encoding. In general, 
labels are increasingly important in 
are low in meaningfulness, percepi] 


stimulus, and thus the ease with which it 
can be encoded as a unit, by varying the 
perceptual grouping of a sequence of letters 
normally perceived and encoded as a unit 

The primary purpose of is 
is threefold : (a) to demon ive 
relationship between stimu coge 
nition, (b) to determine ect of 
stimulus m depends upo: ree of 
integration of the stimuli, « e with 
which they may be perce; rouped 
into a unit, and (c) to d if frag- 
mented high-m stimuli op is low-m 
stimuli in the sense that th ‘cognition 
is, to a great extent, depend upon cones 
textual cues present duri ssociative 
training. 

The first experiment ex 
of stimulus m on recognit 
both integrated 


amined the effect 
ion memory with 
and fractionated letter 
sequences. The recognition memory test 
provides a sensitive measure of the extent 
to which S has encoded the stimulus in a 
unitary fashion, because the test requires 
recognition of a target stimulus compared 
with a distractor item high in formal 
similarity to the target item. Although - 
Some studies (e.g., Ellis & Shumate, 1973) 
have demonstrated 4 positive relationship 
between stimulus m and recognition, nega- 
tive relationships have also been reported 
(e.g., McNulty, 1965) ; moreover, the rela- “4 


4 duce an effect upon stimulus rece 


M! Ne 
nown to depend upon the 
Ellis, Muller, & Tosti, 

e, it was deemed necessary 

s relationship for purposes 
;seline data. The experi- 
n was: Does the effect of 


- tionship als 
scale of m 
1966). Th 
to establis) 
of obtainii 
mental qu 


stimulus is defined by conventional 
norms) © ition memory depend upon 
the inte; fractionated character 
of the le juences? 

The experiment provided evi- 


dence f ow that fractionated high-m 


stimuli u fact, behave as low-m 
stimuli associative learning task. 
Severa! iments have shown that 
recogniti mory for visual form stimuli 
(Ellis & f }, 1971, Experiments I&II) 
and vert iaterials (Ellis & Shumate, 
1973; Eliis * Tatum, 1973) may be en- 
hanced esult of associative training. 
When stir. Ji are paired with “representa= 
tive" ve responses (labels), the stimuli 
are subs ently better recognized than 
when the re paired with “nonrepresenta- 
tive" reponses or are simply observed 
without associative training. Ellis (1973) 
has proposed. that the facilitative effect 
upon recognition memory for stimuli paired 
with representative responses is dependent 
upon the initial codability of the stimuli, 


with relatively greater facilitative effects 
occurring with less codable stimuli. ‘This 
proposition asserts that ambiguous, unin- 
tegrated, or complex stimuli are more de- 
pendent upon contextual events for their 
encoding than are familiar, integrated, or 
simple stimuli, Ellis and Shumate (1973), 
viewing m in terms of codability, with 
low-m stimuli being less codable than 
high-m stimuli, found that recognition 
memory facilitation for trigram stimuli 
paired with representative responses was 
dependent upon the 7 value of the stimuli, 
with reliable facilitation in recognition per- 
formance occurring with low-m stimuli, 
Whereas no differences obtained with high-» 
stimuli, 

The question of interest in th 
periment was whether associative training 
with the degraded high-m stimuli, using 


representative response terms, would pro- 
ognition 


e second ex- 
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memory similar to the findings obtained 
with low-m stimuli by Ellis and Shumate 
(1973). The experimental question was: 
Can a response which is representative of 
the stimulus provide, during associative 
training, cues that enable S to organize 
or integrate nominally fragmented stimuli? 
The assumption was that a representative 
response can effectively instruct S in how 
to organize or integrate a fragmented stimu- 
lus so that it is more readily encoded as 
a unit. 


PERIMENT I 


Method 


Experimental design. The Ss were 40 under- 
graduate student volunteers from the University 
of New Mexico. Each 5 received 4 study-recogni- 
tion test trials to learn a mixed list of 12 trigram 
doublets containing 3 levels of m. The Ss were 
randomly assigned to 1 of 2 list conditions in which 
the doublets were cither “integrated,” i.e. appeared 
as intact trigram pairs, or “fractionated” (degraded), 
ie, broken up into particular letter groupings 
other than the original trigrams. The Ss assigned 
to the fractionated stimulus condition were tested 
1 wk. later than those in the integrated stimulus 
condition, although both were drawn from the same 
population of 5: 

Stimuli. 


e integrated items were trigram 
doublets constructed by combining 2 trigrams 
selected from the Archer (1960) norms, For ex- 
ample, the trigrams BA? and cow were combined to 
form the doublet BANCOW. Doublets rather than 
trigrams were used in order to require S to encode 
more information and to allow for construction of 
different d „tors on each recognition trial in 
accord with a single construction rule. Four doub- 
lets of each of the m levels (high, moderate, and low) 
were used, with average association values of 1004 b 
56%, and 12%, respectively. The doublets used 
were BANCOW, RUMWIG, CUPNET, DENGIT, PUZWEK, 
5, CIMBUX, NEJWOQ, TIJFEQ, YIWFEP, 


condition, in which each appeared as perce] 
groupings of 1 or more lett 
high-m stimuli are more ¢ sily encoded and hence 
better recognized than low-m stimuli because they 
can be more readily processed as a unit, then 
fractionation of the doublets should make the 
high- and moderate-m items more difficult to encode. 
Accordingly, recognition performance differences 
typically ‘associated with variations in stimulus m 
should be either reduced or eliminated.) The 6 
letters of each doublet were grouped according to 
1 of 4 fractionation rules that simply regulated the 
number of letters in each group. For example, the 
4 high-m doublets, BANCOW, CUPNET, RUMWIG, and 
DENGIT, were presented in the following forms, eac h 
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adhering to a different fractionation rule: BA NC OW, 
C UPN ET, R UMWIG, and DE NGI T. The moderate- 
and low-m doublets were also fractionated according 
to the same grouping rules, and each stimulus main- 
tained the same form or grouping rule throughout 
the study-test. sequence, 

Procedure. All Ss were given 4 study-test trials 
to learn either list. The order of items was randomly 
varied on each trial, The doublets were presented 
consecutively on a Stowe memory drum at a 1-sec. 
Presentation rate. The Ss were instructed to 
remember as many items as possible. After viewing 
all 12 items, Ss were given a 2-alternative forced 
Choice recognition test in which they were shown 
each of the old items, singly, paired with a highly 
similar distraction item, Distractors were formally 
similar to the old items, with a distractor consisting 
of a 1-letter substitution of 1 of the 4 middle letters 
of the original doublet. A vowel was replaced by a 
vowel and a consonant was replaced by a consonant. 

For Ss learning integrated items, the old item 

BANCOW appeared, for example, with the distractor 
BENCOW on one trial, BAZCOW on another, and so on. 
As Ss were presented with each old-new pair, they 
were instructed to underline the stimulus they had 
seen during the study trial. A different distractor 

item was used on each of the 4 trials, and the right- 

left position of the pairs was randomly varied on each 
trial. For Ss learning fractionated items, the same 
construction rules were used ; however, the distractor 
items were fractionated according to the same group- 
ing structure as the old items, For example, BA NC 

OW was paired with BE Nc ow on 1 trial, BA ZC ow on 

another, and so on, 


Results 


Figure 1 is à plot of the proportion of 
Correct recognitions of the trigram doublets 
for the conditions of the experiment. The 
left panel shows recognition performance 
curves for the integrated stimuli, and the 
right. panel portrays the results for the 
fractionated stimuli. The left panel clearly 


g? T T T T T ales 
loo} "INTEGRATED" STIMULI "FRACTIONATED" Srivuli 
2 90 
$ 
E 80 
E 
S 
i WSS 
i; 50 
g MEANINGFULNESS 
g 40| —t— HIGH d 
© 30) —O— MODERATE | 
20 => Low 
1 1 fi ER 
1 2 3 4 L =e 3 4 
TRIALS TRIALS 


FIGURE 1, Recognition learning as a function of 


stimulus meaningfulness with integrated and frac- 
tionated*trigrams, 
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shows the differential effect: 
recognition learning, with hi 
producing superior recognition 
followed by moderate-m stimu! 
stimuli showing the poorest | 
In contrast, the right panel in 
m produces no differential 
recognition when the items are f: 
The mean proportions of corr: 
tions of the integrated items, avcr- d over 
4 trials for the high-, moderate-, a. low-m 
conditions, were .86, .81, and ^ re 


"mance; 
es that 
upon 
nated, 
cogni- 


spectively. In contrast, for the froction- 
ated items, the recognition scores à iged 
-62, .60, and .60, respectively. The inter- 


action between m and stimulus grow ping 
was reliable, F (2, 76) = 451,5 .025] 
which is the finding of principal interest. 
The effects of grouping (integrated vs. 
fractionated stimuli) was also highly veli- 
able, F (1, 38) — 41.03, p < .001, anc the 
overall effect of m was reliable, F (2, 76) 
= 6.86, p  .01. Separate analyses oí the 
effect of m for the integrated stimuli yielded 
a reliable difference, F (2, 38) = 11.44) 
P «X .001, whereas the effect of m y ith 
fractionated stimuli was not significant, 
F (2,38) « 1p $05 

The figure shows the obvious effect of 
trials, F (3, 114) = 85.00, p < .01. The 
Ss learned the integrated items faster than 7 
the fractionated items, as reflected in the 
Trials X Grouping interaction, F (3, 114) 
87.50, p < .01, and the m x Trials 
interaction was reliable, F (6, 228) = 5.00, 
P « .0t. 


EXPERIMENT I 


The results of Experiment 1 provide 
direct support for the view that the effect 
of stimulus m on recognition performance 
is dependent upon the structural properties 
of the trigram doublets. High-m stimuli, 
when fragmented or grouped so that their 
unitary property is less readily detected, are 
recognized at approximately the same level 
as moderate- and low-m fragmented stimuli. 
It follows that if the high-m stimuli have 
indeed become functionally equivalent to 
low-m stimuli, then they should operate 
as low-m stimuli under other experimental 
conditions. Since it is known that low- 


"stimuli most susceptible to response- 
depend sncoding (Ellis, 1973; Ellis & 
Shum 973), it would follow that de- 
graded fractionated high-m stimuli 
should w a similar effect. Ellis and 
Shum (1973) have shown that the 
format’) of an association between stimuli 
and r nses can affect the recognizability 
of sti: li, and that this facilitative effect is 
evidi only with low-m stimuli. This 
effect : ccurs with responses that are repre- 
sentei ve of the stimuli, but not with un- 
repr -ntative response: In general, the 
cond ions under which contextual cues aid 
in si mulus encoding are those with stimuli 
tha: ore initially difficult to encode (Ellis, 
197 Therefore, it would follow that if 
fragmented high-m stimuli are paired with 
representative responses during associative 
training, such training would facilitate 


stiraulus encoding compared with associa- 
tive training in which the responses were 
noarepresentative. 


Method 


rhe Ss were 70 undergraduate student volunteers 
from the University of New Mexico. Twenty Ss 
were assigned at random to each of 2 experimental 
conditions and 15 Ss to each of the remaining 2 
conditions, All Ss were given 8 paired-associate 
(PA) tr to learn a list of 8 stimulus-response 
pairs, The stimulus terms were either integrated 
or fragmented high-m trigram doublets and were 
identical for both conditions except, of course, for the 
groupings, (The Ss assigned to the integrated 
stimulus conditions were tested 12 wk. later than 
those in the fractionated stimulus conditions. 
Nevertheless, Ss in both conditions were drawn from 
the same population of introductory psychology 
classes. Furthermore, the integrated stimulus condi- 
tions represent a replication of the immediate recog- 
nition condition of Ellis and Shumate (1973), and 
since quite comparable recognition results were 
obtained in that and the present experiment, the 
comparability of the groups is reasonably assured.) 
For 1 group of Ss the doublets were each paired with 
a representative response, and for the other group 
each doublet was paired with a nonrepresentative 
response. Each stimulus was exposed for a 4-sec. 
period consisting of a 2-sec. anticipation interval 
and a 2-sec, simultaneous presentation with the 
response. The intertrial interval was 4 sec. The 
Ss were tested individually, and the materials were 
presented via a Stowe memory drum. The response 
terms were common words, equated for frequency 
of occurrence between groups in accordance with the 
'Thorndike-Lorge (1944) norms. Response repre- 
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sentativeness (or belongingness) was determined by 
agreement among 6 judges as to whether or not a 
particular word selected “resembled” its stimulus 
mate, Judges were asked to rate each response as 
being representative in the sense of being descrip- 
tive, suggestive, or relevant to the stimulus. A non- 
representative response was judged as bearing little, 
if any, relationship to its corresponding stimulus 
mate. These procedures are comparable to those 
employed by Ellis (1968) in studies of visual shape 
recognition. In the case of verbal stimuli, these 
operations basically mean selection of response 
terms that are formally and/or acoustically similar 
to the stimuli. Although the defining operations 
ol response representativeness clearly imply a multi- 
dimensional concept, all that is necessary for the 
present experimental purposes is the selection of 2 
classes of pairs for associative learning: 1 set of pairs 
that is relatively easy to associate and 1 that 
is relatively difficult. An example of pair types is 
M EDVA N associated with MAIDEN as a representa- 
tive response, and with UMPIRE as à nonrepresenta- 
tive response. 

After Ss received PA training they were given a 
forced choice stimulus recognition test that required 
them to select 1 of 5 highly similar stimuli as being 
the old item seen during training. The 5 stimuli 
(the old item and 4 distractors) were typed in a 
single row ona white 5 X 8in. card. The distractors 
were constructed as outlined in Experiment | and 
were grouped in the same manner as the old item 
with which they appeared. 


Results 


The primary data were the scores from 
the recognition test. Response representa- 
tiveness exerted no effect on stimulus 
recognition with integrated high-m stimuli ; 
mean recognition scores following either 
type of associative training were iden tical 
(.83). In contrast, with fractionated 
high-m stimuli, the recognition scores for 
the representative and nonrepresentative 
response conditions were 79 and .44, re- 
spectively. The expected Stimulus Struc- 
ture X Response Representativeness in- 
teraction was present, F (1, 66) = 14.75, 
p < .01. Clearly, context effects during 
associative training are present, but only 
with the fractionated stimuli, which operate 
as poorly integrated units and thus benefit 
from contextual responses which effec- 
tively instruct S in how to unitize or 
organize the letter sequences. 

The effects of stimulus structure (in- 
tegrated vs. fractionated) and response 
representativeness were both reliable, F (1, 
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66) = 27.66, p < .001, and F (1, 66) = 
12.60, p < .001, respectively, 


Discussion 


Experiment I provided baseline data 
demonstrating the effects of m on recogni- 
tion memory for trigram doublet stimuli. 
The results indicated a positive and sys- 
tematic relationship between the recogni- 
tion of a stimulus and its m value. This 
finding is consistent with those of Ellis 
and Shumate (1973) using similar stimuli 
and with experiments using random shape 
stimuli (e.g., Ellis, 1968; Price & Slive, 
1970). The first experiment provided em- 
pirical support for Shepard's (1963) hy- 
pothesis that m is highly correlated with 
the degree of analyzability of the stimuli. 
This hypothesis contends that low-» stimuli 
are likely to be analyzed into separate com- 
ponents, ie., they are likely to be pro- 
cessed or encoded into more than 1 unit, 
whereas high-m stimuli are likely to be 
processed in à more unitary manner. Frac- 
tionation of high- and moderate-m stimuli 
produced a substantial reduction in recogni- 
tion performance. 

A straightforward interpretation of these 
findings is that the fragmented high-m 
stimyli have become, in effect, functional 
low-m stimuli. This position is reasonable 
if the view that the underlying dimension 
of m is the degree to which a verbal stimu- 
lus may be encoded as a unit is adopted. 
Fractionating a ‘stimulus, particularly a 
high-m stimulus, makes-it more difficult 
for S to encode that stimulus as a unit and 
therefore effectively reduces its m value. 
Relating these findings to Miller's (1956) 
chunking notions, a further interpretation 
of the results seems quite simple and 
straightforward. High-m stimuli may be 
regarded as involving fewer chunks of in- 
formation than low-m stimuli, even though 
the number of letters in each stimulus 
class is identical. Thus, Ss learning low-m 
stimuli attempt to place more units or 
chunks of information into memory than Ss 
learning high-m lists. This differential 


_ memory loading, attributable to varying 


levels of m, is reflected in recognition 
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memory performance, wii 
high-m material performing | 
learning low-» material. ! 
high-» stimuli are fractionat: 
of chunks of information req: 
learning to take place is inc 
increasing S's memory load 
quently lowering recognition | 

The purpose of Experime:: 
demonstrate that when a high- 
is fractionated it does, in fact, ù 
a low-m stimulus under differen: 
mental conditions. The positive ri 
the experiment suggest, again, th 
m effect is a function of the integ 
fractionated nature of the stimulus. T 
consequence of fractionating a hi h 
stimulus is to make it more difficult fo; 
to perceive and encode that stimulus as 
unit. The function of a representative : 
sponse may be seen as one of reintegr 
or organizing the stimulus by sugge 
a possible unitary encoding, More g 
ally, this experiment suggests that 
cedures that enable or encourage < 
encode stimuli as units may be rego: 
as increasing the m value of that stin: 
By the same token, procedures !! 
courage Ss to encode stimuli in | 
their components may be seen as d 5 
the m value of that stimulus. Finily, the 
results of Experiment II support the gene 
eralization that contextual event» (such as 
responses) are most likely to affect the en- 
coding of stimuli that are initially difficult 
to encode. 
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Encoding processes were 
hibition (PI) paradigm 
lation of evaluative connotation (E) 
indicated significant PI buildup for 


Proactive inhibition release 


investigated using the release from proactive in 
with word triads derived from the 
and concreteness (C). The recall data 
all 
concrete vs. abstract triads while no differential PI for good vs. bad triads, 
was found for shifts on either 


factorial manipu- 


EXC conditions with less PI for 


E or C, or both, 


compared to their appropriate no-shift controls only when postshift items 


were abstract. For 
E only. The 
for concrete 
hypothesis, 


Considerable evidence has accumulated 
over the last decade for proactive inhibi- 
ton (PI) and PI release with Successive 
trials on a Brown-Peterson (Brown, 1958; 
Peterson & Peterson, 1959) short-term 
memory (STM) task, The use of PI 
release as a potential indicator of how 
information is encoded in STM has perhaps 
received its most cogent explication in a 
recent review by Wickens (1970). He 
proposed that words are encoded into 
memory along a number of. dimensions or 
attributes and identified a large number 
of these dimensions, via PI release, gauging 
their relative dominance in terms of per- 
centage of release, 

The concern of the present study was 
with 2 such dimensions, connotative evalu- 
ation (E) as defined by the semantic dif- 
ferential (Osgood, Suci, & Tannenbaum, 
1957) and concreteness (C). The fact that 
the former is employed as an encoding 
class in STM has been fairly well estab- 
lished (Turvey, 1968; Wickens & Clark, 
1968). Evidence suggesting such encoding 
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postshift abstract. items, 
produced better recall than shifts on C only, 


simultaneous shifts on E and C 
but not better than shifts on 


data were evaluated in terms of a dual encoding mechanism 
word items and Paivio and 


egg’s offsetting interference 

for the latter, however, has been lacking, 
Wickens and Engle (1970) inve stigated 
differential encoding by means of PI release 
for words differing on thisydimension 5:4 
although their results Suggested differe - 
tial interference for abstract vs. concrei » 
word triads, their negative release result ; 
raised doubts concerning differentia! 
encoding. 

The differential interference interpreta 
tion proposed by these authors, in terms 
of greater semantic overlap for abstrac: 
items, was recently challenged by Paivio 
and Begg (1971) who failed to find sup- 
port for such an interpretation when inter- 
ference was defined in terms of interiten) 
associative relatedness.  Paivio and Begg 
proposed instead that imagery is a source 
of resistance to interference among con- 
crete items based on the assumption that 
high-imagery words are encoded differently 
from abstract words. According to t! 
dual encoding theory (Paivio, 1971), the 
availability of imagery as an alternative 
or additional code for concrete items might 
enhance their recall relative to abstract 
items by offsetting the interference eff 
of a verbal code, the only code available 
for abstract items, Unfortunately, Paivio 
and Begg did not include a shift in triad 
class in their study in which release could 

evaluated. Thus, the Wickens and 
Engle (1970) failure to obtain release with 
this dimension has remained 


à question. 


:NOTATIVE MEANING AND CONCRETENESS IN PI RELEASE IN STM 


The sent study sought to explore 
furth phenomena of PI buildup over 
trials | PI release with concrete and 
abst: vord triads by also including the 
conn ive evaluative dimension. By 
usir ords rated on both E and C, 
shif ; one or both of these dimensions 
cou e evaluated. The inclusion of the 
E nsion would provide a reliable en- 
Cot class under either an abstract or 
con cte triad condition and shifts on this 
dis ion should result in PI release 
re ss of C level. This dimension was 
als chosen since it was not controlled in 
the Wickens and Engle (1970) study and 
t was the possibility that it may have 
i confounded somewhat with their 
1 


gery-concreteness manipulation. This 
possibility arose from the fact that these 

thors selected word stimuli from the 
Paivio, Yuille, and Madigan (1968) norms 

lowing imagery and C ratings to covary 
ince they tend to correlate highly (.83). 
Mowever, since Paivio et al. noted that 
words which differ on these 2 scales with 

iagery ratings being higher than C ratings 
tended to be words with "strong emotional 
or evaluative connotations,” failure to 
control for E may have resulted in some 
confounding, 

Fhe present study also investigated the 
release phenomenon as à function of the 
number of prior PI buildup trials, since 
shifts in triad class occurred after either 
4 or & prior interference trials. This vari- 
able was included since it was thought 
that perhaps recall performance had not 
reached asymptotic level prior to a shift 
in class in the Wickens and Engle study 
where only 3 preshift trials were given. 
This assumption appeared reasonable in the 
light of an earlier study by Wickens, Born, 
and Allen (1963) where significant release 
from PI was obtained with a shift from 
number-number-number to consonant- 
consonant-consonant trigrams relative to 
a consonant-consonant—consonant control 
after 9 prior PI buildup trials but not after 
3 or 6 such trials. 


METHOD 


Design and materials, The design included 4 basic 
triad classifications derived from the 2 X 2 factorial 


315 


combination of E (good vs. bad) and C (concrete 
vs. abstract). There were 4 shift conditions: 
shifts on E only, € only, both dimensions simul- 
taneously, or no shifts on either dimension. Within 
each of the shift conditions there were 8 groups, 
with 2 groups representing each of the 4 triad classi- 
fications. One of these 2 groups received a shift 
after a block of 4 prior PI buildup trials, while the 
other group received a shift after 8 such trials. This 
design defined the 32 groups that were required to in- 
vestigate the PI buildup and release phenomena. 

In order to obtain the words needed for each. 
of the 4 triad conditions, 2 groups of 30 intro- 
ductory social psychology Ss rated words on each 
of 2 scales, good-bad and concrete-abstract, since 
available norms contained few words with ratings 
on both these dimensions. A total of 160 nouns 
were selected from available norms of C and E for 
rating by these Ss, Each S received a 40-page 
booklet containing 80 of these critical words 
(20 from each E X C combination as subjectively 
determined by E) plus 80 filler words. Each of 
the 2 groups of 30 5s received a different set of 80 
critical words but the same 80 filler words. Each 
page of the booklet contained 4 words with 2 
7-point rating scales, concrete-abstract and good- 
bad, below each word with the former scale always 
located above the latter for each of the words 
rated. The booklets were constructed. such that 
2 critical and 2 filler words appeared in random. 
order on each page. Five random page orders 
were employed yielding 5 different ordered booklets 
for each of the 2 sets of words. 

Means and standard deviations were calculated 
from the ratings for each of the critical words and 
were used to select and place them into one of the 
following classes: good-concrete (GC), bad-con- 
crete (BC), good-abstract (GA), and bad-abstract 
(BA). Good was arbitrarily defined as a mean 
rating ranging 1.00-3.20 on the good-bad scale 
while bad had ratings ranging 4,80-7.00. Similarly, 
concrete was defined as mean ratings ranging 
1.00-3.20 on the concrete-abstract scale, while 
abstract words had mean ratings ranging 4.80-7.00. 
‘A total of 27 nouns were chosen from each EXC 
combination. The overall mean ratings of the 
nouns chosen for each condition were as follows: 
for GC, 2.09 and 1.63; GA, 2.13 and 5.72; BC, 5.73 
and 1.86; and BA, 5.81 and 5.41. The 27 nouns 
selected for each triad condition were also matched 
as closely as possible across classes for "Thorndike- 
Lorge (1944) frequency. In constructing word 
triads, care was taken such that, within a triad 
there (a) was minimum interitem association, (b) 
was variable frequency, (c) were different initial 
letters and variable number of syllables for the 
words, and (d) were variable activity and potency 
connotative values, as subjectively determined 
by E. 

Subjects and procedure. Eight Ss were tested 
in each of 32 groups giving a total of 256 5s. All 
were undergraduate volunteers enrolled during the 
summer term at Southern Illinois University at 
Carbondale. For each of the 32 conditions there 
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were 2 random presentation orders of the triads 
since complete counterbalancing would have re- 
quired too many groups. For each of the condi- 
tions, these orders were determined in the following 
manner. First, 2 random orders for each of the 
no-shift control groups were defined. The shifted 
groups then received the same triads in the same 
order as their preshift controls on preshift trials 
and the same triads in the same order as their 
postshift controls on the trial of shift in materials 
and trials following. Within each condition, 4 Ss 
received each random-order replication. The Ss 
were tested individually and randomly assigned to 
the 32 groups in the order they came into the 
lab, with the restriction that the » + 1.5 was not 
assigned to a group until the th S had been as- 
signed to every other group. 

Each word was mounted on a 2X2 in. slide 
and rear-projected by means of a Kodak Carousel 
Projector. After S was seated in the experimental 
room, the typical instructions for the Brown- 
Peterson (Brown, 1958; Peterson & Peterson, 1959) 
STM paradigm were read. Two 20-sec. practice 
periods of backward counting by 3s were then 
given prior to the start of the experiment to fa- 
miliarize S with the counting task. A trial began 
With a 2-sec. presentation of an asterisk as a signa] 
that the first word of a triad was about to be 
presented. Following this signal, the 3 words of 
the triad were presented individually for 1 sec. 
each with a Projector change time of about .75 sec, 
The S was instructed to read each word aloud as 
it was presented, Following the last word of the 
triad, a 3-digit number (from a random-number 
table) appeared on the screen and remained for 
a duration of 20 sec, During this time, S was 
required to count backwards aloud from this 
number by 3s paced by a metronome set at a 1-sec, 
rate. After 20 sec. of counting, a question mark 
appeared on the screen and remained for 8 sec, 
as a signal to S to orally recall the words of the 
previously shown triad, Immediately after the 
recall interval, an asterisk once again appeared 
on the screen, signaling the beginning of the next 


TABLE 1 


ANALYSIS OF VARIANCE SUMMARY FoR TRIAD 
CONDITIONS OVER THE FOUR INITIAL TRIALS; 
Proactive INHIBITION Bvirpup 


Source df F 
Between Ss | 
Evaluation (E) 1 5.35 
oncreteness (C) vc 22.33* 
EXC 1 1.22 
Error (M.S) 252 (87) 
Within Ss 
Trials (T) 3 131.40* 
EXT 3 3.37 
CXT 3 7.15* 
BOG GIC 3 2.94 
Error (M.S) 756 (.59) 


*p «.0t. 
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FIGURE 1. Percentage of recall as a fi 
trials for each of the 4 triad conditions, (( 
cluded are those that received no change 
sion through the first 8 trials, » = 40: 
inhibition buildup, Abbreviations: GC 
concrete, BC = bad-concrete, GA = good 
and BA = bad- abstract.) 


trial, All Ss received a total of 9 trial iis 
manner. The presentation sequence was co; 
by a Gerbrands tape timer, 

Scoring. The E recorded the recalled re 
as well as intru 
the data, S rece 
rectly recalled 
resulting in a Possible tota] of 3 points, 


REsULTS 


Proactive inhibition buildup. The initial 
analysis was a 2 X2 X4 mixed des n, 
with C (concrete or abstract), E (good or 
bad), and trials (1-4) as the repeated 
factor, The results of this analysis are 
given in Table 1, 

In terms of percentage of recall, abstract 
triads (59.3%) were significantly lower 
overall than concrete (68.3%), and per- 
formance, summed over all conditions, 
dropped from Trial 1 through Trial 4: 90%, 
3%, 5595. and 48%, respectively. The 
Significant C X Trials interaction indicated 


abstract and Concrete triad groups with 
Percentage of reca] dropping more rapidly, 
; for the former. 
Simple main effect trials analyses for each 
indicated significant PI 
buildup across trials for all 4 triad condi- 
following F ratios: GC, 
F (3,756) = 25.54, GA, F (3, 756) — 31.53, 
BC, F (3, 756) — 30.63, and BA, F (3, 756) 
= 53.56, all bs < .01. Further analysis 


dicated that the groups did not differ 
ach other on Trial 1. The percent 
Jl scores for the 4 triad conditions over 
first 8 trials for those groups not re- 
ing a shift in triad class on Trial 5 can 
seen in Figure 1. 

^ne additional analysis, a 2* between-Ss 

ign, which compared the performance 

| the 4 E X C triad conditions on Trial 4 

roups that received 4 PI buildup trials) 

ith that on Trial 8 (groups that received 

PI buildup trials) failed to yield any 
ivnificant effects at the .01 level and thus 
indicated no further Pl buildup with ad- 

itional trials. 

Proactive inhibition release. A 25 be- 
iween-Ss design was used to examine the 
release phenomenon. This design included 
the 2 levels of E and C prior to a shift 
in triad class (preshift), 2 levels of each 
dimension following a shift (postshift), and 
trial of shift (5 or 9). Since no main 
effect of trial of shift or any interactions 
with this factor were significant (all F 
ratios < 1), the data were collapsed over 
this factor and are given in Table 2. 

This analysis yielded 4 significant ef- 
fects: a postshift C main effect, F (1, 224) 
= 20.84, a Preshift X Postshift C inter- 
action, F (1, 224) = 11.60, a Preshift 
X Postshift E interaction, F (1, 224) 
— 35.45, and a triple Preshift X Postshift 
E X Postshift C interaction, F (1, 224) 
= 11.60, all ps < .01, with MS error 
= 14. The 2 double interactions can best 
be seen by reference to Tables 3 and 4. 


TABLE 2 


PERCENTAGE OF RECALL FOR EACH PRE- 
X PosrsmrT MANIPULATION 
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TABLE 3 
PERCENTAGE OF RECALL FOR PRESHIFT x 
POSTSHIFT CONCRETENESS: PROACTIVE 
INHIBITION RELEASE 


| Postshift 
Pret — |— = : 
| Concrete Abstract 
| t 
Concrete 68.3 64.0 
Abstract 77.1 48.0 
M total 72.7 56.1 


From Table 3 it can be seen that recall 
was substantially better with concrete than 
abstract postshift triads and that shifted 
groups exhibited better recall than their 
appropriate nonshifted controls. Simple 
main effect analyses comparing the groups 
shifted with their appropriate nonshifted 
controls revealed that only the groups 
shifted from concrete to abstract triads 
recalled significantly more than their post- 
shift controls, F (1, 224) = 9.50, p < 01, 
while groups shifted from abstract to con- 
crete did not differ from their nonshifted 
controls. Therefore, significant C shifts 
were restricted to postshift abstract triads. 

Reference to Table 4, the Preshift 
X Postshift E interaction, also shows su- 
perior performance for the shifted E groups 
compared to appropriate nonshifted con- 
trols. However, this interaction also in- 
teracted with postshift C as seen by 
referring to Table 2 where percent recall 
scores are given collapsed over preshift C. 
Analysis of the simple Preshift X Postshift 
E interaction at each level of postshift C 
indicated a significant interaction effect 
only when postshift triads were abstract, 
F(, 224) = 43.24, p Zaa lh should 


Postshift condition 
i F i TABLE 4 
Preshift. = 
condition: ess Bee PERCENTAGE OF RECALL FOR PRESHIFT x 
A MEE. pel] PostsuirT EVAL PROACTIVE 
Conerete | Abstract | Concrete Abstract ]NHIB ^ SE 
Good i EE 
Concrete 58.3 60.3 79.3 87.7 | Postshift 
Abstract 75.0 31.3 77.0 62.7 Preshift —— SS m 
M 66.7 | 458 | 781 | 75.2 EET du 
Bad á 3 
Concrete 73.0 66.7 62.7 41.3 Good 56.2 16.1 
Abstract. 79.3 75.0 71.0 | 25.0 Bad 73.2 51.5 
M 76.1 | 70.8 69.8 | 33.1 M total 64.1 74.1 
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perhaps be noted that the direction of the 
interaction for the postshift concrete triads, 
although not Significant, also indicated 
better recall for the groups shifted on E 
than those not so shifted. Furthermore, 
inspection of the Preshift x Postshift E 
interactions indicated similar differences 
between shifted groups and their respective 
postshift controls, i.e., relatively sym- 
metrical shift effects regardless of direction 
(from good to bad or vice versa). Con- 
Sonant with these observations, simple 
main effect analyses comparing groups 
shifted on E with appropriate postshift 
control groups indicated significant. shift 
effects from bad to good and good to bad 
with postshift abstract triads, F (1, 224) 
= 12.16, and F (1, 224) = 33.74, respec- 
tively, both PS < .01, but no significant 
effects in either direction. with postshift 
concrete triads, 

Another way to evaluate the triad shift 
data was to Partition the groups into 
4 different shift conditions (no shift, shift 
on E only, C only, and both E and G) 
and analyze recall on the 4 È X C post- 
shift triad types with the additional factor 
of trial of shift (5 or 9). In this anal s, 
the effects of shifts on a particular dimen- 
Sion Were isolated from shifts on other 
dimensions, whereas in the Previous anal- 
ysis, the effects of shifts on one dimension 
Were averaged over shifts and no shifts on 
the other dimension, 

Neither postshift E, trial of Shift, nor 
any interactions involving these factors 
Were significant, Significant effects were 
obtained for shift condition, F (3, 224) 
= 16.63, Postshift C, p (1, 224) = 20.84, 


TABLE 5 
PERCENTAGE OF RECALL rog THE Various Sur 
Conpitions UNDER Born I 5 OF Posrsmpr 
Concrr 


Shift condition 


0 shift 

ncreteness (C) only 
Evaluation (E) only 
Both C and E | 


Note, Data Averaged over Trials 5 and 9, 


and the Shift x Postshift C interact 
E (3.1994) — 4.16, all Ps < .01, with 7 


error — .74. Mean percentage of rec.) 
for this analysis, collapsed over postshif: & 
and trial of Shift, is given in Table 5. 

Simple shift conditions main effect an 
‘ses under e 


consistent with the previous analys 
Pair-wise comparisons of the various sl: 
conditions under postshift abstract yield. 
significant differences between each of 1 

shifted conditions and the no shift cond 
tion as well as à significant difference |; 
tween the double (E-C) shift conditio 
and the C only condition with an over; 
-05 alpha level; 


Discusston 


by the results of the Present study, The 
finding of greater PI accumulation over trials 
for abstract triads compared to concrete triads 
ikewise Consistent with available evidence 
è Wickens & Engle, 


1970), 
creteness and/or Imagery has been demon- 


ations Where this dimension has been 
manipulated (Paivio, 1971) and thus, would 
not be unexpected here, 

If PI release is defined in terms of the 
difference in recall performance between a 
group shifted on one or more dimensions and 
its postshift control, then reliable release was 
Cemonstrated for shifts on both E and Cras 
indicated by their respective Preshift X Pos 
shift interactions. 

Reliable release for shifts on E simply 
replicates the results of previous Studies 
(Turvey, 1968; Wickens & Clark, 1968) and 


E : UN 
Wickens and Engle (1970) implied, abstract 


and concrete words are not encoded differently 
and represent a dimension of relative unim- 
portance in an STM experiment, then release 
should have obtained with shifts om E irre- 
spective of the level of C. If instead, imaginal 
encoding offsets the interference effects of a 
verbal code (such as connotative evaluation), 
as has been suggested by Paivio and Begg 
(1971), then little release would be expected 
with shifts on E when the words are highly 
concrete contrasted with a large release with 
shifts on E when the words are highly ab- 
stract. The results of the present study are 
entirely consistent with this latter prediction. 

Furthermore, the present study also indi- 
caud reliable release for shifts on C (for 
cc-erete to abstract triad shifts) which 
Wictens and Engle (1970) failed to obtain. 
al reasons for such a discrepancy might 


Ve be given. The present study contained a 
ur number of procedural differences from that 
n of Wickens and Engle such as the amount 


of time given for recall (6 vs. 8 sec), re- 
questing Ss to recall triad items in serial 
order of presentation vs. free-recall instruc- 
tions, and the presentation of all 3 items of 
the triad at one time for 3 sec, vs. presenta- 
tion of each item of the triad separately for 
1 sec. each. One might argue that this last 
difference is crucial in the sense that more 
total time was available in the present ex- 
sent than in the Wickens and Engle 


3 tudy to encode the words of a triad prior 
; to the distractor task (due to projector change 
time), thus allowing more time for image 
formation. However, as Paivio (1971) has 


pointed out, evidence would seem to suggest 
that image latency to single words is in the 
order of .6 sec. which would tend to minimize 
the importance of such a distinction. 

A major difference between the studies was 
the control for E, This may be important 
in the sense that a reliable dimension (E) 
was available in the present experiment for 
encoding regardless of the C manipulation, 
whereas in the Wickens and Engle (1970) 
experiment, no consistent dimension other 
than C was available, 

When one considers the very nature of the 
C dimension, it becomes apparent that the 
situation with respect to shifts on this dimen- 
sion is considerably more complex than shifts 
on other dimensions. Concrete and abstract 
words may be considered alike in that both 
may be encoded verbally and, as such, there 
should be no differential encoding. However, 
provide for an additional en- 


concrete words 
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coding mechanism, imagery. This dual en- 
coding possibility for concrete words does not 
exist for other encoding dimensions investi- 
gated by Wickens (1970) and his colleagues 
and makes this dimension unique in that 
respect. Thus, when one shifts from abstract 
to concrete words, an additional encoding 
mechanism is made available to the one 
employed preshift. Performance on such a 
shift might reasonably be expected to im- 
prove relative to that preshift but not neces- 
sarily be significantly different from a control 
which always had the opportunity to employ 
a dual encoding mechanism. A shift from 
concrete to abstract (dual encoding to single 
encoding mechanism) should result in a de- 
cline in performance relative to that preshift 
due to the loss of an encoding mechanism, 
Performance on such a shift might not be 
expected to differ from an abstract control 
unless one of the following prevailed, First, 
if substantial interference buildup occurred 
for images in addition to such buildup for a 
verbal code, then a shift from concrete to 
abstract items would eliminate imaginal inter- 
ference resulting in PI release relative to an 
abstract control, Such a hypothesis seems 
unlikely since it would also predict greater 
PI buildup over trials for concrete vs, ab- 
stract triads, a situation which has been 
repeatedly shown to be the reverse. 

"The second. possibility, which would predict 
a difference in performance in favor of a group 
shifted from concrete to abstract relative to 
an abstract control, as well as superior per- 
formance preshift for concrete items, involves 
Paivio and Begg's (1971) offsetting inter- 
ference hypothesis. If the availability of an 
imaginal code preshift offsets the interference 
effects generated by a verbal code, then a 
shift from a dual encoding mechanism to à 
single encoding mechanism (verbal) should 
result in a performance decrement from that 
demonstrated preshift but to a level somewhat 
above an abstract control where interference 
buildup along a verbal code would have been 
great. This latter possibility would predict 
an empirical demonstration of PI release. 
The C shift results of the present study would 
appear consistent with this offsetting inter- 
ference hypothesis and Paivio's (1971) dual 
encoding theory for concrete words. 


REFERENCES 


Brown, J. Some tests of the decay theory 
of immediate memory. Quarterly Journal of Ex- 
perimental Psychology, 1958, 10, 12-21. 


e. 
320 


Oscoon, C. E., Suci, G. J., & TANNENBAUM, P. H. 
Measurement of meaning. Urbana: University 
of Illinois Press, 1957. 

Parvio, A. Imagery and verbal processes. 
York: Holt, Rinehart & Winston, 1971. 
PaArvio, A., & Brae, I. Imagery and associative 
overlap in STM. Journal of Experimental Psy- 

chology, 1971, 89, 40-45. 

Paivio, A., YUILLE, J. C., & MADIGAN, S. A. 
Concreteness, imagery, and meaningfulness val- 
ues for 925 nouns. Journal of Experimental 
Psychology Monograph Supplement, 1968, 76 
(1, Pt. 2). 

PETERSON, L. P., & PETERSON, M. J. Short-term 
retention of individual verbal items, Journal of 
Experimental Psychology, 1959, 58, 193—198. 

‘THORNDIKE, E. L., & LomcE, I. The teacher's 
wordbook of 30,000 words. New York: Columbia 
University, Teachers College, Bureau of Pub- 
lications, 1944. 


New 


GEORGE D. GOEDEL 


Turvey, M. T. Analysis of augmented re: 
short-term memory following a shift in co: 
tion. British Journal of Psychology, 1968, 
131-137. 

Wickens, D. D. Encoding categories of wor 
An empirical approach to meaning. Psycholog 
Review, 1970, 77, 1-15. 

Wickens, D. D., Born, D. G., & ALLEN, C 
Proactive inhibition and item similarity in sho 
term memory. Journal of Verbal Learming an 
Verbal Behavior, 1963, 2, 440—445. 

WickENs, D. D., & CLARK, S. Osgood dimensic 
as an encoding class in short-term memo 
Journal of Experimental Psychology, 1968, 
580-584. 

WickENs, D. D., & ENGLE, R. W. Imagery and 
abstractness in short-term memory. J^: of 
Experimental Psychology, 1970, 84, 268-27 


(Received June 29, 1973) 


(Continued from page 313) 


Hypothesis Behavior as a Function of Amount of Pretraining. Raymond M. White, Jr. (Department of 
. Psychology, Miami University, Oxford, Ohio 45056). 
The Locus of the Retention Differences Associated with Degree of Hierarchical Conceptual Structure. 


Benton J. Underwood (Department of Psychology, Northwestern University, Evanston, lllinois 
60201), John J. Shaughnessy, and Joel Zi 


yh hn 1 Zimmerman. z 3 >] 4 2 
The Effect of Number of Stimuli on the S ady-State Generalization Gradient: A Test of the Summation 
Hypothesis. Jus C. Malone, Jr. (Department of Psychology, University of Tennessee, Knoxville, 


Tennessi 
Effects of Evaluation Apprehension on Memory over Intervals of Varying Length. Russell G. Geen (De- 


Barnett of Psychology, 209 McAlester Hall, University of Missouri-Columbia, Columbia, Missouri 


Strength of Auditory S-R Compatibility as a Function of Task Complexity. 
of Psychiatry and Behavioral Sciences, University of Oklahoma Health 
Oklahoma City, Oklahoma 73190), Diane Klisz, and Oscar A. Parsons. 

Target Location Aftereffe s for Various Age Groups. Benjamin Wallace (Department of Psychology, 

T Western Illinois University, Macomb, lilinois 61455) and Scott P. Anstadt. 

Extended Mec on in Children's Paired-Associate Learning. Lewis P. Lipsitt (Walter S. Hunter Labora- 
tory of Psychology, Brown University, Providence, Rhode Island 02912) and Judith E, Sims-Knight. 


James R. Callan (Department 
ences Center, P.O. Box 26901, 


jurnal of Experimental Psycholo, 
1974, Vol. 102, No. 2, 321-323 T 


SHORT REPORTS 


PRONUNCIATION AND APPARENT FREQUENCY IN A 
BETWEEN-SUBJECTS DESIGN * 


LARRY WILDER, JOEL R. LEVIN, 


ELIZABETH S. GHATALA, AND SANDRA McNABB 
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College students were administered a list of middle-frequency words, in which 


individual words were presented 1-6 times. 
list, while the other half remained silent. 


Half of the Ss pronounced the 
On a subsequent frequency judg- 


ment task, pronunciation Ss failed to differ significantly from silent Ss on mean 
judgments of items presented only once; however, there were differences on 


these items as indexed by variability 


and mean total correct measures. 


Anal- 


ysis of judgments of sero items (not seen on the study list) indicated differences 


between silent and pronunciation Ss or 
effects tended to be eliminated on the 
results were contrasted with previous 


e quency theory perspective. 

Situational frequency, as described by Ekstrand, 
Wallace, and Underwood (1966), has been shown to. 
be a principal attribute in verbal discrimination 
learning (VDL) and recognition memory. In VDL, 
situational frequency is in part a function of re- 
hearsal of the correct response, which leads to an 
increased frequency differential between the correct 
and incorrect members in a pair. In recognition 
memory, frequency of exposure during the learning 
trial is a significant factor affecting performance 
(Underwood & Freund, 1970). 

Spoken rehearsal has been found to be superior 
to control (i.e., silent) performance in VDL (Wilder, 
1971). Frequency theory can account for thi 
finding if it is assumed that pronouncing during re- 
hearsal increases the situational frequency differ- 
ential between the correct and incorrect item. 
Hopkins, Boylan, and Lincoln (1972) tested this 
assumption in a frequency judgment task. In one 
experiment, they found no difference in mean fre- 
quency judgments between Ss who pronounced all 
words on the study trial and Ss who did not pro- 
nounce (Experiment 3). However, when this 
manipulation was made within Ss; pronunciation 
did increase apparent frequency (Experiment 4). 
That is, when Ss pronounced some words but not 
others in the same list, they judged the pronounced 
words to have a higher frequency of occurrence. 
"These results led the authors to conclude that pro- 
nunciation influenced (increased) apparent fre- 
quency only “f § has also had experience with 
silently studied items in the same experimental con- 


ta 
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items presented more than once, 


n all 3 measures, while the pronunciation 


These 
findings and explained within a fre- 


text [Hopkins et al., 1972, p. 1127," and at the 
same time permitted a frequency explanation of the 
facilitative effect of pronouncing the correct response 
in VD. 
However, Underwood (personal communication, 
November 1971) has justly pointed out that an 
absolule mean increase in the situational frequency 
associated with the "correct" response in VDL, or 
with previously seen items in recognition memory, is 
not the only way in which an increased. frequency 
differential can occur. For example, if. pronun 
tion Ss exhibit lower variability than silent Ss on list 
items with the same situa ional frequency, then an 
equivalent increased. frequency differential among 
items with differen! situational frequencies would be 
the result. Thus, while Hopkins et al. (1972) found 
no mean differences in an independent groups design, 
it is possible that pronunciation increases item dis- 
criminability by reducing the v; ce of mean fre- 
quency judgments for items of the same situational 
frequency. In other words, while apparent fre- 
quency units may not be added as a function of 
pronouncing the items on a study list, more con- 
sistent frequency estimates of items with the same 
situational frequency would be expected to produc 
the same net result in VDL and recognition memory 
tasks. 
Accordingly, the present experiment was intended. 
to assess the effects of pronunciation on frequency 
judgment variability in a between-Ss design. In 
addition, a measure found to be correlated with, but 
more sensitive than, within-S variability—total 
number of correct frequency judgments (Ghatala & 
Levin, in press)—was considered. ‘The experiment 
itself was similar to Hopkins et al. (1972), Experi- 
ment 3; however, in addition to just study list 
items, a number of filler items were presented during 
the second (test) list. It was predicted that if 
pronunciation decreases variability on items pre- 
viously seen, this would result in increased dis- 
criminability between these items and items not 
presented, as in recognition memory. 
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Method. A total of 42 upper division University 
of Wisconsin communication arts students partici- 
pated in the experiment for partial course credit. 

The stimuli consisted of 2 lists of middle-frequency 
(10-20 on the general Thorndike-Lorge, 1944, list) 
2-syllable nouns. The study list contained 90 words. 
Fifty of the words were presented once; 25 were 
presented twice; 10 were presented 3 times; 3 were 
presented 4 times; one was presented 5 times; and 
one was presented 6 times. In all, there were 153 
presentations, "The list was divided so that each 
word occurred equally often in equal divisions of the 
list (e.g., for the one word that occurred 6 times, 
the list was divided into sixths). The second list 
was the test list, which included all of the 90 words 
from the study list plus 10 additional words that 
had not been presented on the study list. 

The first list was presented on a memory drum at 
a 2-sec. rate, Two groups of Ss were used. A pro- 
nouncing group was told : 


This is an experiment on word memory. You 
will be shown a long list of words, Some words 
will occur once, while others will occur twice on 
up to six times, Your task is to look at each 
word carefully and pronounce it. The is too 
long to actually count and remember all of the 
words, so just try to get an impression of the 
number of times each word has occurred. 


A silent group was given the same instructions, but 
they were told only to look at each word carefully, 
A test list was presented once on a memory drum 
at a 3,5-sec. rate to both groups. All of the Ss 
reported verbally how many times they had seen 
the word before, "They were told that they could 
report zero for any item that they had not seen 
previously, 

Results. A summary of Ss’ frequency judgment 
performance on the one and zero items, as well as the 
average number of correct frequency judgments 
(out of 100), is presented in Table 1 with the 


TABLE t 


SUMMARY MEASURES AND Owr-TAILED Sh 
ABILITIES CORRESPOND NCIATION 


^ Pronunci- 
Measure Control ation t (40) | $ 
dad z- ee 
One items 
M 1.090 47 | A 
Variance -603 456 | <.05 
Zero items 
M MB ALS 3.63 «.001 
Variance .326 452 -2.31 «025 
PE TT, I - i 
All items 
45.19 $5.40 | 318 | «.005 
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corresponding statistics based on pronunciation- 
control differences. Since the predictions derived 
from frequency theory were all quite explicit (i.e., 
pronunciation should result in larger means and 
smaller variability on one items, in both smaller 
means and variability on zero items, and in a 
greater number of correct frequency judgments), all 
comparisons made were directional and thus the 
significance probabilities reported in Table 1 are 
1-tailed. 

Mean apparent frequencies were computed for 
each S by averaging his frequency judgments on 
the 50 one items, The same was done for each S on 
the 10 zero items. The mean in Table 1, therefore, 
represents the average mean across Ss in the same 
condition. As may be seen, the difference between 
the pronunciation and control conditions for mean 
judgments of one items was in the wrong direction 
and consequently nonsignificant, which  corre- 
sponds to the Hopkins et al. (1972) result. Mean 
judgments for zero items did result in significant 
differences between the 2 conditions, however. 

Within-S variances were obtained on one and zero 
items by computing each S’s mean squared deviation 
(about his apparent frequency mean). The entries 
in Table 1 represent the average of these within-S 
variances within each condition, Pronunciation 
resulted in lower average within-S variances, par- 
ticularly on the zero items. 

The total correct measure was computed by 
counting the total number of times S’s frequency 
judgment corresponded exactly to the appropriate 
actual frequency. It can be seen that pronunciation 
5s averaged about 10 more correct responses than 
did control Ss (this effect was found to hold for both 
one and zero items when analyzed separately). 

Analysis of means and variances of items presented 
more than once detected only one other statistically 
significant pronunciation-control difference: lower 
within-5 variances for pronunciation Ss on the four 
items, ¢ (40) = 2.12, p < .05. 

Discussion. These results are consistent with 
those of Hopkins et al. (1972) in that there was 
no significant difference between pronunciation ard 
silent 5s on mean frequency judgments, Howe: 
going beyond mean frequency judgments, significant 
differences between silent and spoken performance 
were detect lin a between-Ss design, There was 
less variability within Ss, as well as a greater mean 
total correct, for Ss who pronounced the one items. 
F urther, the prediction that Ss who pronounced the 
items on the study list were better able to dis- 
"nate items they had not seen before was 
confirmed. 

From these results it would appear that Hopkins 
et al. (1972) were premature in discounting the 
effects of pronunciation in a between-Ss design, and 
that frequency theory can account for these results 
if it is assumed that reduced variability contributes 
to improved frequency discriminations. Also, it is 
not surprising that these effects tend to be elimi- 
nated when items are presented more than once, 
since tkstrand et al. (1966, p. 575) have suggested, 
according to Weber's law, that adding frequency 


e 


$ 


pL 
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‘its to items with greater situational frequency 
ould be less noticeable. 
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This study attempted to discriminate between 2 theories about strategies Ss 
en trying to remember a linear ordering. 
d as points along an imaginary spatial 
End-term anchoring theory suggests that serial orderings are 
A scaling analysis of proportions correct 


use wh 


that Ss represent the items to be learne 


continuum, 
Jearned from the end points inward 


suggested that neither theory alone is sufficient, The Ss appe 
ordering information along 2 dimensions, the first representing the ¢ 


between items on the imaginary continuum and the second representing the 
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Rating e theory argues 


ently represented 
tance 


distance of the items from the center of the ordering. 


In an attempt to clarify the encoding strategies 
Ss use when trying to remember a linear ordering, 
Potts (1972) had Ss learn a pa graph that presented 
an ordering of 4 terms along à meaningful dimension. 
Such a 4-term ordering (A>B>C> D) can be 
broken into 6 pairs of terms, 3 of which (the "ad- 
jacent" pairs, A > B, B »C,andC > D) represent 
relations between adjacent elements in the ordering 
and are therefore essential to the establishment of 
thatordering. The remaining 3 pairs (the “remote” 
pairs, A > C, B > D, and A > D) are redundant 
in the sense that they can be deduced from some 
subset of the adjacent pairs. Potts found that 
responses to the remote pairs were both faster and 
more accurate than responses to the adjacent pairs. 
This was true even when the remote pairs were never 
presented and thus had to be deduced from the 
adjacent pairs, Th data were shown to be 
inconsistent with several possible theories regarding 
the form in which such information is stored, Two 
theories were proposed that could account for this 
result. 

The first of these theorie 
locher's (1968) st tial-image 
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an extension of Hutten- 
; model, argues that 
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ssent the items to be learned as points along 
an imaginary spatial continuum and will be referred 
to herein as rating-scale theory. In such a model, 
forgetting is represented by variability in the place- 
ment of the items on this scale. The Ss incorrectly 
identify a particular pair whenever the variability 
between the 2 terms comp ng that pair is sufi- 
ciently large to permit the 2 terms to cross € ch 
other on the le, This model would predict 
proportion correct on a pair to be à monotonic 
increasing function of the distance between the 2 
terms comprising that p: ir. Since the terms of a 
remote pair wovld have to be placed farther apart 
on the scale than the terms of any adjacent pair 
necessary to deduce it, the theory would necessarily 
predict better performance on the remote pairs than 
on the adjacent pairs. 

An alternative explanation of Potts’ (1972) result 
is provided by research (DeSoto & Bosley, 1962; 
Feigenbaum & Simon, 1962; Wishner, Shipley, 
& Hurvich, 1957) suggesting that serial lists are 
learned from the end points inward, with the first 
and last terms serving à 'anchors" for the other 
terms in the list. If one accepts the argument that 
there is less variability in the identification of the 
ordinality of end terms, it follows that performance 
on the remote pairs should surpass performance on 
the adjacent pairs since, on the average, the remote 
pairs contained more end terms than did the adjacent 
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pairs. In what follows, this will be referred to as an 
end-term. anchoring theory. 

A straightforward test of these 2 alternatives 
would involve examining proportion correct as a 
function of distance, with number of end terms held 
constant. If proportion correct still increased as a 
function of distance under this condition, we would 
conclude that end-term anchoring was not im- 
portant. If, on the other hand, holding the number 
of end terms constant eliminated the effect, the 
importance of end terms would be affirmed. Un- 
fortunately, with the 4-term orderings employed 
by Potts (1972), the effects of distance and number 
of end terms were completely confounded. The 
present experiment examined proportions correct 
on 6-term orderings in an attempt to unconfound 
these 2 factors. 


METHOD 


Subjects. The Ss were 40 Indiana University 
undergraduates who participated to fulfill a course 


requirement. Each S served in a single 30-min. 
session. 

Apparatus. The experiment was conducted in 
the Mathematical Psychology Laboratory at 


Indiana University, Stimuli were presented on an 
{1-in. Sony video monitor that was controlled by 
an Applied Digital Data Systems MRD-280 video 
character generator. True-false responses were 
obtained using a 2-button response box located 
immediately below each video monitor. Each 
button was appropriately labeled with the word 
true or false. Stimulus presentation and response 
collection were controlled by an IBM 1800 process- 
control computer. 

Materials, Stimuli consisted of 9 English para- 

graphs presented one at a time on the video monitor. 
Each paragraph represented a linear ordering among 
6 items along a unique dimension. The syntax for 
all 9 paragraphs was identical and presented the 
intended ordering as 5 pair-wise relations. Several 
researchers (Clark, 1969; Huttenlocher, 1968; 
Jones, 1970; G. R. Potts and K. W. Scholz, manu- 
Script in preparation) have shown that differences 
in the linguistic markedness of certain oppositional 
adjective pairs affect the speed with which sentences 
using these adjectives are processed. In order to 
avoid any unintentional confounding by lexical 
marking, the 5 relational terms used in each para- 
graph were identical. In order to reduce variability 
in the case of paragraph comprehension, the surface 
structure of each paragraph was held constant. 
Each of the 9 paragraphs had the structure, “Mary 
is smarter than Jim, and Jim is smarter than Betty. 
Betty is smarter than Tom. Tom is smarter than 
Alice, and Alice is smarter than Bob." If the 
capital letters A-F are used to represent the 6 items, 
and the symbol > used to represent the relation 
then the surface structure of each paragraph can be 
represented as "A » B and B >C. CD 
D >E and E >F.” 

The 6-term orderings used in the present experi- 
ment permit the testing of 15 distinct pairs of 
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terms. Each test statement consisted o single 
sentence such as “Jim is smarter than Ali For 
each true test sentence of the form “ ter 


than B," there was a corresponding false nce 
of the form “B is smarter than A" that ited 
the same 2 terms in reverse order. In this A 
total of 30 statement types was obtained. Eacl 
statement type was tested 3 times in a session. Th 
90 test statements were randomly assigned to the 9 
paragraphs with the constraint that 5 true and 5 
false statements followed each paragraph. 
Procedure. From 1 to 4 Ss participated in each 
experimental session. Prior to the beginning of the 
session, Ss were given a brief training trial to 
acquaint them with the apparatus and procedure, 
This training trial consisted of presenting Ss a 
sample paragraph followed by 2 sample test ques- 
tions which they were required to answer. 
Following this training trial, the first of 


the 9 


test paragraphs was presented on the viewing 
screen. The paragraph remained in view for 90 se 
or until all Ss indicated (by a button press) that 


they could remember the information in the para- 
graph. The time required to study each par: 
wi not specifically recorded, but 
revealed that this time rarely exceeded 60 sec. As 
soon as all Ss in a session indicated that they were 
ready, the screen was erased and the first test 
statement presented. The Ss were instructed to 
respond to the statement (by a button press) as 
quickly as possible. As soon as all Ss had responded, 
the test statement was erased, and the next state- 
ment was presented after a. 1-sec. intertrial interval. 
The procedure was identical for the 10 test state- 
ments following each of the 9 paragraphs. 


raph 


spot check 


RESULTS AND DISCUSSION 


Mean proportions correct for each of the test 
pairs, averaged over true and false sentences, are 
presented in Table 1. The entry in each cell 
represents the proportion correct on the pair made 
up of those terms indicated by the row and colum: 
headings. Each pair can be characterized a ! 
to the number of end terms it has (0, 1, 
its step size (1, 2, 3, 4, or 5), where the 
of a pair refers to the number of adjac irs 
necessary to deduce it. Clearly, step € isa 
monotonic function of the distance separating th 
terms of the pairs, 

A 2-way analysis of variance (step size by true 
false) indicated a significant main effect for step 
size, F (4, 156) = 9.84, p < -001, but not for true 
vs. false, F (1, 39) — 98, p = .09. Orthogonal 
polynomial trend analysis revealed a signific 
linear trend, F (1, 156) = 29.71, p < .001, for 
size. Higher order trends were not significant, 
observation that performance is better for more 
widely separated items is consistent with Potts' 
(1972) data and conforms to the predictions of the 
rating-scale model. However, the average number 
of end terms included in the test pairs increases 
monotonically as a function of step size. Thus, the 
apparent step size effect could also be explained by 


step 


Fhe 
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the assumption that end terms are more readily 
accessible. 

Similarly, average proportion correct is higher for 
pairs that include an end term than for pairs that 
do not, ¢ (39) = 5.04, p < .001. Though this effect 
is consistent with the predictions of the end-term 
anchoring theory, it is confounded with step size. 

In an attempt to distinguish between the 2 
models, proportion correct as a function of step 
size was examined, with number of end terms held 
constant, First, proportion correct as a function 
of step size was examined for those test pairs which 
did not include any end term (i.e., for the suborder- 
ing BCDE). The mean proportions correct for 
pairs with Step Sizes 1, 2, and 3 were .713, .758, 
and .771, respectively. This represents a significant 
linear trend, F (1, 78) — 3.84, p — .05, which 
accounted for almost all (89%) of the variance 
between the cell means of the step size variable. 
Thus, as predicted by the rating-scale theory, even 
with the number of end terms held constant at zero, 
proportion correct is still a monotonic increasing 
function of step siz 

Second, proportion correct as a function of step 
was examined for those test pairs that included 
exactly one end term. The mean proportions correct 
for pairs with Step Sizes 1, 2, 3, and 4 were .771, 
.850, .858, and .804, respectively. These proportions 
correct are clearly not a monotonic increasing func- 
tion of step size, The quadratic trend in this case 
was highly significant, F (1, 117) = 18.05, p « .001, 
and accounted for almost all (88%) of the variance 
between the cell means of the step size variable. 
The linear trend was not significant, F (1, 117) 
= 2,33, p = .13, and accounted for only 11% of 
the variance. 

The extent to which the present data are in- 
consistent with the predictions of a rating-scale 
theory can also be seen by examining the exact 
predictions of such a theory. Let the first through 
sixth terms in the ordering be represented by the 
numbers 1 through 6, respectively; let D(ij) repre- 
sent the spatial distance between the items 7 and j 
on the scale; and let P(1j) represent the proportion 
correct on the pair consisting of the items 7 and j. 
The basic prediction of a rating-scale theory is that 
P(ij) is a monotonic increasing function of D (ij). 
In other words, for any 7, j, m, and n, P (ij) = P(mn) 
if and only if D(ij) = D(mn), and P(ij) > P(mn) 
if and only if D(ij) > D(mn). 

If one examines the nature of a linear scale, it 
becomes apparent that D(ij) > D(mm) whenever 
iX m and j 2 s. (The only exception is the 
trivial case where both i = m and j = n, and this 
case can be ignored in the present analysis.) Thus, 
a rating-scale theory would predict that P(ij) 
> P(mn) whenever i € mand j > n. Examination 
of Table 1 indicates that this prediction is completely 
consistent with the data for those pairs that do not 
include either end term (i.e., pairs included in the 
subordering BCDE), but fails among those pairs 
that do include one of the end terms. Though 
performance is best on the pair A > F, which is 
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TABLE 1 
PROPORTIONS CORRECT ON 15 TEST PAIRS 
Pair A | B | Ç | D E | F 
A =k: 
B 771 = 
c 858 738 = 
D 842 «763 713 x 
E 783 771 -154 688 = 
F 871 825 875 842 TA =. 


the most distant, performance is better on the pairs 
A >C and A » D than on the pair A > E; and 
performance is better on the pair A > C than on 
the pair A > D. Similarly, performance is better 
on the pairs D >F and C >F than on the pair 
B»F. 

Thus, it appears that although a rating-scale 
theory can account for the proportions correct on 
pairs relating to the inner terms (BCDE), its 
predictions are incorrect with respect to the propor- 
tions correct on pairs which contain one of the 
end term In order to clarify the nature of the 
discrepancies, a multidimensional scaling analysis 
was performed on the proportions correct. In doing 
this, the scaling assumption that proportion correct 
is a monotonic increasing function of distance was 
accepted, but the dimensionality of the resulting 
cognitive space was viewed as an empirical question. 
The scaling solutions that follow were obtained 
using TORSCA, a FORTRAN IV program, de- 
scribed by Young and Torgerson (1967). 

The rating-scale theory predicts that the cognitive 
space is unidimensional, since it argues that Ss 
place the terms along a line. Thus, the unidimen- 
sional scaling solution based on proportions correct 
should reflect the true spatial distances between 
the items, and thus would have to result in those 
items being ordered correctly on the scale. To the 
extent that the unidimensional solution does not 
correspond well to the actual ordering of the terms, 
it would indicate that the unidimensional rating 
scale is not an accurate representation of the way 
Ss store or access the ordering. 

The unidimensional solution is shown in the top 
frame of Figure 1. It is clear that this solution 
does not accurately represent the true ordering of. 
the terms. The stress values for the 1-, 2-, and 3- 
dimensional solutions were .274, .027, and .001, 
respectively. The poorness of the unidimensional 
fit relative to the 2-dimensional fit is readily ap- 
parent, In order to investigate the possibility that 
the 1-dimensional solution shown in Figure 1 was 
the consequence of entrapment in a local minimum, 
stress was estimated for the ABCDEF ordering, 
assuming both equal spacing and spacing that 
emphasized the remoteness of the end terms. 
Using the numerical method described by Kruskal 
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c D B E A F 
ONE DIMENSIONAL SOLUTION 


WW 


ORDERING OF TERMS ON THE CRITICAL DIMENSION 
TWO DIMENSIONAL SOLUTION 


DISTANCE FROM CENTER OF THE ORDERING 


FiGURE1. The 1-dimensional (upper frame) and 2-dimensional 
(lower frame) scaling solutions for proportions correct. 


(1964), we discovered no set of distances that would 
yield a stress of less than .2, 

The problem with the unidimensional solution 
becomes apparent when one examines the best 
2-dimensional solution, shown in the bottom frame 
of Figure 1. A pair of easily interpretable orthog- 
onal axes is included in the figure. The projection 
of points onto one of these axes gives the actual 
linear ordering of the terms on the critical dimension 3 
the projection on the other axis gives the distance 
of each item from the center of the ordering. Itis 
clear that the end terms are very different from the 
inner terms (BDCE) of the ordering on both 
dimensions, On one dimension, however, these end 

terms are the items which are farthest apart; on 

the other dimension, they are closer to each other 
than they are to any other item, 

The clustering analysis presented along with the 
2-dimensional solution was obtained ng the 
diameter method. ‘The result again demonstrates 
the uniqueness of the end terms relative to the other 
terms in the ordering. Use of the connectedness 
method yielded minor differences in the clustering 
among the inner terms, but showed essentially the 
same effect. 

Our interpretation of the 2-dimensional 
solution is that one dimension represents the di: 
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between items on an imaginary scale, The ordering 
of the items along this dimension is consistent with à 
rating-scale theory. The other dimension represents 
the distance of the items from the center of the 
ordering. On this dimension, items more distant 
from the center of the ordering (hence nearest the 
ends) are grouped together. The grouping is 
consistent with the assumption of end-term anchor- 
ing theory that end terms are learned best. 

The conclusion drawn from the results is that 
neither a rating-scale theory nor an end-term 
anchoring theory is adequate in itself as an explana- 
tion of how Ss process linear orderings. The 
predictions each theory would make about perform- 
ance were described earlier. The results suggest 
that the theories be integrated. Apparently, the 
accuracy of S's attempt to identify the truth or 
falsity of a test pair is a function of the ease with 
which the relative position of each item can be 
identified (as in the rating-scale theory). In 
addition, position identification is facilitated if the 
item is an end term in the complete ordering (a 
in an end-term anchoring theory). These 2 com- 
ponents could reflect either differences in how the 
information is structured in memory, or differences 


in the strategies by which the information is re- 
necessary to 


trieved. Further work would be 
discriminate these 2 alternatives.? 


3 We note that there is a s 
task and various psychophy: Our task was 
designed to investigate memorial representation and retrieval 
of linearly ordered meaningful material, and as such we feel 
that an attempt to relate our findings to existing psychophysical 
models would be unwarranted and potentially misleading, 
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Two studies were conducted to e: 


samine the b; 
Experiment I, 5 groups of 20 Ss each learned an 8-item A-B list where the A 
and B terms were single alphabetical or numerical symbols. Learning w. 
to à criterion of either 0, 2, 4, 6, or 8 correct anticipations. Subsequently, 
were trained to an 8/8 criterion on the B-A list. 


ward-learning function. In 


Experiment II was designed 


to control for the required temporal delay between A-B and B-A learning 
in Experiment I. The data revealed a number of similarities between the A-B 
and B-A functions, but provided no support for the associative symmetry 
hypothesis or for one of its important variants, 


Systematic studies of the acquisition of ba 
ward associations are rare. Jantz and Underwood's 
(1958) study today has only heuristic value due 
to methodological difficulties resulting from their 
use of nonsense syllable-adjective pairs. In gen- 
eral, studies using these materials have not ad- 
dressed adequately the problem of stimulus-response 
availability. It could be argued that on the com- 
pletion of forward (A-B) learning an S would have 
learned B and the association between the A and 
B terms. But since the A term had not been 
specifically learned as such, and indeed S may not 
have attended to all of the nominal attributes of 
the A term, it would not be as available as the 
previously learned B. Several methods have been 
used to control for availability of stimulus-response 
terms, including prelearning and use of double- 
function s. However, the former method has 
the possibility of producing interitem connections 
which could greatly modify subsequent paired- 
associate learning and the latter results in lower 
rates of B-A learning than when other types of 
lists are learned. Consequently, a simpler solution 
to the problem of the unequal availability of A 
and B terms is to avoid using highly complex 
stimuli (e.g., nonsense ables) and to use instead 
very simple, highly available items such as single 
digits or letters of the alphabet (Ekstrand, 1966). 

A recent study (Baumeister & Campbell, 1971) 
attempted to avoid the design problems discussed 
above by using black and white drawings of highly 
familiar objects in order to equate their availability 
as both stimulus and response terms. Baumeister 
and Campbell observed an orderly relationship 
between the forward and backward associative 
functions, although the relationships were not 
identical for the normal and retarded children 
studied. In their first experiment, where the focus 
was the amount of B-A recall as a function of 
the degree of A-B learning, the number of A-B 
trials was the principal variable manipulated. In 
contrast, the proportion of the A-B list correctly 
anticipated in the present studies was varied in 
order to examine its influence upon the course of 
B-A learning. 

1 Requests for reprints should be sent to C. Michael Levy, 


Department of Psychology, Psychology Building, University 
of Florida, Gainesville, Florida 32605. 


EXPERIMENT I 


Method. The list were composed of 8 single 
digit and letter pairs selected to maximize item 
availability, The digits 2-9 were chosen, all of 


which rank in the upper quartile in association 
value for the numbers 0-100. The letters used 
were B, M, S, T, H, D, R, and F, which are among 
the 9 letters highest in meaningfulness. The 
materials were presented on a memory drum at 
a 1;1 rate to minimize intratrial rehearsal and to 
prevent Ss from "running through" the A-B se- 
quence during B-A learning (Ekstrand, 1966), 

During Stage 1, Ss engaged in forward anticipa- 
tion learning. Group 8 learned a list to a criterion 
of 8 out of 8 correct responses. Subjects in Groups 
6, 4, and 2 were taken to criteria of, respectively, 
6 out of 8, 4 out of 8, or 2 out of 8 correct responses, 
Group 0 was not presented with the A-B list, 
but proceeded di 

Stage 2 consi ackward anticipation 
learning. A 1-min, period intervened between 
es 1 and 2 during which the instructions were 
given for the second stage, All groups learned 
the B-A list to a criterion of 1 perfect errorless 
anticipation. 

List order was systematically varied so that half 
of the Ss in each group received digits as A terms 
and letters as B terms, while the reverse con- 
tingency was used for the remaining Ss, Four 
different orders of items occurred before the lists 
repeated, There was a 2-sec. interval between 
each trial. 

The Ss were 105 female undergraduates fulfilling 
psychology requirements, 20 being randomly as- 
signed to each of the 5 groups. Five Ss (1 each 
in Groups 0, 2, and 8, and 2 in Group 4) failed 
to reach criterion on either the A-B or B-A list 
within 30 t and were excused, being replaced 
by the next arriving Ss. 

Results. In an analysis of total trials to cri- 
terion, no main effect for group, F <1, or for 
order, F <1, was found. It appeared as if the 
groups learned the lists at the same rates overall 
and were thus comparable. There was, however, 
a highly significant difference between the total 
trials to criterion in Stages 1 and 2, F (1, 90) 
= 57.71, p <.001. Since Ss were required to 


327 


38^ 


learn to 1 of 3 possible criteria during A-B learn- 
ing but to a criterion of 1 errorless trial during 
backward learning, the average number of trials 
to reach criterion on A-B was suppressed relative 
to B-A. 

The degree of learning during Stage 1 had a 
reliable effect on performance during Stage 2, 
F (4, 90) = 42.15, p < .001. The mean number 
of trials required by the major groups to reach 
criterion in Stages 1 and 2 were, respectively, 
Group 2, 5.1 and 19.9; Group 4, 7.7 and 18.9; 
Group 6, 14.0 and 10.8; and Group 8, 17.9 and 7.9; 
Group 0 required 21.0 trials to learn the materials 
to 1 errorless trial. The inverse relationship be- 
tween trials to criterion required for Stages 1 and 2 
suggests that some B-A associations were being 
formed during A-B learning. 

The Stage X List Order (letter-numeral vs. 
numeral-letter) interaction was also significant, 
F (1, 90) = 6.3, p < .001. The Ss given letters 
as A terms and numerals as B terms performed 
proportionally better during A-B learning, requir- 
ing an average of 7.84 trials to criterion, than 
their counterparts who were presented numeral- 
letter combinations, requiring 9.98 trials, and 
proportionally worse during Stage 2 (16.86 vs. 
14,52 trials, respectively). This suggests that the 
letter-numeral combinations were easier than the 
reversed pairings, reflecting either greater absolute 
availability for the numerals than the alphabetical 
characters or a higher expected value for correct 
responses due to guessing, since the number of 
potential items in the pool of single digits is less 
than the number of potential letters. 

Prior to reaching the criterion necessary for a 
shift to Stage 2, Groups 4, 6, and 8 each passed 
one or more of the critical criteria. By combining 
appropriate groups during Stage 1, it was possible 
to determine the performance on the first post- 
criterion trial, a determination that was necessary 
for the evaluation of Trial 1 performance during 
B-A learning for those proceeding to Stage 2. 
"Thus, for example, the scores of Groups 6 and 8 
were averaged on the first trial after the criterion 
of 4 correct responses was achieved during Stage 1 
in order to evaluate the B-A strength of Group 4 
on the first B-A anticipation trial during Stage 2. 
At all comparable points there was a reliable 
decrement (p «.01) in performance averaging 
28.6% at Criterion 2, 25.6% at Criterion 4, and 
23.8% at Criterion 6 when Ss were shifted from 
A-B to B-A learning. When only the anticipa- 
tions correct on both the criterion trial and first 
posteriterion trial were examined, the correspond- 
ing decreases were 38.7%, 214%, and 24.0%, 
Thus, as Wollen, Allison, and Lowry (1969) have 
shown, the principle of associative symmetry was 
neither supported on an individual item-by-item 
basis nor in the more general case of the average 
number of items anticipated. 


EXPERIMENT II 


In Experiment I, a 1-min. delay between the crit- 
ical trial in Stage 1 and the first trial of Stage 2 was 
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necessary in order to change lists on the mc; 
drum. But the corresponding interval betwee: 
criterion trial and the first postcriterion tria 
the comparison groups continuing in Stage 1 | 
only 2 sec. Additionally, in the previous c 
parisons, there was a potential bias in favor « 
continuing A-B learning who were not subjc 
the loss of set. A second experiment was ther 
designed in which 3 groups were interrupted du 
Stage 1 learning after reaching a given criter 
and were engaged in conversation for 1 min, A 
quisition of Stage 1 was then resumed, 

Method. The materials were identical to those 
in Experiment I. 

The Ss were 31 female undergraduates ta! 
from the same pool and during the same acade: 
quarter as those who served in Experiment 
One S failed to reach the Stage 1 criterion wit 
30 trials and her data were voided. 

Three groups of 10 Ss each participated 
Stage 1 until a given criterion of learning w 
reached, after which they were engaged in co 
versation with E for 1 min. Training then resur 
for Stage 1. Group 4A was interrupted af 
making 2 correct responses and proceeded to 
criterion of 4 out of 8 correct anticipatic 
A second delay interval then occurred du: 
which E changed the lists on the memory dr 
and after which Stage 2 began. The procedi 
for Group 8A was similar except that a l-mi: 
interruption for conversation occurred after each 
of the first 3 criteria (2, 4, or 6 correct responses) 
were reached. The group was subsequently trained 
to a criterion of 1 perfect recitation (and give 
a final interruption) prior to commencing Stage 2. 
Groups 4A and 8A learned the B-A list to a cri- 
terion of 1 errorless trial. Group 8B was trained 
to a criterion of 1 errorless trial in Stage 1, and 
was then interrupted for 1 min. of conversation 
before receiving 2 additional trials of Stage 1 
learning. It received no training on Stage 2. 

Results. Prior to determining the effects of the 
l-min. interruption, the preinterruption perform 
ance of Groups 4A, 8A, and 8B was compar: 
to Groups 4, 6, and 8 used in Experiment I. 

6 groups entered one analysis of variance of th 
number of trials to reach criteria of 2 and 4 cor- 
rect anticipations in Stage 1. Groups 6, 8, 8A, 
and 8B were compared on the number of trials to 
eee ine and 6 e anticipations. Data 
ue. UR 5 an pe were used in a third 
Sta ers a 6, and 8 correct 
fects fell far short of si ie WURDE E 
; : „Significance in these analyses 
ADU ihe DIEI DOHOR A-B performance 
"e sateen n xperiments I and II were pooled 
ent comparisons. 
_ dn order to evaluate the effects of the 1-min 
interruption on subsequent A-B and B-A ere 
formance, 2 post hoc groups were formed. Forward- 
ME CDD 55, composed of Groups 4A, 8A, and 
die CON. l or more l-min. interruptions 
5 1 arning. Backward-interrupted Ss, 
including Groups 4, 6, and 8, had : 
interruption immediately prior to B 


their major 
-A learning. 
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After each learning criterion was attained, the 
number of correct responses was determined on 
the first postinterruption trial. Table 1 indicates 
that there was an inverse relationship between 
the Stage 1 criterion and postinterruption im- 
provement. The decrement at Criterion 8 and the 
increment at Criterion 2 in the number of correct 
anticipations after the interruption were highly 
reliable, yielding 2s of 3.67 and 2.98, respectively. 
When Ss were shifted to the first trial of Stage 2, 
their correct anticipations decreased by 2595-305 
relative to Ss who were given a 1-min. interruption 
before resuming Stage 1 learning. Except at Cri- 
terion 4, all of the decrements were statistically 
reliable. 

Table 1 also presents the mean number of antici- 
pations that were correct on both the criterion 
trial and the first postinterruption trial. Except 
for the limiting case at Criterion 8, there were 
fewer correct anticipations in this analysis than in 
the preceding one. But, as in the previous analysis, 
the first-trial performance on Stage 2 was signifi- 
cantly inferior to that after a 1-min. interruption 
for those continuing in Stage 1 in each comparison 
except Criterion 4. 

Discussion, Although the methodologies used in 
the present study and by Baumeister and Campbell 
(1971) were quite different, the data are strikingly 
similar, When Ss were given highly available 
materials, a considerable amount of backward 
learning occurred at each test criterion in forward 
learning. The A-B and B-A functions were ap- 
proximately mirror images. 

The present data, however, ofler strong con- 
traindications for the associative symmetry hy- 
pothesis (Asch & Ebenholtz, 1962) and for the 
revision of this hypothesis proposed by Wollen 
et al. (1969). 1f it may be assumed that the 
l-min. interruption used in Experiment Il con- 
trolled for the disruption of set and/or for the 
opportunities to forget in Ss shifted to Stage 2, 
then the associative symmetry hypothesis remains 
untenable. Regardless of whether the dependent 
variable was the average number of postcriterion 
anticipations or the average number of anticipa- 
tions correct on both criterion and postcriterion 
trials, performance on the B-A task was poorer 
than it was at the comparable points in A-B 
learning. 

It is equally important to note that, even with 
rapid rates of presentation used to minimize the 
opportunities for backward rehearsal, an appre- 
ciable amount of B-A learning occurred during the 
process of A-B learning. The backward associative 
strength, after correcting for the effects of the 
l-min. interruption, was estimated at 7005-7506 
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TABLE 1 
MEAN NUMBER CORRECT ANTICIPATIONS 


Criterion trial and first 


First postcriterion trial posteriterion trial 


Stage 1 
criterion | For- | Back- For- | Back- 
ward | ward 5 ward | ward $ 
inter- | inter- inter- | inter- |. ^ 
| rupted | rupted rupted | rupted 
2 1.9 1.6 9 F «0t 
4 2:9 2.8^ 2.2 ns 
6 ES 448 34 «05 
8 49 OAS 49 «01 


a The m = 10; otherwise, n = 20. 


of the strength of the originally learned materials, 
independent of the level of A-B learning. 

Obviously, some pairs were either learned bi- 
directionally or at least had achieved sufficient 
bidirectional associative strength to permit suc- 
cessful elicitation at the time of testing. These 
particular pairs were, in general, idiosyncratically 
determined. There was a tendency for the letter- 
numeral combinations to produce superior per- 
formance than the numeral-letter pairs, particu- 
larly during the initial trials of Stage 2, suggesting 
that (a) despite the high association value of the 
materials, the equivalence of availability of the 
letters and numerals was not achieved at a func- 
tional level, and/or (b) the expected chance value 
of correct responses due to guessing depended upon 
the class of responses required. 

Given the utility of the concept of backward 
associations as a mechanism for understanding 
many phenomenon in verbal learning (see Ekstrand, 
1966), and the similarities between the growth of 
forward and backward associations in the present 
experiments, future research should be directed 
toward investigations of the susceptibility of back- 
ward learning to the major variables in forward- 
learning situations, 
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OPERANT CONDITIONING OF VASOCONSTRICTION : 
A VERIFICATION! 


ROBERT M. STE 


N? axd RAYMOND P. PAVLOSKI 


Pennsylvania State University 


A verification of operant conditioning of unelicited vasoconstrictions was 
obtained for 2 separate sets of data. Significant differences were found between 
30 experimental Ss and 60 yoked control Ss, half of whom received reinforce- 
ment identical in time and number to a given experimental S and half of whom 
received the same number of reinforcements per minute but who were never 


reinforced when they were vasoconstric 
ficant group differences in explanations of the response reinforce- 


revealed sig 
ment contingencies. 


Several studies have reported success in operant 
conditioning of vasomotor responses in animals, but 
only 2 published papers (Kimmel & Kimmel, 1967; 
Snyder & Noble, 1968) have presented evidence for 
operant conditioning of vasoconstriction in human 
ind in the Kimmel and Kimmel study, relevant 
statistical tests are not provided. Because of the 
important theoretical and methodological issues 
involved and the possible application of these tech- 
niques to psychosomatic disorders, it was deemed 
desirable to repeat the Snyder and Noble study. In 
the 2 experiments to follow, attention was given 
to possible cognitive mediators and to the role of 
reinforcement for the so-called noncontingent con- 
trol groups (Stern, 1967). 


EXPERIMENT I 


Method. The Ss were 27 female and 18 male 
students from introductory psychology classes at 
Pennsylvania State University. Right index-finger 
blood volume was recorded using a Beckman photo- 
plethysmograph transducer, and respiration was 
recorded by means of a Parks Electronics silastic 
mercury strain gauge transducer. The reinforcing 
stimulus was a 28 v. pilot light, which was presented 
for 3 sec. 

The Ss were divided into 3 groups of 15 each; all 
were read the instructions used by Snyder and Noble 

(1968) except for the absence of arm electrodes and 
for the shortening of session length from 40 to 30 
min. (no extinction period). Basically, the instruc- 
tions told Ss to relax and not try to make any 
voluntary responses. After the instructions were 
read, a 5-min. baseline period was followed by a 
25-min. acquisition period, after which a. question- 
naire was administered. The criterion for a vasocon- 
striction was set independently for each § during 
the baseline period according to the procedure 
utilized by Snyder and Noble (1968). During this 
period, responses were recorded and reinforcement 
was omitted; the minimum amplitude response re- 
inforced during the acquisition period was the largest 
baseline period vasoconstriction free from respiratory 


1The second author was an NSF Undergraduate Re: 
Participant during the course of this study. gei 
? Requests for reprints should be sent to Robert M. Stern, 
Department, of Psychology, Pennsylvania State University 
University Park, Pennsylvania 16802, » 


g. Postsession questionnaire data 


and muscular artifacts, This means that during the 
baseline period each S made 1 criterion response 
except for the unusual case where S made 2 or mort 
"largest" responses of equal size. 

Only experimental (Group 1) Ss were reinforced 
for vasoconstricting during the conditioning period, 
Any vasoconstriction recorded within 15 sec. follow- 
ing a reinforcement "was not reinforced. he 30 
control Ss were divided into 2 groups: each S in 
Group 2 (truly yoked) received reinforcement at the 
same relative time as his experimental counterpart, 
while each S in Group 3 (partially yoked) received 
the same number of reinforcements per minute, 
but never when S was vasoconstricting. At the end 
of cach session, S was given a questionnaire to com- 
plete that contained the following queries: (a) What 
do you think made the light come on? and (5) If | 
told you that it was a response that you made that 
determined when the light came on, what do you 
think it wa An extinction period was not in- 
cluded in the sessions because of the interest in Ss’ 
ability to state the response reinforcement con- 
tigency. If Ss had in fact successfully tested hy- 
potheses about what provoked the reinforcement, an 
extinction period would have thoroughly confused 
them and invalidated the questionnaire data, 

Results and discussion. Figure 1 shows the mean 
number of artifact-free vasoconstrictions that met 
the amplitude criteria for successive 5-min. periods. 

The mean levels of responding for the 3 groups 
during the acquisition phase were 3.9, 1.8, and 0.9. 
Analysis of variance of the acquisition period data 
revealed significant group diflerences, F (2, 42) 
— 12.05, p « .001. Individual comparisons indi- 
cated that the level of responding of the experi- 
mental group was significantly greater than both 
control groups at each of the 5 time periods. The 
truly yoked group responded at a significantly 
higher level than the partially yoked group during 
Periods 4 and 5. Although no significant overall 
time effect was found, there was a significant Group 
X Time interaction, F (7, 144) = 2.89, p < .01. 
The latter is given with adjusted degrees of freedom 
(Huynh & Feldt, 1970). 

Results from the postsession questionnaire data 
were analyzed to see whether or not the different 
reinforcement contingencies created group differ- 
ences in regard to expectancy of reinforcement, The 
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FiGURE 1. Mean number of criterion vasoconstrictions of 
contingent and control groups, Experiment 1. 
majority of Ss in the contingent group thought that 
their nulating thoughts or bodily activity made 


the reinforcement light come on. Most Ss in both 
of the control groups, on the other hand, reported 
that the reinforcement was random or the con- 
tingency was unknown to them. The results of the 
questionnaire data, while not significant, were in 
the predicted direction, The truly yoked control 
group gave the expected high percentage of responses 
of contingency unknown. It is interesting to note 
that despite this reply their level of vasoconstriction 
increased considerably from Trial Block 2 to Trial 
Block 5 of the acquisition period, as shown in Figure 
1. Conceivably, this increase in sympathetic re- 
sponding may have been due to Ss persistently 
testing various hypotheses concerning the response 
reinforcement contingency, 


EXPERIMENT 11 


The vasomotor and subjective report data of 
Experiment I were thought to be sufficiently inter- 
esting to repeat the experiment. More specifically, 
the experimental group did make significantly more 
vasoconstrictions than the control groups, as Snyder 
and Noble (1968) reported, and the questionnaire 
data, as mentioned above, were in the predicted 
direction but insignificant. 

Method. The design as described above was 
repeated using 45 new Ss. 

Results and discussion. Figure 2 shows the mean 
number of artifact-free vasoconstrictions that met 
the amplitude criteria for successive 5-min. periods. 

The mean levels of responding for the 3 groups 
during the acquisition phase were 4.0, .9, and .1. 
Analysis of variance of the acquisition period data 
revealed significant group differences, F (2, 4) 
= 71.60, p < .001. Individual comparisons indi- 
cated once again that the level of responding of the 
experimental group was significantly greater than 
both control groups at each of the 5 time periods. 
There were no significant differences between the 
control groups except at Time Interval 4, where 
truly yoked Ss made more vasoconstrictions than 
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partially yoked Ss. Neither the overall time effect 
nor the Group X Time interaction were significant. 

The results of the postsession questionnaire data 
indicated that Ss in the contingent group tended to 
report that their stimulating thoughts or bodily 
activity made the light come on, whereas Ss in the 
2 control groups reported either that they did not 
know what the contingency was or that reinforce- 
ment was dependent on their relaxing. These group 
differences were found to be highly significant, 
x? (2) = 17.00, p < .001. 


CONCLUSIONS 


The 2 sets of data from the present experiments, 
collected under identical conditions, show successful 
modification of the rate of vasoconstrictions. This is 
in support of the earlier findings of Kimmel and 
Kimmel (1967) and Snyder and Noble (1968). 
Postsession questionnaire data revealed that most 
experimental Ss hypothesized correctly that the rein- 
forcing light was contingent upon their stimulating 
thoughts or bodily activity, whereas most control Ss 
either thought that it was contingent on their 
relaxing or they did not know. 

A rather curious finding which can be seen in 
both Figures 1 and 2 was that the performance of 
experimental groups reached almost the highest level 
during the very first 5-min. conditioning period. 
No ready explanation is available for this finding, 
even though it was observed in some of the earliest 
studies of operant conditioning of autonomic re- 
sponses (Fowler & Kimmel, 1962). 

Snyder and Noble (1968) reported a much higher 
level of responding than do the present authors 
this may be attributed to a methodological differ- 
ence, In the former study "any definite vasocon- 
striction not preceded by bodily movement .. , 
[p. 264]" was reinforced; only in the results were 
respiratory components and amplitude criteria taken 
into account. Of interest is the fact that in the 
present study, in which considerably fewer reinforce- 
ments were presented (since amplitude and re- 
spiratory criteria were considered during each 
session), results again point to successful modifica- 
tion of the rate of vasoconstriction. The mecha- 
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nisms by which this change comes about are not 
known at this time; however, an important aspect of 
these findings is that vasoconstriction, an autonomic 
response of great relevance to hypertension and 
other psychosomatic diseases, can be relative easily 
controlled and/or conditioned. 


REFERENCES 


FOWLER, R. L., & KrMMEL, H. D. Operant conditioning of the 
GSR. Journal of Experimental Psychology, 1962, 63, 563-567. 


Journal of Experimental Psychology 
1974, Vol. 102, No. 2, 332-334 


WHOLE-LIST RETENTION FOLLOWING WHOLE-PART LEARNING ! 
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After 6 trials of whole-list learning and 5 trials of part-list learning, Ss were P 
asked to reproduce (Experiment I) or relearn (Experiment II) the items from 

the wholelist. Experimental Ss, whose part list was taken from the whole list 

reproduced (Experiment I) and relearned (Experiment II) more whole-list 

items than control Ss, whose part list did not contain any items from the whole 

These results are incompatible with Tulving and Osler's 1967 theory of 

the reorganization of whole-list subjective units during part-list acquisition. 


list. 


Tulving and Osler (1967) have argued that the 
whole-part paradigm provides data that strongly 
support the organizational or interdependence 
hypothesis of free-recall learning. In this paradigm, 
Ss initially learn a list of words (whole list) under 
conditions of free recall. Then, experimental Ss 
receive further trials on a part list that is composed 
of a subset of the whole-list items, while control Ss 
learn a part list that does not contain items from 
the whole list. The critical finding is that experi- 
mental Ss exhibit less efficient learning of the part 
list than do control Ss. This counterintuitive find- 
ing, Tulving and Osler maintain, results from the 
fact that the formation of new subjective (S) units in 
the part list is hindered by the prior organization 
of the words in different S units in the whole list. 
Thus, during part-list learning, experimental Ss 
“must modify the units formed during whole list 
learning or else form completely new units [Tulving 
& Osler, p. 255." Ehrlich (1970) also favors the 
view that the organization of the whole list is de- 
structured during part-list learning in order that 


1 This research was supported by Grant A8266 
National Research Council of Canada to the Bret au ta ul 
a National Research Council fellowship to the second author. 
The authors wish to thank Doug Smith and Bill Kirkwood for 
Eonar for zis testing of us "n 

equests for reprints shoul e sent to Ronal k: 

Department of Psychology, Atkinson College, York Una aon 
Downsview, Ontario, Canada. wo 

3 Now at the University of Alberta, Edmonton, Alberta, 
Canada. 2 


SHORT REPORTS 
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. dependent upon the retrieval of S units (Tulving & 
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new units can be formed; however, he maintains 
that the whole-list organizational structure is not 
permanently disrupted but only temporarily in- 
hibited. Ehrlich reached this conclusion following 
the demonstration that whole-list retention was not 
affected by several intervening trials of second-list 
learning in which the second list was comprised of 
alternate subsets of whole-list items. 

If the organization established during whole-list 
learning is not permanently disrupted during part- 
list learning, one would expect that experimental 
Ss should prove as able as control Ss in their 
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learning. However, if experimental Ss must re- 
organize the words in the part list, and if recall is 


Osler, 1967), then experimental Ss must prove in- 
ferior to control Ss at subsequent whole-list recall. 
The present studies were designed to compare whole- 
list reproduction (Experiment I) and whole-list 
relearning (Experiment II) for experimental and 
control Ss after each group had learned its respective 
part list. Since there is no apparent reason to 
postulate that control 5s must reorganize their whole 
list during part-list learning, this condition is an 
appropriate control for assessing whole-list retention 
following part-list learning. 

Method. Fifty students of both sexes at the Brant 
Senior Public School in Toronto served as Ss in 
Experiment I. They were randomly assigned to 2 
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groups of 25 each with the restriction that each of 
the groups have an equal number of males and 
females. All Ss were seventh-grade students ranging 
in age 12-14 yr. 

Forty-eight students of both sexes at the Bra- 
malea Secondary School in Brampton served as Ss 
in Experiment Il. They were randomly assigned 
to 2 groups of 24 with the restriction that each of the 
groups have an equal number of males and females. 

In Experiment I, all Ss learned a common whole 
list prior to the part list. Experimental Ss learned 
a part list that consisted of items randomly selected 
from the previously learned whole list. Control Ss 
learned a part list that had no words in common 
with the whole list. All lists consisted of 2-syllable 
nouns from the 1,000 most common words in the 
Thorndike-Lorge (1944) word count. The whole 
list contained 24 words; the part list contained 12 
words. In Experiment II all details were the same 
as in Experiment I except that experimental and 
control Ss learned a different whole list and a 
common part list, as in the Tulving and Osler 
(1967) study. 

The experimental and control groups were treated 
identically during the learning of the whole and 
part lists in both experiments. After standard free 
recall instructions were read, the first whole list was 
presented via tape recorder at a 1-sec, rate. Ninety 
seconds were allowed for written recall. There were 
6 alternating training and test trials with the items 
computer randomized on each trial. Before the 
first presentation of the part list, both groups were 
told that “we will now go on to a shorter list. Listen 
carefully to each word and write down the words 
in any order after the list has been read." As in the 
Tulving and Osler (1967) study, experimental Ss 
were not informed about the relationship of the 
second list to the first list. The part list was pre- 
sented via tape recorder at a I-sec. rate. Forty-five 
seconds were allowed for written recall. There were 
5 alternating training and test trials with items 
computer randomized on each trial. 

In Experiment I, after the test phase following the 
fifth presentation of the part list was completed, Ss 
in both groups were given the following instructions: 
“You are now to write down the words from the 
first or longer list that you studied in this experiment. 
Do not write any of the words from the second or 
shorter list you heard. You may write these longer 
list words in any order, but remember, do not write 
the words from the second or shorter list you 
studied." After 2 min. of free recall of the whole 
list, Ss turned to a new sheet of paper. They were 
then again given the instructions to recall the whole 
list in an attempt to determine if there was an inter- 
action between final whole-list recall trials and 
conditions. The Ss were given 2 min. for their 
second recall phase 

In Experiment II, after the test phase following 
the fifth presentation of the part list was completed, 
Ss in both groups were given the following instruc- 
tions: “You are now going to hear the words from 
the first or longer list that you studied in this 
experiment. Listen carefully to each word and 
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acquisition and final wholeJist reproduction (Experiment I) 
and relearning (Experiment II). 


write down the words in any order after the list 
has been read." Ninety seconds were allowed for 
written recall. 

Results and discussion, The whole-list learning 
data were virtually identical for both conditions in 
Experiment I. An analysis of variance revealed 
that only trials was significant, F (5, 240) — 115.17, 
p < .01. On the sixth trial of the whole list, experi- 
mental Ss recalled 12.7 items, while control Ss re- 
called 12.4 items, F (1, 48) < 1, ns. 

The part list data are presented in the left panel 
of Figure 1. Both the trials effect, F (4, 192) 
= 32.44, p < .01, and the Condition X Trials inter- 
action, F (4, 192) = 9.86, p < .01, were significant, 

An analysis of the final whole-list reproduction 
data (see left panel of Figure 1) revealed that 
conditions and trials were significant, F (1, 48) 
= 37.70, p < .01, and F (1, 48) = 6.11, p < .05, 
respectively. However, the Condition X Trial 
interaction was not significant, F (1,48) = 3.44, ns. 

The performance of experimental and control Ss 
did not differ during whole-list learning; however, 
during part-list learning, experimental Ss exhibited 
less efficient learning than control Ss, ‘These results 
are consistent with previous studies (Tulving &Osler, 
1967). With regard to final whole-list recall, Figure 
1 shows that the ability of experimental Ss to ulti- 
mately reproduce whole-list items was not impaired 
by several trials of part-list learning and was greater 
than that of the control group. These findings are 
inconsistent with the hypothesis that experimental 
Ss must reorganize existing S units acquired during 
whole-list learning and form new S units in order to 
learn the part list. However, it might be argued that 
the inferiority of control Ss in reproducing the 
whole list was due to the inaccessibility of the whole- 
list structure as a result of Ss’ recent preoccupation 
with an unrelated part and consequent response set 
suppression of the whole list. 

The results of Experiment II are relevant to this 
point. In this study, a relearning trial on the whole 
list followed whole-part learning to determine 
whether a single reinstatement of the whole-list 
context would benefit control Ss in retrieving items 
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from the whole list. The whole-list learning data 
were virtually identical for both groups in Experi- 
ment II. An analysis of variance revealed the only 
significant effect to be that of trials, F (5, 230) 
= 137.75, p < .01. On the sixth trial of the whole 
list, experimental Ss recalled 16.5 items, while 
control Ss recalled 16.9 items, F (1, 46) < 1, ns. 

"The part-list data are presented in the right panel 
of Figure 1. Both the trials effect, F (4, 184) 
= 80.32, p < .01, and the Condition X Trials 
interaction, F (4, 184) = 7.29, p < .O1, were 
significant. 

On the relearning trial of the whole list (see the 
right panel of Figure 1), experimental recalled 
18.4 items, while control Ss recalled 14.3 items, 
F (1,46) = 13.93, p < .01. 

Again as in Experiment I, the typical negative 
transfer effects were found during second-list learn- 
ing. Nevertheless, on the subsequent relearning 
trial of the whole list, experimental Ss recalled more 
whole-list items than control Ss. Thus, the addi- 
tional presentation of the whole list does not alter 
the fact that experimental Ss recall more whole-list 
items than do control Ss. For the reasons stated 
earlier, this finding is contrary to the predictions 
from Tulving and Osler's (1967) interdependence 
hypothesis, which states that whole-list S units 
which mediate recall are modified or completely 
new S units are formed during part-list learning. 
If new S units are organized during part-list acquisi- 
tion, why then do experimental Ss show no loss of 
whole-list items in subsequent whole-list recall and 
achieve higher recall performance than control 5s, 
who had no reason to modify their S units during 
part-list learning? 

Slamecka, Moore, and Carey (1972) offered an 
alternative account of negative transfer in the whole— 
part paradigm. This account proposes that the 
greater discrimination that experimental Ss are 
required to perform between first and second lists 
during part-list learning is primarily responsible for 
the negative transfer, Inability to discriminate 
among part-list items and whole-list items would 
depress part-list performance, producing negative 
transfer during part-list learning. 

A list-discrimination hypothesis could also ac- 
commodate the reproduction and relearning data 
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following whole-part learning. Since ex 

Ss have an opportunity to retrieve whol ens 

during part-list learning, discriminatin 

them for part-list items on the basis of 

of occurrence, these Ss should be adept 

ducing or relearning the whole list follow: 

part learning. Control Ss, in learning a n 

items during part-list learning, have no r 

retrieve whole-list items and are not hind: 

the list-discrimination task (and thus acquir 

part list more readily). However, when recall of th 

whole list is required following whole-part learning, 

the frequency information that was established and 

utilized during whole-list learning (situational fre- 

quency) has begun to assimilate to its preexperi- 

mental level (background frequency) during the 

interpolated part-list learning (see Underwood & 

Freund, 1970). Thus, control Ss exhibit forgetting 

of whole-list members following part-list learning. 
In summary, the superiority of experin | Ss 

at reproducing (Experiment 1) and r 

(Experiment II) the items from the wi 

following whole-part learning may derive from 

more frequent retrieval of whole-list i! for 

experimental Ss. In any event, since expe: 

Ss retain more of the whole list following part-li 

learning than do control Ss, it cannot be the case 

that the organizational structure that mediates 

retrieval is permanently disrupted. Similar conclu- 

sions were reached in the part-whole paradigm by 

Carey and Okada (1973). 
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USAGE AND 


MARILYN A. BORGES 


San Diego State University 


TERPRETATION ! 


BARBARA K. SAWYERS? 


University of California, San Diego 


The Ss made judgments about the meanings of 6 common quantifiers com- 


bined with different set sizes. 


judgments; Experiment 1I, verbal numerosity judgments. 


showed that Ss' judgments followed a simple multiplying 


integration analysi 


Experiment I involved nonverbal quantity 


An information 


rule: the meanings of the quantifiers increased linearly with respect to the 
set size of the population considered. The Ss used the quantifiers in a highly 
consistent manner, the only exception being the quantifier several, which was 


found to have 2 distinct usages, 


The present experiments studied the usage of 
6 quantifiers: few, several, some, lots, many, and 
most. These quantifiers are all words occurring 
with high frequency in the English language 
(AA count, Thorndike & Lorge, 1944). They are 
independent from one another in etymological 
origins (there are no common cognates or roots), 
and their meanings are not precisely defined in 
a numerical sense, e.g., few is more than 2, but 
not many (Onions, 1955). In addition, the use of 
these quantifiers is, in most cas learned prag- 
matically rather than pedagogically. 

Because these quantifiers are frequently used, 
a high degree of agreement about their meanings 
should exis The main experimental questions to 
be answered were as follows. (a) Does an indi- 
vidual use these quantifiers in a consistent manner? 
(b) What is the extent of agreement among adults 
on the uses of these general quantifiers? (c) How 
do Ss modify their interpretations of each quan- 
tifier depending upon the size of the set on which 
they operate? 

"The .Ss' tasks were to make judgments about 
the use of each quantifier. Judgments were based 
on 2 sources of information: (a) S's conception or 
understanding of the meanings of the quantifiers 
and (6) S’s perception of the size or numerosity 
of the sets from which he chose. In Experiment I, 
values for each quantifier were determined on 
the basis of quantity, i.e., S formed subsets from 
various set sizes of marbles to represent each 
quantifier. In Experiment II, each judgment was 
a cognitive rating of "count" or numerosity, with 
the set information supplied to S and the quantifier 
response given by S in the form of specific integers. 

Previous investigators have used statistical mul- 
tiplying models for evaluating word combinations. 
Cliff (1959), extended by Howe (1962), studied 
combinations of adverbs and adjectives. However, 
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this previous work has a serious shortcoming 
(Anderson, 1973) because it depends on Thursto- 
nian scaling methods. For verbal stimuli, this 
method relies completely on pooled S data, making 
no allowance for individual differences. Integra- 
tion theory, which takes individual differences into 
account, has been used in a variety of judgmental 
tasks (Anderson, in press; Sawyers & Anderson, 
1971). The present study applied this general an- 
alytic approach to determine how Ss integrate 2 
diverse sources of information, quantifier meaning 
and set size, in making their judgments. 


METHOD 


The S's task was to make a series of judgments 
about the quantitative meanings of 6 different 
quantifiers combined with 4 (Experiment I) or 5 
(Experiment II) different set sizes. In Experi- 
ment I, judgments were made by taking marbles 
from one box and placing them in another box. 
In Experiment II, Ss made judgments by specifying 
a particular integer for each combination offquan- 
tifier name and numerical set size. 

Subjects, The Ss were 56 students (26 in E 
periment I and 30 in Experiment II) from San Diego 
State University. 


Design and Procedure 


re 72 judgments for 


Experiment I. There w 
The 


each S, with 6 quantifiers and 4 set sizes. 
quantifiers were 
most. Sets of 8712, 2 
Each S served in 3 replications of this design. 
For each replication, § judged all the quantifiers 
in random order in 1 set size for each of the 4 set 
sizes. Within each replication, set size was pre- 
sented in a different random order for each S. 

The Ss were run individually and instructed to 
make judgments by taking marbles from one box 
and putting them into another box, without actually 
counting the marbles. The Ss were allowed to 
take as many handfuls of marbles as necessary 
and could rearrange the marbles between the 2 
boxes until they were satisfied that their judgments 
were accurately reflected. Although no time limit 


ew, several, some, many, lots, and 
36, and 48 were used. 
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cautioned not to refer back to past judgments or 
change a judgment on a page that they had already 
turned. 


ResuLts 


Filot work suggested that Ss were relatively con- 
sistent in their usage of 5 of the quantifiers. For 
several, however, there were large individual dif- 
ferences, and Ss could clearly be partitioned into 
a “low” group and a "high" group. Based on 
these preliminary results, several was assumed to 
have 2 distinct meanings, and Ss were subdivided 
into a high-several or low-several category. In 
periment I, only 2 of 26 Ss were in the high-several 
category ; for simplicity, their data are not included. 
In Experiment II, Ss were equally divided between 
the 2 groups. 
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Ficure 1, Mean numerical judgment of the 6 quantifiers 


for Experiment I. 


was imposed, it was suggested that S work at a 
fairly rapid pace. Before the experiment began, 
Ss were shown a list of the 6 quantifiers. However, 
Ss were not informed of the set sizes to be used 
in the experiment, Two 9 X 16 X 7.5 mm. wooden 
boxes were placed in front of S. One of the boxes 
contained 1.5-mm. marbles, of the first set size to 
be judged. For each trial, a statement of the 
following type was read to S by E: “Take some 
of the marbles from this box [E pointed to first 
box] and put them into this box [E pointed to 
second box]'" After each judgment, E recorded 
the number of marbles transferred by S to the 
second box, then returned the marbles to the first 
box for the next judgment. After each series of 
6 judgments on one set size, E changed the number 
of marbles in the first box, and the next series of 
judgments was made. 

Experiment II. Each S made a total of 90 judg- 
ments. The quantifiers used were those of Ex- 
periment I. Sets of Size 12, 36, 60, 84, and 108 
were used, Each S judged the 30 quantifier-set- 
size combinations over 3 replications. As in 
periment I, for each replication, all the quantifiers 
were presented together in random order, for 1 set 
size at a time, Set size was presented in a dif- 
ferent random order within each replication for 
each 5. 

‘The Ss were run in a group and were instructed 
to specify a numerical value for each quantifier- 
set-size combination. The quantifiers to be used 
in this experiment were shown to Ss before the 
experiment began; however, set sizes were not 
disclosed, Each S was given 4 packets, with 5 
booklets in each packet. Each booklet contained 
6 pages, with a statement of the following type on 
each page: “If you were to take some of the marbles 
from 84 marbles, you would take mar- 
bles." The S filled in the blank with his numerical 
judgment. Only specific integers were allowed as 
responses. The first packet was given as practice 
to familiarize Ss with the task. The Ss were 


Main Empirical Results 


Experiment I. Figure 1 shows the mean judg- 
ments of the 6 quantifiers for the 4 different set 
sizes for 24 Ss. The relative ordering of the judg- 
ments for the 6 quantifiers from low to high was 
few, several, some, many, lots, and most. As can 
be seen in Figure 1, the judgments of the quanti- 
fiers increas near linear functions of the ob- 
jective se A Newman-Keuls test on all 
ordered pairs of means for each of the quantifiers 
showed each quantifier to have a unique semantic 
meaning (p < .01). 

Experiment II. Figure 2 shows the mean judg- 
ments of the 6 quantifiers for the 3 different set 
sizes. Two curves are shown for several: low 
several, the mean of 15 Ss who consistently gave 
low estimates of this quantifier (9965-2596 of the 
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Ficure 2. Mean numerical judgments of the 6 ii 
for Experiment Il. (For few, some, many, lots, E 
n = 30; for low several and high several, x —15.) i 


SHORT REPORTS 


set size), and high several, the mean of 15 Ss who 
consistently gave high estimates (4905-7795). 
Newman-Keuls tests showed both high- and low- 
several Ss to judge lots and many as having similar 
semantic meanings; also, low-several Ss did not 
differentiate between few and several. However, 
all other quantifiers were judged as semantically 
unique in meaning. The ordering of the 6 quanti- 
fiers for low-several Ss was few/several, some, 
many/lots, and most. For high-several Ss, the 
ordering was few, some, several, many/lots, and most. 


Model Analysis 


Goodness of fit. The multiplying model from 
integration theory makes a simple graphical pre- 
diction: the curves of Figures 1 and 2 should form 
fans of straight lines, diverging as a function of 
i ing set size. Inspection of the curves in- 
that this prediction is approximately 


correct. 

An exact statistical test of the model is also 
available. Theoretically, 2 or more variables that 
combine in a multiplicative manner will show an 
interaction concentrated in the bilinear (Linear 
Linear) component, but the residual should be 
nonsignificant. A test on the interaction residual 
is primary, therefore, in that a significant residual 
is direct evidence against the model. In Experi- 
ment I, the overall interaction (Set Size X Quanti- 
fier) was found to be very large, F (15, 345) 
= 101.36, p <.01. The residual interaction was 
found to be nonsignificant, F (14, 345) = 1.08. 
The overall Set Size X Quantifier interactions of 
Experiment Il for both the low-several and high- 
several groups were also large, F (20, 280) = 203.52, 
and F (20, 280) = 75.19, respectively, both ps 
<.01, The residual interaction of the low-several 
group was only marginally significant, F (19, 280) 
= 1.70, p = .05. The residual interaction of the 
high-several group was nonsignificant, F (19, 280) 
= 47. Thus, the data give considerable support 
to the multiplying model: each quantifier is inte- 
grated with set size by a simple multiplying rule. 

Functional scale values. Since the marbles were 
given as a heap in the box, it was not possible to 
assume without evidence that the actual objective 
number of marbles in the box was equal to the 
effective subjective number of marbles that S used 
in making his response. Thus, factors of visibility 
or manual accessibility might produce differences 
between the subjective and objective value: 
functional measurement analysis from int 
theory provides subjective values for both stimulus 
variables. This analysis showed that, for set size, 
subjective number was nearly a perfectly linear 
function of actual number. The functional scale 
values of the quantifiers few, several, some, many, 
lots, and most, respectively, were, for Experiment I, 
.16, .26, .33, .60, .68, and .81; and for Experi- 
ment II, .15, .17 (low several) and .63 (high several), 
.34, .77, .79, and .92. 
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DISCUSSION 


In both Experiments I and II, the multiplicative 
model of integration theory was found to be highly: 
descriptive of the way in which Ss integrate the 
meanings of each quantifier with set-size informa- 
tion. A high level of consistency was found in 
the use of verbal quantifiers for each individual S 
and, with the exception of several, across Ss as well. 
There were also strong indications that Ss judged 
each of the quantifiers to represent a consistent 
proportion of the total set size. This proportional 
relationship existed regardless of whether the task 
was a quantity judgment task (Experiment 1), 
or a verbal numerosity judgment (Experiment II). 

That several was found to have 2 distinct mean- 
ings was quite unexpected. A number of Ss were 
interviewed, and each stated his strong belief that 
his particular use of several was the only “correct” 
usage, but no clues were obtained to explain this 
difference. However, the data show that both 
low-several and high-several Ss use the word con- 
sistently in a variety of contexts. Specifically, 
high-several Ss always judged several to mean more 
than low-several Ss, regardless of context. 

In Experiment I, all 6 quantifiers were judged 
to have distinctly different meanings. However, 
in Experiment II, the meanings of low several and 
few did not significantly differ, nor did the judg- 
ments of lots and many. The most likely explana- 
tion for this difference lies in the nature of the 
tasks. In Experiment I, Ss made an intuitive 
judgment of relative quantity for each quantifier 
based on the perceptual mass of the total set of 
marbles. In Experiment II, Ss had to give an 
exact number for each quantifier based on only 
verbal information about the total set size. Pre- 
sumably, this was more difficult in that (a) usually, 
no specific number or percentage is associated with 
quantifiers, and (b) such precise numerosity judg- 
ments fall outside Ss’ normal range of experience 
with the quantifiers, It is possible that the neces- 
sity of making numerosity responses to the quanti- 
fiers rather than quantity judgments produced the 
less specific distinctions among the quantifiers. 

In summary, adult Ss were found to use the 
quantifiers in a highly consistent manner, with the 
exception of the quantifier several, which had 2 
distinct interpretations. Information relating quan- 
tifier and set size was seen to follow a simple 
multiplying rule, i.e, usage of these quantifiers 
seems to be determined by each quantifier's pro- 
portional relationship to the total set si While 
small variations in the meanings of words across 
people is to be expected, the surprising bimodal 
distribution of the meaning of several raises an 
important question about the objectivity of word 
meanings, 
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A test was made of the hypothesis that short-term motor forgetting could be 
found on 1 trial, in the absence of interference, if covert rehearsal were suffi- 
ciently restricted. The covert-rehearsal process was impaired by either 
reducing response-produced feedback and/or requiring attention toward a 
rehearsal-preventing activity during the retention interval. The task used was 
linear lever positioning. Two conditions of feedback and 3 levels of inter- 
polated activity gave 6 independent treatments, with 19 Ss each. Feedback 
involved the presence of audition, proprioception, and vision (APV) or their 
absence (—APV). Interpolated activity was a 5-sec, rest or immediate recall, 
a 90-sec. rest, and a 90-sec. digit classification. Under —APV, there was 
forgetting over the unfilled retention interval and no increased forgetting with 
interpolated activity. Under APV, there was no forgetting at rest, but recall 
decrements occurred following interpolated activity. The results were inter- 
preted in terms of trace decay theory, and the importance of covert rehearsal for 


motor retention was discussed. 


The trace decay theory of forgetting has received 
little experimental attention in short-term motor 
memory research. Only 2 studies have used a 
single-trial learn-recall design, which eliminates 
the possibility of interference. In these studies, 
Williams (1971) reported no forgetting in the ab- 
sence of prior learning, while Adams, Marshall, 
and Goetz (1972) found evidence of trace decay 
under minimal feedback conditions. Adams et al. 
suggested that impoverished feedback produced 
a weak image or perceptual trace and, although 
they made no attempt to measure the strength of 
the trace, they concluded that the weak trace 
decayed over time. According to Brown (1958), 
decay is not a function of time per se, but it is 
rather a consequence of covert rehearsal restrictions. 
Thus, impoverished feedback appears to impair 
covert rehearsal, 

If one assumes that S is unable to rehearse a 
weak perceptual trace, but that he is able to 
rehearse a stronger image, one would expect on 
the basis of the trace theory that interpolated 
rehearsal-preventing activity would differentially 
effect recall under impoverished and augmented 
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feedback conditions. In support of this contention, 
Posner and Konick (1966) and Posner (1967) found 
that blind movements (minimal feedback) were 
forgotten over an unfilled retention interval and 
unaffected by interpolated activity, while visually 
guided movements showed forgetting only after 
rehearsal-preventing activity. Posner and Konick 
and Posner used a repeated measures design 
(48 trials in 50 min.); consequently, their results 
do not provide conclusive evidence of trace decay, 
since such designs contain a built-in potential for 
proactive interference (cf. Adams et al., 1972). 
In the present study, a single-trial learn-recall 
design was employed to eliminate the possibility: 
of interference and to provide a clean test of the 
trace decay theory, which predicts forgetting under 
impoverished feedback and interpolated noninter- 
fering activity because of covert-rehearsal restric- 
tions. [n addition, the strength of the perceptual 
trace was assessed under impoverished and aug- 
mented feedback conditions. 

Method. One hundred and fourteen right-handed 
male undergraduate students were randomly as- 
signed to 1 of the 6 independent conditions of the 
2 X3 factorial design, with 19 Ss per cell. The 
2 levels of the first factor involved the presence 
of audition, proprioception, and. vision (APV) or 
their absence (—APV), and the second factor was 


SHORT REPORTS 


a S-sec. rest or immediate recall, a 90-sec. rest, and 
a 90-sec. digit classification, 

Linear lever positioning was used as the memory 
task, and the apparatus was essentially the same 
as that described by Adams et al. (1972). On top 
of the lever, which ran along a 15-in. trackway, 
was a button that S depressed to move the carriage 
and released to lock it in place for the purpose 
of measurement. The apparatus was located 
behind a 1-way mirror and E was able to ma- 
nipulate visual feedback by altering the lighting 
conditions on either side of the mirror. All lever 
movements were from right to left and were essen- 
tially frictionless (—APV) or were resisted by a 
coiled clock spring with a pull of 1.75 Ib. (APV). 
Auditory feedback was climinated when necessary 
(—APV) by white noise played through earphones 
at 120 db. 

The experiment was conducted in 2 stages, the 
first of which involved instructions and a 20-sec. 
practice trial on the digit classification task. The 
digits were printed on a sheet of paper located on 
the wall to the left of S’s chair at approximately 
eye level. On the command “digit,” S turned his 
head and began reading down the first of 4 columns 
of 2-digit numbers, verbally indicating as fast as 
possible whether each was high or low (above or 
below 50) and odd or even. 

The second phase included linear-lever-position- 
ing instructions, a demonstration (14 in.) and 2 
movements, which were separated by a retention 
interval of 5 or 90 sec. The first movement was 
to the criterion. (8 in.), which was defined by a 
stop, and the second was to where S thought the 
stop had been. At the end of each movement, 
S released the button on top of the lever and 
maintained his grasp for another 2 sec. before 
being instructed to place his right hand on the 
pad located on the table in front of him, The 2 
movements were made under identical feedback 
conditions, with vision and white noise being 
manipulated for a period that extended from 2 sec. 
prior to movement initiation through 2 sec. after 
button release. The retention interval was timed 
from button release, and during this period, digit 
classification Ss performed the interpolated task, 
while rest Ss were required to face the mirror. 
All Ss kept their right hands on the pad throughout 
the retention interval, and Æ returned the lever 
to the start. 

After the 2 movements, S was asked to estimate 
the direction and magnitude of his error from the 
criterion. The absolute difference between the 
actual and estimated algebraic error scores w: 
taken as an indication of S's ability to accurately 
estimate his own performance. This score was as- 
sumed to reflect the strength of the perceptual 
trace since S is unable to accurately estimate per- 
formance without a strong trace. Finally, an in- 
terview was conducted to gather information on 
the strategy S used to position the lever. 

Results. Analysis of variance of the absolute 
error scores found the main effects of both feed- 
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TABLE 1 
MEAN ABSOLUTE RETENTION TEST ERROR (}-IN, UNITS) 


Feedback 
condition 
Interpolated activity Grand M 
APV —APV 
None (immediate recall) 3.63 6.16 4.90 
90-sec. rest 3.42 10.53 6.97 
90-sec, digit classification 7.68 10.47 9.08. 
Grand M 4.91 9.05 


. Noe. Abbreviations: APV indicates the presence of audi- 
tion, proprioception, and vision; —APV indicates their absence, 


back and interpolated activity to be significant: 
F (1, 108) = 12.55, p < .001; and F (2, 108) = 4.27, 
p = .02, respectively. Table 1 indicates the overall 
superiority of the APV condition (4.9) over the 
—APV treatment (9.1). Post hoc analysis, using 
Duncan's new multiple range test, revealed that 
the interpolated digit classification condition (9.1) 
recalled significantly (p < .05) less than the im- 
mediate recall group (4.9), and that there were no 
other significant differences between the inter- 
polated activities, 

The Feedback Condition X Interpolated Activity 
interaction was not significant, F (2, 108) = 1.61, 
p —.20. Although the interaction was not sig- 
nificant, it was expected on the basis of the trace 
decay theory that recall decrements would occur 
over an unfilled retention interval under —APV, 
and that forgetting would occur only after inter- 
polated rehearsal-preventing activity under APV, 
To test these theoretically based a priori assump- 
tions, 2 orthogonal tests were made, using the ratio 
with the mean square error from the analysis of 
variance as the error term to give 108 df, The first 
test indicated that under —APV, decrements in 
recall were significantly (p < .05) larger after the 
90-sec. rest (10.5) than at immediate recall (6.2). 
The second test revealed that under APV, there 
was significantly (p < .05) more forgetting after 
the 90-sec. digit classification (7.7) than after the 
90-sec. rest (3.4). 

The same analysis was run on algebraic error as on 
absolute error, and none of the tests were significant, 
Averaging over all interpolated activities, the mean 
algebraic error was —1.0 for APV and —2.9 for 
—APV. Averaging over feedback conditions, the 
means for interpolated activities were —1.1, —2. 
and —2.1 for the 90-sec. digit classification, 90-se 
rest, and 3-sec. rest, respectively. 

The analysis of variance of S's ability to ac- 
curately estimate his own performance (absolute 
difference between actual and estimated algebraic 
error scores) found both main effects significant: 
for feedback, F (1, 108) = 7.71, p = .01; and for 
interpolated activity, F (2, 108) = 4.57, p = .0t. 
Table 2 indicates that —APV produced a weak 
perceptual trace (7.6) compared to that developed 
under APV (4.7). Furthermore, post hoc analysis 
revealed that the perceptual trace was significantly: 
(p < .05) weaker following interpolated activity 
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TABLE 2 


ABSOLUTE DIFFERENCE ({-IN. UNITS) BETWEEN ACTUAL AND 
ESTIMATED ALGEBRAIC ERROR SCORES 


Feedback 
condition 
Interpolated activity Grand M 
APV —APV 
None (immediate recall) 2.95 6.00 


90-sec. rest 6.90 
90-sec. digit cl ification 9.84 
Grand M 4.67 7.58 


Note. Abbreviations: APV indicates the presence of audi- 
tion, proprioception, and vision; —APV indicates their absence, 


(8.3) than at immediate recall (4.8). None of the 
other differences were significant. 

Discussion. The present results lend, credence 
to the trace decay theory of forgetting, since recall 
decrements occured with 1 learn-recall trial in the 
absence of apparent interference. In support of 
Adams et al. (1972), there was no forgetting over 
the unfilled retention interval when feedback chan- 
nels were intact (APV), but significant decrements 
in recall were induced by attenuating feedba 
(—APV). The manipulation of feedback also in- 
fluenced Ss ability to accurately estimate his own 
performance, This ability was assumed to reflect 
the strength of the perceptual trace, and, as Adams 
et al. hypothesized, the trace under impoverished 
feedback was significantly weaker than under APV. 
Apparently, minimal feedback produced a weak 
perceptual trace, which decayed over the unfilled 
retention interval because it was not rehearsed. 

If one assumes that covert rehearsal was not 
possible following —APV but was possible after 
APV, then one would predict on the basis of the 
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trace decay theory that interpolated rehearsal- 
preventing activity would produce no more for- 
getting than would rest under —APV, but that 
it would cause increased forgetting under APV. 
The results, which provided support for the hy- 
pothesis and additional evidence of trace decay, 
were in accord with the findings of Posner and 
Konick (1966) and Posner (1967) and although 
these studies, because of their built-in potential 
for proactive interference, do not offer direct 
evidence of trace decay, they certainly do not 
contradict the theory. 

Short-term motor memory is apparently in- 
fluenced by covert-rehearsal opportunities. ‘The 
mechanism for such rehearsal is not clear at present, 
although it appears plausible to suggest, on the 
basis of the introspective reports of Ss, that verbal 
labels or imaginal mediators are necessary for 
motor retention. The present interview data con- 
curred with the findings of Posner and Konick 
(1966) and favored the imagery position, but ad- 
ditional research is required to substantiate this 
visual encoding hypothesis. 
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ABSTRACTION OF UNIDIMENSIONAL CONCEPTS FROM 
LARGER CONCEPTUAL SYSTEMS! 


VITO MODIGLIANI? 


Wesleyan University 


Stimuli consisting of 2 s 


consisted of 3 bivalued dimensions and 2 constant values. 
2 systems made up a meaningful figure, a "face smoking a pipe.” 
, one of the dimensions was made relevant in a concept 
In the subsequent concept conservation task, 48 


initial training phas 
identification type ta 


stems, a face and a pipe, were used. Each system 


Together, the 
During an 


Ss were found to conserve less in response to transformations operating on 
the system containing the relevant attribute than to identical types of trans- 


formations operating on the other system. 


The findings were interpreted as 


indicating the importance of stimulus structure in determining the abstrac- 
tion of unidimensional concepts from larger conceptual systems. 


Recently, Modigliani (1971) studied how bounded- 
s affected the conservation of simple unidimen- 
nal concepts. In that study bounded stimuli 
nsisting of 4 bivalued dimensions were potted 
wers, each made up of flower pot, stem, leaves, 
d blossom. Unbounded stimuli were obtained 
rom the bounded ones by redrawing the latter 
ı à disjointed manner. Concept conservation was 
iower with bounded than with unbounded stimuli. 
‘he problem with this manipulation of bounded- 
ess is that it is not clear to what extent the effect 
was determined by the perceptual nature of the 
stimulus manipulation, rather than by conceptual 
boundedness per se. 
In the present study, boundedness was manipu- 
lated both at the stimulus level, as in Modigliani 
1971), and conceptually, as follows. Consider 
drawing depicting a "face smoking a pipe." 
h a stimulus is an overall bounded set of at- 
utes and could be perceptually unbounded ac- 
ding to the Modigliani procedure. On the other 
hand, the stimulus could also be considered as 
consisting of 2 separate systems, the face and the 
pipe, each of which constitutes a bounded set of 
attributes in itself. If this were so, it could be 
expected that a higher degree of boundedness ob- 
tained between 2 stimulus elements belonging to 
the same system, eg, the pipe, than between 
2 elements belonging to different systems, e.g., one 
to the pipe, the other to the face. Thus, the 
hypothesis tested in the present study was that 
if the concept to be conserved was a pipe element, 
then transformations operating on other pipe 
elements should be more disrupting (lead to less 
conservation) than similar transformations oper- 
ing on face elements, and vice versa. This 
hypothesis will be referred to as the conceptual 
boundedness hypothesis because it depends solely 
on how S interprets the stimulus, not on any 
perceptual manipulation of it. 


1 This research was made possible by a Wesleyan University 
grant in support of scholarship. The author wishes to thank 
Marjory S. Daltry, Clem M. Green, Harold Kaplan, and 
Anthony LaBella for their collaboration in providing Ss for 
this research, and Robert S. Feldman for running the Ss. 

? Requests for reprints should be sent to Vito Modigliani, 
Department of Psychology, Wesleyan University, Middletown, 
Connecticut 06457. 
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Method. The design of the experiment was a 
3X2 18 factorial, the independent variables 
being age, perceptual boundedness, and trans- 
formations, respectively. The 3 age levels included 
children from the first, third, and sixth grades. 
The 2 perceptual boundedness levels were those 
used by Modigliani (1971), stimuli being either 
bounded or unbounded, and the transformations 
are described below. The first 2 factors were 
betweer and the third withi bles. 

The Ss were 48 children, 16 from each of the 
3 grades, drawn from 2 schools in Middletown, 
Connecticut. Within each age, Ss were randomly 
assigned to the experimental conditions. 

The experiment followed the same pattern as 
that of Modigliani (1971) and consisted of a training 
phase followed by a conservation test phase. In 
the training phase, Ss learned to categorize stimuli 
according to a single relevant dimension—the shape 
of the eyebrows for half of them and the brightness 
of the pipe bowl for the other half. The training 
stimuli consisted of +32 outline figures drawn on 
21.7 X 28 cm. sheets of white paper. Each drawing 
consisted of 10 values, of which 4 were constant, 
ie, the same for all stimuli, and the remaining 
6 were values from bivalued dimensions. Each of 
the 2 stimulus systems, the face and the pipe, 
consisted of 3 (varying) dimensions and 2 constant 
values. The 3 face dimensions were the shape of 
the eyebrows, the shape of the nose, and the shape 
of the mouth. The shape of the eyes and the face 
contour were constant face values. The 3 pipe 
dimensions were the brightness of the pipe stem, 
the brightness of the bowl, and the shape of the 
smoke the pipe emitted. The constant dimensions 
of this unit were the overall shape of the pipe and 
the shape of the bowl rim. Values were drawn 
at fixed positions in both conditions of perceptual 
boundedness. The 32 training stimuli comprised 
a random sample of the 64 possible stimuli, with 
the restriction. that half contained one relevant 
value and the other half, the other relevant value. 

The conservation phase stimuli consisted of 28 
drawings, 10 of which were randomly selected 
from those used in the training phase, and the 
rest representing 1 transformation each. Trans- 
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TABLE 1 


MEAN PERCENTAGE OF CONSERVING RESPONSES AS A FUNCTION 
or Other VERSUS Same SYSTEM TRANSFORMATIONS 
AND PERCEPTUAL BOUNDEDNESS 


Perceptual boundedness 
System operated on 
by transformations 


Unbounded Bounded 
Other 79.2 75.0 
Same 80.0 53.3 


formations were obtained by adding, deleting, and 
substituting new values for old ones in the training 
stimuli. For present purposes, only the first 10 
transformations will be described in detail. Five 
of them operated on elements belonging to the 
system not containing the relevant attribute, re- 
ferred to as the other system, The 5 remaining 
operated on elements belonging to the system 
containing the relevant attribute, the same system. 
For each transformation in the ofer system group 
there was an exact replica in the same system 
group, with 1 exception. The 5 transformations 
in the other system group were: (a) deletion of 
1 irrelevant value, (5) deletion of 2 irrelevant 
values, (c) deletion of the entire other system, 
(d) deletion of the system's 2 constants values, 
and (e) substitution of a new value for one of the 
old constant ones. The same transformations were 
used to operate on the same system, the exception 
being that instead of deleting the entire same sys- 
tem, the relevant value was never deleted. 

A test of the conceptual boundedness hypothesis 
entailed a comparison between these 2 groups of 
transformations as a function of perceptual bounded- 
ness. When the stimuli are perceptually bounded, 
the distinction between other and same systems is 
meaningful, but is effectively lost when the stimuli 
are perceptually unbounded. The conceptual 
boundedness hypothesis, therefore, predicts an in- 
teraction between perceptual boundedness and 
transformations operating on the other vs. the same 
system: there would be a significant difference due 
to these 2 types of transformations for bounded 
but not for unbounded stimuli. 

Two random sequences of the 28 conservation 
test cards were used with the restriction that no 
more than 2 training cards appear consecutively. 
Half of the Ss were randomly assigned to each 
order, A 28 X 28 X 45 cm. box, divided into 3 
compartments, the middle of which was painted 
gray, a lateral one blue, and the third green, was 
used to test Ss. 

The procedure was almost identical to that used 
by Modigliani (1971). ‘The only difference was 
that while Modigliani presented the task as one 
in which Ss had to tell whether cards were taken 
from boxes of different colors, the present task 
was one of putting cards info such boxes, In 
outline, S was told in the training phase that 
each card would go into either the green or the 
blue compartment of the box in front of him. 
He was shown an example consisting of a pair of 


stimuli differing only along the relevant dimension, 
and was told which of these 2 stimuli went into 
the green and which into the blue compartment. 
After S examined the stimuli for as long as he 
wished, he was run to a criterion of 31 out of 32 
correct classifications of the training stimuli, one 
at a time. He was then told that more cards 
would be shown and that he could now put any 
of them into the gray compartment if he so wished. 

Results and discussion. A placement of a test 
card into the appropriately colored (blue or green) 
compartment was defined as conserving and scored 
as 0. A placement in the gray box was a non- 
conserving response and was scored as 1, All Ss 
but 2 classified all old cards in the conservation 
phase correctly. In each of the 2 exceptions, 
SS made one error that consisted in each case of 
placing a green card (different for each S) in the 
blue compartment. These errors were considered 
as temporary slips of memory, and these Ss' re- 
sponses to the actual test cards were included in 
the analysis of the data. 

An overall analysis of variance was performed 
on the data. Only results pertinent to the con- 
ceptual boundedness hypothesis will be reported. 
There were no age effects related to the hypothesis 
being tested. The mean percentages of conserving 
responses as a function of perceptual boundedness 
and transformations in the other and the same 
systems, are shown in Table 1. Planned com- 
parisons showed that the 2 mean percentages in 
the top row did not differ from one another (F < 1) 
but that those in the bottom row did, F (1, 714) 
= 19.86, p <.01. Furthermore, the 2 means in 
the left column did not differ from one another 
(F <1), while those in the right column did, 
F (1, 42) = 21.33, p «.01. Thus, the bottom 
right mean is significantly different from the other 
3 means, which do not differ from each other. 

The above results entirely confirmed the con- 
ceptual boundedness hypothesis. It seems that 
the most important aspect of this confirmation is 
the demonstration that the internal structure of 
a stimulus, considered as a conceptual system, 
may affect the way Ss abstract a simple concept 
from the larger conceptual framework. Stimulus 
structure, with the exception of salience (e.g., 
Trabasso & Bower, 1968) has received little atten- 
tion in the experimental literature on conceptual 
behavior. By and large, stimuli are considered as 
a collection of attributes, some being more salient 
than others, but having no structural organization 
of a more complex sort. One exception to this 
statement is provided by Sutherland (1968). His 
views were later summarized as follows: “a picture 
of a man can be decomposed into head, trunk, 
arms, etc.; head into cheeks, eyes, ears; and at 
the lowest level, objects are decomposed into seg- 
ments (or edges) and textures [Sutherland & 
Mackintosh, 1971, p. 53]" This hierarchical 
description is remarkably similar to one indepen- 
dently favored by the author with regard to the 
stimuli used in this experiment. Here the per- 
ceptually bounded stimuli are thought of as con- 
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sisting of (at least) 3 levels. The top level repre- 
sents the entire stimulus, a face smoking a pipe. 
The middle level represents the 2 independent 
systems, the face and the pipe. The bottom level 
represents the single attributes, the shape of the 
eyebrows, the brightness of the pipe, etc. Trans- 
formations in the present study operated on elements 
at this bottom level. However, their effect de- 
pended on the locus of the attribute(s) operated 
upon, with respect to the next higher level in the 
hierarchy, as was the case with perceptually bounded 
stimuli, While the present results demonstrate in 
only 1 instance that the abstraction process de- 
pends on a level above the one at which the element 
to be abstracted is located, these results do raise 
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the problem of stimulus structure as a. potentially 
important one in the experimental analysis of con- 
ceptual behavior. 
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SHORT- AND LONG-TERM MEMORY! 


C. JAMES SCHEIRER 


2 AND MICHAEL J, HANLEY 


State University of New York at Binghamton 


The Ss were required to retain a short list of items for an entire experiment 
while processing successive different short lists of items in the Sternberg 


paradigm. 


for both the permanent list and the immediate li 
function depended on whether the type of material retained permanently w: 
sively processed lists. The results 


similar to or different from the suc 


It was found that reaction time was linearly related to set size 


s, but that the slope of the 
s 
re 


discussed in terms of Sternberg's exhaustive-scanning hypothesis. 


The scanning paradigm has been used by Stern- 
berg (1969) and others to investigate search and 
retrieval processes in memory. In the scanning 
paradigm, S is presented, serially, with a short list 
of items. Immediately after the presentation of the 
list, S is presented with a single item (probe) and is 
required to state whether the probe was a member of 
the list just presented. In this task, performance is 
practically error free, and response latency is the 
dependent variable of major interest. 

Logically, one would expect the items presented 
in a scanning task to reside in short-term store 
(cf. Atkinson & Shiffrin, 1968). Sternberg (1969, 
Experiment V) and Forrin and Morin (1969) have 
investigated retrieval of materials in short-term 
store (STS) and long-term store (LTS) using the 
scanning paradigm. Sternberg's results led him to 
postulate that material residing in LTS at the time 
of the test is transferred item by item to STS, where 
it is scanned exhaustively. Forrin and Morin, on 
the other hand, found evidence for independent 


1 This research was supported by Grant MH-23231-01 from 
the National Institute of Mental Health, and by Grant 040- 
7228A from the Research Foundation of the State of New York 
to the first author. x S 

2 Requests for reprints should be sent to C, James Scheirer, 
Department of Psychology, State University of New York at 
Binghamton, Binghamton, New York 13901. 


access to STS and LTS. It must be noted that, 
although an. STS-LTS distinction is discussed by 
many writers, there does not appear to be an empiri- 
cal distinction between processing in S and pro- 
cessing in STS in the scanning paradigm. Forrin 
and Morin presented materials assumed to ri le in 
LTS and STS; they found that Ss were able to 58 
the sets of materials independently. Kaminsky anc 
DeRosa (1972) found that when Ss were presented a 
mixed 6-item list of letters and numbers they were 
able to partition this list, and they appeared to 
operate on each sublist independently. Thus, for 
these studies, the utility of a distinction between 
STS and LTS on other than operational grounds is 
unclear. Glanzer (1971) has made much the same 
point in his distinction between LTS and STS as 
theoretical constructs and long- and short-term 
memory (LTM and STM) as operational distinctions. 

The question thus arises as to the possibility of 
empirically separating scanning functions in LIS 
and STS. The present study attempted such a 
differentiation. It was felt that a critical factor in 
producing the independent processing found by 
other Es might be the discriminability of the stimu- 
lus sets used. Therefore, in this study, the discrimi- 
nability of stimulus sets was varied systematically. 
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METHOD 


The materials used in this study were 2-digit num- 
bers, with each digit chosen randomly from the pool 
2-9, and letter bigrams, with each letter chosen 
randomly from the pool P-W. In no case were both 
items of a pair identical. Two complete sets of ma- 
terials were constructed, and half of the Ss received 
each set. Items were presented on pairs of IEE 
display units controlled by an Iconix system con- 
troller for .8 sec., with .2 sec. between items. After 
all of the items on a list had been presented, there 
was a 2-sec. pause until the test item (probe) ap- 
peared. One second prior to the presentation of the 
probe the word /esl, written in red, appeared above 
the display units. The probe terminated after 3 sec. 
There followed an intertrial interval of 11 sec., 
at which time the word ready appeared above the 
display units. One second later the presentation of 
the next list of items began. One hundred and 
twenty such lists were presented to each S. The S 
responded to the test stimulus (probe) by depressing 
1 of 2 buttons located on a box in front of him. The 
buttons were labeled yes and no, corresponding to 
positive and negative probes. 

Experimental conditions. There were 3 experi- 
mental conditions, control (C), same (S), and dif- 
ferent (D). Condition C was a replication of the 
basic Sternberg (1969) paradigm. The Ss received 
lists of 1, 2, or 3 items, followed 2 sec. later by a 
probe, An S saw only a single type of material. 
Sixteen Ss were run in this condition; for 8 Ss, all 
lists were composed of digits, while for the other 
8 Ss, all lists were composed of letters. This condi- 
tion can be conceptualized as a 0 item in the LTM 
set condition. 

Condition S Ss were required to memorize a list 
of 1, 2, or 3 items prior to the start of the experiment; 
this list constituted the LTM set. During the course 
of the experiment, Condition S Ss received lists of 
1, 2, or 3 items of the same type of material they had 
previously memorized in the LTM set; these items 
constituted the STM set. Half of the Ss received 
lists of numbers in both the STM and LTM sets, 
and half of the Ss received lists of letters in both the 
STM and LTM sets. In Condition S, a positive 
probe was defined by the instructions to include 
items in either the LTM or STM set; that is, items 
that had been memorized prior to the start of the 
experiment and items on the list most recently pre- 
sented were to be responded to when they were 
presented as probes by pressing the yes button. 
Negative probes were defined as items not presented 
in either the LTM or the STM set. Negative probes 
were, however, always of the same material as that 
in the LTM and STM sets. Twenty-four Ss were 
run in Condition S, 8 with each of the 3 different 
length LTM lists. The procedures for Ss in Condi- 
tion D were identical to those for Ss in Condition S 
except that the STM set lists were composed of ma- 
terial of the opposite type from that memorized in 
the LTM set. That is, half of the Ss received lists of 
numbers in the LTM set and lists of letters in the 
STM set, while the other half received lists of letters 
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in the LTM set and lists of numbers in the STM set. 
Long- and short-term memory items were tested 
equally often. 

For Ss in Conditions S and D, the LTM s 
written in block capital letters on a single 5 
index card. The Ss were instructed to men 
the items on the card and were allowed to ke 
card with them during the course of the exper 
The Ss were instructed to respond to probes as 
rapidly as possible, consistent with errorless perform 
ance. There were 64 Ss, all introductory psy- 
chology students at the State University of New 
York at Binghamton who participated in the experi- 
ment as part of the course requirement. 


RESULTS AND Discussion 


The data from the first 12 trials for each S were 
discarded. Mean and harmonic mean latencies for 
each condition were then computed on the remaining 
108 trials for each S. Only data from correct re- 
sponses were utilized; error rates were uniformly 
low, ranging from 3.6% in Condition C to 2 6% in 
Condition S and 2.2% in Condition D. Identical 
patterns of results were found for mean and harme 
mean latencies, so harmonic mean latencies w 
used throughout this report. These measures werc 
preferred, since they weight short latencies d 
portionately. This was felt to be desiral 5 
latency scores tend to be positively skewed. Har 
monic means tend to reduce the size of scores relat’ 
to arithmetic means. In general, they differ : 
from arithmetic means as the size of the mean de 

s. Thus, while differences were small, har- 
mean latencies tended to yield smaller values 
of reaction time (RT) and slopes Steeper than arith- 
metic means. For example, the best fitting (least 
squares) straight line relating reaction time to set 
size (s) for Condition C was RT — 637 + 160s for 
harmonic mean latencies, and RT = 668 + 172s, 
for arithmetic mean latencies. 

Condition C. The results of this condition, based 
on the responses of 16 Ss, replicated the basic Stern 
berg (1969) results. As is usual in that parac 
RT was a linear function of list length, with 
linear component of the trend accounting for 99,‘ 
of the variability in the case of positive probes and 
99.6% of the variability in the case of negative 
probes, The best fitting (least Squares) straight 
line was RT = 588 + 158s for positive probes, and 
RT = 686 + 161s for negative probes. It should 
be noted that these slopes are considerably larger 
than those found by Sternberg with single-digit 
numbers or by Swanson, Johnsen, and Briggs (1972) 
with pairs of digits. They are also substantially 
larger than those found previously by this E using 
similar procedures. No reason for the differences 
obtained in this study are apparent. 

Condition D. An overall analysis of variance of 
the harmonic mean latencies in Condition D showed 
that positive probes were responded to faster than 
negative probes, 1029 msec. vs. 1148 msec., F (1, 12) 
= 59.39, p < .01; and that list length in the STM 
Set was a significant source of variance, F (2, 24) 
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(RT) for correct responses from the long- and short- 


term memory (LTM and STM) sets in Conditions different (D) and same (S) as a 


function of set. 


= 68.11, p < .01, with 97.2% of the variability 
counted for by the linear trend. There were no 
ignificant deviations from linearity, F (1, 24) 
= 3,74, p > .05. It might be noted that list length 
in LTS was also a substantial source of variability, 
but since this factor was between Ss, the design 
lacked power in the test of this component, F (2, 12) 
<1. The only other sources of variability to reach 
significance were the LTM Set List Length X Locus 
of Test (LTM or STM) interaction, F (2, 12) 
= 16.90, p < .01, and the STM Set List Length 
X Locus of Test interaction, F (2, 24) = 26.90, 
p <.01. These interactions, illustrated graphically 
in the top portion of Figure 1, indicate that memory 
scanning occurred for both the LTM and STM 
items, but that changing the size of the memory 
set in the nontested locus had no effect on speed of 
recall. The slope of the scanning function in the 
LTM set was 124 msec/item, while that in the STM 
set was 104 msec/item, A specific planned compari- 
son, using a combined within-Ss error term, indi- 
cated that these slopes did not differ reliably, 


F (1, 132) = 1.86, p > .05; and that the size of the 
memory load in the nontested store did not affect 
I, F (4, 132) < 1. 
e data presented in Figure 1 suggest that Ss 
access the STM and LTM material independently 
when that material is highly discriminable, This 
conclusion follows from the fact that in Condition D 
there is no effect of LTM set size on scanning in the 
STM set, and no effect of STM set size on scanning 
in the LTM set. These results thus replicate the 
results of Forrin and Morin (1969). It should be 
noted, however, that although it would seem likely 
that STM and LTM sets would reside in different 
memory states (STS and LTS) because of the 
amount of information and operations involved in 
the task, this conclusion is not forced by the data. 
Itis possible that all items are held in active memory, 
but that discriminable subsets of items are acted on 
independently (Kaminsky & DeRosa, 1972) 
Condition S. An overall anal of variance on 
harmonic mean latencies revealed that list length 
in the LTM set was a significant source of variance, 
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F (2, 12) = 5.53, p < .05, with 97.0% of the vari- 
ability in the effect accounted for by the linearity of 
thetrend. Deviations from linearity failed to reach 
significance, F (1, 12) < 1. Harmonic mean laten- 
cies of the 3 types of test items also differed signifi- 
cantly, F (2, 24) = 21.82, p < .01, with means of 
1071 msec. for positive probes in the STM set, 1102 
msec, for positive probes in the LTM set, and 1243 
msec. for negative probes. Specific comparisons 
among these conditions revealed that negative probes 
were responded to reliably slower than positive 
probes, F (1, 24) = 42.15, p < .01, but RT to posi- 
tive probes did not differ according to memory set, 
F (1,24) = 1.24, p > .05. List length in the STM 
set was also a significant source of variability, 
F (2, 24) = 20.35, p < .01, with the linear com- 
ponent of trend accounting for 99.98%, of the vari- 
ability in the effect. The only other sources of 
variability to reach significance were the LTM Set 
List Length X Test Conditions interaction, F (4, 24) 
= 15.83, p < .01, and the STM Set List Length 
X Test Conditions interaction, F (4, 48) = 3.14, 
b <.05. These effects are presented graphically 
in the bottom portion of Figure 1. 

The function relating RT to set size in the LTM 
set in Condition S was RT = 380 + 361s. This 
function was basically linear in form, accounting for 
99.3% of the variability associated with difference 
in set size. The function obtained for this condition 
has a slope substantially larger and an intercept 
substantially smaller than that found in any other 
condition. Using Sternberg's (1969) interpretation 
of slope and intercept values, the L'TM items in 
Condition S are accessed faster than those in any 
other condition (including Condition C, access to 
a single set of items), but are scanned more slowly. 
While the slow scanning rate for LTM items appears 
plausible, and in fact replicates Sternberg (Experi- 
ment V), the very fast access time appears difficult 
to handle within Sternberg's model. 

One possible explanation of this data is that Ss 
might be disregarding the classification of items into 
L'TM and STM sets. The best fitting straight line 
through these points is RT = 657 + 104s, which 
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accounts for 91.2% of the variability in prediction. 
One interpretation of this function is that material 
from the LTM set is held in the same locus as items 
in the STM set; the total set is then scanned exhaus- 
tively before a decision is made concerning the ST ^? 
set items. The material from the LTM set could 
not be the actual LTM items, however, since thi 

would predict identical scanning functions for th 

LTM and STM sets. 

An alternative explanation of the data from Con 
dition S suggests that Ss were able to differentiat: 
between the LTM and STM set only when there 
was a single item in the LTM set. This interpreta- 
tion is suggested by the data of Darley, Klatzky, 
and Atkinson (1972), which indicate that Ss are 
able to process items which have a high probability 
of being tested, independent of the remaining 
items on a list. In this study, when there was a 
single item in the LTM set, it was tested 25% of 
the time (50% positive probes, 50% L'TM tests) a 
proportion of testing that might lead to this item 
being treated as a high-probability test item. This 
explanation is not supported, however, by the fact 
that a similar facilitation did not occur in Condi- 
tion D. 


REFERENCES 


ATKINSON, R, C., & SuirrRIN, R. M. Human memory: A 
proposed system and its control processes. In K, W, Spence 
& J. 1 pence (Eds,), The psychology of learning and motiva- 
tion. Vol. 2. New Yor) 

DARLEY, C. F., KrATZKY, 
of memory load on reaction time. 
Psychology, 1972, 96, 232-234. 

FonniN, B., & Morin, R, E, Recognition times for items in 
np and long-term memory. Acta Psychologica, 1969, 30, 
126-141. 

GLanzer, M. Storage mechanisms in recall. In G. H. Bower 
(Ed.), The psychology of learning and motivation, Vol, S. 
New York: Academic Press, 1971. 

Kaminsky, C, A., & DeRosa, D. V. Influences of rehearsal 
cues and set organization on short-term recognition memory, 
Journal of Experimental Psychology, 1972, 96, 449-454. 

Swanson, J. M., JORNSEN, A. M., & Bricos, G. E. Recoding 
in a memory search task. Journal of Experimental Psy- 
chology, 1972, 93, 1-9. 

STERNBERG, S. Memory-scanning: Mental processes revealed 
Be DAC He experiments, American Scientist, 1969, 57, 


: Academic Press, 1968, 
R. L., & ATKINSON, R. C, Effects 
Journal of Experimental 


(Received March 26, 1973) 


| 


Journal of Experimental Psycholo; 
1974, Vol. 102, No. 2, 3472349 d 


INFLUENCE OF ACTIVE AND PASSIVE VOCALIZATION ON SHORT-TERM RECALL? 


PHILLIP M. TELL? ann ALEXANDER M. FERGUSON 


Florida Technological University 


The Brown-Peterson distractor technique was used to investigate the effects 
of active and passive vocalization vs. silent visual presentation on recall at 
various retention intervals. Under active vocalization conditions, visually 
presented stimulus items were voiced aloud by S, and under passive vocaliz. 
tion conditions, these items were aurally presented by E. The findings indi- 
cated that both active and passive vocalization produced better recall than 
the silent presentation condition at retention intervals of up to 5 sec. Also, 
the active and passive vocalization conditions differed mainly at the longer 


retention interv: 


with recall much higher under the p. 
condition. There were only minimal differences between ac 


ve vocalization 
e and passive 


vocalization at the short retention intervals. 


There is currently much theoretical interest in 
the effects of various auditory and articulatory 
stimulus presentation conditions on short-term 
recall. In order to facilitate understanding, the 
present study will maintain Crowder's (1970) dis- 
tinction between active and passive vocalization 
presentation conditions. Active vocalization refers 
to presentation conditions where the to-be-remem- 
bered stimulus items are voiced aloud by S as 
they are visually presented. Under passive vocal- 
ization conditions, S listens to E read each stimulus 
item as it is visually presented. 

The findings of many experiments indicate that 
both auditory presentation of stimulus items by Æ 
(passive vocalization) and vocalization of these 
items by S (active vocalization) have significant 
advantages over silently read, visually presented 
items (Cooley & McNulty, 1967; Crowder, 1970; 
Murdock, 1966, 1967; Murray, 1965; Tell, 1971). 
This advantage of active vocalization over silent 
visual presentation has been demonstrated both 
with the Brown-Peterson distractor task (Brown, 
1958; Peterson & Peterson, 1959) and the im- 
mediate memory task. It seems reasonable to 
assume that the same memory systems can be 
investigated through the use of the distractor task 
or the immediate memory task. Thus, recall per- 
formance at short retention intervals (distractor 
task) is assumed to reflect the same memory char- 
acteristics as recall at recency positions of the serial 
position curve. Likewise, it is assumed that recall 
after longer retention intervals should be com- 
parable to the primacy positions of the serial posi- 
tion curve. When measured with the distractor 
task, active vocalization is superior to silent visual 
presentation mainly at retention intervals of less 
than 7 sec. (Tell & Voss, 1970). When measured 
with an immediate memory task, active vocaliza- 
tion is generally superior to silent visual presenta- 


iThis research was supported by National Institute of 
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tion on the last few serial positions only (Conrad 
& Hull, 1968). The same general types of results 
also occur when passive vocalization is compared 
with silent visual presentation (Corballis, 1966; 
Crowder, 1970; Grant & McCormack, 1970). Thus, 
it strongly appears that the same information 
system can be investigated with both the distractor 
task and the immediate memory task. 

Currently, the only available direct comparison 
of the effects of active and passive vocalization on 
immediate recall is by Crowder (1970). His most 
relevant finding in terms of the present discussion 
was that passive vocalization was superior to active 
vocalization for only the first few serial positions 
(primacy positions). Both of these conditions 
were superior to the silent visual presentation 
condition at the recency positions only. 

It has previously been speculated that the 
auditory feedback that results from active vocal- 
ization is functionally equivalent as to the auditory 
input resulting from passive vocalization (Tell & 
Voss, 1970) as a source of information.  Crowder's 
(1970) data indicate that this is clearly not the 
case. As Crowder speculates, active vocalization 
during presentation could interfere with effective 
rehearsal or encoding strategies compared to a pas- 
sive condition, in which S is free to engage in 
whatever type of encoding activity is most effective. 
This may account for the apparent superiority of 
passive vocalization at the primacy positions. 

The present study was designed to provide a finer 
analysis of the effects of active and passive vocal- 
ization vs. silent visual presentation on recall at 
various retention intervals through the use of the 
distractor technique, If recall performance at short 
retention intervals indeed reflects the operation of 
the same memory characteristics as does recall of 
recency items, then it would be predicted that both 
active and passive vocalization should be superior 
to silent visual presentation at these short retention 
intervals. Likewise, it is predicted that passive 
vocalization will be superior to active vocalization 
at longer retention intervals, which correspond in- 
formationally to the primacy positions of the serial 
position curve, 


347 


348 


90. 


ES PASSIVE 
AN b VOCALIZATION 


ITEM RECALL 
E] 


40 


.30, 


MEAN PROPORTION CORRECT 


iy rapa es BEN MES 
RETENTION. INTERVAL 


FIGURE 1, Mean proportion correct i 
" i item recall as n 
9f presentation condition and retention inter ed 


Method. The Ss were 90 undergraduates at 
Florida Technological University, All Ss were 
tested individually and received course credit for 
participating in the experiment. 

Twenty-eight consonant (C) trigrams were se- 
lected from the Witmer norms in Appendix B of 
Underwood and Schulz (1960) in such a way as 
to minimize association value and acoustic simi- 
larity. Repeated occurrences of any consonant 
were separated by at least 2 CCCs, Four of the 
28 CCCs served as practice items and were not 
used. in analysis of data or counterbalancing of 
conditions, "The CCCs were photographed on 
35-mm. film and mounted in slide holders. Blue 
gelatin filters mounted in slide holders marked the 
onset of each trial, and a series of 3 question 
marks served as the recall signals. 

The slides were projected by means of a Kodak 
Carousel 800 projector. Series of 2-digit numbers 
for the interpolated task were filmed on 16-mm. 
motion picture film and projected with a Dunning 
Animatic film strip projector. 

Both of these projectors were operated by tone 
pulses recorded on magnetic tape by the use of 
electromechanical programming equipment, One 
channel of a Sony Model 630 stereo tape recorder 
was used for this purpose. A Kodak Carousel 
programmer and sound synchronizer were used for 
both the recording and the playback of tones, 

The second channel contained E's recorded vo- 
calizations of the CCCs, calibrated to coincide 
approximately with the visual display of the CCCs. 
This channel was turned on in the passive vocal- 
ization condition and was turned off under the 
silent and active vocalization conditions, 

Using the Brown-Peterson distractor technique 
(Brown, 1958; Peterson & Peterson, 1959), Ss at- 
tempted to retain CCCs while performing an inter- 
polated task. The interpolated task involved the 
presentation of digit pairs at a rate of 1 sec/pair, 
and all Ss were tested after 1, 3, 5, 8, 11, and 14 
pairs of digits. The interpolated activity required § 
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to verbally classify each member of the digit 
as odd or even, from left to right. 

Each S received 28 trials: the first 4 were 
tice and the rest were test trials, The or 
trigram presentation was the same for all Ss. 
confounds practice completely with individua! 
grams, but balances practice against reter 
interval. 

"The 90 Ss were divided into 3 presentation 
ditions. In the silent presentation condition, 
terials were presented visually and Ss were request 
to silently read the trigrams to themselves. The ac 
tive vocalization condition required Ss to read the 
trigrams aloud while they were on the screen. 
Under the passive vocalization conditions, Ss heard 
E read the trigrams aloud while they were presented 
on the screen, Thus, during the 24 scored trials 
in each of the 3 presentation conditions, a given S 
received 4 test trials at each of the 6 retention 
intervals. 

A 6 X 6 Latin square was used to order the 6 
retention intervals within the 24 test trials. | 
received an ordering of retention intervals d 
mined by 4 rows of the Latin square. Ti 
separate samples composed of 4 rows each w: 
used, so that each retention interval followed 
other retention intervals an equal number of t 
and never followed itself. Within any presentation 
condition, 10 different Ss received each of the 3 
retention interval orders. 

Briefly, a single trial proceeded in the following 
manner. A blue ready light illuminated the screen 
for 1 sec. Following this by .75 sec., a CCC was 
presented on the screen for .75 sec., during which 
time SS either read it silently, spoke it aloud, or 
listened to it read by E over a speaker. After 
the presentation of each CCC, Ss were tested at 
one of the 6 retention intervals (1, 3, 5, 8, 11 or 
14 sec.). The interpolated activity was continually 
scored, and when necessary, Ss were reminded to 
do as well as possible, Three question marks ap- 
peared after the last digit pair was removed from 
the screen. These remained on the screen for t 
10 sec. during which .S attempted to recall the CC 
A rest interval of 10 sec. separated the trials. 

Results and discussion. The data of each S were 
scored in terms of the number of consonants cor- 
rectly recalled, regardless of whether they appeared 
in the correct intratrigram position. With low- 
association-value trigrams, it appears that the item 
score is the most meaningful and sensitive unit of 
analysis (Wickelgren, 1965), Results are not pre- 
sented for ordered scores because they are in close 
agreement with item scores in all important respects. 

An overall analysis of variance was performed 
on the recall scores. This analysis indicated that 
the 3 types of presentation conditions resulted in 
significantly different recall functions, F (2, 87) 
= 4.65, p <.05. As can be seen in Figure 1, 
recall performance for all groups decreased as a 
function of retention interval, F (5, 435) = 316.4, 
b «.001. It is apparent that the types of pres- 
entation conditions had different effects at the 
various retention intervals tested, F (10, 435) = 4.0, 
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p <.01. Both active and passive vocalization con- 
ditions are clearly superior to silent visual pri 
entation at retention intervals of up to 5 sec. This 
might be assumed to result from the additional 
echoic information provided by the auditory input 
from these presentation conditions (Neisser, 1967). 
However, the initial differences between active 
vocalization and silent visual presentation disappear 
at the longer retention intervals (8, 11, and 14 sec.). 
It is also apparent that the passive vocalization 
condition provides higher recall than either of the 
other 2 presentation conditions at these longer 
retention intervals, 

The present findings closely parallel those ob- 
tained with the immediate memory task (Crowder, 
1970), First, both active and passive vocalization 
conditions produced better recall than the silent 
presentation condition at retention intervals of up 
to 5 sec, Second, the active and passive vocaliza- 
tion conditions differed mainly gt the longer reten- 
tion intervals. There are just minimal differences 
due to active or passive vocalization at the short 
retention intervals. The evidence strongly indicates 
that the same memory systems can be investigated 
using either supraspan memory units (immediate 
memory task) or subspan memory units (dis- 
tractor tas 

It appears that the requirement of active vocal- 
ization at presentation interferes with effectiv 
encoding operations. Active vocalization may 
demand more attention at presentation than does 
passive voci tion, The effect may not involve 
just the disruption of rehearsal strategies, but may 
instead lessen the selective attention capabilities 
that are necessary for effective encoding in memory. 
Thus, the advantages of passive vocalization are 
mainly at the longer retention intervals, when the 


p." 


349 


information in echoic memory has dissipated. The 
a clearly indicate that active and 
ization conditions are not comparable 
hput to the memory system when 
ith silent visual presentation. 
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BELOW-ZERO CONDITIONED INHIBITION OF TI 


= RABBIT'S 


NICTITATING MEMBRANE RESPONSE! 


HORACE G. MARCHANT, III, AND JOHN W. MOORE? 


University of Massachusetts 


Albino rabbits received Pavlovian conditioned inhibition training of their 
nictitating membrane response to a light (positive conditioned stimulus, 
CS+) and the same light plus a pure tone (negative conditioned stimulus, 


CS): 


Subsequent acquisition to the tone was retarded relative to naive 


and "sit" controls as well as groups receiving differential conditioning to 
noise (CS+-) and light plus tone (CS —), and light (CS+) and light plus 


noise (CS—). 


The results were interpreted as demonstrating that condi- 


tioned inhibition training results in a CS with a negative (below-zero) value. 


Although interest in Pavlovian conditioned in- 
hibition has increased in recent years, there remain 
problems of interpretation of this phenomenon 
(Hearst, 1972). Conditioned inhibition is usually 
defined as a response tendency opposite that of 
excitation, where excitation is the increase in the 
strength of a conditioned response (CR) due to 
pairings of the conditioned stimulus (CS) and un- 
conditioned stimulus (UCS). This definition of 
excitation implies that conditioned inhibition is a 
tendency to not respond resulting from presenta- 
tion of the CS without reinforcement. The question 
of interpretation is whether failure to respond in 
this case may not be more parsimoniously concep- 
tualized as resulting from a loss of excitation rather 
than from an active inhibitory pro (cf. Brown 
& Jenkins, 1967). 

'The assumption that both excitatory and in- 
hibitory forms of conditioning exist, separated by 
a point of zero-response strength, is a crucial feature 
of a number of learning theories, including familiar 
Hullian theories of discrimination learning (Spence, 
1937). The theory of classical conditioning de- 
veloped by Rescorla and Wagner also assumes that 
CR strength (V in their model) may take on either 
positive or negative value (cf. Wagner & Rescorla, 
1972). The demonstration that conditioned in- 
hibition results in a CS of negative value would 
seem to be of some importance for both of these 
theories. 

The following 3 criteria should be satisfied in 
order to assert that a given training experience 
results in a CS with inhibitory properties: (a) a zero 

point must be established, (b) CR strength must 
lie below this point, and (c) this below-zero or 
negative effect must be the result of learning rather 
than some nonassociative factor such as an atten- 
tion decrement. Marchant, Mis, and Moore (1972) 
investigated conditioned inhibition of the rabbit's 
nictitating membrane response (NMR) using 
Pavlov's procedure of reinforcing one stimulus 
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tion Grant GB-24557. This experiment was based on the 
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(CSi) while systematically nonreinforcing a com- 
pound consisting of CS; and another stimulus 
(CS2). Although successful in demonstrating in- 
hibitory summation and retardation (cf. Re 
1969), this experiment failed to establish condi- 
tioned inhibition as a below-zero effect. One reason 
for this was that all Ss were conditioned to CS:, 
the potential inhibitor, prior to differential condi- 
tioning in an attempt to circumvent the develop 
ment of latent inhibition to CSa Therefore, 
retardation was assessed in terms of rate of re- 
acquisition following training rather than rate of 
initial acquisition. Secondly, control Ss received 
prior conditioning to CS» followed by pseudo- 
conditioned inhibition training in which each of 
the 2 trial types was paired with the UCS 50% 
of the time in a random sequence, Under these 
circumstances it seems unlikely that CS» in the 
control group defined a  zero-reference point. 
Wagner and his associates have also investigated 
conditioned inhibition of the rabbit's eyeblink, 
These studies, while supporting the Rescorla and 
Wagner model, did not address the question 
whether conditioned inhibition is a below-zero 
effect (cf. Wagner & Rescorla, 1972). 

The present experiment sought to establish co: 
ditioned inhibition of the rabbit's NMR as a be! 
zero effect by utilizing essentially the same trai; 
procedures employed by Marchant et al. (197 
but by comparing initial acquisition of the con 
ditioned inhibitor with a reference group not 
subject to the complications mentioned above. 
Specifically, conditioning to CS; following inhibitory 
training was contrasted with a group which merely 
sat restrained in the conditioning chambers during 
the conditioned inhibition phase. Additional con- 
trols, not employed previously, were included to 
assess the role of a variety of potential artifacts 
contributing to retardation of acquisition to the 
inhibitory CS, 

Method. The Ss were 49 experimentally naive 
rabbits, Up to 4 Ss were run simultaneously in 
ventilated soundproofed file drawers, and the details 
of recording the conditioned NMR have been de- 
scribed elsewhere (Marchant et al., 1972). The 
CSs were a 1,200-Hz. pure tone (T) of 86 db. 
SPL, 2 4.5-v. incandescent lights (L) behind milk 
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TABLE 1 
SUMMARY OF EXPERIMENTAL DESIGN 


Stage 
Group ere 
1 2 
Naive (V = 7) E S 
Sit (N sit T 
LLT- (N =14) L T+ 
19 
LLN- (N - 7) Lr T+ 
N+LT— (N =7) N+ T 
os 
Nole, Abbreviations: T = tone, L = light, N = noise 
burst, + = reinforced, — = nonreinforced. 


white screens located in front of S, and an 86 db. 
SPL burst of white noise (N). The duration of 
each CS was a constant 550 msec. The UCS was 
a 2-ma. ac shock of 50-msec. duration delivered 
through stainless steel wound clips attached to the 
infraorbital region of the right eye. The intertrial 
interval was a constant 30 sec. and the interstimulus 
interval was 500 msec. throughout. 

The experimental design is summarized in 
Table 1. One day prior to Stage 1 training (or, 
in the case of Group Naive, Stage 2 training), 
‘Ss had their right NMs sutured and were habituated 
to restraint and the apparatus for a period of 
approximately 15 min. For Ss in Group L + LT—, 
Stage 1 training consisted of 50 reinforced presenta- 
tions of the L stimulus, unsystematically inter- 
mingled with 50 nonreinforced presentations of the 
simultaneous compound L plus T, in each daily 
session. This training was expected to produce 
an inhibitory T stimulus and was continued until 
the following criteria had been met: Between 
Sessions 6 and 12, if (a) percentage of CRs to 
CS— had first increased and then decreased over 
sessions and (b) percentage of CRs to CS+ was 
better than 80% and the percentage of CRs to 
CS— was less than 30%, then Stage 2 training 
began for that S midway through the next day’s 
session, Group Naive did not participate in the 
experiment until Stage 2. 

Group Sit served as a control for any possible 
effects of handling, suturing, sitting in restraint, 
experience with static apparatus cues, etc. In 
Stage 1, Ss in this group were yoked to Ss in 
Group L + LT— and were treated in an identical 
manner but without CS and UCS presentations. 
Group L + LN — was trained to the same criterion 
as Group L + LT—, but the discrimination was 
between L+ and (L 4- N)— as opposed to L+ 
and (L + T)— as in the Group L + LT—. Group 
L 4- LN — served as a control for experience with 
CSs, the UCS, conditioning and, more specifically, 
inhibitory training (i.e., inhibitory training per se 
may be sufficient to produce a deficit in acquisition 
of all stimuli). Group N + LT— was also run 
to the same criterion as Group L + LN—, and 
served ^is a control for experience with the LT— 
compound and generalization decrement in Stage 2. 
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The same nonsystematic order of CS+ and CS— 
trials in Stage 1 was used for Groups L + LT-, 
N+LT-, and L+LN-, 

Stage 2 training for all groups began with 50 
reinforced trials to T in the last half of the last 
session of Stage 1. This was followed by 100 
trials 5 (2-3) hr. later and 2 100-trial sessions the 
following day at approximately the same times as 
the previous day’s sessions. 

Results. All Ss in Groups L + LT—, N 4- LT—, 
and L + LT— met the discrimination criteria in 
Stage 1 within the allotted 12 sessions. Acquisi- 
tion to T in Stage 2 was assessed in terms of (a) 
trials to a criterion of 80% CRs over 3 consecu- 
tive bloc f 10 trials and (b) percentage of CRs 
over the initial 150 acquisition trials. Table 2 
shows the mean of each measure of Stage 2 con- 
ditioning for each group. 

Analysis of variance indicated significant 
(p < .001) differences among groups for both mea- 
sures with F (4, 44) = 19.56, and F (4, 44) = 13.91, 
for trials to criterion and percentage of CRs, 
respectively. 

Individual planned ¢ tests based on the analyses 
of variance (df — 44) indicated that Group 
L+LT— was significantly slower to reach cri- 
terion than Group Sit, £ = 4.15, p < .001. Group 
L + LT— was also slower to acquire than Group 
Naive, t = 4.07, p < .001. Description of the 
results of individual pair-wise contrasts among 
groups is confined to the trials to criterion measure 
snce similar comparisons for percentage of CRs 
over the first 150 acquisition trials were significant 
at better than the .025 level. 

Groups N+LT— and L+LN-—, which did not 
differ significantly from each other, conditioned to 
T significantly more quickly than Group Sit, 
t = 3.81, p < .001, and ¢ = 3.94, p < .001, respec- 
tively. Groups N--LT— and L--LN-— also con- 
ditioned to T significantly more quickly than 
Group Naive, t = 2.58, p < .01, and ¢ = 2.70, 
p «.005, respectively. Similar comparisons for 
percentage of CRs over the first 150 acquisition 
trials were also significant at better than the 
.005 level. 

Discussion. The principal findings of this ex- 
periment were as follows. (a) Pavlovian condi- 
tioned inhibition training (L--LT-—) resulted in 
retarded acquisition of the inhibitor (T) when 
compared with "sit" and naive controls. In this 
sense, conditioned inhibition produced a CS with 
negative or below-zero value as required by theo- 


TABLE 2 


MEAN TRIALS TO CRITERION AND PERCENTAGE OF 
ConpitionNeD Responses (CRs) IN STAGE 2 


| Trials to criterion | Percentage of CRs 


Group 
L+LT— 134.4 35.3 
Sit 81.6 48.5 
Naive 68.6 56.2 
L+LN— 21.3 81.8 
N+LT— 19.1 82.5 


ries (Spence, 1937; Wagner & Rescorla, 1972) 
which assume the existence of active inhibitory as 
well as excitatory proces (b) Differential con- 
ditioning in an N+LT— paradigm and condi- 
tioned inhibition training of a noise burst (L+LN—) 
both resulted in faster subsequent acquisition to T 
than in the sit and naive control groups. 

The below-zero effect of conditioned inhibition 
training cannot be readily interpreted as resulting 
from an attention decrement since the criteria of 
differential conditioning imposed in Stage 1 of 
training assured attention to the conditional in- 
hibitor right up to the onset of acquisition in 
Stage 2. If attention decrement were responsible 
for retardation of acquisition to T in Group 
L+LT—, retardation would have been observed 
in Group N+LT— as well. Failure to observe 
retardation to T in the latter group cannot be 
attributed to “overshadowing” of T by L since 
independent observations in our laboratory have 
established that T is the more effective CS in 
simple conditioning and hence the more “salient” 
of the 2 stimuli, 

Repeated preexposure to the CS or UCS is 
known to retard conditioning of the rabbit's NMR 
(e.g, Mis & Moore, 1973). Groups N--LT— 
and L+LN-— served as controls for these factors 
in the present experiment. The fact that condi- 
tioning to T in Groups L--LT— was substantially 
retarded relative to these 2 groups would seem 
to eliminate preexposure as the mechanism of 
retardation. 

Failure to observe retardation to T in Groups 
N+LT— is consistent with the prediction of the 
Rescorla-Wagner model that inhibition develops as 
à consequence of nonreinforcement against a base- 
line or background of excitation (Wagner & Ri 
corla, 1972). In the present experiment, T in 
Group L-+LT— satisfied this stipulation by virtue 
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of the fact that nonreinforcement of T occurred 
in the presence of an excitatory CS, L, and retar- 
dation of acquisition to T was observed in this 
case. By contrast, L could not provide the ex- 
citatory component necessary for the development 
of inhibition to T in Group N3-LT—. 

Finally, Groups L+LN— acquired a CR to T 
more rapidly than any group other than N-+L 
Since both N and T are in the same modality, 
one might have expected inhibition to transfer 
from Stage 1 (N) to Stage 2 (T) and result in 
some retardation. That such transfer did not 
occur may have been due either to (a) excitatory 
transfer overcoming any inhibitory transfer present 
or (b) overshadowing of the L by the N so that 
Group L+LN— would be, in effect, a simple dif- 
ferential conditioning group. In the latter case 
no active inhibition would be produced due to the 
absence of an excitatory component. 
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THE APPARENT MAGNITUDE OF NUMBER SCALED 


BY RANDOM PRODUCTION ? 


WILLIAM P. BANKS? AND DAVID K. HILL ? 
Pomona College 


A technique for generating a ratio scale for any psychophysical continuum 
by having Ss generate subjectively random productions on that continuum 
is described and used to scale numerical magnitude, duration, and line 
length. The perceived magnitude of numbers grows as an increasingly 
decelerated function of their absolute magnitude, but it is approximately 
logarithmic from 10 to 1,000. The range of 1—500 is fitted well with a power 
function with an exponent of about $. Several experiments defend the 
position that the psychological scale of number is compressive and explore 
a number of possible artifacts in the random-production technique. Cross- 
modality matches between numbers, line lengths, and durations are predicted 
from the random-production scales derived for each of these dimensions. 
The predictions give power functions whose exponents are in some agreement 
with previous experimental estimates. Random-production scales for hand- 
grip force, intensity of electric shocks, and subjective area are consistent 
with their magnitude-estimation scales, given that number has an ex- 
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ponent of 3. 


! 
| 
In magnitude-estimation experiments, S 
' assigns numbers to stimuli according to 
| some rule, and Æ must take him at his 
word on the rule he is using. Of course, 
! j there is no reason to assume that S is 
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authors must apologize for not following their 
advice in every case. 

? Requests for reprints should be sent to William 
P. Banks, Department of Psychology, Pomona 
College, Claremont, California 91711. 

3 Now at the University of Oregon Law School. 


lying about his perceptions, but there is 
no way we can be sure that he has the same 
conception of the magnitude of the response 
numbers he is using that Æ (or arithmetic) 
does. Thus, if S calls a given stimulus 
“20,” the magnitude-estimation technique 
assumes its subjective magnitude is twice 
that of the stimulus called “10.” But S’s 
conception of the numerical magnitude 
could be such that a stimulus subjectively 
twice as great as the one called “10” 
would have to be called “30.” In this case, 
the resulting scale, while replicable and 
capable of entering properly into cross- 
modality matches, would not reflect sen- 
sory magnitudes accurately. 

The question of the form of the psycho- 
logical scale of number is central to under- 
standing the meaning of direct-scaling 
experiments; yet, while these experiments 
continue to proliferate and conclusions 
about sensory function continue to be 
drawn from them, no agreement on this 
important question has been reached. Per- 
haps the simplest and most convenient 
approach is to assume that the magnitude 
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FIGURE 1, Psychophysical mapping between O, 
the observable continuum, and S, the continuum 
of subjective experience, 


scale of number is linear. There is, how- 
ever, evidence that it is not (cf. Atteneave, 
1962; Curtis, 1970; Ekman & Hosman, 
1965; Fairbank, 1969; Moyer & Landauer, 
1967; Rule, 1969; Sekuler, Rubin, & 
Armstrong, 1971), and in any event it 
would be better to demonstrate its linearity 
before proceeding on the assumption that 
itislinear. (For evidence for linearity, see 
Dawson, 1971, and Goude, 1962.) 

Previous attempts to scale the psycho- 
logical number continuum have shown it to 
be compressive, but they are in some dis- 
agreement over the precise form of the 
function. Some seem to have shown that 
the scale is logarithmic (Moyer & Land- 
auer, 1967), and others, that it is a power 
function (Curtis & Fox, 1969). These con- 
flicting results reflect the difficulties in- 
trinsic to the problem. Conventional 
direct-scaling techniques cannot be used in 
unmodified form because with them, one 
dimension must be scaled in terms of an- 
other. Whatever function is obtained will 
be a result of the input and output dimen- 
sions working together, Conventional in- 
direct methods cannot be applied in un- 
modified form to scaling the numbers, 
either. One obvious difficulty is that there 
is no just noticeable difference for number 
(2 numbers are either equal or they are not), 
and thus there is no discrimination problem 
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to exploit in constructing a scale. (How 
ever, for an ingeniously devised Thurston 
ian scale of number, see Rule, 1969). 

This article reports experiments utilizi 
a new technique for studying how | 
magnitude of numbers is perceived. Th 
technique can be used to scale any dime: 
sion S can control, and it is not limited : 
the numerical continuum. The technique 
is very simple: Ss are asked to produce 
random quantities—in this case, numbe 
as rapidly as possible, with no fixed upper 
limit, in no special order, and as randomly 
as possible. The assumption behind the 
random-number-production technique is 
that Ss will display, by the form of the 
distribution they produce, their conception 
of numerical magnitude. It is as though 
Ss refer to their internal "ruler"—pre- 
sumably the same one that is used in 
magnitude estimation—and select numbers 
from this source in a nonsystematic fashion. 
"Their sampling from this line is assumed to 
be rectilinear and random. By this rea- 
soning, intervals will be represented in 
proportion to their psychological size. If, 
for example, the numbers 105 and 115 
represent points that are about as far 
apart psychologically as 5 and 15, Ss 
should generate, on the average, as many 
numbers in one of these intervals as in the 
other. 

The assumption that Ss sample recti- 
linearly from their subjective scale in the 
number-production task permits a power- 
ful analysis of the results. If their sampling 
is in fact rectilinear, the expected value 
of the order statistic for the n random 
samples can give a ratio scale metric for 
the psychological continuum over which the 
samples were made, as the following con- 
siderations show. 

In a random-production experiment on 
any continuum, S produces n subjective 
randomly selected quantities on that con- 
tinuum. If the productions of each S are 
put in order from smallest to largest and 
an average of the productions at each 
ordinal position is taken across Ss, then 
the resulting » averages estimate the ex- 
pected values of the order statistics for 
those samples (Hogg, 1969). If sampling 


on the continuum is uniform, that is if 
f(x) = 1, 0 <x « 1, then the probability 
density function of Y, the kth of the n 
order statistics on 0 < x < 1, is 


n: 
20) = Gigs 5i 
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The expected value of Y, can thus be 
calculated as follows : 


1 
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Thus, the order statistics divide the 0-1 
line into » equally spaced points with a 
unit difference of 1/(n +1). That is, the 
kth order statistic will be & units of size 
1/(n +1) from the bottom. If the line 
extended from zero to N, the unit difference 
would then be N/(n + 1). 

It will be assumed throughout this article 
that Ss map all continua, in the process 
of perceiving them, onto a "subjective" 
scale which, for the sake of convenience, 
will be represented as a line going from zero 
to one. Figure 1 illustrates this mapping. 
In the figure, the O scale represents the 
physical (observable) continuum, and the 
S scale represents the subjective mapping 
of this continuum. The mapping function, 
M (0), assumed in this figure is monotonic 
and compressive but, in general, it may not 
be compressive for all continua. In the 
case of numbers, “infinity” may be mapped 
on the subjective continuum; if it is, the 
subjective. continuum, being finite, must 
become infinitely compressed at the top. 
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When Ss are asked to sample rectilinearly — 


from a continuum, they must do this sam- 


pling on the Scontinuum, for this is all that — 


is available to them. Thus, to the extent. 
that Ss can obey the rectilinearity instruc- 
tions, the order statistics derived from n 
random samples will determine n points on 
the S continuum, equally spaced with a 
unit proportional to 1/(n + 1). The S 
values are not, of course, directly available, 
but the order statistics that result from a 
random-production experiment provide n 
values on the O continuum that correspond 
to m equally spaced values on the S con- 
tinuum. The transformation of the O 
continuum that spaces the order statistics 
on O linearly is M(O), the psychophysical 
mapping function. This scale estimated 
by the order statistic is determined except 
for the origin and has a constant unit, It 
thus provides an interval scale of subjec- 
tively spaced O values if the O scale is 
interval or better, If a zero point can be 
located on the S scale that corresponds to 
a known point on the O scale [i.e., M(Opo) 
= So], then the order statistic gives a 
ratio scale of the S continuum. 


EXPERIMENTS I-III 


These experiments illustrate the basic 
random-number-generation task as applied 
with some variations, The experiments 
are reported in chronological order and 
show some evolution in technique, but the 
findings are essentially the same in all 3, 


Method 


The Ss were asked to produce random whole 
numbers rapidly and with no upper limit They 
were instructed not to produce long sequences ol 
numbers in the same neighborhood and not to count 
or give more numbers in order than would be ex- 
pected by chance. In Experiment I, instructions 
to this effect were given in an informal manner to 4 
psychology graduate students at Johns Hopkins 
University, These Ss were friends of E but were 
kept ignorant of the intent of the experiment, Their 
first 26 responses were written down, 

In Experiment II Ss were 30 male and female 
undergraduates at the University of California, 
Berkeley, who performed the task individually 
while waiting to serve in another, unrelated experi- 
ment. The Ss spoke their responses into a tape 
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FiGURE 2. Ordinal magnitude vs. absolute 


magnitude for random numbers produced by 4 Ss— 
SC, NH, PH, and BP: Experiment I. (Semilog 
coordinates.) 


recorder after the following instructions were read 
to them: 


This is an experiment on how people think about 
numbers. What I would like you to do is give 
me any integers from one to the highest number 
you can imagine. I would like you to think of 
the numbers as going from one up, with no upper 
limit. Just skip about in the numbers and speak 
whatever number comes to mind into the micro- 
phone. Do it as quickly as you can and do not 
try to think about the numbers you are giving. 
Don't count or try to follow any kind of system. 
Try to make the numbers as random as you can. 
I will tell you when you have given me enough 
numbers, 


Questions were answered (usually by referring to 
the instructions), the tape recorder was turned on, 
and S was asked to begin. Hesitancy on the part 
of some Ss was overcome by repeating or para- 
phrasing parts of the instructions. Once S began 
giving numbers, E looked away and assumed a 
bland expression to avoid giving subtle hints about 
what was expected (cf., Greenspoon, 1955; Orne, 
1962). The Ss were, however, told to “be more 
random” or to observe no limits if they appeared to 
be counting or limiting themselves to numbers in a 
small interval, and they were urged to go faster if 
their rate of number production was less than 1 per 
2 sec. or so. The Ss were allowed to generate 30 
numbers or more before being stopped. Numbers 
above 1 billion were discarded from analysis, and 
the first 25 acceptable numbers given were trans- 
cribed from the tape. 

Experiment III was a replication of Experiment 
II with slightly different instructions. The Ss 
were 20 male and female Pomona College under- 
graduates interviewed individually during breaks 
in an introductory psychology laboratory section. 
The instructions differed from those of Experiment 
II only in that Ss were asked to pretend they were 
a “random-number-generating machine” and were 
asked to deliver the numbers “as if free associating.” 
The Ss gave about 30 responses, all of which were 
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tape recorded. The first 15 responses unde» 19,000 


were transcribed for analysis. 


Results 


The responses given by each .S i: 
experiments were put in order befor: 
analysis was performed. Thus, il 
sequence as generated was 3, 97, 29, 
it was put in order as 3, 8, 29, 97. (This 
is, of course, the first step in calculating 
the order statistics.) Numbers in the 
ordered sequence tend to increase ex- 
ponentially in absolute magnitude as a 
function of ordinal position. For ex- 
ample, the geometric mean of the smallest 
number given by each S in Experiment II 
was 1.65, and the geometric mean of the 
largest (twenty-fifth in the order) was 3770, 
but the geometric mean of each S’s median 
number was 67.2. This dramatic increase 
in the size of the numbers with ordinal 
position can be seen in Figure 2, where the 
ordinal position of the numbers given by 
Ss in Experiment I is plotted against the 
absolute size of the numbers. The log- 
arithmic abscissa in this figure reduces the 
increase of the numbers to an approxi- 
mately linear one for each S. Two aspects 
of the functions of Figure 2 are of interest. 
First, the plots for each S show an overall 
logarithmic trend for most of their ex- 
tent, but have regions of scalloping at 
various points. Second, the functions show 
some systematic departure from the log- 
arithmic trend for numbers in the rec: 
below 10 or 15. Almost every S wh 
formed in the random-number-prodiction 
task displayed these features in the num- 
bers he gave. 


*Intuition seems to demand that the x and y 
coordinates of Figure 2 be reversed so that size of an 
emitted number would be plotted as a function of 
rankorder. Sucha plotting would make the variable 
that can vary freely—size of number—the depen- 
dent variable, but this fact is not sufficient to 
make number the logically dependent variable. 
The relationship between size and rank (or between 
the magnitude and rank order of order statistics) 
resists logical division into independent and de- 
pendent variables. The arrangement of Figure 2 
will be followed in subsequent figures because the 
function it shows is M(O). Interchanging co- 
ordinates would depict M^!(O) and might lead to 
some confusion. 


v 
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The Averaging Problem 


Each S's set of n random productions 
represents but 1 sample of the order 
statistics and consequently gives a noisy 
estimate of M (0). Averaging across Ss is 
clearly necessary for an accurate picture of 
the average M(O), but how to perform this 
averaging is a problem not easily resolved. 
The value of each of the productions ob- 
tained from any random-production ex- 
periment is necessarily on the O scale (see 
Figure 1). The “values” on the S scale are 
inaccessible, and the S continuum and its 
associated psychophysical function are, in 
fact, only intervening variables that serve 
to explain why subjectively equally spaced 
points on O are not equidistant in the 
units of O. In order to arrive at the best 
estimate of the order statistic, however, 
averaging of Ss’ responses should take 
place on the S scale, because rectilinearity 
of sampling can hold only on the S con- 
tinuum. Sampling is not rectilinear on the 
O scale, and the expected values of the 
order statistics cannot be estimated ac- 
curately by taking arithmetic means of 
untransformed O values. An obvious solu- 
tion to the problem would be to transform 
the O values to numbers proportional to 
the S values, average these numbers, and 
then reconvert the averages to O values by 
the inverse transformation. Of course, for 
a precise estimate of the S; that corres- 
ponds to each O,, the transformation used 
must be M(Q), and this is the unknown 
function to be estimated by the values of 
the order statistics. 

Estimating order statistics with medians. 
Fortunately, there are a number of solu- 
tions to the averaging problem. The 
siniplest is to take the median of the pro- 
ductions at each ordinal position. These 
are based on values on the O scale, but 
converting O to S by any monotonic M (O), 
taking medians of the S values, and then 
transforming the S medians back to O 
values by M=(S) will result in O values 
very nearly the same as the simple median 
of the original O values. Thus, medians 
taken on the O scale can provide an esti- 
mate of the O values that correspond to the 
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FIGURE 3. Rank of order statistic plotted against 
its magnitude estimated by the median of the con- 
stant-corrected numbers: Experiments II and III. 
(Semilog coordinates.) 


order statistics on S without requiring that 
any particular M (O) be assumed. 

The median has, however, a striking de- 
fect as a measure of central tendency in 
this situation. Unfortunately, with the 
present data, medians tend not to give an 
average scale but give rather the scale of 
the small number of "average" Ss whose 
productions tend to fall consistently in the 
middle range at each ordinal position. The 
medians can, however, be made much more 
representative of the average scale if 
individual differences in range and size of 
numbers are reduced by a correction pro- 
cedure before medians are taken. An ap- 
propriate correction can allow all Ss to 
contribute equally to the median. Ac- 
cordingly, each S's numbers in Experiments 
IL and III were normalized by a multiplica- 
tive constant chosen so that the geometric 
mean of S’s transformed numbers was equal 
to the overall geometric mean of numbers 
given in the experiment in which he per- 
formed. (This procedure is similar to the 
"modulus equalization” technique.) The 
constant correction was successful on at 
least 2 grounds: the standard deviation 
associated with each order statistic was 
reduced by approximately 20%, and nearly 
every S had a corrected number directly 
involved in the computation of one of the 
medians. A power transformation was also 
tried out as a method for normalizing the 
numbers, but the order statistics had 
standard deviations larger than those ob- 
tained with the constant transformation. 
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FiGURE 4, Rank vs. magnitude of order statistic 
for Experiments H and II. (In A the magnitude 
js estimated by the geometric mean of the constant- 
corrected numbers; in B, by the geometric mean of 
the untransformed numbers, Semilog coordinates.) 


Figure 3 shows the relationship between 
the order statistics, estimated by medians 
of the constant-corrected numbers, and 
ordinal rank for Experiments II and HI 
The first 20 of the 25 order statistics of 
Experiment IH and all 15 of Experiment III 
are shown. The general trends seen in 
Figure 2 show up strikingly in Figure 3. 
The functions are linear at the smaller 
numbers and show some acceleration in the 
semilog coordinates until a magnitude of 
about 12 is reached, where they become 
logarithmic until numbers near 1,000 are 
reached. Within this central range, the 
magnitude of the numbers from one ordinal 
position to the next increases by a very 
nearly constant percentage. For the high- 
est numbers, the function becomes more 
compressed than a logarithmic function, 
something not seen in all the individual 
functions. 

Estimating order statistics with geometric 
means. Still another averaging method is 
to estimate the order statistics with means 
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of the corrected numbers at each ordinal 
position in the ordered output. The geo- 
metric mean seems the best one to use in 
this case, largely because the numbers 
found at each ordinal position are posi- 
tively skewed, even in the corrected data. 
A further reason for using the geometric 
mean is that the functions of Figures 2 and 
3 are approximately logarithmic, indicating 
that M(O) may be approximately log- 
arithmic. If M(O) is a log function, tak- 
ing geometric means of the O values is 
equivalent to taking arithmetic means on 
the S scale and transforming the means by 
M~ (0) to obtain estimates of values on O 
that correspond to the order statistics on 
S. If the M(O) for each S is a power 
function, geometric means will give an 
overall power function whose exponent is 
the arithmetic average of the exponents for 
the individual functions (Bruvold & Gatley, 
1965). 

Figure 4A shows the relationship between 
the rank of the order statistic and the 
magnitude of the order statistic, estimated. 
by the geometric means of the constant- 
corrected numbers for Experiments Il and 
III. These functions show all the features 
of those in Figure 3, and they are somewhat 
smoother. In addition, they are very 
similar to the functions obtained by esti- 
mating the order statistics with the geo- 
metric means of the uncorrected data, as 
Figure 4B shows. 


Cumulative Distributions of Numbers 


The cumulative distribution of the num- 
bers produced by Ss is a concise summary 
of the data, although such a distribution 
does not bear a simple relationship to the 
order-statistic analysis. Table 1 shows 
the cumulative proportional distribution 
of numbers given in Experiments II and 
III. Table 2 shows the results when steps 
are taken to reduce individual differences 
in the numbers given. For this table, only 
the first 10 numbers given below 1,000 
were taken from each S's protocol. The 
numbers were then normalized by a multi- 
plicative constant that equalized the geo- 
metric mean of Ss’ 10 numbers. 


TABLE 1 


CUMULATIVE PROPORTIONAL DISTRIBUTION. OF 
NuwnERS: EXPERIMENTS HE AND H1 


Cumulative proportion 


Experiment 11 Experiment 111 


These distributions show that small 
numbers are more likely to be selected 
than large ones. Such a bias in the selec- 
tion of numbers is to be expected from the 
compressive scales of number shown in 
Figures 3 and 4. As the subjective dis- 
tance between arithmetically equidistant 
numbers decreases, the numbers Ss select 
will be spaced farther apart, and thus 
fewer numbers will be selected from an 
interval of a given width, 

The cumulative distributions emphasize 
à hypothesis about the results that will 
be discussed later; viz., that Ss perceive 
numbers in a linear manner, but sample 
from them nonlinearly, This hypothesis 
runs counter to the hypothesis maintained 
here—that Ss sample rectilinearly from a 
nonlinear subjective representation of the 
numbers—but it is impossible to decide in 
favor of one or the other on the basis of 
random productions alone. Experiment IV 


=) 
hypotheses, and the results of other experi- 
ments below are relevant to the decisi 
between them. 


Discussion 


Under the assumption that Ss generate ra 
dom integers by selecting quantities uniformly 
from a psychological number space (the § 
continuum), the results of Experiments I-II 
indicate that this space is compressive: 
cessive numbers must lie closer and ch 
together as they become larger. Figure 
shows values on the O dimension corresponds — 
ing to the order statistics on S that would 
obtained by uniform sampling from the 
scale with various M (O)s. 3 

The function labeled L would result if the — 
psychological distances between numbers were 
proportional to their logarithms. According 
to this distribution, the subjective ruler is 
calibrated in the same way as the C or D 


TABLE 2 


Comvutanive PRorORTIONAL DisTRIBUTION or 
Nuungas Liwrrko ar 1,000 AND. AnJUsTED 
by MULTIPLICATIVE CONSTANT TO EQUATE 
Growkriuc. Mean Nuwpirs GIVEN: 

Exrexments H asp HT 


E — 


= 


Cumulative proportion 


Experiment 11 Wogeriment (11 
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Ficure 5, The rank vs. magnitude functions 
that would result for the order statistics if rectilinear 
samples were taken on a logarithmically compressed 
continuum (L), a partly logarithmic, partly linear 
continuum (A-L), and a continuum compressed 
according to a power law with an exponent of .67 
(Paz). 


scale of a slide rule; that is, M(O) = k log O. 
Numbers selected randomly from such a ruler 
and put in order will show an exponential 
increase with ordinal position, and the rank 
of the order statistics and the logarithm of 
their absolute size will be linearly related. 
The slope of the function is determined by the 
length of the ruler over which the sample is 
taken. The slope is not, however, at issue here. 
The point is that a logarithmic M(O) leads 
to a linear plot of the ordered samples in semi- 
log coordinates, and numbers obtained from 
Ss in the number-generation task give such a 
plot for much of their range. That the ob- 
tained plots are not strictly linear in semilog 
coordinates is clear, and other assumptions 
about the psychological scale of number come 
closer to representing the full range of data. 

The function labeled A-L in Figure 5 shows 
what happens if it is assumed that the psycho- 
logical scale is linear for the numbers 1-10 
and logarithmic thereafter. This assumption 
is not intuitively unreasonable, although it is 
mathematically cumbersome. We have a good 
understanding of the integers under 10 and of 
the relative distances among them, but above 
10 or so the numbers become less distinct, and 
we are more willing to accept a percentage error 
of “rounding off" when quantities are expressed 
in terms of these numbers. If, in fact, we de- 
mand amount accuracy for small numbers but 
prefer percentage accuracy for larger numbers, 


a subjective scale of number like the one lead- 
ing to the A-L function is implied. 

A scale of numerical magnitude that has 
excellent empirical support is the power func- 
tion (e.g., Curtis, 1970; Curtis & Fox, 1969). 
The Ps; function in Figure 5 is based on a 
scale of number constructed according to a 
power function with an exponent of .67. For 
P.s, M(O) = k:O 9. Numbers on this scale 
are spaced in such a way that their 3 powers, 
if written next to them, would be linearly 
spaced. This power function results in order 
statistics that closely parallel the functions of 
Figures 3 and 4 for numbers below 500 or 
1,000, but at the upper end of the range, the 
divergence of the empirical functions from a 
power function is great. 

Fitting any function to the order statistics 
obtained from the number-production task is 
complicated by the flattening of the plots for 
numbers above 500 or 1,000. This decelera- 
tion would seem to indicate that the subjective 
scale of number becomes very compressed 
around 1,000 and is even more compressed for 
higher numbers. This apparent compression 
may simply result from a very poor under- 
standing of large numbers on the part of Ss. 
The Ss tended, for example, to give numbers 
such as a billion, a trillion, and a zillion (or a 
quadrillion, etc.) with approximately equal 
frequency. It is likely that numbers of this 
size, and perhaps those as small as 1,000, are 
much less meaningful to Ss than smaller num- 
bers, and they emitted the names of these 
numbers only in order to touch upon the 
vaguely understood realm of very large 
numbers in their output. 

It is clear that numbers that are really only 
names and do not stand for magnitudes should 
be eliminated from consideration in attempts 
to discover the best M(O). Not only do the 
meaningless numbers put an unrealistic com- 
pression at the top of the scale but they also 
affect the function for lower numbers by enter- 
ing into the averages that estimate many of 
the order statistics. In an attempt to eliminate 
the effect of large numbers on the estimate of 
M(O), the data of Experiments II and III 
were retabulated with numbers above 1,000 
excluded. This procedure arbitrarily assumes 
that numbers become meaningless at about 
1,000. The choice of 1,000 or any number is 
arbitrary because the lower limit of “meaning- 
lessness" in numbers is difficult to define, and 
Ss probably differ widely in the range of num- 
bers for which they have an intuitive under- 
standing. Whether or not 1,000 marks the 
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limit of meaningful numbers, however, it is 
at the upper limit of numbers normally given 
by Ss in magnitude-estimation experiments 
and thus, at the upper end of the subjective 
ruler that the order statistic is intended to 
calibrate. 

Figure 6 shows the relationship between the 
order statistics estimated by the geometric 
mean of constant-corrected numbers and the 
ordinal rank of the statistics in Experiments II 
and III, when all numbers above 1,000 are 
eliminated from Ss’ output before the numbers 
are put in order for averaging. Only 10 order 
statistics were computed for this figure because 
the upper limit of 1,000 reduces the number 
of acceptable numbers that can be obtained 
from every S. Also plotted in Figure 6 is the 
function given by 5 male Johns Hopkins 
undergraduates who were asked, while waiting 
to serve in another experiment, to give random 
numbers, with the restriction of an upper 
limit of 1,000. They were allowed to emit 26 
numbers, and all 26 were used. The order 
statistics were estimated by the geometric 
means of the untransformed numbers, 

This figure is plotted in log-log coordinates 
to show that a power function describes the 
results, In log-log coordinates, a power 
function is a straight line whose slope is equal 
to the exponent of the power function. The 
functions of Experiments II and III in Figure 
6 are fairly linear, although they have some 
curvature, especially at the upper end. Evi- 
dently, for at least some Ss, 1,000 is well into 
the range of meaningless numbers. The linear 
portion of the curves is best described by a 
power function with an exponent of .60 for 
Experiment II and of .67 for Experiment III. 
The M (O) may, therefore, be a power function 
with an exponent around 2. Interestingly, 
this value is close to being an average of the 
various exponents for number that Curtis and 
his colleagues (Curtis, 1970; Curtis & Fox, 
1969) have found in fits of their 2-stage model 
to judgments of differences between psycho- 
physical quantities. It is also very close to 
the exponent of .63 for number that Rule and 
Curtis (1973) found with nonmetric scaling. 
The fact of individual differences in output 
exponents (Curtis, Attneave, & Harrington, 
1968) and task variables that may alter the 
output exponent (Poulton, 1968) indicates that 
the value of $ can be at best a first-order ap- 
proximation to the exponent. 
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Ficure 6. Rank ys. magnitude of order statistic 
for Ss who generated integers with an upper limit 
of 1,000, labeled 1— 1,000 and for Ss of Experi- 
ments II and III when numbers above 1,000 were 
excluded, labeled E II (<1,000) and E III (<1,000), 
respectively. (Log-log coordinates.) 


Experiments IV-VII 


The results of the random-number- 
generating task have been taken as evi- 
dence for a psychological scale of number 
that is negatively accelerated. Two func- 
tions relating perceived size of number to 
absolute magnitude have been considered— 
a power function and a partly linear, 
partly logarithmic function (A-L). Both 
functions cover the scale up to about 1,000, 
where numbers may become meaningless, 
but this limit is above the upper limit of 
numbers commonly used in magnitude 
estimation. 

While deciding between the power and 
A-L functions is a theoretically important 
task, determining that the obtained func- 
tions reflect the subjective scale of number 
and not some irrelevant factor in the 
number-generation task is a more im- 
mediate problem. ‘There are a host of 
alternative explanations for the results, 
and the following experiments are directed 
at some of them. 
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Experiment IV tests the notion that the 
numerical continuum is subjectively linear 
and that nonuniform sampling results in 
the obtained functions. The method used 
allows linear sampling from a compressed 
"number line" to be distinguished from 
nonlinear sampling along a linear con- 
tinuum. 

Experiment V investigated the plausi- 
bility of a simple number-generation strat- 
egy that would lead to results of the sort 
obtained. Using this strategy, S would 
generate random single digits and decide, 
more or less randomly, whether to cut the 
string of digits at a 1-, 2-, 3-, or 4-digit or 
longer number. If each length of string 
were used equally often, about as many 
numbers would be produced among the 
units, tens, hundreds, and so on, and the 
ordered output functions would be ap- 
proximately linear in semilog. In this 
experiment, Ss were asked to give random 
numbers from the range >100. If Ss were 
simply cutting digits into strings of random 
length for output, the interval 100-999, 
which contains numbers that are all of the 
same length, should be equally represented 
from end to end and should show a linear 
increase in magnitude as a function of 
ordinal position in the ordered output. 
This experiment also tests a “least effort" 
hypothesis of number generation. Ac- 
cording to this hypothesis, Ss find it 
easier and quicker to give small numbers. 
They therefore concentrate on the smaller 
ones, but throw in large numbers from time 
to time to satisfy the somewhat vague 
requirements of the task. Such a least 
effort strategy should produce a linear 
function over the range 100—999, since all 
these numbers are equally troublesome to 
produce. 

Experiment VI investigated some effects 
of reinforcement and instructions on the 
distribution of numbers obtained in the 
random-number-generation task. Subtle 
cues from E can have a strong effect on 
behavior in a situation as unstructured as 
the number-generation task (Greenspoon, 
1955; Orne, 1962), and it is possible that E 
was unconsciously cuing S to produce a 
distribution of the desired form. In Part 1 
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of this experiment, the effect of deliberately 
asking for larger or smaller numbers was 
determined. In Part 2, Ss were presented 
with either an arithmetic or a geometric 
series (in random order) as a “good ex- 
ample" of a random series, and then pro- 
duced their own set of numbers. If subtle 
cues have any effect on Ss’ number- 
generation behavior, these explicit in- 
structions should have a strong effect. 
Experiment VII tested the notion that, 
in generating their numbers, Ss merely 
reflect the "spew" principle (Underwood & 
Schulz, 1960) by producing numbers in 
proportion to the frequency with which 
they encounter them in daily life. Small 
numbers are used more frequently than 
large ones in almost every application of 
numbers—from street addresses in Ameri- 
can Men of Science to physical measure- 
ments (Benford, 1938; Raimi 1969)—and 
the decrease in usage of numbers is a 
logarithmic function of their size. This 
decrease, sometimes called Benford's law, 
is a sort of Zipf's (1949) law for numbers. 
In this experiment, Ss were asked for ran- 
dom numbers with zero as the lower limit, 
and they produced fractional and decimal 
numbers in addition to integers. Frac- 
tions and decimals are probably less com- 
mon in everyday occurrence and should 
appear less often than integers in the output 
if Ss' numbers merely reflect their daily 
experience with numbers. Furthermore, 
if the scale for number is not changed 
when nonintegers are permitted, an ex- 
planation solely in terms of the spew 
principle is impossible because of the rela- 
tively low language frequency of any 
arbitrary noninteger. à 


EXPERIMENT IV 
Method. 


The 20 Ss who participated in Experiment III 
were used. After generating the random numbers 
in that experiment they were shown a graph like 
that of Figure 7A drawn on a 83 X 11 in. sheet. 
They were told that this picture expressed their 
instructions for the number-generation task: the 
abscissa represented the numerical continuum, and 
the horizontal line indicated that their sampling 
procedure gave every number an equal chance of 
being selected. (Every S but 1, a physics major 
who pointed out that only integers were sampled, 
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accepted this as an adequate description of what 
he did.) They were then told that they should use 
a new sampling rule and were shown a graph similar 
to Figure 7B and 7C. According to this new rule, 
Ss were to sample numbers randomly, as before, 
but were to limit the range of numbers from which 
they sampled. 

Every .$ generated numbers in 6 intervals defined 
in terms of figures like 7B and 7C drawn on 8% X 11 
in. sheets, The intervals were of 3 widths, “narrow” 
(1.91 cm.), "medium" (3.82 cm.), and “wide” 
(7.64 cm.), and were placed with their midpoints 
at 1 of 2 positions on the 20-cm. abscissa (number 
line). One position for the intervals was described 
as being "among the smaller numbers" and was 7 
cm. from the origin. The other was 14 cm. from the 
origin and was described as being “among the larger 
numbers." Thus, the instructions for generating 
numbers in, for example, the 3.82-cm. interval with 
its midpoint 14 cm. from the origin were, "Now I 
would like you to give me random numbers from an 
interval of medium width among the larger numbers." 
(The S was shown the drawing to illustrate the 
instructions and had it before him while giving 
numbers.) The instructions for all 6 width—posi- 
tion combinations were constructed according to 
this formula. 

The Ss often asked such questions as “How wide 
is *wide'?" and were always answered with ‘That's 
completely up to you.” Half of the .Ss first gener- 
ated numbers from the 3 intervals placed at the 
smaller end of the line, then from the 3 intervals 
placed at the higher position. The other half 
generated larger numbers first, then smaller ones. 
At each position on the number line, all Ss gener- 
ated numbers first from the narrow interval, then 
from the medium-sized interval, and then from 
the widest one. Each S gave 10 numbers for each 
interval presented before going on to the next one. 

Of course, neither the numerical span the intervals 
were to contain nor the magnitude of the midpoint 
of the intervals was specified in the instructions. 
This was the whole point of the experiment: ‘Ss 
themselves specified both the width and the mid- 
point of the intervals, and the question of interest 
was whether they felt an interval of a given width 
encompassed more numbers if it was placed at the 
high end of the number range than if it was placed 
at the low end. Since all Ss were told that the ab- 
scissa was open ended (i.e., that it went to infinity 
off the page), any compression they assumed in the 
mapping of numbers on this line resulted from their 
own visualizations of the continuum and was not 
forced on them by the instructions. 


Results 


As Figure 8 shows, Ss assumed that a 
given interval width, whether narrow, 


medium, or wide, spanned a greater numeri-_ 


cal range when it was centered among the 
higher numbers than when it was centered 


MAGNITUDE OF NUMBER FROM RANDOM PRODUCTION 363 


absolute value of numbers 


w 
> 


density 


— 
absolute value of numbers 


O 


density > 


—> 
absolute value of numbers 


FIGURE 7. Examples of drawings used to 
instruct Ss in Experiment IV. 


among the smaller numbers. For ex- 
ample, when .$s saw the wide interval 
placed among the smaller numbers, they 
assumed it spanned a range of 70, but 
among the larger numbers, the span jumped 
to 155. All but 3 of the 20 Ss followed the 
trends shown in Figure 8, and no consistent 
differences between Ss in the 2 counter- 
balancing conditions could be found. 

An Ss X Interval Width X Large vs. 
Small Numbers analysis of variance was 
performed on the logarithms of the range 
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interval was called narrow, medium, or wide, 
s assumed it spanned a greater numerical range 
s described as being “among the larger 


scores obtained from each S. The log 
transformation was used to reduce the 
differences in variability among the cells 
of the experiment. The large vs. small 
numbers variable was significant, F (1, 19) 
= 12.7, p < .01, when tested against its 
interaction with Ss. 


EXPERIMENT V 
Method 


Twelve Ss from the population sampled in Experi- 
ment II were given the same number-generation 
task administered in that experiment, except that 
they were allowed to give only 20 numbers before 
being stopped. After giving the 20 random num- 
bers, they were asked to do “the same thing again” 
but to maintain a lower bound of 100 instead of 1. 
The Ss were urged to emit their numbers as quickly 
and as randomly as before, but were reminded that 
they should sample from ‘100 on up" this time. 
Approximately 25 numbers given by each S were 
tape recorded. 
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Results 


Numbers over 20,000 were rejected from 
analysis, and the first 19 numbers given 
under this limit were transcribed from the 
tape. Three Ss had to be rejected: 2 be- 
cause they apparently interpreted the 
instructions as giving a lower limit of 1,000, 
and 1 because he gave only 12 numbers 
under the limit of 20,000 and would have 
reduced the range over which the data 
could be averaged. The responses of the 
remaining 9 Ss were put in order, and 
geometric means were taken at each ordinal 
position. Figure 9 shows how the magni- 
tude of the order statistics (geometric 
means of untransformed numbers) is re- 
lated to their rank order. The results of 
both tasks performed by Ss in this experi- 
ment are plotted so that the results found 
with the 2 ranges can be compared. The 
Ss gave results similar to those of Experi- 
ments II and IHI when the lowest acceptable 
number was 1.0. Imposing a lower limit 
of 100 produced an almost perfectly loga- 
rithmic function, with no curvature at t! 
lower limit. It would seem from th 
result that the curvature at the bottom oí 
the other random-production scales of 
number is not an “end effect," but rather 
reflects the form of the subjective scale 
for small numbers. 
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FIGURE 9. Rank vs, magnitude of order statistic 
when Ss could give an unlimited range of integers 
(1— ©) and when they could not give integers 
below 100 (100— =); Experiment V. (Semilog 
coordinates.) 
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EXPERIMENT VI, PART 1 
Method 


Experiments VI and VII were run concurrently 
with a total of 50 male and female Ss, all Pomona 
College undergraduates, none of whom had served 
in similar experiments previously. The Ss were 
assigned in rotation to the experiments in the order 
in which they arrived. The first 2 Ss were assigned 
to Experiment VI, Part 1, the next 2 Ss to Part 2, 
the next S to Experiment VII, and so on. 

A total of 20 Ss served in Part 1 of Experiment VI. 
All Ss were initially given instructions for generat- 
ing random numbers similar to those used in 
Experiment III, and all responses were tape re- 
corded, After 10 responses had been given, half 
of the Ss were told that they were giving too many 
small numbers and should try to give more large 
ones, The other half were asked to give more small 
numbers because their numbers were too large. 
Some Ss were allowed to emit more than 10 numbers 
before the biasing instructions were given because 
they were in the process of giving a series of large 
(or small) numbers at the tenth item, The Ss in 
the larger biasing condition had to give at least 1 
response under 10,000 after the tenth response 
before the biasing instructions were given, and Ss 
in the smaller condition had to give at least 1 
response over 30 after the tenth before instructions 
were given, 


Results 


Figure 10A shows the order statistics 
(geometric means of the constant-cor- 
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Ficure 10. Rank vs. magnitude of order statistic 
before biasing instructions were given (A) and after 
Ss were told to give more small numbers (open 
circles) and more large numbers (filled circles) (B): 
Experiment VI, Part 1. (Semilog coordinates.) 
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MAGNITUDE of ORDER STATISTIC 
Ficure 11, Rank vs. magnitude of order sta- 
tistic for Ss who had heard a geometric "good ex- 
ample" of random numbers and for Ss who heard a 
linear “good example": Experiment VI, Part 2. 
(Semilog coordinates.) 


rected numbers) at each ordinal position 
before biasing instructions by Ss in the 
larger and smaller biasing conditions, re- 
spectively. In Figure 10B, the results of 
the bias toward larger and smaller numbers, 
respectively, are seen. The instructions in 
both cases had the intended effect of rais- 
ing or lowering the overall range of num- 
bers emitted, but the general form of the 
function seems little altered by the instruc- 
tions. Both groups show curvature near 
the bottom and top of the range, with a 
logarithmic section in the middle, but 
larger instructions led to a more compressed 
scale than smaller instructions, especially 
at the top of the range. To a poor first 
approximation, the /arger order statistics 
equal the squares of the smaller order 
statistics of the same rank. 


EXPERIMENT VI, PART 2 
Method 


Twenty Pomona College undergraduates served as 
Ss in this experiment, All were given random- 
number-generation instructions, The Æ then played 
a tape-recorded series of integers, with the explana- 
tion that the series was a “good example" of what 
was expected of them. Two taped series were used, 
One contained 20 numbers ina linear series extending 
in equal steps from 1 to 130,000, The linear series 
was presented in random order, with small quanti- 
ties added to or subtracted from each member to 
conceal the orderliness of the numbers. The other 
contained 20 numbers from a geometric series 
extending from 1 to 40,000, (The difference in the 
upper limits of the 2 series is the result of an error 
discovered too late, The effect is that the geometric 
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FIGURE 12, Rank vs. magnitude of order sta- 
tistic when only integers could be produced and when 
the same Ss were later allowed to give fractions and 
decimals in addition to integers: Experiment XII, 
(Function for the fractions and decimals alone is 
also shown, Semilog coordinates.) 


series is missing 3 or 4 members at its upper end.) 
The geometric series was also Presented in random 
order, after small quantities were added to or 
subtracted from each member, Ten Ss had the 
arithmetic random series played to them, and 10 
Ss heard the randomized geometric series. After 
hearing the series appropriate to his condition, 
each S dictated 20 numbers into the tape recorder, 


Results 


ka Figure 11 gives the order Statistics 
(geometric means of constant-corrected 
numbers) at each ordinal position for both 
conditions, The tandom series derived 
from an arithmetic Progression implies a 
subjective ruler graduated linearly, but it 
seemed to induce number-generation þe- 
havior consistent with the same compressed 
subjective ruler seen in Experiments MII. 
The geometric random-number example is 
consistent with a subjective ruler graduated 
logarithmically. The numbers Ss produce 
after hearing this example imply a slightly 


EXPERIMENT VII 


Method 
In this experiment 10 Ss, all Pomona College 
undergraduates, had random-number-generation 
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instructions read to them and 
sponses which were tape recorded 
asked to do “the same thing" a 
that this time they could give frac 
and numbers of any size (above zero 
and fractional parts in addition to v 
All responses were tape recorded, Ss y 
to give 20 numbers, and all response 
scribed from the tape and put in order. 


Results 


Figure 12 shows the relatio rank 
and the absolute size of the « tistic 
before and after the instruct wing 
fractions were given. The fu: covers 
a wider range when fractions wed, 
but the form of the psycholog f 


number implied is little alter 

Figure 12 is a plot of the ord 

based only on the nonintegers 

though 67% of the numbers . 
nonintegers, 2 Ss gave only a f. 

and decimals and could not be 1 
the curve for fractions and deci only. 
Of the remaining 8 Ss, 2 Ss gave only 10 
nonintegral numbers, and the re sulting 
function therefore gives the results for 
only 10 order statistics for the 8 Ss. The 
functions in Figure 12 show that the M(O) 
for numbers does not depend critically on 
the use of integers. Furthermore, the 
function for nonintegers alone is very 
similar in form to that for integers, 


Discussion: Experiments I V-VII 


Experiment VII eliminates several inter- 
pretations of the results of Experiments III 
based on the Spew principle. It cannot be 
maintained that Ss merely reflected their 
frequency of experience with numbers in the 
numbers they gave. Integers are 
more frequent in everyday life than 
with decimal parts, yet whole num! r re 
outnumbered 2 to 1 by fractional numbers in 
the responses when fractions were allowed. 
Except for a few common decimal numbers, 
such as 3.1416 and 1.414, and the fractions j 
and 1, the fractional and decimal numbers Ss 
gave seemed Very capricious, arbitrary, and 
uncommon; yet the Psychological scale of 
number derived from only the fractions given 
by Ss is virtually the same as that found w hen 
only integers are allowed. 

A modified form of the spew principle could 
be considered, according to which Ss feel 
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there are somehow "more" small numbers— 
whether small integers or small fractions. This 
version of the law reduces to the assumption 
that Ss sample nonlinearly from a linear 
numerical continuum. Experiment IV shows, 
however, that Ss assume that a given interval, 
whether narrow, medium, or wide, must span 
a greater numerical range as its mean increases. 
Such an assumption would not be made if the 
numerical continuum were psychologically 
linear. 

Still another line of reasoning would have it 
that the numbers Ss generate simply obey 
Benford's “law of first digits" (Benford, 
1938; Raimi, 1969).  Benford's law is a 
generalization based on the analysis of num- 
bers found in a great many scientific and 
statistical tables. According to this law, the 
probability that a randomly selected number 
begins with a digit D is log (D 4- 1) — log D. 

If Ss simply use or obey this law in generat- 
ing their numbers, the derived subjective scale 
for any decade interval will be logarithmic. 
If each decade interval from the units to, say, 
the ten thousands is sampled equally often by 
Ss, then the subjective scale derived from 
numbers generated in the entire range of 
1-99,999 will be logarithmic. While the ob- 
tained numbers are not perfectly consistent 
with a logarithmic subjective scale and are 
perhaps better described with a power law, 
the law of first digits comes very close to 
providing an explanation of the data. 

A test of the first-digits artifact can be per- 
formed by examining the relative frequencies 
of numbers in a range wherein all begin with 
the same digit. If this artifact were solely 
responsible for the obtained results, the num- 
bers given in any such range should be evenly 
distributed. Table 3 shows how numbers are 
distributed in the ranges 10-19 and 100-199 
for Experiments II and III. All numbers given 
by all Ss were included in the count, not just 
the first 25 or 15 used for the order statistics. 
Both cumulative distributions deviate signif- 
icantly from linearity according to the Kolmo- 
gorov-Smirnov 1-sample test (Siegel, 1956). 

In fairness to Benford (1938) it should be 
said that he felt that his law was really a 
reflection of a more general law of nature. 
Nature, it seems, counts exponentially or, in 
the terms of this study, generates quantities 
that are spaced linearly on a logarithmic scale, 
not a yardstick. Thus, the law of first digits 
is only 1 consequence of this law of nature, 
and numbers within any interval examined 
should be spaced with progressive compression 
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of the scale toward the top. If this more gen- 
eral law is applied to behavior in the number- 
generation task, however, it reduces to the 
same assumption being made here and cannot 
be construed as an explanation in terms of an 
artifact. The Ss agree with nature, so to 
speak, in using a negatively accelerated ruler to 
measure things. The subjective ruler probably 
became graduated as it is by Ss’ experience 
with quantities, but it cannot be argued that 
the effects of this experience appear in Ss’ 
number-generation behavior in unmediated 
form. The Ss do not simply produce numbers 
with frequencies proportional to their experi- 
ence with them, nor do they sample with a 
bias toward small numbers from a linear con- 
tinuum. Rather, it seems that they have a 
nonlinear analog conception of numbers from 
which they sample as linearly as they can. 

The other artifacts considered in Experi- 
ments IV-VII are more easily dismissed. 
Experiment VI showed that subtle reinforce- 
ments provided by Æ (Greenspoon, 1955) or, 
to phrase it differently, the "demand char- 
acteristics" of the task (Orne, 1962) can in- 
fluence the range of numbers over which S 


TABLE 3 


DISTRIBUTION OF NUMBERS GIVEN IN INTERVALS 
10-19 AND 100-199: EXPERIMENTS lI AND IIT 


Frequency 
Number or A3. 
interval 
Experiment II | Experiment III | Cumulative 
Interval 10-19 
10 | 7 12 19 
il 6 7 32 
12 10 13 55 
13 9 10 74 
14 2 9 85 
15 5 7 97 
16 5 9 111 
17 v 8 126 
18 3 5 134 
19 2 12 148 
Interval 100-199 
100-109 14 11 25 
110-119 14 5 44 
120-129 5 3 52 
130-139 5 5 62 
140-149 2 3 67 
150-159 4 0 71 
160-169 0 4 75 
170-179 1 0 76 
180-189 1 3 80 
199-199 1 7 88 


TABLE 4 


Goopness or Fit (r?) or PowER AND Loc Fune- 
TIONS TO INDIVIDUAL NUMBER SCALES 


All 10 order statistics First 8 order statistics 
Log ower func- | LR | power funce 
s. fins -| eh | enn Rose 
" i Expo- " Expo- 
"oL" eects] n | ^ Ems 
1 970 |.878 | .45 | .955 | 937 | 61 
2 986 | .931 | .54 | .982 | ‘072 | ‘65 
3 976 |.943 | 47 | .955 | (978 | 56 
4 974 | .955 | .71 962 | .976 | .82 
5 966 |.972 |.76 | .964 | 974 | 82 
6 :902 | 996 |.77 | .916 | ‘904 | 76 
7 .984 |.918 | 38 | .972 | 962 | 48 
8 867 | .990 |.46 | .886 | 990 | 45 
9 978 |.960 | .63 | .968 | 974 | 71 
10 | .945 | 974 | .43 | .980 | (964 | 43 
11 | .960 | .980 | .76 -970 | .980 | .79 
12 970 | .949 | 61 945 | 988 | .73 
M | .956 | .954| 58 | .954 | :974 | ‘651 
SD | 144 | 143 
| 


* Equation: M(O) = A + B log O. 
b Equation: M (O) = AO», 


samples in generating his output and the over- 
all compression of the scale, but they do not 
radically alter any conclusions about the form 
of the scale. Experiment V controlled for the 
possibility that Ss simply produce random 
lengths of digit strings. This strategy could 
produce the results obtained if Ss generated a 
roughly equal number of 1-, 2-, 3., and 4-digit 
numbers. Such a Strategy would, however, 
produce a linear psychological scale for the 
range 100-999, and the results of Experiment 
V show this not to be the case. (The distribu- 
tions in Table 3 reinforce this conclusion.) 
Experiment V also renders untenable an ex- 
planation of the results in terms of a least 
effort strategy. 


Individual Differences in Number 
Production 


Since the functions for numbers reported 
here were obtained by averaging over Ss, it is 
important to determine that individual dif- 
ferences in number production do not somehow 
conspire consistently, through the evils of 
averaging, to produce a group function that has 
little relevance to the number-generation be- 
havior of individual Ss, Two approaches to 
individual differences are reported below, 

Individual scales for 12 subjects. The pro- 
cedure of averaging over Ss was used because 
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the productions given by individual Ss seemed 
noisy. It is possible, however, to achieve 
fairly smooth functions for individual Ss by 
taking repeated observations and averagin 

within Ss. To obtain these functions, 12 male 
and female University of Oregon students 
were interviewed individually each day for 5 
days in a row. On each day they dictated 
20-30 numbers into a tape recorder under 
standard number-production instructions. he 
first 10 numbers under 1,000 given on each 
day by each S were transcribed and put i 

order. Order statistics for each S, averaged 
over the 5 days, were estimated with geometric 
means. 

Table 4 presents the goodness-of-fit measure 
r°, for each S for the logarithmic and power 
M(O) functions for number. The table also 
lists the exponents for each S for the power 
function. Because the overall scale derived 
from this experiment showed the extra com- 
pression that may indicate meaningless num- 
bers at the top end, the fitting procedure was 
repeated for only the first 8 order statistics 
for each S, and the results of these fits are also 
reported in Table 4. 

It seems clear from Table 4 that either a 
log or a power function provides an excellent 
approximation to the subjective scale of num- 
ber. Both account for an average of over 95% 
of the variance in the individual scales, De- 
ciding between the 2 
still difficult. 
are considered, the log function gives a better 
fit than the power function in 7 of the 12 cases, 
but the power function does better in 8 of 12 


I > exponents compares 
favorably with the variability found in ex- 


power functions fit to individual 
Ss’ magnitude-estimation scales. "| 

dard deviation of about .14 is Somewhat less 
than that found by Stevens and Guirao (1964) 
in a study of individual loudness functions for 
11 Ss, but it is larger than the standard devia- 
tions for brightness exponents Marks and 
Stevens (1966) obtained in a series of 4 experi- 
ments. Itis also less than the variability found 
by Jones and Marcus (1961) and Bruvold and 
Gaffey (1965). The range from smallest to 
largest exponent is not an ideal measure of 
variability, but for some studies it is the only 
one available for comparison. The present 
ratio of about 2:1 between largest and smallest 
is about the same as or less than ranges found 
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in a number of studies reviewed by Marks and 
Stevens, less than the ranges found by Stevens 
and Guirao, and less than the ranges found by 
Stevens and Mack (1959), as reported by 
Stevens (1961a). It is greater than ranges 
found by Marks and Stevens and by Bruvold 
and Gaffey. 

Individual scales from Experiment II. In a 
further attempt to assess the generality of 
the group M(O) function for individual Ss, 
power and logarithmic functions were fit to the 
data given by each 5 in Experiment II. Fit- 
ting was done only on the numbers under 190 
in order to reduce as much as possible the 
effect of meaningless numbers on the individual 
scales. Since about 70% of the numbers 
given and 18 of the 20 order statistics re- 
ported fall in this range, this upper limit 
should not distort the relationship between 
group and individual scales very much, at 


.least not for the more important region. 


The individual functions were quite well 
fit by a 2-parameter power function, Y — AXE, 


-The smallest goodness-of-fit measure was .86 


(only 4 of the 30 7? values were below.90), and 
the mean was.938. A 2-parameter logarithmic 
function, Y = A + B log X, also fit the data 
well, but not as well as the power function. 
The mean 7? for the log function was .893, 
and the log function was superior to the power 
function for only 8 of the 30 Ss. The goodness- 
of-fit measure for the log function is signif- 
icantly worse than for the power function at 
the .01 level, ¢ (29) = 3.28. 

The mean of the power function exponents 
was .646, a value close to that of approximately 
2 suggested here for the power function M(0), 
and also close to the mean exponents reported 
in Table 4. There was also a high degree of 
uniformity among Ss in their exponents. The 
standard deviation of the exponents was .105, 
and the range was .444-.964. The standard 
deviation is about the size to be expected in 
individual fits of power functions to magnitude- 
estimation data. Also, the ratio of slightly 
more than 2 to 1 between the largest and smal- 
lest exponents is within the range one would 
expect if individual power functions were fit 
to the magnitude-estimation scales given by 30 
different Ss. 

The results of the individual curve fittings 
indicate that power functions fit individual 
number scales about as well as they fit in- 
dividual magnitude-estimation scales. It 
therefore seems reasonable to consider the 
group average scale for number to have the 
same logical status as the group scale obtained 
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in a magnitude-estimation experiment. The 
analysis of individual differences in scales 
derived by Bruvold and Gaffey (1965) seems 
as appropriate for the present number scales 
as it is for magnitude estimation. According 
to this analysis, the mean of the individual 
power function parameters equals the param- 
eters fit to the data averaged over Ss. 


EXPERIMENTS VIII AND IX: SCALING 
Line LENGTH AND DURATION BY 
RANDOM PRODUCTIONS 


The random-production technique is not 
limited in principle to numbers. It can 
be used on any dimension whose quanti- 
ties S can produce at will. The following 
experiments use this technique to scale 2 
dimensions, line length and duration, 
which admit of easy random production. 
Although random-production scales for 
continua other than number are of interest 
in themselves, the present reason for 
generating these scales is to use them in a 
model, discussed below, to predict magni- 
tude-estimation scales for psychophysical 
continua from their random-production 
scales. 


Method. 


Random line lengths and random durations were 
produced in Experiments VIII and IX, respectively. 
Ten Pomona undergraduates served individually as 
Ss in these experiments, half assigned to Experi- 
ment VIII first and then Experiment IX, and half 
assigned in the reverse order. Instructions were 
essentially the same as in Experiment Ill except 
that reference was made to random line lengths 
and periods of time instead of to numbers. 

For Experiment VIII, Ss were given a pencil 
and a roll of 5-cm.-wide adding-machine paper on a 
.[5 X L5 m. desk. They were told to think of 
line length as a continuum extending from a point 
to the longest line they could conceive of and were 
asked to draw lines that represented randomly 
selected lengths on this continuum, They drew 
their lines from the loose end of the paper toward 
the roll and were allowed to unroll the paper as 
they needed more. They were not allowed, how- 
ever, to hold the pencil down and pull the paper by 
beneath it. Pilot Ss who used this strategy soon 
fell into frantic paper pulling, and it was decided 
they did not have much feeling for the line lengths 
they produced. After S had finished drawing each 
random line, the paper was unrolled about 20 cm. 
and he was asked to produce another line, All Ss 
drew 20 random lines. 
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RANK of ORDER STATISTIC 


100 cm 
Ol ' 10 sec 100 
MAGNITUDE of ORDER STATISTIC 


FiGugE 13. Rank vs. magnitude of order sta- 
tistic for random productions of line lengths and 
durations: Experiments VIII and IX.  (Semilog 
coordinates.) 


In Experiment LX, Ss were given a microswitch 
that caused a Standard Electric Centisecond timer 
(out of Ss’ sight) to accumulate time when it was 
depressed. "They were told to think of durations 
as falling on a continuum extending from those of 
infinitesimal length to those as long as they could 
conceive of. They were asked to produce random 
durations from th ntinuum by holding the micro- 
switch down for random periods. When S had 
completed à random duration, E recorded the timer 
reading to the nearest 1/100 sec., reset the timer, and 
asked for another random duration, All Ss produced 
20 random durations. 


Results and. Discussion 


Figure 13 shows the rank order that cor- 
responds to each of the order statistics for 
both line length and duration. The func- 
tions for the 2 continua are quite similar, 
both being compressed and having a form 
that is in fact similar to that found with 
random productions of numbers. It is 
interesting that a compressed subjective 
scale for length is consistent with judg- 
ments and productions of sums of lines 
(Kreuger, 1970), even though judgments 
of length are linear (Stevens & Galanter, 
1957). Equivalent evidence for a com- 
pressed duration scale does not seem to 
exist. 

Randomly produced numbers have a 
considerably greater range than random line 
lengths or durations. The value of the 
order statistic in free number production 
often spans 5 log units, but the order 
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statistics for both line length and durat: 
fit comfortably within 3 log units, Ma» 
mum quantities given by individual < 
also reflect this difference: the large 
numbers were often in the billions or highe 
but the longest random line drawn was ? 
cm. and the longest duration produced w 
just over 2 min. It is possible that ; 
in these random production tasks feel, 
they do not feel when generating number 
that they should limit themselves to fair! 
small quantities, It also may be that va 
magnitudes are easier said than done. | 
any event, the following analysis shows tha 
the scales derived from the random-pro 
duction tasks are not inconsistent with 
behavior seen in magnitude-estimatio: 
experiments. 


Simulation of Magnitude Estimation and 
Cross-Modality Matching 


A magnitude-estimation task has been 
characterized as one in which S matches 
numbers to the perceived size of his sensa- 
tion according to a rule. If the random- 
production scales derived for number, line 
length, and duration reflect the perceived 
magnitudes of these dimensions, then the 
scales specify the rule S uses for matching. 
Magnitude-estimation behavior can thus be 
simulated by artificially matching numbers 


1000, 
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FIGURE 14. 


Simulated psychophysical function 
for the match between number and line length 
and between number and duration, using data from 
Experiments HI, VIII, and IX. (Log-log 
coordinates.) 


to quantities on the estimated dimension 
in such a way that the subjective magni- 
tudes of the numbers are paired with 
equivalent subjective quantities on the 
dimension. Since the order statistics are 
assumed to be equally spaced on the S 
dimension, the order statistic of Rank N 
on one dimension should be matched with 
the order statistic of the same rank on the 
other dimension to create the x-y pair that 
simulates a single magnitude estimation. 

Figure 14 shows the scales that result 
when numbers are matched to equivalent 
subjective quantities of length and dura- 
tion according to the random-production 
scales. The numbers used in these matches 
are from the scale of Experiment HI when 
numbers above 1,000 were omitted (see 
Figure 6 and associated discussion), and 
the lengths and durations are from the 
scales of Experiments VIII and IX. Be- 
cause only 10 order statistics were available 
for the scale of number used for the matches 
and 20 had been taken for duration and 
line length, order statistics of Rank N for 
number were matched to order statistics of 
Rank 2N for length and duration. When 
the random-production scale for the full 
set of numbers produced in Experiment IHI 
is used, functions similar to those of 
Figure 14 emerge, but they show accelera- 
tion at the upper end. Presumably, this 
acceleration results from the distorted 
scale for the very large numbers. 

The linearity of the functions of Figure 14 
in log-log coordinates indicates that power 
functions describe them well. The curva- 
ture in these functions occurs near the 
lower end and might be removed by a 
threshold correction, In this situation, 
however, the curvature probably means 
that the matching procedure does not line 
up the 2 continua in exactly the same way 
Ss doing magnitude estimation would. The 
slope of the linear portion of the simulated 
apparent length function is close to 1.0, 
and that for the duration function is about 
1.3. The simulated apparent length scale 
is thus quite close to that found by magni- 
tude estimation, but the duration function 
is somewhat steeper than the 1.1 slope found 
in magnitude estimation (Stevens & Gla- 


LINE LENGTH (cm) 


o.l 
Ol 


1 
DURATION (sec) 
Figure 15. Simulated cross-modality match 


between line length and duration. —(Log-log 
coordinates.) 


lo 


anter, 1957). This difference between ob- 
tained and predicted slopes for duration 
may result from an artificial compression 
of the time dimension in random produc- 
tions—even 1 min. of holding the button 
down seems like a very long time—but it 
may also be a result of the fact that there 
is no regression effect to lower slope in the 
simulated matches (Stevens & Greenbaum, 
1966). 

The random-production scales for line 
length and duration also allow the cross- 
modality match (Stevens, Mack, & Stevens, 
1960; Stevens, 1959) for these dimensions 
to be simulated. Figure 15 shows the 
simulated result of a task where line lengths 
are drawn to match the apparent length of 
periods of time. For this simulation, order 
statistics of Rank N for length were paired 
with order statistics of Rank N for dura- 
tion, and 20 simulated length-duration 
matches were thereby made. The function 
these simulated matches trace out is evi- 
dently well fit by a power function, as is 
shown by its near linearity in log-log co- 
ordinates. The curvature near the bottom 
seems to indicate a needed threshold cor- 
rection, but again it may mean that match- 
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MAGNITUDE OF ORDER STATISTIC (ARBITRARY UTS) 


FIGURE 16, Random-production scales for sub- 
jective intensity of electric shock, force of handgrip 
squeeze, and areal extent, all arbitrarily placed on 
the X axis, (Log-log coordinates.) 


ing behavior is not perfectly simulated. 
The linear portion of this function has a 
slope of about 1.4. This slope is somewhat 
steeper than it should be and may be too 
steep for a number of reasons, including the 
lack of a regression effect. 

Random productions for other continua. 
Random-production scales have been ob- 
tained for subjective force of handgrip 
Squeeze, subjective intensity of a 60-Hz, 
electric shock, and subjective areal ex- 
tent (Banks, 1973). Figure 16 shows these 
scales with threshold corrections of 7 kg. 
for the handgrip scale and .1 cm." for the 
area scale. The technique used to obtain 
these scales was very similar to that used 
for the length and duration scales pre- 
sented in Figure 13. The Ss used a hand 
dynamometer to produce the handgrip 
squeezes, and a variable transformer, con- 
nected through a step-down transformer to 
saltwater finger electrodes, to produce 
random shocks. Areas were produced by 
freehand drawing of squares on unlined 
84 X 11 in. paper. : 

A power function provides an excellent 
fit to each of these scales, accounting for 
over 98% of the variance of the means in 
every case. The exponents for the hand- 
grip, shock, and area scales are 1,1, 2 
and .30, respectively, These exponents 
are smaller than magnitude-estimation ex- 
ponents for the same continua, but their 
small size is not consistent with a least 
effort (or least pain, in the case of shock) 
hypothesis of random-production behavior. 
First, the shock scale (which should be 
most compressed by such a hypothesis), 
is the most accelerated one, and the hand- 
grip scale is the next most accelerated, 


WILLIAM P. BANKS AND DAVID K, HILL 


while the area scale (whose large quanti 
are, if anything, easier to produce t! 
small quantities) is the most compress: 
Thus, Ss chose to give themselves more | 
than little shocks and more hard than easy 
handgrip squeezes, but they drew mor: 
little squares than big ones. 

Another version of the least effort hy- 
pothesis, that Ss produce a compressed 
version of the "true" psychophysical scale 
as given by the magnitude-estimation 
function, is also inconsistent with the 
results. The shock scale (which should be 
the most relatively compressed by this 
account) underestimates the magnitude- 
estimation exponent the least, the handgrip 
scale the next least, and the area scale the 
most. 

It seems best to view random. produc- 
tion as a scaling technique, especially since 
the least effort hypothesis of the results 
is disconfirmed and nothing plausible 
seems available to replace it. The fact that 
the random-production scales for shock, 
handgrip, and area are consistent with 
magnitude-estimation scales under the 
assumption of a nonlinear scale for number 
is discussed in the following section, 


GENERAL DISCUSSION 


Implications of scales of number, 
Previous research (e.g., Curtis, 1970; Rule, 
1969; Rule & Curtis, 1973) and the present 
evidence point to a nonlinear psychophysical 
function for number, the question of precisely 
what this function is has not been answered. 
The evidence seems to indicate a power func- 
tion with an exponent of approximately 3, 
but a logarithmic function with an inelegant 
linear disturbance near the bottom cannot be 
excluded. Both functions cover the range of 
numbers used in magnitude estimation well, 
and both show too little compression for the 
very large numbers, 

Similar as they 
random-production 
functions have quite different implications for 


Although 


and power functions are obtained in magnitude 
estimation, then it 
tion for the estimated dimensions is anything 
but logarithmic. Ekman (1964) has shown 
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how power functions will result for magnitude 
estimation if all continua, including number, 
have logarithmic psychophysical functions. 
With a logarithmic function for number, the 
psychophysical law for the estimated continua 
cannot be a. power function, and whatever 
psychophysical law they do obey must be 
virtually indistinguishable from  Fechner's 
(1860). 


1f, on the other hand, the subjective size - 


of numbers grows as the $ power of their 
absolute magnitude, then the power functions 
found in magnitude estimation imply that the 
perceived magnitudes of the continua being 
judged also rise as a power function of the 
physical measure. The magnitude-estimation 
function will, however, have an exponent 1.5 
times too large. can be seen from the 
following considerations. The preceived size 
of numbers, by this argument, obeys the 
following function: 


Wn = RENI, 


where Wa is the perceived size of numbers, 
is the absolute size of the numbers, £j; is an 
arbitrary constant, and } is the exponent in 
the power function for apparent size of number. 
If the perceived size of magnitudes on any 
other dimension, ¢1, grows according to the 
function 
Yı = kihi’, 

where B is the exponent in the power function 


for the apparent size of di, then the match 
between numbers and $; will give the function 


N= koi?! = kp". 


Even if all exponents from magnitude esti- 
mation misrepresent the subjective psycho- 
physical exponent by a common factor, 
cross-modality matches will still be correctly 
predicted. Suppose, for illustration of this 
fact, that S’s task is to match 2 continua, $1 
and $», and that magnitude estimation gives 
the following psychophysical power functions: 


WV, = hor 
and 

Va = hops’. 
The predicted match for these 2 continua, 
when @ is adjusted by 5 to equal the apparent 
size of magnitudes of $», will be 


$i = hubs" 


(Stevens, 1959). 
If the overestimation of the exponents 
due to the nonlinear scale of number used in 
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Ficure 17. Relationship between random-pro- 
duction exponent (open circles) or input (sensory) 
exponent (filled circles) and that obtained by 
magnitude estimation for a number of continua. 
(Linear coordinates.) 


magnitude estimation is taken into account, 
the exponents used for prediction should be 
2(a) and $(8), and the predicted match should 
therefore be 


a= kapt ARa) = hapshle, 


which is the same prediction obtained from 
the magnitude-estimation exponents. 

It should be noted that few experiments 
employing magnitude estimation depend on 
the precise size of the exponent. Rather, 
they usually set out to prove that a power 
function does (or does not) fit a given estima- 
tion scale, or else they study relative values 
of power function exponents. Thus, the con- 
clusions of very few of these experiments 
would have to be changed if the psychophysical 
function for number is a power function. 


Predicting Magnitude Estimation Exponents 


If magnitude-estimation scales overestimate 
the psychophysical exponent by a factor of $ 
and if random production yields a scale for 
the continuum that is unaffected by the 
numerical response scale, then the magnitude- 
estimation exponent should equal 1.5 times 
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the random-production exponent. The degree 
to which this prediction holds can be seen in 
Figure 17. In this figure, the open circles 
show magnitude-estimation exponents for the 
continua scaled here, plotted as a function of 
their random-production exponents. Random- 
production exponents for handgrip, shock, 
and area were obtained from Banks (1973), 
and exponents for duration (.75) and length 
(.68) were estimated from the random-pro- 
duction scales of Experiments VIII and IX. 
The magnitude-estimation exponents are those 
reported in Stevens (1961a) except for area, 
which is an average of the values obtained by 
Stevens and Guirao (1964), Teghtsoonian 
(1965), and Curtis and Rule (1972). If the 
magnitude-estimation exponent is 1.5 times 
larger than the random-production exponent, 
these points should fall on a line with a slope 
of 1.5. As is seen, the points are well approxi- 
mated by this line. 

Also plotted in this figure are points showing 
the relationship between “input” exponents 
of the 2-stage model of magnitude estimation 
(cf. Curtis et al., 1968). These points have 
been taken from a number of papers on the 
2-stage model (Curtis, 1970; Curtis et al., 
1968; Curtis & Fox, 1969; Rule & Curtis, 
1973) and they summarize results for several 
continua. Since the input exponent in this 
model is an estimate of the sensory scale of 
the continuum without the effect of the 
numerical response scale, the relationship be- 
tween the input and the magnitude-estimation 
exponents should approximate a straight line 
with a slope of 1.5 and, as seen in Figure 17, 
it does quite well. 

Finally, the N in the figure relates the subjec- 
tive exponent for number, as estimated in this 
paper and by Rule and Curtis (1973), to the 
exponent found in magnitude estimations of 
number by Rosner (1965), Stevens (cited in 
Marks, 1968), and Banks (unpublished experi- 
ment, 1968). Of course, numerical magnitude 
estimations of numbers lead to an almost per- 
fectly linear scale, and the N plotted in this 
figure serves only to determine the slope of 
the line on which the other points should fall. 


Indirect Evidence for a Nonlinear Numerical 
Response Scale 


Given that the psychophysical law for 
number is a power function with an exponent 
of about $, it still remains to be shown that this 
law governs the use of numbers as responses 
in a magnitude-estimation experiment, The 


main reason for studying the subjective scale 
of number is to determine its effect on magni- 
tude-estimation scales. The psychophysical 
scales of number generated here may, however, 
apply only to unconstrained numbers. 

The work of Curtis and his colleagues 
(Curtis, 1970; Curtis et al, 1968; Curtis & 
Fox, 1969) with the 2-stage model provides the 
best direct evidence that a decelerated power 
function of number is at work when estimates 
of sensory magnitude are made, but there 
exist some other less direct indications that 
this is the case. Equisection and related scal- 
ing tasks that do not give specific numerical 
instructions to S presumably operate entirely 
on the subjective continuum. There is no 
need to assume that Ss interpret numerical 
instructions correctly, only that they under- 
stand the meaning of equality of sense dis- 
tances (Torgerson, 1958). If the numerical 
response scale causes the psychophysical 
power function derived from magnitude estima- 
tion to have too large an exponent, then the 
scales from equisection and related techniques 
should be power functions with exponents 
smaller than those of magnitude estimation. 
This is usually the case (Stevens, 1961b, 
1971a), but the fact that these techniques are 
subject to bias effects that can lead to internal 
inconsistencies and to various different pre- 
dicted psychophysical functions (Garner, 
1954b, 1958; Stevens, 1957) makes a more 
precise comparison with magnitude-estimation 
scales impossible. 

In category scaling, S is instructed to space 
the categories evenly over the stimulus con- 
tinuum. Although the instructions usually 
designate the categories with numbers, the 
task requires only that S break the continuum 
into a number of apparently equal intervals. 
Here, too, there are a number of sources of 
bias operating, and the form of the category 
scale is not solely determined by the individual 
stimulus magnitudes. Marks (1968) has 
Shown that power functions can describe 
category scales obtained for a wide variety of 
continua if it is assumed that the average 
category ratings are on an interval scale and 
must be readjusted for unit and zero point. 
The parameters in the power functions Marks 
fits to category scales reflect the sources of 
bias of various experimental procedures, but 
the exponents always remain below those 
found in magnitude estimation. 

Fractionation and ratio-production scales 
Should, when power functions describe them, 
have exponents approximately the same as the 
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magnitude-estimation function for the con- 
tinuum. This is so because the experimentally 
prescribed ratios will be interpreted nonlinearly 
by S, as are numbers in magnitude estimation. 
"These techniques give the same exponent as 
magnitude estimation when production and 
fractionation are used together (Scharf & 
Stevens, 1961; Stevens, 1962), but, individu- 
ally, the techniques give scales that seem 
overly subject to procedural biases (Garner, 
1954b). It is of interest that Garner (1954a) 
was able to equate fractionation and equisec- 
tion scales of loudness by assuming that Ss 
fractionated according to a ratio of 1:1.7 
instead of the prescribed ratio of 1:2. If the 
power function for perceived number is $, 1:2 
will define a subjective ratio close to 1:1.7. 
As à further bit of evidence, Stevens (1970, 
1971b) has noted that when power functions 
are fit to neuroelectric correlates of stimulus 
intensity, the exponent is typically smaller than 
that for magnitude estimation of the same 
intensity continuum. 

Finally, it should be noted that Stevens 
(1971a) uses the term "virtual exponent" to 
refer to the undersized exponents generally 
produced by the scaling techniques discussed 
above, and he provides a more thorough review 
of this topic. Stevens does not, however, give 
an explanation of why the virtual exponent 
should be smaller than the “actual” exponent 
obtained in magnitude estimation. An im- 
portant advantage of the conclusion that num- 
bers are used nonlinearly in magnitude estima- 
tion is that it predicts the general relationship 
between the virtual exponent and the exponent 
obtained in magnitude estimation. 
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held for written recall. 


*,.Martfh (1972) has presented a view of 
preal-time information processing in which 
the Berception and, production mechanisms 
“underlying temporally unfolding behavior 
are linked "by the concept of rhythmic 
action. Very briefly, the basic notions are 
thése.. Natural constraints on human 
moveynent entail that sequences of move- 
ment-produced sounds, e.g., connected 
speech and music, are rhythmically pat- 
terned, i.&., they are sequences of "ac- 
cented'" and “unaccented” elements tem- 
.porally organized im q certain way. A 
descriptive, rule has been proposed’ for 
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Zachai helped with the experiments. Experi- 
ment II was reported at the meeting of the 
Psychonomic Society, St. Louis, November 1971. 
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G. Martin, Department of Psychology, University 
of Maryland, College Park, Maryland 20742, 
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$ In 3 experiments, Ss heard continuous sequences of 14 or 16 binary elements 
(high- and low-buzz tones). These were either 7-element or 8-element pat- 
terns repeated once. Dummy sequences were included also to provide a same- 
different (repeating or changed) judgment task. 
sequence, Half of the patterns of each length followed a rule for rhythmic 
structure and half did not. Of particular interest is that 7-element patterns, 
though shorter, than 8-element patterns, cannot have a simple rhythm. 
> In general the results were that rhythmic patterns were more easily recog- 
nized as repeating than nonrhythmic patterns, and 8-element patterns were 
more easily recognized as repeating than 7-element patterns. Similar results 
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characterizing the organization of these 
natural patterns and shows that their 
temporal structure is based upon relative 
timing constraints which go beyond the 
simple ordering (concatenation) of ele- 
ments. The rule also shows the way in 
which rhythmic patterns have a hierar- 
chical structure. 

Of relevance here is the relation between 
characteristics of a stimulus, in this case 
auditory temporal patterns, and the way 
the stimulus is processed (Garner, 1970b). 
Jf auditory stimulus sequences normally 
occurring in the natural environment are 
charatteristically organized in a certain 
way, e.g. they are rhythmic, then one 
might reasonably expect that perceptual 
mechanisms as they have evolved will be 
biased to listen for sounds organized in 
this certain way. This basic notion un- 
derlies the present work. 

If an auditory pattern is rhythmic, one 
of its properties is that certain elements 
in the pattern, main accents in particular, 
are temporally redundant. The potential 
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. 
consequence for perception of such an 
organization of the input sequence is that 
once early accents in the pattern are 
heard, later accents, and the end of the 
pattern as well, can be anticipated in real 
time. The question of interest then is 
whether perceptual mechanisms are in 
some way designed to make active use of 
these temporal redundancies in the pattern 
during the on-line input of the pattern 
itself. Alternatively, one could postulate 
a mechanism that treats the sequence of 
elements in a pattern as a concatenated 
string, so that even if there are temporal 
redundancies in the sequence these redun- 
dancies are ignored during listening and 
the elements registered one by one as they 
are heard, in left-to-right fashion. A fur- 
ther alternative, if elements occur at rates 
too fast to be handled sequentially, as in 
speech (Liberman, Cooper, Shankweiler, 
& Studdert-Kennedy, 1967), is to suppose 
that processing of the sequence consists 
in first accepting a whole string of elements, 
e.g. those making up a phrase or other 
"decision unit" (Miller, 1962), and then 
processing the whole string simultaneously 
upon reaching the end of the unit, the 
occurrence of which has just been signaled 
by a pause or other terminal cue, This 
alternative also does not take account of 
any temporal redundancies in the sequence, 
and, like the analysis-by-synthesis (Halle 
& Stevens, 1964) and similar models, 
would appear to require extremely rapid 
processing between decision units. By 
contrast, the present theory suggests that 
the mechanisms for on-line listening in real 
time are optimally designed to make use 
of the temporal redundancies always pres- 
ent in humanly produced, i.e., rhythmically 
structured, sound inputs. The redundan- 
cies are highly correlated with the distri- 
bution of information (melodic or lin- 
guistic) in the sequences and hence are 
not ignored but. are made use of during 
ongomg perception in a highly efficient 
way (Martin, 1972). 

Since human sound patterns are in- 
herently rhythmic, however, these notions 
cannot easily be tested using intact spoken 
or other humanly produced sounds. One 
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possibility is to temporally disto 
nected speech sequences in some a ial 
way. Another is to use electrome - 
ally timed nonspeech sounds, as w 

in the present experiments. Same-d 
judgments were used in the experimen 
to get at the more perceptual aspects of 
input processing. However, inputs in 
memory experiments also are fre tly 
sequential so that, in the present view, 
the results of memory experiments 
can be affected by the temporal structure 
of the input. A conventional memory 


ilso 


task—written recall—was used to show 
that this is the case. 

The stimulus inputs used in the experi- 
ments were repeating auditory temporal 
patterns consisting of sequences of evenly 


spaced high- and low-buzz tones. ch 
pattern was either 7 or 8 elements in 
length, and began with a high tone and 
ended with a low tone. The first (high) 
tone was assumed to be heard as accented, 
as were all following run-initial high tones. 
Some patterns were rhythmic as defined 
by the rhythm rule presented elsewhere 
(Martin, 1972), and some were For 
instance, a sequence of 8 equidisiant ele- 
ments is rhythmic for present purposes if 
accents are located on elements in Serial 
Positions 1 + 3 and/or 5, i.e., accents are 
2 elements apart or some very simple 
multiple thereof. In consequence, hearing 
the early accents in a pattern generates 
expectancies concerning later accents. Ad- 
ditionally and just as important, these 
patterns generate expectancies concerning 
their termination (an even-numbered ele- 
ment), hence expectancies concerning when 
they begin repeating (an odd-numbered 
element). By contrast, an 8-element pat- 
tern is nonrhythmic if accents are located 
on Elements 1 + 4, since this pattern of 
accents generates initial expectancies for 
triplets Such expectancies will be vio- 
lated, since triplets are appropriate for 
9- (or 6-) element patterns but not for 
8-element patterns. Further examples of 
nonrhythmic patterns are those having 
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Obviously there cannot be accents on Positions, 
3 or 5 if there is one on Position 4, if accents are 
defined as run-initial high tones. 


RHYTHMIC STRUCTURE IN AUDITORY TEMPORAL PERCEPTION 


accents on Elements 1, 4, and 6, or on 1, 
3, and 6. These patterns mix duples and 
triplets, hence generate either conflicting 
expectancies or else no expectancies in 
particular, including: expectancies concern- 
ing the termination of the pattern. Thus, 
receiving no impression of a coherent pat- 
tern during listening, it should be more 
difficult for S to recognize the pattern as 
repeating or to recall it. 

Besides rhythmic and nonrhythmic 8-ele- 
ment patterns, matching 7-element pat- 
terns were also used in the experiments. 
These are interesting because while they 
are shorter than 8-element patterns, hence 
on most theories should require less pro- 
cessing, they cannot have a simple rhythm, 
ie., accents cannot be equidistant. To 
demonstrate to yourself that 8-element 
patterns are simpler and more natural 
than 7-element patterns, tap an 8-element 
pattern on your desk, continuously and 
repeatedly, as fast as you can, accenting 
Taps 1, 3, and 5. Then try a 7-element 
pattern, again accenting Taps 1, 3, and 5. 
If not immediately convinced, practice a 
bit. For most people it may take some 
time before the 7-element patterns become 
smoothly and evenly repeated. On the 
present view, the relative naturalness of 
evenly spaced accents during production 


has consequences for perception also, 
which can be demonstrated under ap- 
propriate experimental conditions. For 


instance, a repeating 7-element pattern 
with accents on Elements 1 and 5, or on 
Elements 1, 3, and 5, generates expectan- 
cies during listening for an 8-element pat- 
tern which, however, will be violated since 
the expected pattern begins repeating one 
element too soon. In consequence, it 
should be harder to recognize the pattern 
as repeating. Additionally, with respect 
to memory, 7-element patterns should also 
be harder to recall than 8-element patterns, 
even if, as in these experiments, 7- and 
8-element patterns are equivalent in num- 
ber and location of accents, and in number 


4Of course, there are 7-beat musical rhythms, 
but these require more sophisticated responses in 
perception or production than do simpler rhythms. 


379 


TABLE 1 


EXPERIMENTAL PATTERNS 


Good 


8 element | 7 element 8 element | 7 element 
10001110 | 1000110 10011100 1001100 
11001000 | 1100100 11010000 1101000 
11101000 | 1110100 11100100 1110010 
11001110 | 1100110 | 11000110 1100010 
10101000 1010100 10110110 1011010 
10101110 | 1010110 | 11010100 1101010 


Note. 1 = high tone and 0 = low tone. C 
mic patterns and poor refers to nonrhythmic patterns, 


fering only in that one group (or chunk) 
is smaller. 

Experimental patterns were presented 
continuously, but only twice, i.e., repeated 
only once. The 7- and 8-element patterns 
were mixed. There were also dummy pat- 
terns that changed somewhat, i.e., did not 
repeat exactly, so that S's first task was 
to judge same-different, i.e., repeated or 
changed. The second task was to write 
the pattern(s). The 3 experiments were 
identical except for presentation rate and 
response delay time, and will be discussed 
together whenever convenient. 


METHOD 
Experiment I 
Subjects and design. The Ss were 44 under- 
graduates from the University of Maryland. 


Rhythmic structure (rhythmic vs. nonrhythmic) 
and pattern length (7 vs. 8 tones) were combined 
factorially as within-Ss variables. 

Materials. The buzz tones were approximately 
180 Hz. (high) and 150 Hz. (low) presented at 
a 3.3/sec rate. The tones were 200 msec. long, 
with 100-msec. silent intervals between them. 

The experimental, or repeating (same), patterns 
are shown in ble 1. Each begin with a high 
tone as preferred by most Ss (Royer & Garner, 
1970) and assumed to be heard as accented (Garner, 
1970a). Looking at 8-element patterns first, the 
“good” (rhythmic) patterns had accents (single or 
run-initial high tones) on Serial Positions 1 and 5, 
and in 2 cases also on Position 3. The “poor” 
(nonrhythmic) patterns had accents on Positions 
1, 4, and/or 6. 

As Table 1 shows, each 8-element pattern was 
matched by a 7-element pattern constructed by 
deleting one item from the end of a run in its 
matching 8-element pattern at Serial Position 6 
or 7. Thus, the 7- and 8-element patterns were 
exactly equivalent in the number of runs and in 
the number and location of accents. They were 


TABLE 2 
MEAN PROPORTION Correct JUDGMENTS 


| Pattern 


| E- 

Experiment A x Az = 
element- | element- | element- | element- 

good | poor | good | poor 

PN =a | 83 | og |OGro| gi 

(N= 24) | 80 | 24 | 273 | Jot 

IEN 242) | ‘30 | :38 | ‘69 | ‘53 


Note. Good refers to rhythmic patterns and poor refers to 
nonrhythmic patterns, 


also equivalent in number and content of groups, 
chunks, ete., except that one group (or chunk) in 
the 7-element pattern was smaller (shorter). Each 
pattern was recorded twice continuously (repeated). 
Thus, each sequence was presented as a series of 
14 equal-interval tones (the 7-element patterns) 
or 16 equal-interval tones (the 8-element patterns), 

In addition to the 6 repeating (same) patterns 
of each type in the 2 X 2 design there were also 
4 dummy patterns of each type that did not repeat 
but were changed slightly (different), These dummy 
Sequences are not shown, For each of the dummy 
subsets, as in the experimental subsets (Table 1) 
there were 2 3-accent patterns and 4 2-accent 
patterns; in each of the subsets, half of the elements 
were high tones and half were low tones. 

The 40 patterns (24 experimental and 16 dummy) 
were combined in a mixed list and were divided 
into 2 blocks of 20 patterns each with the following 
restrictions. Half of each t of items was in 
each block and only one of each pair of matching 
7- and 8-item patterns was in each block. In order 
to avoid repetition effe whenever (half of) a 
dummy sequence was the same as an experimental 
pattern it alw. followed the experimental pattern 
in the block. Half of the Ss heard Block A first; 
the other half heard Block B first. 

Procedure, The Ss first heard practice sequences 
with instructions to write the pattern, A non- 
repeated pattern of each length was presented first, 
then 4 repeating patterns, one each of 7- and 
Selement pattern repeated (same) and c ged 

(different), After hearing each sequence, made 
their judgments, wrote the entire pattern, and 
were given feedback about the correct responses, 
No feedback was given during the presentation 
of the experimental patterns that followed. 

The Ss in small groups heard the 
monaurally through headphones and responded 
after each on an answer sheet which for each se- 
quence provided the letters S and D and 2 solid 
lines for writing the 2 patterns. The Ss’ first task 
was to judge same or different and circle S or D. 
The Ss’ second task was to write the entire sequence 
whether repeating or changed with 1 representing 
a high tone and 0 a low tone. They were told 
that each pattern began with a high tone and 
ended with a low tone, to guess if uncertain, and 
to mark an X when unable to guess, 


Sequences 
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Experiment II 


The Ss were 24 undegraduates fr 
versity of Maryland. This experiment \ 
to Experiment | except that the tones 
sented at the rate of 4.4/sec. The toi 
150 msec. long, with 75-msec. silent 
between them, 


inter 


Experiment ITT 


The Ss were 42 undergraduates from the Uni- 
versity of Maryland, This experiment was iden- 
tical to Experiment II except that after each 
presentation of the sequence there was a 4- 
unfilled forced-delay interval before Ss could re- 
spond, The answer sheet had 3 lines, labeled A, 
B, and C, for each sequence. Four seconds after 
the end of the sequence E held up a card showing 
either A, B, or C to indicate the line on which 
S was to write his responses. 


RESULTS 


The following results are for the experi- 
mental sequences only; data for dummy 
sequences are not reported. 


Judgments 


The mean proportions of correct judg- 
ments over the 2 trials are shown in 
Table 2. For all experiments there were 
more correct judgments for good patterns 
than for poor patterns, as predicted: Ex- 
periment I, F (1, 43) — 7.90, p «€ .01; 
Experiment II, F (1, 23) = 11.01, p < .01; 


and Experiment Ill, F (1, 41) — 56.38, 
p €.001. For Experiment I the judg- 
ments for the 8-element patterns v ] 
significantly better than for the 7- 

patterns, F (1, 43) = 2.94, p < .10. Kor 


Experiments II and III this difference was 
significant: Experiment Il, F (1, 23) 
= 11.20, p < .01, and Experiment III, 
F (1, 41) = 9.55, p < .01. There were 
no interactions, In short, good patterns 
Were more easily recognized as repeating 
than were poor patterns in all 3 experi- 
ments. Except for Experiment I (slower 
presentation rate), 8-element patterns were 
more easily recognized as repeating than 
T-element patterns, as predicted. 


Paltern Recall 


Two measures were analyzed. The first 
was the number of repeating patterns 


e 
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TABLE 3 


N CORRECT PATTE 


RECALL 


Trials 1 and 2 combined 


| Trial 2 plus judgment correct 


Experiment 


es Leucine |? T-element- | 7-element- | S-element- | 8-element- | 7-element- | 7-element- 
good | poor good (Do meer good | poor | good | poor 
m eis | TE |- 
I | 2.15 3.78 | 2.88 | 2.30 | 1.14 | 2.02 1.59 
Il 1.63 2.25 1.67 1.75 | .96 1.00 n 
Iu | 1.14 2.73 1.12 1.79 | .69 | 1.43 48 


Note. Good refers to rhythmic pattern and poor refers to nonrhythmic pattern. 


completely correct over the 2 trials com- 
bined, ignoring the second half of the 
written response and also whether or not 
the same-differenl judgment was correct. 
The Trials 1 and 2 combined data in Table 3 
show that in all 3 experiments there was 
better pattern recall for the good than for 
the poor patterns: Experiment I, F (1, 43) 
= 29.27, p< .004; Experiment II, F 
(1, 23) = 11.51, p< .01; and Experi- 
ment. III, F (1, 41) = 58.70, p < .001. 
Using this response measure, there was no 
significant overall difference between 8- 
and 7-element patterns on any of the 
experiments (although the direction was 
in favor of 8-element patterns in the last 
2 experiments). In Experiments I and I 
there was a significant interaction: Ex- 
periment I, F (1, 43) = 8.71, p < .01, and 
Experiment LI, F (1, 23) = 6.16, p < .025. 
These interactions mean that the longer 
patterns gain relatively more when the 
patterns have a good structure, namely, 
rhythmic. 

The second written recall measure was 
the number of patterns completely correct 
on Trial 2 when S also had accurately 
judged the pattern as repeated. They have 
the advantage of practice and of more 
confident judgments and should be some- 
what more reliable. The Trial 2 plus 
judgment correct data in Table 3 show 
that for all 3 experiments the good pat- 
terns were easier than the poor patterns: 
Experiment I, F (1, 43) = 16.59, p < .001; 
Experiment II, F (1, 23) = 6.51, p < .05; 
and Experiment III, F (1, 41) = 52.36, 
p < .001. In Experiment I there was an 
interaction, F (1, 43) = 7.76, p < .01, 
again showing the relative disadvantage 
of greater length when the patterns were 


nonrhythmic. For Experiments II and II 
the 8-element patterns were recalled sig- 
nificantly better than the B oS pat- 
terns: Experiment II, F (1, 23) = 8.46, 
p < .01, and Experiment III, P (1, 41) 
= 7.43, p < .01. 

In short, rhythmic patterns are generally. 
easier to recall than nonrhythmic patterns. 
Extra length depresses recall for non- 
rhythmic (nonstructured) sequences but 
not for rhythmic sequences. 


DISCUSSION 


Under the conditions of these experiments, 
auditory temporal patterns generally were 
more difficult to recognize as repeating when 
they were nonrhythmic than when they were 
rhythmic, and were often less easily recalled 
when nonrhythmic than when rhythmic. This 
result generally held even when the rhythmic 
patterns (8-element good) were made non- 
rhythmic, or at least rhythmically more com- 
plex, by simply reducing their length (7-ele- 
ment good). Considering the recognition 
(judgment) task first, these results appear to 
go against element-by-element registration as 
a useful perceptual strategy, either deliberate 
or "automatic," particularly when the input 
rate is fairly rapid (Experiments II and IIT). 
(This is not to say that the listener could 
never profit by attempting such a strategy.) 
It is also unlikely that the listener in any 
sense compares the 2 patterns in the sequence 
by somehow holding them as units in echoic 
memo! Also unlikely is that the differences 
between 7- and 8-element patterns could be 
explained by some measure of sequential 
complexity other than rhythmic complexity, 
since only on rhythmic grounds is it easy to 
see why taking one element out of a run (or 
subgroup) should make the pattern more 
complex. More likely is that the listener 
“feels” the coherence of a rhythmic pattern 
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SERIAL POSITION 


FiGURE 


Serial position curves for the second trial of Experiment II], 


(The first 8 positions are shown, so that for 7-element patterns the last 


position represents the fi 


rhythmic pattern.) 

as he hears it, and since this includes the 
"knowledge" that the pattern has terminated, 
hence the beginning of the next, the remainder 
of the sequence is more readily recognized as 
& repetition of the pattern just heard. This 
conclusion, that rhythmic patterns are some- 
how "easier" or more "natural," supports the 
hypothesis that listening mechanisms are 
optimally designed to handle sequential inputs 
that have a rhythmic organization. 

It should be mentioned that an alternative 
experimental method would have been to 
present patterns of a given length in blocks, 
so that S would know in advance which 
length to expect, and hence could use various 
deliberate strategies. There is little doubt 
that 5 would do better with 7-element pat- 
terns than 8-element patterns under such 
conditions (though not better with poor pat- 
terns than with good patterns). This con- 
clusion is hinted at in Table 1, which shows 
insignificant main effects of pattern length on 
judgments with the slower rate. in Experi- 
ment I. But results favoring shorter patterns 
are not interesting in themselves since they 
are expected on nearly any theory of informa- 
tion processing. 

Turning to written recall, consider the serial 
position curves in Figure 1, which were ob- 
tained from the second trial of Experiment III 
(similar curves were obtained from Experi- 
ments I and II). Position 8 was scored as 
correct for the 7-element patterns only when 
left blank. For 8-element patterns it was 


st element of the pattern as it was repeated. Ab- 
breviations: G = good, ie, rhythmic pattern, and P = poor, i.e 


, non- 


scored correct if 0 was recorded but inc 
either for 1 or a blank. 

"These curves are not quite conve: 
several respects. For one, they repre 
the first half of the sequences act € 
sented to S and his responses. For a r 
the point representing the first tone is 
connected to the following points, sinc 
knew it was always high, and Position 8 in 
the sequence is not connected. The S would 
know whether this was high or low if hc 
knew the length of the pattern he had just 
heard. Obviously he often did not, as can 
be seen by noting the poor scores at Posi- 
tions 7 and 8. 

Several aspects of these curves are of 
interest. For one, when Positions 1 and 8 are 
disregarded, the curves for good patterns are 
relatively flat. Poor patterns, on the other 
hand, show the dip in the middle typical of 
serial position curves for arbitrary sequenci 
of memory items. 
be in part a respo 


rect 


nal in 


result, however, may 
bias toward writing 
good patterns, e.g., toward marking Position 5 
as accented. Such a bias would result in an 
elevation at that position for good patterns 
and a depression for poor patterns, both 
tendencies of which can be seen in Figure 1.5 
Further suggestions for a response bias can 


* This point was tested by linear contrasts of 
Position 5 against the average of Positions 4 and 6 
for each of the 4 string types. All but the 8-cle- 
ment-good patterns were reliable (p < .01). 
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be seen in Table 3. Here, although the 
input stimuli were the same in Experiments 
II and III, the delay in Experiment III 
resulted in lower scores for 8- and 7-element- 
poor patterns, compared to Experiment II, 
but higher scores for 7-element-good patterns 
(8-element-good patterns remained un- 
changed). These results suggest a "drift" 
toward good patterns during the delay in- 
terval. Both of these tendencies, a flat serial 
position curve and a response bias in favor 
of good patterns, are to be expected on the 
view that rhythmic patterns are gestaltlike 
holistic entities. 

Another result seen in Table 2 is worth 
mentioning. While the input sequences were 
exactly the same in Experiments II and III, 
the latter required 5 to make a delayed 
response. Comparing Experiments II and III, 
note that the delay depressed correct judg- 
ments for nonrhythmic patterns. Why should 
this happen, since presumably the judgments 
are made immediately even if the written 
response must wait? Most likely, the answer 
is that when patterns are nonrhythmic the 
impression of a coherent pattern is less firm 
and evaporates quickly over time, with the 
result that S now reports that the pattern 
did not repeat. 

Finally, speculating on the implications of 
these experiments, the main point concerns 
the hypothesis that meaningful auditory pat- 
terns occurring in nature generally have a 
temporally redundant (rhythmic) organiza- 
tion, and that perceptual mechanisms have 
evolved to expect and make use of these 
redundancies. The importance of temporal 
structure can be seen in the fact that simple 
musical tunes remain recognizable when pitch 
variations are eliminated with timing rela- 
tionships intact, but not vice versa (Lenne- 
berg, 1967). A sequence (concatenation) of 
notes is not a tune unless the notes are in 
the right temporal places. Similarly, speech 
appears to be organized in such a way that 
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the more informative linguistic elements are 
temporally predictable, i.e., organized around 
the accent structure of the sequence (Martin, 
1972). It is natural to speculate that the 
temporal redundancies in speech at the level 
of sound facilitate relatively “automatic” 
processing at that level, thus freeing psy- 
chological resources for "attending" to the 
level of meaning. The fact that spontaneous 
speech departs from the received grammar, 
yet is understood nevertheless ( Martin, 1967), 
is in line with this view. The listener can 
depend upon temporal, if not syntactic, regu- 
larities in the speech he hears. 


RE CES 


GARNER, W. R. Good patterns have few alter- 
natives. American Scientist, 1970, 58, 34-42. (a) 
Garner, W. R. The stimulus in information 
processing. American Psychologist, 1970, 25, 


350-358. (b) 

HALLE, M., & Stevens, K. N. Speech recognition: 
A model and a program for research. In J. A. 
Fodor & J. J. Katz (Eds), The structure of 
language: Readings in the philosophy of language. 
Englewood Cliffs, N.J.; Prentice-Hall, 1964. 

LEN RG, E. H. Biological foundations of lan- 
guage. New York: Wiley, 1967. 

LIBERMAN, A. M., Cooper, F. S., SHANKWEILER, 
D:-P3 DDERT-KENNEDY, M. Perception 
of the speech code. Psychological Review, 1967, 
74, 431-461. 

Martin, J. G. Hesitations in the speaker's pro- 
duction and listener's reproduction of utterances. 
Journal of Verbal Learning and Verbal Behavior, 
1967, 6, 903-909, 

Martin, J. G. Rhythmic (hierarchical) vs. serial 
structure in speech and other behavior. Psy- 
chological Review, 1972, 6, 487—509. 

MiLLER, G. A. Decision units in the perception 
of speech. J.R.E. Transactions on Information 
Theory, 1962, 8, 81-83, 

Royer, F. L., & Garner, W. R. Perceptual 
organization of nine-element auditory temporal 
patterns. Perception & Psychophysics, 1970, 7, 
115-120. 


(Received February 5, 1973) 


Journal of Experimental Psychology. 
1974, Vol, 102, No. 3, 384-392 


MEASURES OF INFORMATION PROCESSING 
IN CONCEPT IDENTIFICATION ! 


DALE E. BERGER? 


Claremont Graduate School 


A method was developed to show multiple-hypothesis behavior and states of 
information during concept identification (CI). Compared to Levine's 1966 
blank-trial method, the new method provided more detail about information 
held b at each stage of a problem and allowed examination of process 
assumptions of current CI models, Most Ss attempted to process information 
on all dimensions simultaneously; some initially selected a subset of dimen- 
sions to proce Processing errors were made following positive as well as 
negative feedback. No CI model could account for the data in detail, al- 
though protocols from individual Ss were consistent with aspects of the 
various models. A random-reversal assumption was suggested as a mechanism 
to account for departures from perfect processing. 


Recent theoretical approaches have dis- 
cussed behavior in concept identification 
(CI) tasks within the framework of hypoth- 
esis testing. Much of the work (e.g., 
Bower & Trabasso, 1964; Falmagne, 1972; 
Levine, 1966; Trabasso & Bower, 1968; 
Wickens & Millward, 1971) grew out of the 
“strategy selection” theory formulated by 
Restle (1962). This theory postulates 
sampling with replacement from a known 
set of hypotheses following each error. 

Since the original investigation by 
Bruner, Goodnow, and Austin (1956), 
researchers have been concerned with the 
difficulties of externalizing strategies used 
in attaining concepts. This remains a 
most difficult problem. In a recent review 
of research on strategies in Cl, Taplin and 
Jeeves (1972) concluded that valid and 
reliable indices of the processes of CI must 
be established if information-processing 
theories are to remain a promising approach 
to understanding concept attainment. 

1 This article is based on a PhD dissertation sub- 
mitted to the University of California, Los Angeles, 
A shorter version was presented at the meeting of 
the Western Psychological Association, Portland, 
April 1972. The research was facilitated in part by 
grants from the University of California, Los 
Angeles, Graduate Student Patent Fund and the 
Campus Computing Network to the author, and 
U.S. Public Health Service Grant MH-08741 to 
T. Trabasso. The author wishes to express his 
thanks to T. Trabasso, M. Friedman, and T. 
Wickens, who offered many valuable suggestions. 

? Requests for reprints should be sent to Dale E. 
Berger, Department of Psychology, Claremont 
Graduate School, Claremont, California 91711, 


Levine (1966) introduced a technique 
that provides some information about 
hypothesis behavior of individual Ss i 
Cl task. Assuming S's responses between 
errors are determined by a single hypothesis, 
the pattern of responses to a set of blank 
trials (trials with no feedback) can be used 
to identify the hypothesis. Levine's blank- 
trial technique has been used extensively 
in testing implications of mathematical 
models of CI (cf. Erickson, 1968; Levine, 
1969). 

Levine's (1966) hypothesis-tracking tech- 
nique will be called multiple-cue (MC) to 
contrast it with an alternative single-cue 
(SC) technique introduced in experiments 
to be reported here. The difference is in 
the complexity of the stimuli used for blank 
trials: The MC method uses complex 
stimuli which each have a value on every 
dimension, whereas the SC method uses 
stimuli which each have only one value on 
à single dimension (see Figure 1). 

With the SC blank-trial technique, each 
stimulus dimension is represented in the 
blank-trial set by a single value presented 
in isolation. The S is given 3 response 
alternatives to SC stimuli: If he feels the 
cue is potentially relevant to solution, he 
is allowed to classify it into the correct 
category ; if he has eliminated the dimension 
from the set of possible solutions, he would, 
respond irrélevant; if he does not remember 
how the cue should be classified, he would 
respond don’t remember. When a set of 
SC blank trials follows every feedback 
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MULTIPLE-CUE 


FIGURE f. 


trial (FT), it is possible to recover the 
information .S has on each dimension at 
each stage of the problem. 

The SC blank-trial procedure allows 
detailed examination of processing errors 
during the course of solving a CI problem. 
Eight types of errors account for all possible 
deviations from perfect processing. Table 1 
shows the possible combinations of infor- 
mation held on a given dimension before 
an FT, the direction of the feedback, classi- 
fication after the feedback, and the type of 


SINGLE- CUE 


Examples of multiple-cue and single-cue blank-trial test sets. 


processing error, if any. If a dimension is 
thought to be relevant, it can be classified 
in either of 2 directions, labeled A and B in 
Table 1. Note that for the purpose of 
classifving errors, a don't remember response 
is here considered equivalent to irrelevant 
since both indicate that S is unable to 
classify the dimension. 

The 8 types of errors or deviations from 
perfect processing in a simple binary CI 
task are as follows: 

1. Initial omission—a failure to classify 


TABLE 1 


PossrBLE ERRORS IN SIMPLE 
CONCEPT IDENTIFICATION 


PROBLEMS 
Classification | (eeedbéck, | Classification | Type of error 
before feed- "er after feed- Ut any) 
back trial MET back trial 
gory A or B) | 
No. hypothesis A | (Correct) 
(before first A B Initial 
feedback reversal 
trial) Irrelevant Initi; 
omission 
A | (Correct) 
A B Inconsistency 
| reversal 
A Irrelevant Premature 
| elimination 
A Failure to 
eliminate 
B B Consistency 
reversal 
Irrelevant | (Correct) 
A sistent 
resample 
Irrelevant A B Inconsistent 
resample 
| Irrelevant | (Correct) 


a dimension in either direction following 
the first FT (FT), because information has 
either been forgotten or was never encoded. 

2. Initial reversal—classification, follow- 
ing FT), of a value on a dimension Opposite 
to the direction supported by FT;. 

3. Failure to eliminate—retaining a hy- 
pothesis following an FT where the hypoth- 
esis could logically be eliminated. 

4. Premature elimination—elimination of 
a dimension before it could logically be 
eliminated, 

5. Consistency reversal—reversal of the 
previous classification on a dimension, 
making the new classification consistent 
with the last FT. Since information from 
the last FT was inconsistent with informa- 
tion held previous to the FT, the dimen- 
sion should have been eliminated, 

6. Inconsistency reversal—reversal of the 
previous classification on a dimension, 
making the new hypothesis inconsistent 
with the last FT. Since information on the 
last FT was consistent with the previous 
hypothesis, this hypothesis should have 
been maintained. 

7. Consistent resample—classification of 
a dimension which had previously been 
considered irrelevant to the solution ina way 
that is consistent with the last FT. 
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8. Inconsistent resample—ciascification of 
a dimension that had previously been con- 
sidered irrelevant in a way that is incon- 
sistent with the last FT. 

Distributions of processing errors from 
individual Ss can be used to test assump- 
tions of various models of Cl. For example, 
the hypothesis-manipulation model (Chum- 
bley, 1969) assumes perfect 
following a correct response 
resampling following a proj: 
of error trials. Thus, resam; 
the only errors predicted by t 
they are expected to occur oni; 
an error trial. 

It should be noted that the MC blank- 
trial technique does not provide enough 
information to allow a comparable detaile 
examination of the assumptions of mod 
Only one hypothesis can be re 
each MC blank-trial-set, so it npossible 
to determine which other hypotheses 5 ma y 
be tracking at that time. 

The present experiment was designed to 
compare the MC and SC blank-trial tech- 
niques and to generate distributions of 
processing errors so models of CI could be 


processing 
but massive 
ion (1 — 1) 
‘rrors are 
nodel, and 
following 


led by 


evaluated. In a preliminary study with 
4-dimensional CI problems, practiced 
college students performed so wel! that 


most made too few processing errors of 
any type to allow detailed analysis (Berger, 
1970). However, performance was poorer 
on problems with interpolated MC blank 
trials than on problems with SC blank 
trials. The present experiment s an 
extension to more difficult probiems with 
8 dimensions. Differences between the SC 
and MC test procedures were reduced by 
signaling the trials on which S would re- 
ceive feedback, and the effect of the interpo- 
lated tests on solution rate was evaluated 
by comparison with a control condition 
which had no interpolated tests. 
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Subjects. The Ss were volunteers from introduc- 
tory psychology classes at the University of Cali- 
fornia, Los Angeles, who were fulfilling a class re- 
quirement, They were paid $1.50/hr for any time 
beyond their class requirement. Twenty-four Ss 
participated in the experimental group, and 10 Ss 
were used in a control condition, 
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Apparatus and materials, Stimuli were rear pro- 
jected onto a screen in front of S by a Sawyer 
550ER projector. A Davis timer was used to con- 
trol the presentation of feedback. The response 
panel contained a button labeled “next” to advance 
the projector, and 4 labeled response choice buttons. 
There was 1 response button for each of 2 clas 
tion categories, a button labeled “irrelevant,” a 
a button labeled “don't remember." The 2 category 
buttons had lights over them to indicate feedback. 
Also, there was a small light directly below the 
screen to signal the FTs. 

The FT and MC test patterns each contained 
8 binary attributes, and the SC stimuli each con- 
sisted of a single value (see Figure 1). 

Problems. The stimulus sets used for FTs were 
arranged so that within each set every pattern 
differed from every other pattern on exactly half of 
the dimensions. Thus, an S who processed informa- 
tion perfectly would eliminate half of the remaining 
solutions on each FT. Since there are 16 elementary 
hypotheses in a problem with 8 binary dimensions, 
an S who eliminated half of the hypothesis following 
each FT would be left with a single hypothe 
following FT, Five FTs were given in these expe 
ments, since it was anticipated that most Ss would 
not solve these problems as rapidly as was logically 
possible. The problems were arranged so that each 
of the elementary hypotheses was the correct solu- 
tion equally often. 

Patterns in each test set for the MC blank trials 
met the requirements that (a) each stimulus differ 
from all other stimuli in the set on exactly half of 
the dimensions, and (b) both values of each dimen- 
sion occur at t once in each test set. These 
restrictions insure that within each subset it is 
impossible for S to be correct every time if he 
responds on the basis of a response bias, and if S 
responds on the basis of a single hypothesis, his 
response protocol for any subset is unique for each 
of the possible hypotheses, Only 4 patterns are 
actually necessary in the subset to determine the 
hypothesis with which S responded, but following 
Levine (1966), a fifth pattern was included to pro- 
vide a consistency check (see Figure 1). 

The set of patterns for the SC test trials consisted 
of 9 stimuli, each composed of 1 value on a single 
dimension. The first 8 patterns included 1 value 
from each of the 8 dimensions, and the ninth pattern 
served as a consistency check. With equal fre- 
quencies, this last pattern was the complement of a 
value which had occurred earlier in Position 1, 3, 5, 
or 7 in the test set. 

The problems alternated between those with SC 
and those with MC tests following each FT. The 
control condition had no interpolated tests, but both 
types of tests were given after the last FT in all 
conditions, and S was asked what he knew about 
the problem. 

Procedure, Since the focus of the experiment was 
on asymptotic behavior, extensive training was 
given to Ss. Every Sattempted 7 practice problems 
with 4 dimensions and 4 practice and 16 experi- 
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TABLE 2 


COMPARISON OF B AND Pool 
PROPORTION OF PROBLEMS SOL’ 
INCONSISTENCIES ON MULTIPLE-CUE 
(MC) AND SINGLE-CUE 
BLANK-TRIAL SETS 


| Solution rates 


s GAA ee AD 
les | 2 3 | 4 
Beste — | .50 | .66 | .70 | .70 | 198 | .041 
Poorest* | .08 | .06 | .19 | .23 | .242 | 173 
Control? | .20 | 38 | 43 | 45 |— - 
| 


+ Blocks of 4 problems, 

gue 

n= 
dn = 10, 
mental problems with 8-dimensional patterns, The 
Ss in the control condition were given the same 
instructions and practice problems. Most Ss re- 
quired less than 33 hr. 

After S responded to a pattern, he pressed a 
central button labeled “next.” This stepped the 
projector and brought S's hand to à neutral position, 
On FTs, this also caused a light to appear for 1 sec. 
between the label and button of the correct cli 
The feedback light appeared just after the stimulus 
was removed, and was off before the next stimulus 
was projected. 

In response to an MC pattern, S had to press 1 
of the 2 classification buttons, Given an SC pattern, 
Shad 3 options. If he thought the value had always 
been consistent with only 1 category, he was to 
press the corresponding button, If the value had 
been paired with both categories, he was to press the 
irrelevant button. If he could not remember to 
which category the value belonged, he was to press. 
the don't remember button. 


RESULTS AND DISCUSSION 


Each .S attempted 16 experimental 
problems. After the last FT in each prob- 
lem, every S was given an SC and MC test 
series and was asked what he knew about 
the problem. An S was considered to 
have solved a problem when the results 
from both cue tests agreed with the correct 
hvpothesis. Individual differences were 
very large, with Ss solving 0-13 of the 16 
problems. The 11 best Ss solved at least 
7 problems, and none of the remaining 
13 Ss solved more than 4 problems. Be- 
cause of this natural break, Ss were divided 
into 2 groups of unequal numbers for sub- 
sequent analyses. Overall performance 
improved rapidly over the first half of the 
experiment, and was relatively stable 
during the last half (Table 2). By the 
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second block of 4 problems, the better Ss 
were performing nearly as well as they did 
during the rest of the experiment. The 
poorer Ss, however, did not show much 
improvement until the third block of 
problems, and it is likely that they were 
still improving at the end of the experiment. 

Comparison of tests. The 2 types of 
interpolated tests yielded similar accounts 
of the composition of the hypothesis sets 
after FTs. Of the 384 problems (16 prob- 
lems from 24 Ss), both tests agreed that 
the correct solution had been reached on 
142 problems, and on 62 problems, both 
tests agreed that a particular incorrect 
hypothesis had been reached as a solution. 
On 149 other problems, the SC test indi- 
cated that SS was tracking more than 1 
hypothesis following FT;. The remaining 
31 problems showed inconsistencies on the 
MC test, failure to track any hypothesis 
on the SC test, or disagreement between 
the 2 tests. 

Effect of interpolated tests. In contrast to 
the pilot studies where SC problems were 
easier than MC problems, the proportions 
of SC and MC problems solved were 
exactly the same, 71/192 = .37. The 
signal that Ss in the present experiment 
received at the beginning of FTs to dis- 
tinguish them from MC blank trials is 
probably a relevant factor here. On each 
trial, the stimulus was removed before 
feedback was given, so the signal probably 
helped S to avoid processing irrelevant 
information in the MC blank trials and 
thus made the 2 probe tests more 
comparable. 

The effect of interpolated test series on 
overall rate of problem solving can be 
gauged by comparing the proportion of Ss 
who solved each experimental problem 
with the control condition. Ten Ss in the 
control condition solved the same problems 
as experimental Ss, but the 5 FTs had no 
tests interpolated between them. There 
was no difference in the proportion of 
problems solved by control Ss (58/160 
= .36) and by experimental Ss(.37), so the 
interpolated tests did not influence overall 
rate of solving. 


Inconsistencies in multiple-cue tests. Vhe 


DALE E. BERGER 


overall proportion of MC test protocols 
that were not consistent with any single 
hypothesis was There was no dif- 
ference between the best and poorest Ss, 
U (11, 13) = 56.5, p > .25 (see Table 2). 

Inconsistencies in MC test protocols have 
been interpreted as being the result of 
random response errors, sometimes called 
"oops errors" (Levine, 1969). However, 
s of the inconsistent response 
showed that they were not all 
due to random response errors since many 
could be attributed to a response rule used 
by Ss. The "response bias” rule of respond- 
ing with the same category name to each 
stimulus in the MC test series was used 
more often than chance by 19 of the 22 Ss 
who showed MC inconsistencies (one-ta 
binomial, p < .001). Also, the likelihood 
of an inconsistency on the last MC 
depended on the number of hypotheses ihat 
S was tracking. When the SC test follow- 
ing the last FT indicated that more than 
1 hypothesis was considered to be possibly 
relevant, .161 of the MC test protocols 
were inconsistent, whereas only .041 were 
inconsistent when only 1 hypothesis was 
tracked on the SC test, X*(1) = 4.48, 
p < 001. 

Inconsistencies in single-cue tests. Nine 
blank trials were used in each SC test sc: 
Each of the first 8 trials tested a differ: 
dimension, and the ninth provided 
consistency check by retesting 1 of 
8 dimensions with the value opposite th 
Which had occurred earlier. An incon- 
sistency was said to have occurred if the 
test and retest did not give the same results. 
The best Ss had a much lower proportion 
of SC inconsistencies over the 5 FTs than 
did the poorest Ss, an average of .041 com- 
pared with .173 per test, U (11, 13) — 4, 
b < .001 (see Table 2). An SC inconsis- 
tency could indicate guessing, a response 
error, a hypothesis change, forgetting 
during the course of the set of blank trials, 
or that S had not integrated the informa- 
tion available on the given dimension so 
that it could be applied equally to both 
values. 


It should be noted that an inconsistency 
does not necessarily have the same meaning 
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for an SC and MC test. An SC incon- 
sistency means .S failed to classify the 2 
complementary values from a dimension in 
a consistent manner, while an MC incon- 
sistency means S did not respond on the 
basis of a single hypothesis possibly be- 
sause he used some other response rule. 

Size of hypothesis set. The collection of 
hypotheses that S considers to be poten- 
tially relevant at any given time will be 
called his hypothesis set. This set does not 
include values from a dimension that has 
been ignored or forgotten. The SC tests 
reveal directly the contents of the hypoth- 
esis set at each point in the problem, and 
it is a simple matter to count the number of 
hypotheses. Figure 2 shows the mean 
number of cues in the hypothesis set for 
the best and poorest Ss following positive 
and negative feedback on each FT. On 
average, the poorest Ss started with fewer 
hypotheses in their sets, but were less 
successful in eliminating hypotheses than 
were the best Ss. The size of the hypothesis 
set was independent of the kind of feed- 
back, in opposition to theories which 
assume that resampling is more likely to 
occur or will always occur following an 
error. lt might be noted, however, that 
Ss in the present study were highly 
experienced. 

Distributions of processing errors. An S 
who processes all information perfectly 
will start with 8 hypotheses after FT, and 
will eliminate all hypotheses which are not 
consistent with new information presented 
with each subsequent FT. As described in 
Table 1, there are 8 types of possible 
processing errors, Distributions of errors 
from each S as a function of positive or 
negative feedback on the preceding FT are 
shown in Table 3, with Ss arranged in 
descending order of success in solving the 
problems. 

Overall, processing errors were more 
likely to follow negative feedback than 
positive feedback, s = 4.06, p < .001. This 
may appear to be consistent with many 
information-processing theories of CI which 
assume that most or all processing errors 
occur following negative feedback (cf. 
Chumbley, 1969; Trabaso & Bower, 
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best Ss (+ feedback) 
best Ss (- feedback ) 


poorest Ss (« feedback ) 
poorest Ss (-feedback ) 


NUMBER OF HYPOTHESES 


FEEDBACK TRIAL 


FrGURE 2. Size of the hypothesis set as a function 
of feedback trial number and feedback for the best 
and poorest „$s, 


1968). However, the large proportion of 
errors following positive feedback (447) 
must be considered evidence counter to 
these theories. 

Strategies of individual Ss and departures 
from perfect processing can be identified 
from the distributions of errors. A par- 
ticularly straightforward example is Su, 
the last S in Table 3. His protocol clearly 
shows that he reduced his hypothesis set 
size to an average of 2 hypotheses at the 
beginning of the problems, and then used 
a strategy which required at least local 
consistency on resampling following errors. 
This S produced a total of 48 initial-omis- 
sion errors on the 8 SC problems, for an 
average of 6 of these errors per problem. 
Since there were 8 dimensions in these 
problems and he eliminated an average of 
6, Se, was tracking an average of only 
2 of 8 dimensions following FT;. This 
analysis was verified by a verbal report at 
the end of the experiment. When asked 
how he proceeded in his attempt to solve 
the problems, this S said he always started 
"with 1 or 2 parts." Most of the rest of 
the errors made by Ss, were resampling 
cues which had previously been eliminated. 
These almost always followed only trials 
with negative feedback, as would be ex- 
pected from an S who resamples only when 
the hypothesis he had been tracking was 
shown to be incorrect. The hypotheses he 
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selected for his new hypothesis set were 
always consistent with the last feedback 
(no inconsistent-resampling errors). This 
is the "local consistency" strategy as 
described by Gregg and Simon (1967). 

An S who used a selection strategy (cf. 
Restle, 1962) would select a subset of 
possible hypotheses following FT;, produc- 
ing many initial-omission errors. Only six 
Ss had over 5 initial-omission errors, but 
these 6 Ss produced 8-48 each (Table 3). 
Judging from the scarcity of initial-omis- 
sion errors in the remaining protocols, the 
most popular initial strategy was a “global” 
strategy (cf. Bruner et al., 1956), which 
requires S to process information on all 
dimensions simultaneously. Although the 
best Ss used this strategy with considerable 
success, the poorer Ss were overextended 
and generally made many premature- 
elimination and failure-to-eliminate errors. 

It is apparent, however, that no S per- 
formed perfectly in accord with any current 
CI model. Failure to predict error pro- 
tocols in detail is not surprising in that 
models typically do not include mecha- 
nisms to account for departures from 
idealized protocols. Such a mechanism is 
suggested by the current data and could 
be included in any model. 

The random-reversal assumption. Infor- 
mation held by Ss attempting to solve a 
CI problem may be considered in 3 parts: 
classification of a given dimension before 
an FT, cue value and feedback received 
from the FT, and classification of the 
dimension following the FT. The random- 
reversal assumption states that there is a 
small probability that information at any 
of these 3 steps may be lost or reversed. 
An alternative, suggested by Falmagne's 
(1972) model, is that .S has a confidence 
hierarchy for the information he holds on 
the various dimensions, and the probability: 
of information loss or reversal on a dimen- 
sion is inversely related to the confidence. 

The consequences of information loss or 
reversal at any 1 of the 3 steps may be 
examined with reference to Table 1. 
Reversal of the classification before an FT 
can lead to 1 of 2 types of errors, depending 
on subsequent feedback: If the feedback 
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is consistent with the new (reversed) classi- 
fication, the error will be consistency re- 
versal; if the feedback is inconsistent with. 
the reversed classification, the error will be 
premature elimination. Reversal of the 
classification following FT; leads to an 
initial-reversal error; reversal following a 
subsequent FT leads to an inconsistency- 
reversal error. 

Perhaps the most interesting predictions 
of the random-reversal assumption concern 
the consequences of reversal of information 
presented on an FT. If the information is 
changed to be consistent with the previous 
classification, the error will be failure to 
eliminate; if the information is changed to 
be inconsistent with the previous classifica- 
tion, the error will be premature elimina- 
tion. These 2 types of errors were by far 
the most common ; together, they accounted 
for almost half of the total errors (Table 3). 

Reversal of information could be due to 
incorrect encoding of the cue value or of 
the feedback direction; it could indicate 
a logical error in combining new informa- 
tion with old, or a memory failure followed 
by an incorrect guess. Any of these errors 
would contribute to degradation of ideal 
performance predicted by a simple model. 

An initial-omission error is probably less 
an error than the result of a decision to 
process fewer than the total number of 
available dimensions. Since Ss differ 
widely on this strategy (Table 3), it may 
be necessary to include initial sample set 
ize as a parameter in Cl models. 

The SC procedure on highly trained indi- 
vidual Ss was successful in providing direct 
evidence on a number of processing errors 
that have relevance to the kind of in- 
formation-processing strategy that Ss use 
in selecting and testing hypotheses in 
inductive reasoning such as Cl. The 
present analysis has shown that individual 
Ss might exhibit any of several alternative 
strategies, each of which is formalized in 
particular assumptions of existing models. 
In a sense, no model is entirely correct or 
wrong, since some protocols favor or con- 
tradict each model in turn. What appears 
to be lacking in the models are mechanisms 
such as forgetting or reversing information 


so that the idealized or formal strategies 
are not consistently exhibited. The present 
experiments and specific information- 
processing model indicated what erroneous 
behavior occurs and which mechanisms 
seem to be responsible for the departures 
from perfect or consistent processing. 
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ABSTRACT CONCEPT LEARNING IN THE PIGEON ' 


THOMAS ZENTALL? Ap DAVID HOGAN 


University of Pitishurgh 


In 1961, Cumming and Berryman found that color matching-to-sample train- 
ing with pigeons did not transfer to a new color. In Experiment I in the 
present study, pigeons were trained on either a 2-color matching-to-sample 
task or a 2-color oddity-from-sample task. In the transfer phase, the 2 train- 
ing colors were replaced by 2 novel colors for all Ss and half of each training 
group was shifted to the other task. Nonshifted Ss learned the transfer t 
4 faster than shifted Ss. In Experiment II, using the same design, similar 
results were obtained when Ss were trained on either brightness matching or 
brightness oddity and transferred to color matching or color oddity. Ap- 
parently pigeons are capable of learning the concepts same and/or different. 


In a now classic experiment Herrnstein 
and Loveland (1964) demonstrated that 
pigeons could learn to discriminate be- 
tween the presence of a human form 
(positive stimulus) and its absence (nega- 
tive stimulus) thus suggesting that pigeons 
are capable of learning complex concepts. 
Since a large number of quite different 
complex visual images (photographs of 
natural settings) were used in both the 
positive and negative set, the task could 
not have been mastered by learning a re- 
sponse to each of the individual stimuli. 
It is less clear, however, that there was 
not some common attribute, other than 
the concept "'person-present," which served 
to distinguish the positive from the nega- 
tive set. While there was no attempt to 
equate the positive and negative sets in 
all other respects, the authors were aware 
of this problem. For example, to show 
that color distribution was not the basis 
for the discrimination, they presented 
black-and-white photographs, with little 
disruption in discrimination. Siegel and 
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Honig (1970) and Poole and Lander (1971) 
extended these findings and ruled out a 
number of other artifactual cues upon 
which the discrimination might have been 
made. But, given the complexity of the 
stimuli used in all 3 experiments, the ade- 
quate test for, or control of, all factors 
other than person-present, may be virtually 
impossible. 

A more efficient means of measuring 
concept learning in pigeons is with the 
use of simple stimuli which vary along a 
single physical dimension and a concept 
which involves the relation among these 
stimuli. The matching-to-sample task po- 
tentially satisfies these conditions. 

In a matching-to-sample task an orga- 
nism is presented with a sample stimulus 
and 2 or more comparison stimuli. The 
task requires the selection of that com- 
parison stimulus which is the same as 
(matches) the sample stimulus. 

Cumming and Berryman (1961) trained 
pigeons on a 3-color matching-to-sample 
task (red, green, and blue). Responses to 
the sample stimulus which was presented 
on the center response key produced the 
2 comparison stimuli, one on each side 


key. A response to the comparison stim- 
ulus which matched the sample was 
reinforced. 


At least 3 different means of learning 
this task can be described. (a) Pigeons 
may learn to make a particular response 
to each different stimulus configuration 
(e.g., peck the left key when presented 
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with the stimulus configuration, red-left, 
red-center, green-right). With 3 sample 
stimuli the pigeons would learn to respond 
individually to the 12 different stimulus 
configurations. (b) Pigeons may learn to 
form a response chain, one for each of 
the 3 sample stimuli (e.g., peck red on the 
center then peck red on the side). (c) Pi- 
geons may learn a matching concept (i.e., 
peck that comparison stimulus which is 
the same as the center stimulus). 

In an effort to determine what was 
learned during matching training, Cum- 
ming and Berryman (1961) substituted 
yellow stimuli for the blue stimuli. They 
found that performance was disrupted 
(dropped to chance) only on trials in- 
volving a yellow sample stimulus, thus 
suggesting that the pigeons learned re- 
sponse chains. The pigeons responded ap- 
propriately to red and green sample stimuli 
but, since they had not learned a response 
chain for yellow, they performed at chance 
when yellow was the sample. Cumming 
and Berryman concluded that matching 
training did not result in the formation of 
a matching concept. However, an alter- 
native interpretation can be stated. If in 
fact there were competing response ten- 
dencies on yellow-sample trials, (a) to re- 
spond to that stimulus which had been 
reinforced in the past (ie., the non- 
matching red or green stimulus), and (b) 
to respond to the stimulus which matched 
the sample stimulus (ie, the matching 
yellow stimulus), then performance may 
have reverted to chance, either because 
the pigeons alternated between the 2 re- 
sponse tendencies or, more likely, because 
the pigeons, under conflict, adopted posi- 
tion-response tendencies. 

The purpose of the present experiments 
was to determine whether pigeons can 
acquire a matching (and/or oddity) con- 
cept if transfer involved only novel stimuli. 
Experiment I examined the extent to which 
color matching and oddity would transfer 
to novel colors. Since such transfer might 
be explained by simple stimulus general- 
ization, rather than complex concept learn- 
ing, Experiment II measured the extent 
to which matching and oddity training 
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would transfer to stimuli not fal on 
the dimension used during trainin To 
control for nonspecific transfer effe ialf 
of the Ss were initially trained « an 
oddity-from-sample task and shifi to 
the matching task with the novel sti uli, 
In the oddity task a response to the om- 
parison stimulus which is different m 
the sample stimulus, is reinforced. I8, 
learning an oddity concept should rd 
learning a matching task with new st li, 
and learning a matching concept si. ld 
facilitate learning a matching task with : w 


stimuli. Similarly, learning an oddity © a- 
cept should facilitate learning an od y 
task with new stimuli, and learnin à 
matching task should retard learning :n 
oddity task with new stimuli Exp: i- 
ment I involved 4 groups of Ss. I o 
groups were trained on a 2-color match g 
task, and one of these was transferred o 
another matching task, while the seco d 
was transferred to an oddity task, Sin i- 
larly, 2 groups were trained on 
oddity task; 1 group was transferre 
another oddity task and the second ws 
transferred to a matching task. The tra: 
fer task involved 2 novel colors and tran 
fer sessions were continued to 
learning. 


asst 


EXPERIMENT I 
Method 


Subjects. © Eight experimentally naive female 
white Carneaux pigeons approximately 1 yr. old, 
were maintained at 80% of their free-feeding 
weights throughout the experiment. 

Apparatus. A standard 3-key Lehigh Valley 
Electronics Pigeon Test Chamber (1519D) was used. 
In-line projectors behind each key (with G. E. 
No. 1820 lamps) projected red (Kodak Wratten 
filter No. 26), green (Kodak Wratten filter No. 60), 
blue (Kodak Wratten filter No. 38A), or yellow 
(Kodak Wratten filter. No. 9) fields onto each of 
the keys, Control equipment was located in the 
next room. Reinforcement consisted of 3-sec. 
access to Purina Pigeon Grains. 

Procedure, All Ss were magazine trained and 
shaped to peck a lighted side key on the first day. 
Shaping position (left or right) and color (red or 
green) were counterbalanced over Ss, On the 
second day, all Ss were given 48 continuous rein- 
forcements for responding to a single lighted key 
which varied in color (red or green) and/or position 
(left or right) following each reinforcement. On 
the third day, half of the Ss received matching 
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FIGURE 1. Comparison of matching and oddity 
performance during training: Experiment I. 


training while the other half received oddity train- 
ing. Each trial started with the onset of the 
sample stimulus on the center key. Five responses 
to the center key turned on the side keys. A single 
response to either side key terminated the trial 
and initiated a 5-sec. intertrial interval. Stimulus 
presentations for the 2 tasks were the same, only 
the "correct" response differed. Daily sessions 
consisted of 96 trials, counterbalanced for correct 
color and correct side. Twenty such training 
ns were given. 


sc 
sfer sessions began following the last day of 
training. All Ss were switched from red and green 


stimuli to yellow and blue stimuli. Half of the Ss 
continued with the same task as during training 
(matching or oddity) while the remaining Ss were 
shifted to the other task. Eleven transfer sess 
were run 


Results 


Training performance data are presented 
in Figure 1. The data suggest that the 
oddity task was learned faster than the 
matching task, though this difference was 
significant only on the first session, F (1, 6) 
= 6.40, p < .05. On the first session, 3 of 
the 4 oddity Ss performed above chance 
while 3 of the 4 matching Ss performed 
below chance. By the end of training no 
difference in performance between the 2 
groups was apparent. 

The transfer data are presented in 
Figure 2. The nonshifted Ss were those 
for which the training task and transfer 
task were the same; for the shifted Ss the 
training task and transfer task differed. 
A 2-way analysis of performance on the 
first transfer session indicated a nonsig- 
nificant training effect (matching vs. od- 
dity) (F <1), a nonsignificant transfer 
effect (matching vs. oddity) (F < 1), and 
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a nonsignificant Training X Transfer inter- 
action (shifted vs. nonshifted), F (1, 4) 
= 2.41, p > .05. 

The nonsignificant difference between 
shifted and nonshifted Ss on the first 
transfer session may be due to the fact 
that 4 of the 8 Ss were correct on exactly 
half of the trials. For the 4 birds whose 
performance differed from chance, how- 
ever (1 shifted S and 3 nonshifted Ss), 
performance of the shifted S was below 
that of the 3 nonshifted Ss. A second 
2-way analysis of performance pooled over 
the first 5 transfer sessions indicated a non- 
significant training effect (/ < 1) and a 
nonsignificant transfer effect, F (1, 4) 
= 1.80, p > .05, but a significant Train- 
ing X Transfer interaction, F (1, 4) = 7.89, 
p «.05. The interaction indicates that 
nonshifted Ss performed significantly better 
over the first 5 transfer sessions than did 
the shifted Ss. 


EXPERIMENT II 


Using the same design as in Experi- 
ment I, Ss were trained on either a match- 
ing or oddity task with 4 sample bright- 
nesses. Four brightnesses were used to 
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Ficure 2.’ Experiment I transfer data. (Shifted 


Ss are those transferred to a new concept—matching 
to oddity or oddity to matching; nonshifted Ss are 
those maintained on the same concept—matching 
to matching or oddity to oddity.) 
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FiGuRE 3. Comparison of matching and oddity 
performance during training: Experiment 11. 


maximize the likelihood that a matching 
and/or oddity concept would be formed. 
All 5s were then transferred to 2 colors 
with half of the Ss shifted in concept and 
the other half nonshifted. 


Method 


Subjects, Twelve experimentally naive female 
loft-reared pigeons approximately 1 yr. old were 
maintained at 80% of their free-feeding weights 
throughout the experiment, x 

Apparatus. "The apparatus was the same as 
that used in Experiment I. The training stimuli 
were obtained by lighting 10 unfiltered lamps 
(G. E. No. 1820, supplied by 24 v.), 1 unfiltered 
lamp, 1 lamp with a 1.0 neutral density Kodak 
Wratten filter, or 1 lamp with a 2.0 neutral density 
Kodak Wratten filter. The brightness values were 
13.0, 3.00, .30, and .03 ftc. as measured by a United 
Detector Technology Opto-meter (Model 40A) with 
à photome ric filter having transmission character- 
istics approximating the spectral sensitivity of the 
human eye. It should be noted that spectral 
sensitivity in pigeons is quite similar to that in 
humans (Blough, 1957), 

The transfer stimuli were the red and green 
values used in Experiment I equated for brightness 
at 1.0 ftc. All brightness values were checked at 
the end of the experiment and were found to be 
consistent with initial measurement. 

Procedure. On the first day, each S was shaped 
to peck either the right or left response key illu- 
minated with 1 of the 4 brightness values. Shaping 
stimuli were assigned to .Ss such that, when Ss 
were divided into 2 groups, 3 from each group had 
been shaped to the left and 3 to the right. Within 
each of the 2 groups 1 S had been shaped to the 
brightest stimulus value, 1 S to the dimmest 
stimulus value, and 2 Ss each to the 2 middle 
stimulus values. 

On the second day, all Ss were given 48 con- 
tinuous reinforcements for responding: to a single 
lighted key which varied in brightness and/or 
position (left or right) following each reinforcement. 
On the third day, half of the Ss received matching 
training while the other half received oddity train- 
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ing. With 4 sample brightness values an: dd 
brightness values (appearing either on 1 eft 
or right) for each sample, 24 different usd 
configurations could be presented. Each cx a- 
tion was presented 4 times in each 96-trial n. 
In all other respects training was the sam in 
Experiment I. All Ss received 30 training sı 18. 
"Transfer sessions began following the last of 
training. All Ss were transferred from bri ss 
stimuli to red and green stimuli match. r 
brightness. Half of the Ss continued wit! ir 
training task (matching or oddity) while + e 
maining Ss were shifted to the other task 3s 


received 6 transfer sessions. 


Results 


Training data for the matching an: 
dity tasks are presented in Figu 
There is little evidence for matc! 
oddity differences on Session 1 (F 
but over sessions matching perform: 
was superior to oddity, F (1, 10) = 9 
b < .05. 

Transfer data are presented in Figur 
A 2-way analysis of the data from 
first transfer session indicated a significa t 
transfer effect, F (1, 8) = 13.78, p<. 
and a significant Training X Transfer i 
teraction, F (1, 8) = 19.26, p< .01, bi 
a nonsignificant training effect (F < 1 
The transfer effect indicates that initis! 
performance on the oddity task was su 


Sessions 


FIGURE 4. Experiment II transfer data. 
breviations: M-M = matching to matching, M-O 
= matching to oddity, O-M = oddity to matching, 
and 0-0 = oddity to oddity.) 


(Ab- 
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perior to performance on the matching 
task. The Training X Transfer interaction 
indicates better performance for nonshifted 
Ss than for shifted Ss. A similar analysis 
of the Session 2 data indicated nonsig- 
nificant effects of training (F < 1), trans- 
fer, F (1,8) = 1.98, and Training X Trans- 
fer interaction (F <1). By the second 
transfer session, all but 1 S were performing 
above chance, and by Session 6 mean per- 
formance was above 95% correct. A 2-way 
analysis of the transfer data pooled over 
days indicated nonsignificant effects of 
training (F < 1), transfer, F (1, 8) = 3.67, 
p> .05, and Training X Transfer inter- 
action (F « 1). 

A 2-way analysis using sessions-to-cri- 
terion (90% correct) indicated similar re- 
sults, though the transfer effect was signifi- 
cant, F (1, 8) = 5.60, p < .05, with this 
measure. Thus, for the transfer task, 
oddity was apparently learned faster than 
matching. 


DISCUSSION 


The results of Experiment I indicate that 
matching and oddity training with pigeons 
will transfer better to new stimuli when the 
transfer task is the same as the training task 
than when the transfer and training tasks 
are different. The results of Experiment II 
indicate that these effects can be observed 
under conditions where generalization along 
a stimulus continuum (e.g, wavelength) 
cannot account for the transfer difference. 
The data suggest that the concepts same 
and/or different can be learned when pigeons 
master matching and oddity tasks, respec- 
tively. While the present design does not 
allow for the evaluation of these concepts 
separately, it does offer presumptive evidence 
for some form of “abstract” concept learning 
in pigeons. The word abstract is used here 
to identify transfer which cannot be explained 
by (a) stimulus generalization along physi 
cally definable dimensions or (b) nonspeci 
transfer effects. 

While Experiments I and ll are in basic 
agreement with regard to the transfer of con- 
cept learning, certain differences in the ob- 
tained results should be discussed. 

During original training in Experiment I 
oddity performance was superior to matching 
performance for the first training session. 
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In Experiment Il there was vitually no dif- 
ference between matching and oddity per- 
formance on the first training session, though 
over sessions superior performance was found 
on the matching task. These results are not 
inconsistent with the results of other studies 
which have reported comparisons of matching 
and oddity training. 

Using a very lax criterion for learning 
(16 out of 20 correct), Ginsburg (1957) found 
that pigeons learned an oddity task faster 
than a matching task. Berryman, Cumming, 
Cohen, and Johnson (1965) found that per- 
formance on an oddity task was initially 
superior to performance on a matching task, 
but after about 600 training trials a crossover 
occurred and matching performance continued 
to be superior from then on. Both studies 
used 3 different sample stimuli for both 
matching and oddity. While oddity perform- 
ance may be initially better than matching, 
the data suggest that matching may be learned 
faster when a moderate to stringent learning 
criterion is used. 

Initial superiority of birds trained on the 
oddity task (Experiment I in this study; see, 
also, Berryman et al., 1965; Ginsburg, 1957) 
may be due to an artifact of pretraining. 
Typically, pretraining consists of single-stimu- 
lus presentation of each of the 3 keys (only 
2 keys in the present studies). Then, at the 
start of training the center key is illuminated 
and responses to it illuminate the side keys, 
but for the first time these responses are not 
reinforced. It was observed in the present 
studies that during early training trials as 
many as 100 center key pecks occurred before 
a response was made to either side key (which 
terminated the trial). Undoubtedly, these 
nonreinforced pecks served to partially ex- 
tinguish responding to the center key, and 
it is reasonable to expect that this partial 
extinction. generalized to the same stimulus 
on the side key. Thus, better than chance 
performance on the oddity task can occur 
without any specific oddity training. 

During training in Experiment II the failure 
to observe initial superiority of oddity over 
matching may be due to the specific training 
stimuli used. It may be that the brightness 
differences in Experiment Il were not as 
salient as the color differences in Experiment I, 
and thus, initial extinction to the stimulus on 
the center key generalized nearly equally to 
the 2 comparison stimuli in Experiment IIl. 

The superiority of matching later in training 
suggests that in addition to whatever concept 
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learning occurs, pigeons master matching and 
oddity by learning response chains (e.g., peck 
red on the center then peck red on the side). 
The fewer response chains to be learned the 
easier the ta In general, with » different 
sample stimuli, matching requires learning 5: 
response chains (one for each sample stimu- 
lus) Oddity, however, requires learning 
n(n — 1) response chains (a separate chain 
for each sample stimulus combined with each 
possible odd stimulus). With 2 sample stim- 
uli, n = 2 and n = n (n — 1), matching and 
oddity should not differ in difficulty. Experi- 
ment I of the present study confirms this 
prediction. With 3 or more stimuli, n < n 
(n — 1) and matching should be easier than 
oddity. Berryman et al. (1965) and Experi- 
ment II of the present study confirm this 
prediction. Unfortunately the above model 
is insufficient to account for superior oddity 
learning (sessions to criterion) during transfer 
sessions in Experiment Il. Other factors 
apparently contribute to the relative speeds 
of matching and oddity learning. 

A second difference between the results of 
Experiments I and II was the occasion of the 
appearance of the concept-transfer effect. In 
Experiment I a significant transfer effect ap- 
peared only after the first transfer session; 
in periment II the same effect was 
nificant only om the first transfer session. 
The difference in transfer results between the 
2 experiments was probably due to the dif- 
ference in transfer stimuli (yellow and blue 
in Experiment I; red and green in Experi- 
ment IT), though it is possible that differences 
in either S pool (strain differences) or dif- 
ferences in original training, i.e., training 
stimuli (colors in Experiment I and bright- 
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ness values in Experiment II), iber of 
training stimuli (2 in Experiment ! and 4 in 
Experiment II), and number of t: ng ses- 
sions (20 in Experiment I and 30 in Experi- 
ment II) may have affected the transfer 
results as well. The fact that 2 Ss in Ex 


periment I responded to the yellow com 
parison stimulus on every trial of the first 
transfer session, while no S in Experiment II 
showed a complete color preference, suggests 
that differences in transfer stimuli may have 
been responsible for the observed differences 
in transfer results. 
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DYNAMIC PROCESSES IN STIMULUS INTEGRATION THEORY: 
EFFECTS OF FEEDBACK ON AVERAGING OF MOTOR MOVEMENTS? 


l'L. NORMAN* 


sity of California, San Diego 


A stimulus integration experiment was conducted which provided feedback to 
S ina task requiring the averaging of 2 fixed-movement lengths. Feedback 
indicated the relative position of each response to a weighted average of the 
lengths. Changes in the integration rule as a function of feedback were indi- 
cated by changes in the weighting parameters of a general weighted average 
model, Feedback appropriate to the numer average reduced an initial 
bias toward the longer distance, while no changes were noted for a group that 
received no feedback. When feedback placed greater weight on the longer or 
shorter movement in a pair, tematic chan were observed in Ss’ re- 
sponses, but not when feedback placed greater weight on the movement in the 
first or second ordinal position, These results indicate that in some cases the 
integration rule may be a function of dynamic processes such as feedback. 


A judgment may be viewed as a response 
dependent on a set of stimuli, and the 
observer, an integrator of stimulus informa- 
tion. Stimulus integration theory and func- 
tional measurement proposed by Anderson 
(1962, 1970) has studied integration be- 
havior in a number of tasks. The primary 
assumption of this theory is that the judg- 
ment is an algebraic function of the subjec- 
tive stimulus values, s, and weights, w, on 
the subjective stimulus values. 

While the observer may be viewed as an 
integrator of information, it must also be 
remembered that he is an organism capable 
of learning, and that the judgment response 
may also be a function of practice and feed- 
back. It is likely that such feedback affects 
the integration function. Slovic and Lich- 
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tenstein (1972) pointed out that little or 
no research has been done on learning to 
integrate stimuli and on the effect of feed- 
back in stimulus integration paradigms. 
However, experimentation on dynamic pro- 

ses has been conducted in Bayesian 
on-making and multiple-cue learning 
tasks. Methodologically, integration theory 
has an advantage over correlational para- 
digms in that it uses subjective scale values 
rather than physical values. Other advan- 
tages have been discussed elsewhere 
(Anderson, 1962, 1972). 

The present research is an attempt to 
extend the investigation of the effects of 
feedback and dynamic processes to an 
integration task involving the averaging 
of motor movements, The Ss were blind- 
folded and required to make several suc- 
cessive controlled motor movements and 
then make an additional linear movement 
rage of the previous 
fixed movements. This task has been 
studied by Levin, Craft, and Norman 
(1971) and by Levin, Norman, and Dolezal 
(1973). These authors noted that a possi- 
ble advantage that this task may have over 
other integration tasks is that stimulus 
magnitude is manipulated along a con- 
tinuous dimension and responses are exe- 
cuted along the same dimension. 

Levin et al. (1971) found that the data 
could be approximated by an ordinal- 


to represent the av 
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dependent weighted average model, 
Ra = È wisi, [1] 


where R4 is the averaging response, w; is 
the weight varying with ordinal position on 
the subjective value s;, such that Yw;,=1. 
However, there was a significant interac- 
tion between stimulus levels that could not 
be accounted for with an averaging model 
with constant weights. Levin et al, (1973) 
confirmed that the interaction between 
stimuli within a set was highly reliable. 
Since the response scale appeared reliable, 
a different averaging model was proposed 
that weighted stimuli differentially as a 
function of their value. This model can 
be expressed as 


Vis p Visi 


DEO [2] 


where R;; is the magnitude of the averaging 
response for stimuli i and j, and where vi 
and v; refer to weights on the subjective 
stimulus values varying with the values of 
the stimuli. Furthermore, the restriction 
is added that the weights sum to 1. The 
fact that both order-dependent weighting 
and value-dependent weighting may be 
present in this motor task make it an ideal 
one to study. 

The following 3 issues 
effect of feedback on the 
were investigated in the present study: 
(a) effects of "correct" numerical feedback 
vs. no feedback, (b) effects of feedback on 
order-dependent weighting, and (c) effects 
of feedback on value-dependent weighting. 
"These effects may be localized in the ac- 
curacy of the subjective scale values 
changes in weight parameters, 

One group was given correct feedback 
using the numerical average (NA) and 
another group was given no feedback (NF). 
A comparison of these groups should indi- 
cate fundamental differences between feed- 
back and no-feedback conditions. Further- 
more, these groups are necessary control 
conditions for other groups. 

In a second comparison, one group was 
given feedback with respect to a primacy 
weighted average (PWF) and another group 
was given feedback with respect to a re- 


Ri; = 


relating to the 
integration rule 


or in 
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cency weighted average (RWF). Since y 
averaged only pairs of distances, a primas 
effect is defined as greater weighting on th 
first stimulus and a recency effect is define 
as greater weighting on the second stimulus, 
The ratio of the order-dependent weights 
on the feedback trials was set at 2:1 for 
Group PWF and 1:2 for Group RWE; 
according to the model in Equation 1. The 
observed weights may be compared on both? 
feedback and test trials. 
In a third comparison, feedback from a 
value-dependent weighted average was pro- 
vided where longer distances received 
greater weight for one group (X oup LWE) 
and where shorter distances received greatel 
weight for another group (( oup SWF). 
Since the results of the 3 experimenta 
manipulations may follow either an order- 
dependent weighted average or a value= 
dependent weighted aver 


One procedure is to 
use a general model that includes the other 
models as special cases. Then the specific: 
models can be statistically compared with 
the general model. A general model which 
includes the order-dependent weighted av- 
erage model, the value-dependent weighted 
average model, and the simple numerical 
average model as special cases is 


Rye Vus (Ll = ws [3] 
wi + (1 — )v; 

where is a weight applied to the first 
stimulus in the pair, 1 — w is a weight 
applied to the second stimulus in the pair, 
and vi is the weight component for the scale 
value s; The sum of all value-dependent 
weight components is set equal to 1. If 
the order-dependent weights are equal, 
Equation 3 reduces to a value-dependent 
weighted average model; and if the value- 
dependent weights are equal, it reduces 
to an order-dependent weighted average 
model. If both sets of weights are equal, it 
reduces to a simple numerical average 


Marruop 
Design 


The Ss produced averages of pairs of fixed move- 
ments (stimuli, S}, S2), where each length was 20, 
40, or 60 cm. during pre- and postfeedback trials 
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TABLE 1 
VALUES OF AVERAGES USED FOR FEEDBACK GROUPS 


Group 


E! NA | 


SWE LWF 


S«-15 sess as lsemts P 


5 |S = 15 [S2 9 30 [S1 45. sisse [S1 245 


15 15.0 | 22.5 | 30.0 | 15.0 | 20.0 | 25.0 | 15.0 | 25.0 | 35.0 | 15.0 | 21.0 | 22.5 | 15.0 | 25.0 | 37.5 
30 22.5 | 30.0 | 37.5 | 25.0 | 30.0 | 35.0 | 20.0 | 30.0 | 40.0 | 21.0 | 30.0 | 35.0 | 25.0 | 30.0 | 39.0 
5.0 | 35. 45.0 | 25 “0: 35.0 | 45.0 | 22.5 | 35.0 | 45.0 | 37.5 | 39.0 | 45.0 


0 | 40.0 


t A 
= shorter distance: 
S: 


and 15, 30, or 45 cm. during feedback tria All 
9 pair combinations were repeated 3 times in a 
randomized blocks presentation to obtain prefeed- 
back baseline data. Four randomized blocks of 
the 9 pair combinations were presented for feedback 
trials and 3 blocks for postfeedback trials. 

During feedback trials, Ss were told the distance 
(rounded off to the nearest centimeter) that each 
response was over or under the average, according 
to the prescribed model. Feedback given was 
appropriate to the numerical average for Group 
NA, a primacy weighted average where w = 2/3 
for Group PWF, a recency weighted average where 

= 1/3 for Group RWF, and a value-dependent 
weighted average (see Equation 2) where ts = 1/2, 
vs = 1/3, and vs 1/6 for Group SWF, and 
where vis = 1/6, vs = 1/3, and va; = 1/2 for Group 
LWF.. The prescribed averages for each condition 
isted in Table 1. Group NF produced averages 
uns this stage, but received no feedback. 


Apparatus 


The apparatus was the same as that used by 
Levin et al. (1971), Two free-moving, 5-cm.-sq. 
were mounted on a wooden meter 


e by E when defining the length of the fixed 
movements and could be moved out of the way 
when § was making his averaging response. An 
additional sliding scale was added for the present 
experiment so that E could read directly the dis- 
tance that S's response was over or under the aver- 
age prescribed by the feedback requirements. This 
E zeroed in on the prescribed average before 
S made his averaging response. The apparatus was 
placed on a table parallel to the coronal plane, and a 
shield was placed over the meter stick. The .5 was 
blindfolded so that responding could not be based 
on visual cues, 


Procedure 


On each trial, S was required to complete the 
fixed movements Sı and $ and then move the 
sliding block a distance which he estimated to be 
the average of the 2 fixed movements. Details of 
this procedure are given by Levin et al. (1971). 


numerical average, PWF = primacy weighted average, RWF = recency weighted average, SWF 
eiving greater weight, LWF = longer distances receiving greater weight, Si = stimulus of first distance, 
stimulus of a second distance, and 15, 30, and 45 refer to distances in centimeters, 


The time to complete each fixed movement and 
return to the starting position was approximately 
8 sec. Five seconds were allowed for .5 to produce 
the average before he was to hold the block steady 
so that the distance could be read and the feedback 
given during the feedback trials. Responses were 
recorded to the nearest millimeter. The distance 
(rounded off to the nearest centimeter) over or 
under the prescribed average was then read off by Æ. 


For example, if the prescribed average was 37.5 cm, 
and the response was 40.1, S heard "over by 3 cm,” 
The interval between trials was 5 sec. 

During the feedback trials, S was told that he 


would learn a new type of average that might or 
d 


‘average. This 
ional average." 
feedback to in- 


might not differ from the “regula 
new average was called the “navi 
The S instructed to use th 
fluence his subsequent responses so that he could 
“home in on navigational averages." Two 5-min. 
breaks were inserted—one following prefeedback 
trials and one following feedback trials. ally, 
a short questionnaire was administered, The Ss 
were asked to explain in their own words what the 
navigational average was, how they made the 
averages, and if they used any special strategy. 

Subjects. h of the 6 groups was composed of 
10 Ss (5 female and 5 male) recruited from intro- 
ductory psychology classes at the University of 
Iowa. 


RESULTS 


The mean averaging responses on the pre- 
and postfeedback trials are graphed in the 
successive panels of Figure 1 for each group. 

All main effects of Sı and Sẹ were highly 
significant for all groups at all stages, The 
steepness of the lines reflects the effect or 
weighting of S; in determining the average, 
and the degree of separation of the lines 
reflects the effect or weighting of Sj. Devia- 
tions from parallelism indicate the extent 
of value-dependent weighting. Lines that 
converge with increased stimulus magni- 
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Prefeedbach Trials 

On prefeedback trials, a very slight con- 
verging interaction can be noted for most 
of the groups. The difference between the 
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pairs 60-20 cm. and 60-60 cm. is less than 
the difference between 20-20 cm. and 
20-60 cm. The form of this interaction is 
consistent with the interpretation that 
longer distances received greater weighting. 
When prefeedback trials were analyzed for 
initial group differences, all between-groups 
effects were nonsignificant. The Sı X S: 
interaction was highly significant, F (4, 216) 
= 22.52, p < .001. Furthermore, this in- 
teraction occurred in a large majority of 
the individual S analyses. 


Learning Curves on. Feedback Trials 


Learning curves were constructed for the 
4 feedback blocks on Stage 2. Mean abso- 
lute error with respect to the average de- 
signated as correct was used as an overall 
measure of learning for the feedback groups. 
For Group NF, mean absolute error was 
taken with respect to the numerical aver- 
age. Figure 2 shows these curves. No 
systematic changes occurred for Groups 
NF or NA, through NA tended to be more 
accurate than NF. No decrease occurred 
for Groups PWF or RWF; however, mean 
absolute error for Groups SWF and LWF 
showed trends to decrease with blocks of 
trials. With mean absolute error as the 
dependent variable, the blocks effect was 
significant for Group LWF, F (3:521) 
= 3.72, p < .05, but not for Group SWF. 
It should be noted that it is not at all clear 
that performance stabilized by the end of 
the feedback stage. This fact limits the 
interpretation of data on the postfeedback 
trials to the levels of learning evidenced on 
the feedback trials. 


Postfeedback Trials 


Responses for Group NF did not change 
from pre- to postfeedback trials. For 
Group NA, the converging Sı X Se interac- 
tion observed on prefeedback trials did not 
occur on postfeedback trials. This change 
would be expected due to feedback. 

No significant changes occurred on the 
postfeedback stage for Group PWF. Had 
Ss learned and used the primacy weighted 
average on postfeedback trials, the effect 
of S, should have increased from pre- to 
postfeedback trials and the effects of Se 
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E 
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Fi 
225; 
20 
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REPLICATION 
Ficure 2. Mean absolute error as a function 
of replications of control trials for the no-feedback 


(NF) group and replications of feedback trials for 
the following groups: numerical average (NA), 
primacy weighted average (PWE), recency weighted 
average (RWE), shorter distance receiving greater 
weight (SWF), and longer distance receiving greater 
weight (LWF). 


should have decreased. This did not hap- 
pen. Similarly, for Group RWF, if Ss used 
a recency weighted average on postfeed- 
back trials, the effect of Se would have 
increased and the effect of Sı would have 
decreased from pre- to postfeedback trials. 
Though there were trends in these direc- 
tions, they were not significant. A post hoc 
analysis for Group RWF compared the 
mean of the pairs 20-40 cm., 20-60 cm., 
and 40-60 cm. with the mean of the pairs 
40-20 cm., 60-20 cm., and 60-40 em. The 
mean of the former pairs was greater than 
the mean of the latter, indicating recency, 
t (9) = 2.31, p < .05. However, only 2 of 
the 10 Ss showed marked effects in the 
predicted direction. 

As seen in the Group SWF panel in 
Figure 1, a large interaction occurred on 
postfeedback trials. The form of this inter- 
action differs from the converging interac- 
tion noted on prefeedback trials. ‘The lines 
tend to diverge as stimulus magnitude in- 
creases. This interaction is consistent with 
an integration model that weights shorter 
distances more than longer distances. The 
3-way Si X S: X Stage interaction was 
highly significant, (4, 36) = 10.89,p < 
.001. Individual S$ analyses showed that 
this change in the direction of the interac- 
tion occurred for most Ss (8/10) and 
reached significance for 3 Ss (p < .05). 
The overall mean averaging response de- 
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creased from 38.8 cm. on prefeedback tria 
to 36.6 cm. on postfeedback trials, F (1, 
= 8.93, p «.05. The value-dependent 
weighted average model predicts that th 
averages of heterogeneous pairs shoi 


decrease while homogeneous pairs should 


y weighted average, 


pa 
g 
č 
* 
É 
$ 
H 
k 
Mi remain constant from pre- to postfeedba 
| FH trials. "Therefore, a post hoc comparison 
| Tag was made on the homogeneous pairs, Ni 
gee É z E 
t < | 5S change in homogeneous pairs occurred with 
| Su |a | 2*8 b 
| | ge |” gz stage, F (1, 9) < 1. 
on Sar `. ~ - * H 
Fig FE For Group LWF, the converging intei 
| | £55 action increased only slightly from pre- 
| 5 : postfeedback trials, but this trend did not 
| | §tz reach significance. The majority of Sg 
| | Bez showed no change with stage. Howe 
us PES! the overall mean increased from 40.0 c 
| St ice : 
| $z | RES on prefeedback trials to 42.8 cm. 
d S | ats 


feedback trials, F (1, 9) — 19.87. 
The value-dependent weighted 
model with greater weight for longer di 
tances predicts that responses to hetero? 
geneous pairs should increase while re poi 
ses to homogeneous pairs should remai 


Postfeedback trials 


oe | the same from pre- to postfeedback trials: 
u | Again a post hoc test was made to see if 
E | any changes occurred for the homogeneous: 


pairs alone. A significant increase was 
found of about the same magnitude as th 
overall increase (2.8 cm.), F (1, 9) = 35.69) 
b «001. Thus, it appeared that feedback 
did not increase the value-dependent weigh- 
ting of longer distances as much as it 
caused an increase in the absolute magni- 
tude of the averaging response, 


Model Analysis 


Group 
NF 


The general model (Equation 3) provides 
for both value-dependent and order-depen- 
dent weighting. As noted earlier, these 
parameters may be systematically re- 
stricted to simplify the model. Parameter 
estimates for these models were obtained 
by the method of maximum likelihood with 
the assumption that observations were 
normally distributed with heterogeneous 
variances. 

Given maximum likelihood estimates, 
the restricted models may be statistically 
compared to the general model by likeli- 
hood ratio tests (Kendall & Stuart, 1961). - 
The tests for each model are not indepen- 
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Postfeed hack trials 
Parameter ( au trials EUR SS 
NF NA PWF RWF 
n 19.15 20,79 20.89 19.79 20,31 
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Se 58.07 57,33 59.91 5187 5944 
to ELI 212 
L^] MT 350 
Teo 372 IS 
w A29 
P» Il-w S 


Note, The parameter f 91 — 
dived a! 
ving E mer ‘weight! s 9 subjective stimulus values, # = 
and 60 refer to distances In centimeters, 


Abbreviations 


- dent of each other, However, if all possible 
tests of restrictions are conducted to form 
a matrix of results, each restriction can be 
evaluated in combination with all other 
parameter restrictions, The likelihood 
function was maximized for the data col- 

- lapsed over replications and groups on pre- 

— feedback trials and for the data collapsed 

- over replications on postfeedback trials for 
ch group. 
— "Tn all cases, the general model fit the 
© data extremely well, Theoretical means 
did not deviate appreciably from the ob- 
served means, Results of the likelihood 
ratio tests are presented in Table 2. 

d Prefeedback trials. The restricted model 
with w = 1/2, implying a lack of order 
effects, does not fit significantly worse 

than models including an order-dependent 
weight. This is indicated in Table 2 by 
the fact that values in the column for re- 

7 stricted weights are not significant relative 

to Column 1. However, value-dependent 

d. weights are a necessary part of the model 

| as indicated by the large increase in cell 

entries from Row 1 to Row 2. Table 3 

lists the parameter estimates for the most 

parsimonious model not significantly worse 

" than the general model. The value-depen- 
dent weights for prefeedback trials covary 
with the scale values. This indicates that 
Ss weighted longer distances more than 
shorter distances. The theoretical means 
for this model did not deviate appreciably 
from the observed means. 


Postfeedback trials. On postfeedback 


fe, / : NF = no feedback, NA = numerical a 
verage, RWE,= recency weighted average, SWE = shorter distances receiving greater "s L 
value-dependent weig! 


, PWF 
= longer distances 


it, w = orderdependent weight, and 20, 40, 


trials for Group NE, the simplest model 
was the value-dependent weighted average 
model since entries in Row 2 of Table 2 
are much larger than Row 1. The value- 
dependent weights listed in Table 3 again 
covary with scale value, Thus, for Group 
NF the model did not change from pre- to 
postfeedback trials and the value-depen- 
dent weights remained approximately the 
same, 

Table 2 indicates that for Group NA, 
both the value-dependent and order-depen- 
dent weights are not necessary Assumpt ions 
for the model to fit the data, The general 
model does not fit significantly better than 
a simple numerical average model, Thus, 
in this group the feedback reduced value- 
dependent weighting and helped Ss to give 
responses appropriate to the correct numeri» 
cal average. ‘Though a change in the model 
from pre- to postfeedback trials occurred, 
the precednig analysis does not constitute 
a test of the significance of such a change. 

Table 2 indicates that for Group PWF, 
as for Group NA, the simple average model 
fts the data no worse than the general 
model. Feedback did not affect order-de- 
pendent weighting as expected; hence, it 
was not a necessary requirement in the 
model. For Group RWF, large and signifi- 
cant entries in Table 2 in Column 2 indicate 
that order-dependent weighting is a neces- 
sary requirement of the model. However, 
value-dependent weighting is not necessary. 
Table 3 gives the parameter estimates for 
this order-dependent weighted average 


aa e 


model. The value of w indicates that the 
second (most recent) stimulus received a 
greater weighting. Thus, the feedback ap- 
parently influenced S's responses in this 
group to weight the second distance more. 
It should be noted again that the group 
effect here is primarily localized in the data 
of a few Ss. 

For Groups SWF and LWF, large entries 
occur in Table 2 when the value-dependent 
weights are constrained to be equal, but not 
when the order-dependent weighting is 
restricted. This indicates that a value- 
dependent weighted average model is nec- 
essary to fit the data for these groups. The 
value-dependent weights listed in Table 3 
indicate that shorter distances received 
greater weighting than longer distances for 
Group SWF. This model is in line with the 
feedback given to this group and is con- 
sistent with the highly significant diverging 
interaction that was obtained on postfeed- 
back trials, Unlike Group SWF, the longer 
distances received the greater weighting in 
Table 3, as prescribed by the feedback. 
"These weights are more extreme than those 
on prefeedback trials, but, as noted earlier, 
the converging S; X S, interaction did not 
increase significantly, 


Subjective Scale Values 


The subjective scale values estimated for 
pre- and postfeedback trials for each group 
are also listed in Table 3. Overall, these 
values are close to the physical distances 
although underestimation occurred for 40- 
and 60-cm. distances. Very little change 
in the scale values occurred from pre- to 
postfeedback trials except for Group LWF. 
As noted earlier, this group tended to in- 
crease all responses on postfeedback trials. 
This is reflected in the increased scale values 
for Group LWF. 

All the stimulus scales, while differing 
slightly, should be at least linearly related 
for scale invariance to hold (Anderson, 
1970). Likelihood ratio tests were con- 
ducted for each group comparing the hy- 
pothesis that different scales were needed 
for pre- and postfeedback trials with the 
null hypothesis that the scales were linearly 
related. In all cases the null hypothesis of 
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scale invariance held as indi v thi 
small and nonsignificant chi-sq iue 
DISCUSSION 

Previous research has shown th inte. 
gration function often depends o: parti- 
cular task, In most experiment char 
acteristics have generally remained stant, 
and the integration rule that Ss u tatic. 
Feedback characteristics had a gre ct in 
the present study on averaging res; and 
complex changes in the response ; n res 
sulted. These results indicate tha inte 
gration rule may also be a functi: feed- 
back and practice, and that the con orial 
rule may be learned. The study of thi rning 
process may help to (a) explain wh rtain 
integration rules are used, and (b) eate 
the cognitive processes by which Ss 1ally: 


combine stimuli. 
As found in previous studies (Levi: al. 


1971; Levin etal., 1973), no order efi. — ob- 
tained with only 2 stimuli on prefe back 
trials. Similarly on postfeedback tri for 
each group except RWF, order-depc: dent 
weighting did not obtain. In Group ! WF, 
order-dependent weights were necessary ind 
indicated a greater weighting of the most 
recent stimulus as predicted by the feedback 


to this group. Individual § analyses, however, 
indicated that this effect was evidenced in 
only a few Ss. On prefeedback trials, value- 
dependent weights varied directly with stimu- 
lus magnitude. This is consistent with the 
form of the S, X Se interaction found on these 
trials. On postfeedback trials, it was found 
that in Groups NA, PWF, and RWF, value- 
dependent weights could be eliminated from 
the model. This was not true for Groups NF, 
SWF, and LWF. Value-dependent weighting 
did not change appreciably for Group NF 
from pre- to postfeedback trials, For Group 
LWP, the weights which varied directly with 
stimulus magnitude became a little more ex- 
treme, while for Group SWF, the value- 
dependent weights changed appreciably from 
directly varying with stimulus magnitude on 
prefeedback trials to indirectly varying on 
postfeedback trials. 

In the present experiment, one may con- 
sider the 2 distances, S; and Se, to be the only 
stimuli that Ss integrated on each trial. In 
other tasks to which integration theory has 
been applied, such as probability learning, 
feedback over a number of trials constitutes 
the set of stimuli to be combined (e.g., Freid- 
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man, Carterette, & Anderson, 1968; Levin, 
Dulberg, Dooley, & Hinrichs, 1972). In the 
present task, feedback on previous trials may 
be viewed as yet another set of stimuli to be 
integrated along with Sy and S». This process 
may take on several forms. ach averaging 
response is a function of Si, S», and feedback 
on the previous trials, One model may assume 
that S5 integrates Sı and S» as he did on the 
first trial and then adjusts or biases his re- 
sponse as a function of previous feedback. 
Models like this one bias the integrated re- 
sponse; they do not change the rule by which 
Sı and S» are integrated. 

Other mod may assume that feedback 
alters the integration rule itself or at least the 
parameters in that rule. For a value-depen- 
dent weighted average model like Equation 3, 
the weights may be a function of feedback. 
These 2 types of models represent the funda- 
mental psychological distinction between re- 
sponse bias and integration processes. While 
these 2 processes are conceptually distinct in 
operation, they are often confounded and 
inseparable. As far as the integration rule 
represents a cognitive process, it is encouraging 
when Ss report a specific model or process 
that they used. This, however, does not 
guarantee that they did indeed use that model. 
Research on the specific effects of feedback 
may help to separate the effects of bias from 
the integration process. 

In some cases, feedback appeared to change 
the form of the integration rule and not re- 
sponse bias when responses to different stimu- 
lus pairs changed in different ways. In Groups 
NA and SWF, the responses to heterogeneous 
pairs decreased from pre- to postfeedback trials 
while the responses to homogeneous pairs re- 
mained constant. On the other hand, for 
Group LWF, responses to both heterogeneous 
and homogeneous pairs increased in magnitude. 
Feedback affected responses to all the pairs 
in the same way, and the integration rule 
did not change substantially as a function of 
feedback. Feedback either tended to aug- 
ment the subjective scale values or cause a 
bias in the response function. Such a result 
may be due to a possible generalization of 
feedback to averages of homogeneous stimulus 
pairs. The Ss in Group LWF were told that 
they were under the prescribed average a large 
percentage of the time. Since initially homo- 
geneous pairs were underestimated, Ss may 
have tended to increase all averaging responses. 
This generalization effect did not occur for 
Group SWF. In this group, Ss were almost 


always told that responses to heterogeneous 
pairs were over the prescribed average while 
responses to homogeneous averages were usu- 
ally under because of the tendency toward 
underestimation. 

Feedback for Groups PWF and RWF had 
little or no effect on changing the integration 
rule on the feedback trials. The number of 
explanations for a null effect are of course 
unlimited. However, one plausible hypothesis 
again is that feedback may have generalized 
between distance pairs. For example, in 
Group PWF, Ss were usually told that their 
responses were over for the 20-60-cm. pair 
and under for the 60-20-cm. pair. Feedback 
generalization would functionally lead to the 
numerical average. Indeed, the pattern of 
responses for these 2 groups was most like 
that for Group NA. Several Ss in Groups 
PWF and RWF reported that they thought 
there was a random variation in the new 
average they were to learn but that it was 
otherwise the same as the numerical average. 
Thus, Ss generally did not discover that order 
was a relevant cue in the integration function. 

Integration rules may be viewed as con- 
ceptual rules (e.g., Adams, 1953; Summers & 
Hammond, 1966; Uhl, 1963). Such con- 
ceptual rules are different from those that 
deal only with the presence or absence of 
discrete attributes since the components are 
continuous. The conceptual rule used by S 
may be modeled by a mathematical function 
which adequately describes the relationship 
between the stimulus sets and the matrix of 
responses. While Ss in Group SWF clearly 
learned a new conceptual rule, and indicated 
this verbally following the experiment, Ss in 
other groups did not. Similar to studies on 
rule learning (e.g., Conant & Trabasso, 1964; 
Hunt & Hovland, 1960), this evidence may 
indicate that some conceptual rules are learned 
with greater ease than others. This may be 
because some aspects of the stimulus set are 
viewed as naturally contributing to the inte- 
gration rule while others are not. For example, 
it is very uncommon for sequential order to 
contribute to a measure of central tendency. 

In conclusion, the present results are evi- 
dence that dynamic processes occur in stimulus 
integration as a function of feedback. Feed- 
back may have a number of effects on the 
integration rule in terms of weighting param- 
eters or response bias. A careful distinction 
must be made between the dynamic processes 
of response bias and integration rule learning 
similar to the distinction in psychophysics 
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between decision and sensory processes. The 
present study supports the notion that integra- 
tion rules are concepts and as such they may 
be learned and modified. 
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VISUAL IMAGERY FOR WORDS: 
THE HEBB TEST! 


ROBERT J. WEBER? anp ROGER HARNISH 


Oklahoma State University 


Two experiments were conducted to examine Hebb's test of the “picture 
theory" of visually imagining words. Experiment I examined retrieval from 
visual image representations of words by using a method of probing for the 
spatial properties of the ith letter in an imagined word. Evidence was ob- 
tained for a limited-capacity visual image system that can represent 3- and 
5-letter words, in a spatially parallel manner, as efficiently as they can be 


represented in a visual percept sy: 
tion of visual image representation: 
generated simultaneously, and g 
to right. The results of the 2 e; 


em. 
of words. All parts of a word were not 
ati 


me 


mined the genera- 


Experiment II 


1 generally did not proceed from left 
nts were interpreted as indicating the 


existence of a visual image operating memory of limited letter capacity. The 
contents of the memory can be examined in much the same way as visual 


percepts. 


Hebb (1966, 1968, 1972) and Woodworth 
and Schlosberg (1954) describe at length 
what we might term the "picture theory" 
of visual imagery. According to this 
theory, a self-generated mental picture or 
image can be examined in much the same 
way as a physical picture is examined by 
perception. For example, many people 
report visual images of whole lines or 
stanzas of verse, or at least of single words. 
The presumption is that these images are 
spatially parallel, i.e., all words or letters 
in the composite image are simultaneously 
available. Hebb proposes a test of the 
picture theory. The test contains a general 
requirement that there exist analogs be- 
tween perception and imagery for the 
processing of spatial information, and a 
particular requirement that these analogs 
reveal themselves in a spelling test. To 
experimentally verify the picture theory, 
Hebb feels it is necessary to show that .Ss 
can spell imagined words in a backward 
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direction as fast as they can in a forward 
direction. Since he finds that the bac 
ward spelling of imagined words is much 
slower than the forward spelling, Hebb 
(1966) concludes that whatever the nature 
of a visual memory image, it is not like 
having a mental picture which can be 
examined by the ‘‘mind’s eye": 


the subjective impression that one can "look at” 
one's image freely own by objective test to be 
wrong [pp. 43-44]. The fact that the person with 
visual imagery can only "see" the letters of a word 
in left-to-right order shows clearly that the memory 
image... is a series of events in a particular order, 
not a picture whose parts could be looked at in any 
order (or simultaneously) [p. 46; italics added]. 


It is the contention of this article that, 
with qualifications, the picture theory of 
visual imagery is at least partly correct, 
and that Hebb's (1966) general requirement 
(that there exist analogs between percept 
and image processing) is a perfectly reason- 
able test of the theory. However, we feel 
that Hebb's particular requirement, a com- 
parison of forward and backward spelling 
times, is not appropriate, for the following 
reasons. First, a spelling test is not neces- 
sarily visual. The S may simply draw on 
a highly practiced and extensively trained 
verbal/speech representation of the word 
when spelling it. Second, the spelling test 
may involve an assymetrical use of verbal 
and visual image representations. The S 
may indeed have a visual image representa- 
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tion of the word but may still use his 
familiar verbal/speech representation for 
spelling in a forward direction, while 
possibly dropping back to his visual repre- 
sentation for the unfamiliar backward spell- 
ing. The more rapid forward spelling would 
then simply follow from the fact that, for 
sequential items, speech is a more rapid 
process than visual imagery (Weber & 
Castleman, 1970). Third, the visual per- 
ception system may have a substantially 
greater spatially parallel processing capa- 
city than the visual imagery system. A 
suggestion that the visual imagery system 
may be of very limited capacity has been 
put forth by Weber, Kelley, and Little 
(1972). If visual imagery is of more 
limited capacity than visual perception, a 
spelling test using long words, e.g., uni- 
versity, may be biased in favor of perceptual 
performance. Fourth, even if capacity 
limitations were equal for the image and 
percept systems, the way in which items 
(letters) sequenced in and out of the 2 
systems might well differ (Weber et al., 
1972). 

In the present study, an attempt was 
made to eliminate some of the difficulties 
associated with Hebb's (1966) spelling test. 
First, to make more certain that S, when 
requested to visually imagine a word, does 
indeed use a visual representation, these 
experiments required S to respond to visual 
spatial properties of the letters in a word. 
In particular, we asked S to imagine words 
with lowercase printed letters and to 
classify a given letter for the spatial 
property of vertical height. "Those letters 
that are vertically large (b, d, f, Risa Y) 
would fall in the yes category and all other 
letters would fall in the no category (Weber 
& Castleman, 1970; Weber & Kelley, 1972; 
Weber et al., 1972). This procedure allows 
for a set of converging factors which point 
to the conclusion that a visual image system 
rather than a verbal system is involved. 
The converging factors are: (a) the ability 
of Ss to correctly classify the imagined 
letters for vertical height, (b) frequent sub- 
jective reports from Ss that they do visual- 
ize during the task, and (c) the finding 
of highly distinctive rates of processing for 


ROBERT J. WEBER AND ROGER H 


visual image 
tional sets ( 
Second, to 
representation 
avoid building 
processing, a 
nique was used 
this technique, 
is indicated by 
position define: 
letter correspo: 
is then classifi« 
The probe te 
direct compari 
representation 
capacity effect 
different length 


speech instig 
Castleman, 1970) 
naximum parallel 
ual image and 
cessary sequential 
probe tec 
ent studies. T 
osition in a word 
robe, with probe 
to right. The 
probed position] 
ıl height by 
allows for 
ept and ima, 
xamination 
nulus words 
nted. 


ort 
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Method 


Subjects. Sixteen experimentally naive volunt 
Ss with normal or corrected vision were tested iW 
dividually, Each S was previously screened in @ 
manner similar to that of Weber and Castleman 
(1970) for ability to respond toa visua alphabe 

Stimuli and procedure. The test stimuli we 
nouns and adjectives with a "Thorndike-Lorge 
(1944) count of 5 or more per 100,000 occurrences: 
A total of 42 different words were used. Twelvé 
were test words of 5 letters in length and 12 were 
test words of 3 letters in length. In addition, ther& 
were 8 filler words 3 letters in length (for statistical 
balancing purposes) and, finally 5 practice words 
for each. word length. The 12 5-letter test word 
were selected such that vertically large lowercase 
letters (b, d, f, g, h, j, ... » Y) and vertically smali 
letters (a, c, e, +++, Z) appeared equally in all $ 
letter positions (first through fifth), The 12 3-lettef 
test words and 8 3-letter filler words were alsa 


* Strictly speaking, this was true only of the wordi 
stem. Occasionally, for example, a 4-letter word 
might have been used with a plural form (s added) 
in order to satisfy conditions and yield a satisfactory 
S-letter word. 


VISUAL IMAGERY 


0-—-o IMAGE 
@— PERCEPT 


RESPONSE TIME (SEC.), MEAN OF MEDIANS 


1 2 3 4 5 
PROBE POSITION 


Ficure 1, Response time as a function of image- 
percept representation and letter position probed in 
Experiment I. 


selected so that vertically large and small letters 
appeared equally in all 3 letter positions, 

Each S$ was given a preliminary spelling test for 
the words that were to be used, He was required 
o “spell” the words both with letters and with yes 
and no (indicating the vertical height of the letters). 
For example, the word cat would be spelled c, a, ¢ 
and no, no, yes, respectively. 

"The stimulus words were presented as images or as 
percepts. For the image conditions, Æ spoke the 
word and S was instructed to form a visual image of 
it with his ey closed. Imagining something 
visually was explained to be like “picturing it in the 
mind." For the percept condition, each word was 
rear projected with a Kodak Carousel projector 
equipped with a solenoid-operated shutter. Letters 
appeared in lowercase elite type on a 30,48 X 20,32 
cm. rear-projection screen, The vertically small 
letters were .63 cm. high. Since vertically large 
letters were slightly irregular in height, the j, 1.27 
cm, in projected height, was taken as the standard. 
The S viewed the letters from a distance of 53.34 em. 

Four seconds after presentation of a stimulus 
word, E orally presented a probe digit (1-5). The 
probe digit indicated a letter position in the word S 
was imagining or perceiving. In a word like cat, 
the digit / referred to the c, 2 referred to the a, and 
3 referred to the t. The 5-letter words were probed 
in the same fashion. The spoken digit activated a 
Lafayette voice relay and started a Standard clock. 
The S then responded yes or no to indicate whether 
the letter denoted by the probe digit was vertically 
large or not. The S's response stopped the clock, 
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and RT from onset of the probe to onset of the re- 
sponse was recorded. For the percept conditions, 
after S responded, the slide terminated and a blank 
adaptation field was displayed from another pro- 
jector with a solenoid shutter. All Ss were in- 
structed to respond as rapidly as possible, with 
100% accuracy. Any words which resulted in error 
were repeated at the end of the run. Hence, all 
time scores were for correct choices only. 

Design. There were 2 levels of word length (3- 
letter and 5-letter words) and 2 modes of representa- 
tion (percept and image). Word length was a be- 
tween-Ss variable and representation mode was 
within Ss, with half of the Ss proceeding in the order 
image-percept and the other half in the order 
percept-image. There were 60 trials within each of 
the between-Ss conditions (each test word appeared 
5 times in the 5-letter treatment and 3 times in the 
3-letter treatment). This allowed each letter posi- 
tion in each word to be tested once and only once; 
the additional 3-letter filler words made it possible 
to equate practice on 3- and 5-letter words. Probe 
position was also treated as a separate within-Ss 
variable for each word length. Two different ran- 
dom orders of words were used. 


Resulls and Discussion 


Each S's median RT for a given condi- 
tion was computed, and the mean of the 
medians was then obtained for Figure 1. 
"Thus, each point represents 96 events (12 
Words X 8 Ss). Medians were used be- 
cause of the substantial variability en- 
countered. The number to the right of 
each function is the mean RT obtained by 
collapsing across probe positions. 

Several statistical analyses were con- 
ducted, and RT was the dependent vari- 
able in each case. In the first analysis of 
variance, data were collapsed across probe 
positions. That is, each S's score was the 
mean of his separate median probe position 
RTs. The resulting 2-way classification 
of 2 word lengths (3 and 5 letters) by 2 
representation modes (image and percept) 
was analyzed, with the first factor between 
Ss and the second within Ss: for word 
length, F (1, 14) = 5.20, p <.05; for 
representation mode, F (1, 14) = .48, 
p > .05; and for the Word Length X Repre- 
sentation interaction, F (1, 14) = 2.82, 
p> .05. Thus, RT significantly increases 
with word length, but images and percepts 
are not reliably different in the speed with 
which spatial information is extracted from 
them. Indeed, processing time from the 


mu 


3-letter image condition is at least as fast 
as that from the 3-letter percept condition. 

The second series of analyses examined 
probe position effects. Consider the 3- 
letter words. Here, the analysis was based 
on a 2-way classification of 3 probe posi- 
tions (1, 2, and 3) by 2 representations 
(image and percept), with both factors 
within Ss: for probe position, F (2, 14) = 
7.64, p « .05; for representation mode, 
F (1, 7) = .94, p > .05; and for the Probe 
Position X Representation interaction, F (2, 
14) = .09, p > .05. Thus, the dip in RT 
for the middle letter is significant, suggest- 
ing that all parts of a word are not processed 
equally rapidly. 

To examine this serial position effect 
more closely, separate Newman-Keuls tests 
were performed for the 3-letter percept and 
3-letter image conditions. The results are 
indicated by the letters adjacent to each 
plotted position in Figure 1. Within the 
range of each function, the probe positions 
having the same letter do not differ signifi- 
cantly ; however, the probe positions having 
different letters do differ significantly 
(p < .05). Hence, the 3-letter percept 
condition may be regarded as statistically 
flat, a result consistent with a random 
access process. But the 3-letter image 
condition indicates a significantly faster 
processing time when the second-letter 
position is probed. However, it is appro- 
priate to reemphasize that the Probe Posi- 
tion X Representation interaction was not 
significant in the analysis of variance for 
probe position effects. 

In a comparable series of analyses for the 
5-letter word conditions, the following re- 
sults were obtained: for probe position, 
F (4, 28) = 4.97, p < .05; for representa- 
tion mode, F (1, 7) = 1.88, p > .05; and 
for the Position X Representation inter- 
action, F (4, 28) — 2.28, .05 < p < .10. 
Again, the probe position effect shows that 
not all parts of a word are processed at the 
same rate. 
| The Newman-Keuls test results are in- 
dicated in Figure 1 in the manner described 
previously. The 5-letter percept function 
is statistically flat, while there appears to 
be a serial position effect developing for the 
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processing time between 3- and 5-letter 
word conditions. There do not seem to be 
significant differences in processing time for 
overall image vs. percept representation, 


but there may be differences in probe posi- 
tion effects. In regard to probe position 
effects, percept condition processing is con- 
sistent with random access to serial posi- 
tions, but image condition processing may 
not be indicative of strict random access 
to serial positions. 


EXPERIMENT II: Prope Worpd 


Experiment I examined differences in the 
processing of information from available 
visual images vs. available visual percepts. 
The present experiment is concerned with 
examining differences in the processing of 
information from visual images vs. visual 
percepts, during the generation process. 
By presenting the probe before the word - 
that is to be imagined or perceived, the 
necessity for generating an image or percept 
is added to the task. The temporal growth 
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percept can 


Method 


Sixteen experimentally naive Ss with normal 
vision were paid for their participation and were 
tested individually. All stimulus materials and 
procedures were identical to those of Experiment I 
except that the probe digit was presented first and 
then was followed by the stimulus word. 


Resulls and. Discussion 


The treatment of the results, sum- 
marized in Figure 2, was identical to that 
of Experiment 1. When the probe posi- 
tion variable was collapsed, the resulting 
2-way analysis of word length by represen- 
tation mode indicated significant effects of 
word length, F (1, 14) = 7.69, p < .05; 
representation mode, F (1, 14) — 249.58, 
p < .05; and the Word Length X Repre- 


sentation interaction, / (1, 14) = 9.72, 
p «.05. In Experiment I, only the word 


length variable was significant. The pres- 
ent results indicate that it takes signifi- 
cantly longer to generate and process an 
image than it does a percept. The signifi- 
cant Word Length X Representation inter- 
action is indicative of some form of limited 
capacity in the visual image generating and 
processing system. As the word to be 
generated becomes longer and exceeds pro- 
cessing capacity, R'T becomes dispropor- 
tionally longer for the image conditions in 
comparison to the percept conditions. 
Separate serial position analyses were 
performed for each word length. For 3- 
letter words, an analysis of probe position 
by representation mode yielded significant 
effects of position, / (2, 14) = 14.56, 
p < .05; representation, F (1, 7) = 336.78, 
p <.05; and no significant interaction, 
F (2, 14) = 2.93, p > .05. The Newman- 
Keuls test for 3-letter word conditions 
indicated that the percept function is 
statistically flat and that for the image 
function each probe position is statistically 
different from the other 2. Thus, generat- 
ing percepts appears to be an all-at-once 
parallel process, and the generation appears 
to have no effect on the processing of in- 
formation from the percept. The increas- 
ing 3-letter image function is consistent 
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FiGU Response time as a function of image- 
percept representation and letter position probed in 
Experiment II. 


with a left-to-right generating and/or 
processing strategy. 


The serial position analyses for 5-letter 


words indicated the following. In the 
probe position by representation anal 

the effects of position, F (4, 28) = 26.76, 
p < .05; representation F (1, 7) = 99.06, 


p <.05; and the Position X Representa- 
tion interaction, F (4, 28) = 13.77, p < .05, 
were significant. The Newman Keuls test 
indicated that the percept function is 
statistically flat and that the image func- 
tion has an inverted-U form. These results 
are not consistent with a left-to-right 
generating and/or processing strategy. 

In summary, percept functions remain 
statistically flat even when the probe is 
presented prior to the word. Generation 
processes of the type studied here seem 
minimal in perception, and there still ap- 
pears to be random access to the different 
serial positions. The random-access pro- 
cess could occur in at least 2 different ways. 
First, the percept representation could be 
of a spatially parallel nature in which all 
the letters are simultaneously available for 
processing. Second, after receiving the 
probe digit, S could simply visually fixate 
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on the appropriate letter location on the 
viewing screen in anticipation of the forth- 
coming word. The latter process would 
not be very interesting theoretically. More- 
over, it would not explain the flat percept 
functions of Experiment I, where the locus 
of the probe position was uncertain. 


GENERAL DISCUSSION 


The picture theory of visual imagery was 
supported in Experiment I; Hebb's (1966) gen- 
eral requirement that performance analogues 
exist for percept and image representation was 
satisfied. Access to a 3-letter image repre- 
sentation was at least as rapid as to a 3-letter 
percept representation. Access to a 5-letter 
image representation was not significantly 
slower than to a 5-letter percept representa- 
tion. However, there was some suggestion of 
an emerging Percept-Image Representation X 
Word Length interaction. This interaction 
was strongly present in Experiment II, which 
dealt with image generation time. 

Our explanation for these findings is that 
there exists a visual image operating memory 
with a fixed letter capacity for parallel process- 
ing that is less than that of the visual percept 
system. When the image capacity of the 
operating memory is strained or exceeded, 
differences in processing time between percept 
and image systems become apparent. These 
differences reveal themselves in effects of word 
length and/or serial position. The effects were 
more apparent in Experiment II, where the 
image had to be both generated and scanned, 
than in Experiment I, where the image could 
have been generated and available some time 
before scanning began. As a partial inde- 
pendent check on this capacity interpretation, 
we have since performed several scaling studies 
of Ss! subjective 


impressions of how many 
letters they can visualize at one time, The 
median number of alphabetic letters simul- 
taneously visualizable was found to be 5, with 
an ascending method of limits procedure 
(1 = 31), and 6, with a numerical estimate 
procedure (n = 24). "These scaling values 
seem very supportive of the results from 
Experiments I and II. 

If the visual image system does have a 
capacity limitation, it is of interest to know 
how this limitation is handled. Weber and 
Kelley (1972) and Weber et al. (1972) have 
found evidence that when an array of letters 
such as an alphabetic list or long word (e.g., 
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"supercalifragilisticexpialidoccious") 
the spatially parallel capacity of the visu 
image system, a verbal sequencing comes into 
play. Briefly, what happens is that S says tj 
himself each successive letter or syllable name 
prior to visually imagining it. This verbal se 
quencing allows for the sequential ordering and 
generation of visual images and is in sharp con- 
trast to perceptual presentation, where S need) 
only visually scan or move his eyes from one 
letter or syllable locus to another (Weber & 
Kelley, 1972) in order to sequence through a 
list. Thus, the capacity of the visual percept 
system is essentially unlimited as long as the 
time requirements are not so stringent as to 
limit visual scanning and/or eye movements. 
In this view, the mode of sequencing betwee 
letters or syllables seems to be one fundamental 
distinction between the visual image and visual 
percept systems. 

In conclusion, the majority of the findings 
of the present study indicate that the picture 
theory of visual imagery may have much more 
to recommend itsthan has previously been sup- 
posed (Pylyshyn, 1973). This is not to sayy 
however, that there are no differences between 
percept and image systems. One important 
difference seems to be in the manner of sequenc- 
ing between items (letters) in a list that ex- 
ceeds capacity. There are, no doubt, other 
differences to be discovered. 
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VERBAL DISCRIMINATION 


AS A CONCEPT-ATTAINMENT 


TASK 


USING THE EVALUATIVE DIMENSION ! 
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In Experiment I, a verbal discrimination task was used in which correctness 
was determined by a relational rule based on the evaluative Senn 
Each pair consisted of a neutral word paired with an extreme ("good" 
“bad”) word. When the word closer to the positive pole of the dimension 
vas correct, performance was significantly better than when correctness was 
randomly determined. When the word closer to the negative pole was cor- 
rect, performance was not significantly different from the random control. 
These results, coupled with the results of a postlearning questionnaire, 
suggest that most Ss failed to conceptualize the evaluative dimension as 
a single bipolar continuum. In Experiment II, it proved harder to judge 
which member of a pair was closer to the negative end than to judge which 
was closer to the positive end of the dimension, indicating an asymmetry 


in rule utili 


Several recent experiments have found 
that the evaluative attributes of words can 
be a basis for concept attainment (Di Vesta 
& Ingersoll, 1969; Haygood, 1966; Rhine, 
1965; Rhine, Cole & Ogilvie, 1968; Taylor 
& Haygood, 1968) or for differential encod- 
ing in memory, as indexed by release from 
proactive inhibition (Wickens & Clark, 
1968). These studies used stimulus words 
drawn from the positive (good) and nega- 
tive (bad) halves of the evaluative dimen- 
sion of connotative meaning. The results 
have been interpreted (e.g., Haygood, 
1966; Wickens & Clark, 1968) as evidence 
that positive and negative words constitute 
psychologically distinct classes, and as pro- 
viding at least partial validation for the 
evaluative scale of the semantic differential. 

According to the bipolar model which 
underlies the semantic differential (e.g., 
Osgood, Suci, & Tannenbaum, 1957), each 
scale is assumed to represent a single bipolar 
dimension extending as a straight line in 
semantic space, with the midpoint of the 
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ition as well as in rule discovery. 


line passing through the origin of this space. 
As applied to the evaluative dimension, 
such a model implies not only that positive 
and negative words should constitute dif- 
ferent psychological classes, but that these 
two classes of words should be perceived 
(responded to) as belonging to the same 
dimension; and further, that evaluatively 
neutral words should be perceived as falling 
along this same dimension midway between 
the two extremes. 

The present study tested some of the im- 
plications of the bipolar model, as it applies 
to the evaluative dimension, by investigat- 
ing the attainment of concepts based on 
relational rules. In Experiment I, a verbal 
discrimination (VD) paradigm was used to 
present pairs of words in which one member 
was evaluatively neutral and the other v 
either positive or negative. Correctness 
was determined by one of two rules. Under 
the positive rule, the correct member of 
each pair was the one closer to the positive 
pole of the dimension; under the negative 
rule, the correct member was the one closer 
to the negative pole. It was expected that 
S should discover and utilize the appropri- 
ate relational rule if, in accordance with 
the bipolar model, S conceptualized posi- 
tive, neutral, and negative words as being 
monotonically ordered along the same di- 
mension. More specifically, if the bipolar 
model is applicable to performance in the 
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present task, the following outcomes would 
be expected : (a) Where correctness is deter- 
mined by a relational rule, VD performance 
should be significantly above the level of a 
control group in which correctness is ran- 
domly determined. (6) The positive and 
negative rules should be equally effective. 
(c) Assuming that Ss are able to verbalize 
the rules when questioned at the end of 
learning, they should tend to express the 
rules in relational rather than absolute 
terms. For example, the positive rule could 
be expressed either as a relative statement 
such as, “The more positive (closer to 
good) member of each pair was correct" 
or in absolute terms, such as, "Where 
a neutral and a good word were paired, 
the good word was correct; where a neu- 
tral and a bad word were paired, the 
neutral word was correct." The relational 
form, but not the absolute form, reflects 
the idea of a single bipolar dimension rang- 
ing from good through neutral to bad. 

Asecondary question concerned the effect 
of instructions upon discovery of the appro- 
priate rule. Some Ss were given an instruc- 
tional clue to the effect that correctness 
was rule-determined and that the rule 
involved evaluative attributes. The pres- 
ence or absence of this instructional clue 
was orthogonal to type of rule. 


EXPERIMENT I 
Method 


N Design. There were five independent groups: posi- 
tive, instructed-positive, negative, instructed-nega- 
tive, and random. The positive rule applied to the 
first two groups; the negative rule, to the next two 
groups. Both instructed groups received the same 
instructional clue, and this clue constituted the only 
difference in treatment between the two instructed 
groups and their uninstructed counterparts. In the 
random (control) group, correctness was randomly 
determined. The random group, therefore, could 
learn the list only by rote; the four rule-determined 
groups could learn either by rote, or by making use 
of an appropriate rule, or by some combination of 
rule and rote. 

Lists. All groups learned a list of 16 pairs in 
which 8 pairs were neutral-good (i.e., a word rated 
neutral paired with a word rated good) and 8 were 
neutral-bad. The construction of the lists might 
be called semi-double-function. "That is, each neutral 
word occurred in two pairs in the same list (e.g., the 
pairs AMOUNT, HOPE, and AMOUNT, CRIME), being 
correct in one pair and incorrect in the other. Sub- 
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jects cannot master such a list by learning whieh 
individual words are correct; they must take 
other member of the pair into account. The Mi 
was constructed in this way to increase the difficulty 
of rote learning and to encourage attention to both 
members of each pair, thereby maximizing 5^ 
portunities to discover the appropriate re 


rule. 

The occurrence of each neutral word in two pai 
while each extreme (good or bad) word occurred in 
only one pair should introduce a bias in favor of the 


neutral words, according to the frequency theory of 
VD learning (Ekstrand, Wallace, & | nderwood, 
1966). That is, all other things being equal, the 
accumulation of more frequency units for the neutral 
words should produce more correct responses for 
pairs in which neutral words are correct than for 
pairs in which neutral words are incorrect. A bias 
favoring neutral words, however, should not diffe 
entially affect performance as a function of group, 
as in all five groups the neutral member was the core 
rect one for half of the pairs. 
In selecting the words to be used, it was desired 
to equate the good, bad, and neutral sets on irreles 
vant variables, but this was not feasible using - 
published lists of words rated on the evaluative 
dimension. Accordingly, a set of 224 nouns which 
appeared by inspection to be unequivocal in theim 
goodness, badness, or neutrality were drawn from 
the Paivio, Yuille, and Madigan (1968) norms, As 
a check on E's judgments, these nouns were them 
rated on a 5-point evaluation scale by 16 students 
recruited from a course in introductory psychology. 
Three sets of nouns—8 good, 8 bad, and 8 neutral— 
were then chosen to meet the following criteri, 
(a) the three sets were equated as far as possible in 
word length, in number of first-letter duplications, 
and in the variables of frequency, imagery, and: 
meaningfulness as measured by the scales provided | 
in the Paivio et al. norms, and (b) the designation 
of words as good, bad, or neutral on the is of $ 
ratings should be as unequivocal as possible, "The 
128 ratings made by the 16 raters for the 8 words in 
each set break down as follows for the good set, 
97% of the ratings were either V ery Good or Some- 
what Good, and 3% were Neutral; for the bad set, 
97% were either Very Bad or Somewhat Bad, and 
3% were Neutral; for the neutral set, 88% of the 
ratings were Neutral, 6% Somewhat Good, and 6% 
Somewhat Bad. It is recognized that, because of 
the rough scaling procedure and small number of 3 
scalers, the sample of neutral words selected may 
not adequately represent the psychological midpoint 
between the samples of good and bad words selected. 
If an imbalance of this kind exists, however, it 
should not bias the comparison between groups, 
Since the same pairs were used for all groups. (This 
point is covered more fully in the discussion of 
the results.) The words selected for the good 
Set were FREEDOM, GIFT, HOPE, HUMOR, KINDNESS) ] 
MOTHER, TRUCE, VIGOR; for the bad set, ASSAULT, 
CRIME, DEATH, DECEIT, PYTHON, SLAVE, TROUBLE; ) 
Victim; and for the neutral Set, AMOUNT, ANKLE, - 
FORK, HAIRPIN, INSTANCE, LENGTH, PRODUCT, UNIT. — 
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Eight 16-pair lists were constructed. In each list, 
the 8 neutral words each occurred in two pairs and 
the remaining 16 words each occurred in one pair. 
The eight lists comprised two basic pairings with four 
versions of each pairing, The four versions differed 
only with respect to the designations of the correct 
pair members, In one version, the positive rule ap- 
plied; i.e., the correct words were the positive mem- 
bers of neutral-good pairs and the neutral members 
of neutral-bad pairs. In another version, the nega- 
tive rule applied. Whichever member was correct in 
the positive version was necessarily incorrect in the 
negative. There were two random versions of each 
pairing, which differed in that whichever word was 
correct in one version was incorrect in the other. 
In each random version, the neutral members were 
designated as correct for half the neutral-good 
pairs and for half the neutral-bad pairs. Within 
each list, the number of pairs with the shorter word 
correct equaled the number with the longer word 
correct. 

Four orders of presentation were used for each 
list. The orders were random subject to certain 
restrictions, such as that no more than three con- 
secutive pairs could be of the same type (i.e, 
neutral-good or neutral-bad). The assignment of 
the correct member to the upper or lower position 
within a pair was random subject to certain restric- 
tions designed to prevent confounding between 
intrapair position and any other variable. 

A practice VD list of eight pairs of numbers was 
constructed by pairing the numbers 1-16, with each 
number occurring in one pair, In the practice list, 
pairing and correctness were random subject to 
certain restrictions designed to minimize the possi- 
bility that any rule could be used as a cue to correct- 
ness. A single pairing, with four orders of presenta- 
tion, was used for the practice list. 

Procedure. Subjects were run individually. The 
practice list of numbers was presented until 5 
reached the criterion of two perfect trials, not neces- 
sarily consecutive. The word list was then presented 
for 12 trials. All lists were shown by the anticipa- 
tion method on a Stowe memory drum at a 2:2 rate, 
with an 8-sec, intertrial interval. The same starting 
order was used for all Ss, Since the standard VD 
instructions to call out the correct member of each 
pair might induce a set toward evaluatively positive 
words, the instructions to S avoided the terms correct 
and incorrect (or right and wrong) in referring to the 
two members. Instead, instructions for both the 
practice list and the word list indicated that pairs 
would appear on the left-hand side of the memory- 
drum window, and that the task was to anticipate 
which member of each pair would appear on the 
right-hand side. The word-list instructions also 
informed S that some words would occur in more 
than one pair, and that a word which appeared on 
the right-hand side when it was a member of one 
pair would not necessarily appear on the right when 
it was a member of another pair. The only reference 
to rules or to the evaluative dimension occurred in 
the following sentence, which was added to the 
word-list instructions for the instructed-positive 
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and instructed-negative groups: “There is a rule 
you can discover which will help you learn which 
words will appear on the right. I cannot tell you 
the exact rule, but it has to do with the extent to 
which words arouse reactions of goodness or bad- 
ness," (The order of the words “goodness” and 
“badness” in this sentence was reversed for half the 
Ss within each of these two groups.) 

At the conclusion of the word list, an untimed 
written questionnaire was administered. The ques- 
tionnaire instructions informed .S that different 
people received different word lists, and asked him 
to check whichever one of the following three state- 
ments seemed to apply to his list: 1. It seemed to be 
arbitrarily determined which words would appear 
on the right. 2. There seemed to be a rule or prin- 
ciple determining which words would appear on the 
right. 3. For some word pairs, it seemed to be 
arbitrary; for other word pairs, there seemed to be 
a rule or principle. If S checked Statement 2 or 3, 
he was asked to write the rule. 

Subjects. The Ss were 40 paid summer session 
students, 20 men and 20 women, none of whom had 
participated in the original rating of the words to 
be used. Eight Ss, 4 men and 4 women, were 
assigned to each of the five conditions, using a block 
randomization procedure. The two pairings of the 
word list were balanced over condition and sex of S, 
and the two versions of each random list were 
balanced over sex of S. Three additional Ss were 
eliminated ; 2 failed to reach criterion on the pra 
list within 18 trials, and the other had four failures to 
respond on the word list. 


Results and Discussion 


Practice list. Because all Ss were treated 
alike on the practice list, performance on 
that list should not differ among the five 
groups. Mean trials to criterion ranged 
from 6.62 to 10.75 for the five groups, and 
analysis of variance confirmed that these 
practice-list differences were not significant 
(p > 10). Nonsignificant effects (p > .10) 
were also found for sex of S and for the 
Group X Sex of S interaction. 

Word lists. Mean correct responses for 
the 12 word-list trials combined are shown 
in Table 1, and trial-by-trial data are shown 
in Figure 1, Panel A. An overall 5 X 2 X 2 
analysis of variance was performed on the 
total number of correct responses. ‘The 
independent variables were rule (positive, 
instructed-positive, negative, instructed- 
negative, and random), sex of §, and type of 
pair (neutral-good vs. neutral-bad). The 
variable of rule proved to be highly signifi- 
cant, F (4, 30) = 10.14, p < .001. Type 
of pair was also significant, F (1, 30) 


418 


"TABLE 1 


MEAN Correct RESPONSES ON WoRD List, BY 
CONDITION AND TYPE OF PAIR 


Type of pair 


| 
Condition SCR eaten ae j 
aji Neutral- | Neutral- 
All pairs good | bad 
Random | 
Neutral word correct 76,62 
me word correct 71,88 


Positive i 
Instructed—Positive 
All conditions combined 


| N 
148,50 3. 
Instructed- Negative | 
I 


= 4.95, p < .05, with performance on neu- 
tral-good pairs somewhat higher than on 
neutral-bad. The only other factor ap- 
proaching significance was the Rule X Type 
of Pair interaction, F (4, 30) = 2.27, 
05 « p < .10. 

As shown in Table 1 and in Figure 1, 
Panel A, the five rule groups in order from 
lowest to highest performance on the word 
list were random, negative, instructed— 
negative, positive, and instructed—positive. 
Dunnett's test was used to compare each 
of the four rule-determined groups with the 
random control. With k = 5 and df = 35, 
performance was significantly above the 
random-group level in the positive group, 
t = 3.84, p <..01, and in the instructed— 
positive group, ¢ = 5.54, p «€ .01, but not 
in the negative, / — 1.09, p > .05, or in- 
structed-negative, / = 2.18, p> .05. 

An additional analysis comparing posi- 
tive vs. instructed- positive groups revealed 
a highly significant effect of instructions, 
F (1, 14) = 22.75, b «.001. The anal- 


FIGURE 1. Mean correct responses on 
word list, by condition. 
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ogous negative vs. instructed-negatiy 
comparison did not approach significan 
(F « 1). 

The major results for the rule varia 
may be summarized as follows: Where tl 
positive rule governed correctness, perform 
ance was significantly better than in thi 
random group, where learning could occi 
only by rote; where the negative rule gov- 
erned correctness, performance was slighth 
but not significantly al the random 
group level. These resulis held both fi 
the instructed and for iLe uninstructed 
groups. 


The question might | 
the superiority of the pos 
tive rule could be accounted for in terms 
a simple response bias; i.^., a general ten 
dency to choose positive and or avoid nega: 
tive words (cf. Boucher & Osgood, 1969). 
Against this interpretation is the findin 
that, within the random group, correct ré 
sponses averaged almost the same for thos 
pairs with the “closer to negative” 
correct (74.13) as for those with the "'clos 
to positive" word correct (74.37). 
supplementary analyses were conducted in 
order to rule out the possibility that sucha 
response bias may have contributed to th 
comparisons between the random and rule- 
determined conditions. The random-group 
lists were divided into two subsets—th 
eight pairs in which the “closer to positive" 
word was correct, and those eight pairs i 
Which the "closer to negative" word was 
correct—and the number of correct re- 
Sponses in each subset was multiplied b: 
two. Then separate analyses of variance 
were used to compare each of the four rule- 
determined conditions with the appropriate 
subset of randoni-group pairs; e.g., the posi- 
tive group was compared with the “closer to 
positive” set. The results of these analyses 
mirrored the results of Dunnett's test re- 
ported above. 

It might also be pointed out that the 
obtained asymmetry between the positive 
and negative rules could not be accounted 
for by a failure of the three sets of words to 
mark off two psychologically equal intervals 
along the evaluative dimension. That is, 
assume that, either because of deficiencies 


raised whethe 
e to the nega: 
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in the sample of words selected or because 
of some peculiarity of the evaluative dimen- 
sion per se, it was harder to differentiate 
between neutral and negative words than 
between neutral and positive words. This 
should make it harder to apply both rules 
to the neutral-bad than to the neutral- 
good pairs. It should not make it harder 
to apply the negative rule than to apply the 
positive rule, however, since the specific 
pairs and, hence, the specific intrapair in- 
tervals would necessarily be the same re- 
gardless of rule. 

It appears, therefore, that the negative 
rule was harder to discover and/or harder 
to utilize than the positive rule, and that 
this outcome was neither an artifact of 
unequal intervals nor the consequence ofa 
general bias toward positive and against 
negative words. 

Turning to the type of pair effect, per- 
formance for the five groups combined was 
significantly better on the neutral-good 
pairs than on the neutral-bad. Although 
the Rule X Type of Pair interaction failed 
to reach conventional significance levels 
(05 < p < .10), Table 1 shows that the 
type of pair effect was not manifest in all 
groups, but was concentrated in the nega- 
tive and instructed-negative conditions. 
This marginal interaction suggests that the 
superiority of the neutral-good pairs should 
not be interpreted simply in terms of psy- 
chologically unequal intrapair intervals. 
Rather, the finding of better performance 
on neutral-good than on neutral-bad pairs 
in the two negative-rule conditions is con- 
sistent with a frequency bias, since fre- 
quency would favor pairs with the neutral 
word correct, and under the negative rule 
those would be the neutral-good pairs. 
Further evidence for a frequency bias is 
found in the random group. There, fre- 
quency would not affect the comparison 
between neutral-good and  neutral-bad 
pairs, but would favor those pairs with the 
neutral word correct as opposed to those 
with the extreme word correct. The figures 
in Table 1 (76.62 vs. 71.88) are consistent 
with this expectation. In the positive and 
instructed-positive groups, however, any 
frequency bias would work to the advantage 
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TABLE 2 


E CorrEcT RESPONSES ON Worp List, BY 
RULE CONDITION AND AWARENESS CATEGORY 
or S 


“Aware” Ss 


Rule con- | | Correct responses Correct responses 
Sai ae E MM —— 
| M Range M 
Negative 6 | 148,33 | 122-172 2 181.50 | 176-187 
am] 

| 5 |152.80 | 133-169 180-189 

5 174.60 | 163-184 175-186 
Instructed- | 
Positive 0 | | 


of the neutral-bad pairs, and there is no 
evidence of such a bias. It may be specu- 
lated that the tendency to choose the more 
frequently occurring word played a sub- 
stantial role in those conditions where 
learning was entirely or mainly by rote, but 
not in those conditions where learning was 
largely by rule. 

Questionnaire. The 32 Ss in the four 
rule-determined conditions were labeled 
“Aware” or “Unaware” on the basis of their 
responses on the postlearning questionnaire. 
A total of 16 Ss checked Statement 2 in- 
dicating that a rule seemed to apply. All 
these Ss then wrote rules which, while vari- 
ously phrased, may be considered correct or 
closely correlated with the correct rule. 
These 16 Ss were labeled “Aware.” The 9 
Ss who checked Statement 1 to indicate 
that no rule seemed to apply were labeled 
"Unaware." The remaining 7 Ss checked 
Statement 3 to indicate that a rule seemed 
to apply to some but not all pairs. Since 
none of them wrote a rule which could be 
considered correct or partly correct, these 7 
Ss were also labeled “Unaware.” (There 
was essentially no difference in learning be- 
tween these two subgroups within the Un- 
aware category.) 

Table 2 shows the number of Ss labeled 
Unaware and Aware according to the 
criteria specified above. Also shown is the 
number of correct responses (out of a maxi- 
mum possible of 192) for these two classes. 
All Ss in the instructed-positive condition 
fell in the Aware category and, in view of 
their near-perfect performance, they prob- 
ably discovered the positive rule on the first 
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trial. In the other three rule-determined 
conditions, only a minority of Ss had dis- 
covered the correct rule by the end of the 12 
learning trials. The range columns indi- 
cate that, within the two negative-rule 
groups, there was no performance overlap 
between Aware and Unaware Ss. Within 
the positive condition, VD performance was 
not so strongly related to verbalization of 
the rule. 

In view of the relatively high level at- 
tained by the five positive-condition Ss clas- 
sified as Unaware (see Table 2 and Figure 1, 
Panel B), their performance was compared 
with that of the eight Ss in the random con- 
dition, and the difference was found to be 
significant, F (1, 11) = 13.18, p < .01. It 
thus appears that awareness of the positive 
rule—defined in terms of verbalization of 
the rule on the questionnaire—was not a 
necessary condition for its effective opera- 
tion. (It might be noted that the mean 
correct of 174.60 among the five Ss in ques- 
tion reflected equally the performance of 
three Ss who checked Statement 1 and two 
Ss who checked Statement 3; mean correct 
responses in those two subgroups averaged 
174.33 and 175.00, respectively.) 

A surprising finding was that the instruc- 
tional clue, which had a marked effect both 
on the questionnaire results and on VD 
performance under the positive rule, had 
virtually no effect under the negative rule. 
A reasonable supposition is that an instruc- 
tional clue which calls attention to evalua- 
tive attributes but does not furnish a speci- 
fic rule has the effect of predisposing Ss to 
try the positive rule, or an approximation 
of it, as their first hypothesis. In the 
instructed-positive group, this led to im- 
mediate discovery of the correct rule and 
near-perfect VD performance. In the in- 
structed-negative group, when this first 
hypothesis proved wrong, most Ss ap- 
parently reverted to rote learning and, ac- 
cordingly, performed at the random-group 
level. 

In considering the applicability of the 
bipolar model to performance in the present 
task, the form in which Ss verbalized the 
rules is highly relevant. Of the 16 Aware 
Ss (i.e, those Ss who stated some version 
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of the correct rule), only 6 s 
tional rule applicable to both : 


a rela 
of pai 


(e.g., “The more positive of th: won 

appeared" or “Pick the less ap g word 
of the pair"). The remainin: s used 
absolute rather than relative te state, 
in effect, a separate rule for « vpe of 
pair (e.g., “A word associated v sitive 
emotions appeared on the rig! pairs 
with negative emotions, the : word 
appeared," or “Anytime one oi words: 
in the pairs had a positive valu: iated 
with it, the neutral word is sh ; the 
right. A negative value appe: n the 
right"). Assuming that . nt of 
the rule on the questionnaire cor: nded 
to his formulation of it during |: g it 
appears that most Ss who dis: da 
workable rule did not utilize the i fan 
underlying good-neutral-bad co ium, 
since they failed to conceptualize t! 10ice 
of a neutral word in some pairs ood am 
extreme word in other pairs as fol) wing 
from the same relational rule. (If > rel- 
evant dimension were size instead of eva- 
luation, an analogous outcome wo 1 be 
for most Ss to formulate a rule such as, 


“When small is paired with mediun , pick 
small; when large is paired with medium, 
pick medium" instead of the simpler rela- 
tional rule “Pick the smaller one.”’) 


EXPERIMENT I] 


A distinction can be made between the 
discovery of a rule and its utilization. The 
learning data combined with the question- 
naire results of Experiment I indicated that 
the negative rule was harder to discover 
than the positive rule. Experiment 1 left 
indeterminate the question of whether the 
negative rule may also be harder to utilize 
than the positive rule. The purpose of 
Experiment II was to make a simple and 
direct test of the relative difficulty of apply- 
ing these two rules. The task was one of 
judgment rather than learning. 


Method 


A positive group and a negative group received 
the Same treatment except for instructions. The 
16 pairs from Experiment I were presented on @ 
memory drum at a 2-sec. rate, without feedback. 
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The positive group received the following instruc- 
tions: “Decide which word would seem more good, 
or at least less bad, to most people, and call out that 
word. That is, of the two words, call out the one 
most people would say is closer to being good,” 
The negative group received the same instructions 
except that the words "good" and “bad” were inter- 


changed. (A normative rather than an idiosyncratic 
judgment was requested for consistency h Experi- 
ment I; ie, in Experiment I, those Ss who dis- 


covered a rule would presumably have applied it on 
the à mption that the rule had a normative basis.) 
Both pairings from Experiment I were used. Within 
each pairing, there were two versions which were 
reversed with respect to intrapair top-bottom posi- 
tion. Lists were balanced over groups. The Ss 
were run individually. Each S received four trials, 
with a switch from one rule to the other after Trial 2. 
Data are reported for Trial 1 only, however, in view 
of the small number of errors made on Trials 2-4 
and the difficulty of evaluating transfer effects. 
The Ss were 32 paid summer session students, 
16 men and 16 women, none of whom had partici- 
pated in Experiment | or in the original rating of 
the words to be used. Sixteen Ss, 8 men and 8 
women, were assigned to each of the two groups 
using a block randomization procedure. Five addi- 
tional Ss were eliminated, 3 because of E errors, 1 
because he refused to judge several pairs, and 1 
because he consistently selected all neutral words as 
“closer to good” and all extreme words as "'closer to 


bad." 


Results and Discussion 


Results are reported in terms of mean 
errors, an error being either a response 
which is the normatively “wrong” member 
of its pair, or a failure to respond. The 
number of errors made on the 16 pairs 
ranged from 0 to 3 in the positive group, 
with a mean of .94, and from 0 to 7 in the 
negative group, with a mean of 2.19. On 
the neutral-good and neutral-bad pairs, re- 
spectively, mean errors were .75 and .19 in 
the positive group, 1.44 and .75 in the nega- 
tive group. Because of apparent skewness 
and heterogeneity of variance, the analysis 
was performed on a Freeman—Tukey square 
root transformation of the error data 
(Mosteller & Bush, 1954). A2X2x2 
analysis of variance with rule, sex, and type 
of pair as the three variables yielded signifi- 
cant effects for rule, F (1, 28) = 5.01, p < 
.05, and for type of pair, F (1, 28) — 8.08, 
b «.01. No other main effects or inter- 
actions were significant. 

Considering first the main effect of type 
of pair, performance on the neutral-good 
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pairs was significantly worse than on the 
neutral-bad. This was in the opposite 
direction from the type of pair effect found 
in Experiment I. The different outcomes 
with respect to this variable reflect different 
tasks; i.e., Experiment I involved a mixture 
of rote VD learning, rule discovery, and 
rule utilization, while Experiment lI in- 
volved only rule utilization. The Experi- 
ment I outcome may be attributable mainly 
to a frequency bias affecting those pairs 
learned by rote. The Experiment II result 
could be interpreted to mean that the 
psychological distance between the neutral 
and positive words was less than that be- 
tween the neutral and negative words, but 
it is indeterminate whether this finding can 
be generalized beyond the specific pairs 
used. As indicated earlier, any deficiencies 
in the samples of words selected to represent 
the good, neutral, and bad sets would not 
bias the comparison between the positive 
and negative rules, but might produce a 
significant type of pair effect. 

The critical outcome of Experiment II 
was the rule effect. The negative rule 
proved to be harder to apply than the 
positive rule. That is, for the same pairs 
presented in the same way, it was signifi- 
cantly harder to judge which member of 
the pair was closer to the negative end of 
the evaluative dimension than to judge 
which member was closer to the positive 
end. This result held equally for neutral- 
good and for neutral-bad pairs; the Rule 
X Type of Pair interaction yielded F < 1. 

The combined results of Experiments I 
and II, therefore, indicate that the nega- 
tive rule is both harder to discover and 
harder to apply than is the positive rule. 

The superiority of the positive rule in 
the present study bears at least a super- 
ficial resemblance to anfoutcome which has 
been found consistently with certain reason- 
ing problems involving evaluative compari- 
sons. That is, linear syllogisms (three-term 
series problems) of the form, “A is better 
than B, B is better than C, which is best?" 
are solved more readily than logically 
equivalent problems of the form, "C is 
worse than B, B is worse than A, which is 
worst?" (e.g., Clark, 1969; De Soto, Lon- 


"ee a 
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don, & Handel, 1965). It is possible that 
there is a basic asymmetry, one of wide 
generality, in the ability to use relational 
rules based on evaluative attributes, It 
cannot be assumed, however, that the same 
cognitive operations underlie performance 
in these different tasks. 
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This study examined reaction times to 


memory, Matches were based on either phys 


AA) or 


ional identity (e.g., the 
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probes of active memory and inactive 
cal identity (e.g., the letters 
a). The results showed that 


letters 


inactive memory probes were processed more slowly than probes of active 


] matches w 


memory, and that phy 


ere faster than associational matches 


only in active memory. These results were interpreted in terms of the nature 


of the stored representation and its retrieval. 
ferences between active and inactive memory. 


It was concluded that dif- 
rches were due to differences 


in the storage location, not the nature of the stored representation. 


Verbal materials have been used to 
study memory since Ebbinghaus' (1954) 
pioneer experimental work in this area in 
1885. One hundred years of this type of 
research has promoted the notion that 
memory in verbal form is of much greater 
relative importance than other nonverbal 
forms (graphic, tactile, etc.). Many theo- 
ries of information storage and retrieval 
(e.g., Atkinson & Shiffrin, 1968) have been 
based entirely on the registration, transfer, 
and storage of linguistic or verbal 
information. 

There are, however, indications that not 
all storage is verbal. In an attempt to 
examine coding processes, Posner and Keele 
(1967) presented to their Ss 2 letters 
which, when viewed sequentially, were 
judged as same or different. These judg- 
ments were based on physical identity, 
which occurred when 2 letters (AA) were 
identical physically, or associational iden- 
tity, which occurred when 2 letters had 
only the same name (Aa). They have 
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shown that physical same matches are 
faster than associational same matches. 
The difference (name reaction time [RT] 
— physical RT = 80 msec.) was inter- 
preted as a difference in processing time 
for different "nodes" or levels. The as- 
sumption was made that a physical iden- 
tity match is based on a comparison of 
codes representing only the visual informa- 
tion, while associational matches are based 
only on a comparison of verbal labels 
or names. 

Posner and Mitchell (1967) gave 2 pos- 
sible explanations for the associational- 
physical match time difference. First, 
physical matches could be made prior to 
associational matches, or second, both com- 
parisons could occur simultaneously, with 
the associational match taking longer to 
complete. An alternative explanation is 
that in the associational match condition 
(Aa) one item is recoded into the format 
of the other, and physical matches are 
then made. Swanson, Johnsen, and Briggs 
(1972) supported the hypothesis that the 
memory item(s) is recoded, not the final 
item (probe). 

In short, it seems that with the com- 
parison of 2 successive stimuli, the iden- 
tity judgments are facilitated when physi- 
cal identity exists. Such facilitation lasts 
for at least 1.5 sec. and probably as long 
as 6-9 sec. or longer (Phillips & Baddeley, 
1971). So, it appears that visual informa- 
tion is used primarily in active or short- 
term memory, and such memory is not 
entirely verbal. 

A distinction that is frequently made 


e 
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involving storage of information is that 
between short- and long-term stores or 
active vs. inactive memory. Waugh and 
Norman (1965) were among the first to 
make such a distinction and provide em- 
pirical support. Their conclusion that the 
2 systems were independent was based on 
differential effects of number of intervening 
items. Subsequent research has supported 
this distinction in terms of serial position 
recall differences (Glanzer & Cunitz, 1966; 
Loftus & Wickens, 1970) as well as physio- 
logical evidence (Milner, 1970). Baddeley 
(1966) suggested that semantic variables 
were most important in inactive memory, 
while phonemic ones were important in 
active memory. Shulman (1970, 1971) 
showed that this may be due to temporal 
and strategical factors, and that both 
variables can be effective in both systems. 

Several recent studies are relevant to 
the question of information processing in 
active and inactive memory. Forrin and 
Morin (1969) combined the Sternberg 
(1966) fixed- and varied-set procedures 
(active and inactive memory) by present- 
ing A, AB, or ABC (memory load [M] 
= 1, 2, or 3) to be retained across a block 
of trials (fixed set). This set is presumably 
held in inactive memory. Each trial, then, 
consisted of presentation of a varied set 
(M = 1, 2, or 3), followed by a probe of 
either active or inactive memory, or 
neither, This classification task is assumed 
to load both memory systems at the same 
time. They found that the RT — memory- 
load function for active memory probes 
was steeper (slower processing) than that 
of the inactive memory probes. However, 
the inactive set in Forrin and Morin's 
study was both highly familiar and prac- 
ticed (relative to the different active sets) 
as well as overlapping in the formation 
of the memory groups. This overlap also 
could conceivably reduce the slope 
constant. 

Sternberg, Kroll, and Nasto (1969) com- 
pared performance on active set (only) 
with performance on inactive set (only). 
There was less difference in practice of 
inactive and active probes than in Forrin 
and Morin's (1969) study and, because 
the same items (digits) were used in both 
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tasks, there was no differential fami ty. 
Sternberg et al. found that the proc- -ing 
rate for the inactive memory prob as 
slower than for the active memory | S 
A greater intercept for the inactive n ry 
probes was interpreted as the ti: e- 
quired to find the proper ensem! in 
inactive memory, while the incre in 
slope constant reflected a serial re al 
process of the inactive items into an ve 
memory for comparison. 

The present study questions th X- 
planation of the slope difference an 0- 
poses an alternative explanation in ns 
of the nature of the coding used in bi iry 


classification tasks as a function of :he 
presumed location of the stored mer ory 


set—i.e., active or inactive memory. is 
predicted that visual or representat. al 
codes will be used in what is presum. ly 
active memory when verbal associate: re 


not required for appropriate respon: g, 
while coding in what is presumably in- 
active memory will be based on the val 
labels even when these labels are ot 
strictly required. These representational 
differences, then, might explain the poc- 
essing rate differences between active and 
inactive memory probes. It is important 
to note that the distinction between active 
and. inactive memory in this research is 
an operational one. A theoretical distinc- 
tion, however, may find support in the 
results. 


METHOD 


To carefully examine the question of representa- 
tional differences in different memory systems, 
acoustic similarity was manipulated in such a man- 
ner as to affect RT as a function of the presumed 
type of match. It was felt that acoustic similarity 
should alter the RT-memory-load functions only 
when the stimulus letters are in fact stored as 
associates, and these associational representations 
are used as the basis of comparison, That 
when physical representations are compared, acous- 
tic similarity is not expected to be a potent factor 
in the comparison process. 

. Design. A repeated measures design was used 
in which 48 conditions were defined by the factorial 
combination of the following variables; memory 
load (1, 2, or 4 letters), memory system (active 
or inactive), type of match (physical or associa- 
tional), acoustic similarity (similar or diss milar), 
and type of response (match or no match). During 
each session 2 blocks of 84 trials were given and 


pi 


the first 4 trials were excluded from analys 
Twenty of the remaining trials involved a no 
match of the probe and memory items and re- 
quired a negative response. Of the remaining 60 
trials, there were 15 trials each of the following 
types of match: (a) physical match of probe with 
active set, (b) physical match of probe with in- 
active set, (c) associational match of probe with 
active set, and (d) associational match of probe 
with inactive set. 

Ihe presentation of each type of material was 
randomized to minimize guessing. One level of 
n wry load for the inactive set was used each 

ind the levels were counterbalanced so that 

oss Ss each level was used equally often in 
cach ordinal position (first, second, or third) across 
consecutive days. The inactive memory load 
was constant within each session to minimize 
confusion as to the contents of this set. . All 3 levels 
of memory load for the active sets were used 
equally often in each block (10 trials/block). 
Given that restriction, the order of occurrence of 
the active memory load levels was randomized. 
Half of the Ss had memory sets and probes drawn 
from an acoustically similar list for the first ex- 
perimental sessions and a dissimilar list for the 
For the remaining Ss the order 


final 3 sessions. 
was reversed. 

»udus material. Nineteen letters of the 
dphabet were selected as stimuli and divided 


into 2 (overlapping) sets of 12 from which both 
memory items and probes were drawn. One set 
was composed of the following 12 acoustically 
similar letters: B, C, D, E, F, G, P, S, T, V, X, 
and Z. The second set was an acoustically dis- 
vivular list comprised of the following 12 lette 
A, C, D, H, K, M, Q, S, U, X, Y, and Z. Indices 
of acoustic confusion (Cónrad, 1964) were cal- 
culated separately for each set of letters, revealing 
much less acoustic similarity for the acou lly 
dissimilar list than for the acoustically similar list. 
The indices of visual confusion within each list 
(Townsend, 1970) were nearly identical. 

The presentation of the meniory sets was visual 
and simultaneous. For M = 1, the letter was in 
the upper left corner of a 2 X 2 cell matrix. The 


remaining cells were unfilled. For M = 2, the 
upper 2 cells contained the letters, and all 4 cells 
contained letters for the M = 4 condition. Both 


upper- and lowercase could be represented in one 
‘ory set. The occurrence was randomized; 

bus, across a block of trials, both cases were rep- 
resented equally often in the active memory sets. 
The probe letters that could be in either case 
appeared in the center of S's viewing screen and 
were selected from the same subset as the memory 
items. The probes were selected such that across 
the final 80 trials in each block there were 60 match 
and 20 no-match trials with an equal number of 
(a) upper- and lowercase probes (40 each), (6) as- 
sociational matches and physical matches (30 each), 
(c) match with active set item and match with 
inactive set item (30 each), and (d) active set 

* match probes of M = 1, 2, and 4 (10 each). 
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Procedure. In the initial session, instructions 
were given and Ss had 1 block of 84 trials. All 
3 levels of memory load were used for the active 
sets, but only M = 4 was used for the inactive set. 
The purpose of this was to develop a tendency 
for Ss not to group or combine the memory sets 
into one set which is then rehearsed as a unitary 
active set. This, of course, is most difficult to do 
when the largest sets are used. 

At the end of each session, S was shown his 
inactive set for the next session and asked to 
memorize it. At the beginning of the next session, 
S was asked to record his memory set to assure 
its storage in inactive memory. If the set or any 
part could not be recalled, he was again shown the 
entire set and given 5 min. to memorize it. Then, 
at the beginning of each block of trials, this set 
was displayed again and the regular sequence began. 
Following the inactive set, an active set was dis- 
played for 3 sec, followed by a 1-sec. interstimulus 
interval. The probe item was then displayed and 
terminated after 2 sec. The time from the onset 
of the probe item to the initiation of .S's response 
was recorded as the primary dependent variable. 
The response was made by pressing a microswitch 
with the left or right index finger. For half of 
the Ss, the left finger indicated a match while the 
right indicated no match. The reverse was true 
for the other . Following the probe there was 
a 3-sec, intertrial interval prior to the presentation 
of the next active set. Thus, the total duration 
of one trial was 10 sec. This sequence was repeated 
until the end of the block of tria After a brief 
rest, the inactive set was again displayed, and the 
sequence was repeated with different active mem- 
ory sets. 

Subjects. The 24 right-handed Ss were volunteers 
responding to an ad in the campus newspaper. 
Each was paid $1.25 for each session. There was 
1 practice session, 6 experimental sessions, and 
1 debriefing session on 8 successive work days. 

Apparatus. ‘The S sat in a dimly lighted, sound- 
damped room before a response panel and a ground- 
glass screen. On the response panel, which was 
at tabletop level, were 2 buttons marked "match" 
and “no-match.”” The marking, of course, cor- 
responded to the particular finger assignment of 
each S. The stimulus materials were rear projected 
at eye level on the screen and measured 20 X 20 
mm. for the 2 X 2 array. Each letter was ap- 
proximately 10 mm. high and was viewed at 60 cm. 
(less than 1° visual angle). A paper-punch-tape 
unit controlled the 2 Kodak Carousel RA950 
projectors which presented the proper memory 
sets and probe items. An electronic clock (ac- 
curate to 1 msec.) measured S's RT to the probe 
item. An Iconix timing system was used to control 
the several intervals and stimulus durations. 


RESULTS 


Each SS contributed 10 positive RTs 
(excepting errors and machine failures) to 
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TABLE 1 


Correct REACTION TIMES (IN SEC.) AND 


AVERAG 
(IN PERCENTAGES) 


RELATIVE ERRORS 


Memory load 
Type 1 letter 2letters | 4letters 
RT RE RT | RE | RT | RE 
3 Inactive 
Physical active | 
Eos 439 .41|454| .62 |469| .83 
2 465 80 | 497 | .62| 510 | 1.26 
4 535 | 2.78 | 554 | 3.64 | 566 | 8.63 
Associational 
active 
1 469| 1.04 |474| .83 | 489 | 1.46 
2 497 | 1.04 | 516 | 1.73 | 530 | 2.08 
4 558 | 10.29 | 570 | 3.78 | 580 | 2.50 
Active 
active 
480 -20 | 490| .20 | 547 | 2.55 
2 548 -83 | 562 | 1.69 | 608 | 3.13 
4 608 | 4.00 | 639 | 6.49 | 656 | 3.45 
Associational | 
inactive | 
1 476 Al} 500| .20 | 547 | 1.9: 
2 557| 1.26 | 577 | 1.93 | 622 | 1.46 
4 626| 2.74 | 639 | 5.57 | 682 | 3.57 
N Areas RT = Romie time; RE = ENS 
each of the cells in Table 1 for each simi- 


larity condition (acoustic similarity or dis- 
similarity). Medians of these scores were 
taken, while all further measures of central 
tendency were derived by use of arith- 
metic means. ‘Table 1 shows the average 
correct RTs and percent errors for each 
condition, collapsed across both Ss and 
similarity conditions. 

Error analyses. The attempt to main- 
tain a low error rate was shown to be 
successful by the 2.31% average error rate 
for match conditions and 8.74% for no- 
match conditions. The error rate data 
provide little additional information beyond 
that shown below by an analysis of RTs, 
It is important, however, to note the lack 
of significance of the acoustic similarity 
variable in terms of errors, F (1, 23) = 2.96, 
p > 10. This suggests that the comparison 
of RTs for the levels of acoustic similarity 
is valid. 
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earlier by Ster: 
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A detailed anal 


uU "ted 
natch grand 


itch response 


mory systemi 


x Type 
and Memory 
(2, 46) = 3.63 
3 main effects 


9 msec.) ové 
ec.) was fou 
ai. (1969) [ 
ov). This disting 


tion between active and inactive memon 
is supported by the fact that, in a post 


experimental questionn: 23 of 24 
reported not rehearsing the inactive 
items. 


Second, the main effect of type of mate 
shows that physical matches (535 msec 
were made faster on the average than 
associational matches (551 msec.). Posné 
and Keele (1967), Posner et al. (1969), ant 
others also found this difference. TH 
magnitude of the difference found heft 
(16 msec.), however, is smaller than thal 
found by Posner and Mitchell (1967) 
This is due to the use here of the mixed: 
list procedure (Posner et al., 1969), 
clusion of some analogue matches, t 
l-sec. interstimulus interval, and the i 
clusion of associational-physical differen 
from both active and inactive memory set 
probes. 

Finally, the characteristic increase inRT 
With an increase in memory load (Sterni 
berg, 1966) is evidenced in the significa 
main effect of memory load. 
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The significant Memory System X 
Memory Load interaction, illustrated in 
Figure 1, suggests that the slopes of the 
? functions are not parallel. The best- 
htting linear equations are 


RT = .440 + .068 (H.) Inactive Memory 
and 
RT 2.417 + .047 (H.) Active Memory. 


It is important to note that the H. scale 


of the number of items in memory is an 
equal-interval log transform scale. The 
H, metric is a Shannon statement (log) 


of central processing uncertainty which is 
determined primarily by memory load (see 
Briggs et al., 1971). Such a scale is not 
without precedent (Briggs et al, 1971; 
Nickerson & Feehrer, 1964) and more im- 
portant, the data overall are better fitted 
in this manner than by a linear scale. 

The prediction that the type of informa- 
tion used in the comparison process will 
be a function of the particular memory 
system involved (active vs. inactive mem- 
/) is supported by the significance of the 
ype of Match X Memory System inter- 
action. This interaction is plotted as a 
function of memory load in Figure 2, and 
indicates that the main effect of type of 
match was due to the difference between 
associational and physical matches in ac- 
tive memory. For this reason the physical 
and associative match functions for in- 


Match Responses. 


Inactive Set 
RT*.440+,068(H.) 


Active Set 
RT*.417+.047(H,) 


© Inactive Set 


Reaction Time (seconds) 


© Active Set 


1 2 3 
Central Processing Uncertainty (Hc) 


FIGURE 1. Average “match” reaction times 
(RTs) as a function of central processing un- 
certainty (H,) with memory system as the 
parameter. 
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FrGURE 2. Average “match” reaction times 
(RTs) as a function of central processing un- 
certainty (Ha) and memory system with type of 
match as the parameter. 


active memory, 


RT = .438 + .066 (H,) Physical 
and 
RT = .443 + .069 (H,) Associational, 


are plotted as a single regression equation 
in Figure 2. 

Thus, the data in Figure 2 support the 
major prediction that the information used 
to match stimuli in this task is a function 
of the memory system involved, Further 
support comes from the contribution of the 
significant type of match main effect and 
from the Type of Match X Memory Sys- 
tem interaction. These effects show that 
physical matches are faster than associa- 
tional matches, but that this difference 
occurs in the active memory system. 
"Thus, only associational matches are in- 
volved in the inactive memo system, 
while both types of matches occur in 
active memory. 

The fact that neither Type of Match 
X Memory Load nor any other higher 
order interactions involving both of these 
variables was significant indicates that the 
difference between associational and physi- 
cal matches in the active memory set is 
constant across memoryjload and thus the 
functions are plotted as’ best-fitted parallel 
lines in Figure 2. x 

Neither the main effect of acoustic simi- 
larity, F (1, 23) = .34, nor any of its 
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interactive patterns approached signifi- 
cance. ]t was expected that only the 


associational matches would be affected by 
acoustic confusability, and such an effect 
would serve to converge on the representa- 
tional differences presumably existing in 
the different memory systems. If it is 
assumed that acoustically mediated asso- 
ciations are stored in inactive memory, 
then it follows that these associations 
would effect RTs to associational matches 
in active memory more so than physical 
matches in active memory. ‘Table 2. re- 
vealed that this did not occur. In fact, 
these data show that that which mediates 
matches in inactive memory must be 
operating at least as much in active 
memory physical matches as in active 
memory associational matches. 


Discussion 


Previous studies have loaded what is pre- 
sumably active and inactive memory in char- 
acter classification tasks (Forrin & Morin, 
1969; Sternberg et al., 1969). It was sug- 
gested that differences between these studies 
might be artifacts of control procedures, such 
as practice, familiarity, overlap of memory 
sets, etc. The present study sought to achieve 
proper control over these variables and de- 
termine other and possibly more meaningful 
explanations for differences in active and in- 
active memory probe RTs. The following 
major predictions were made: (a) inactive 
probe RTs are slower than active probe RTs, 
(b) the rate of processing in inactive memory 
is slower than in active memory, (c) both 
associational and physical matches are made 
in active memory, and (d) only associational 
matches are made in inactive memory. Sup- 
port for these predictions is provided in 
Figure 2. These data suggest that the in- 
formation stored in active memory is such 
that both associational matches and physical 
matches are possible. This in turn implies 
that visual information, which is more or less 
isomorphic with the stimulus, is represented 
in what is presumed to be active memory and 
is more effcient for use in matching than the 
associational information. Further, the data 
suggest that visual information becomes rela- 
tively less efficient than the associational 
information in inactive memory or it must 
be retrieved in a recoding operation. Thus, 
Ss are able to select and to use visual in- 
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effect. Here, the memory sets were hetero- 
geneous, ie., each set could contain both 
upper- and lowercase letters. Thus, if the 


memory items are recoded (as Swanson et al. 
suggested) this must be done on an individual 
basis. That is, all memory items cannot 
automatically be recoded given the probe 
item; each recoding must be based on a com- 
parison of each memory letter with the probe. 

Figure 3 diagrams several possible physical 
and associational match conditions, Its pur- 
pose is to illustrate associational-physical 
match differences in terms of the number of 
recoding operations. Several points must be 
considered in examining this figure. First, 
the matching memory item is shown always 
in the "final" position. There is neither 
vidence in this experiment that a serial scan 
exists, making this the final position, nor that 
the scan is exhaustive. Also, appearance of 
the probe in other positions changes the 
numerical values in the figure but not the 
difference or conclusions. In sum, the figure 
involves several noncritical, simplifying as- 
sumptions. Second, by definition, associa- 
tional matches require recoding. This recoding 
occurs whenever a memory item and the 
probe item do not match in terms of physical 
identity. That is, both a (memory item) - A 
(probe) and D (memory item) — A (probe) 
require recoding. Thus, for example, in 
Figure 3 when ab is presented as the memory 
set and A is the probe, 2 recoding operations 
are necessary. The third important point is 
that the difference between associational and 
physical matches is represented as recoding 
operations and the number of such operations 
is constant (difference — 1) regardless of the 
number of items in the memory set. It is 
proposed that this constancy yields the inter- 
cept difference between associational and 
physical matches and that the magnitude of 
the difference (19 msec.) is due to the necessity 
of one additional recoding for the associa- 
tional match situation. 

The slope difference between active and 
inactive memory functions is also of concern. 
First, processing of items presumably stored 
in inactive memory is slower than processing 
of active memory probes. This precludes 
Forrin and Morin's (1969) concept of a match 
in inactive memory which does not involve 
search and is independent of the number of 
items in that memory system. A more reason- 
able explanation is that items stored in 
inactive memory must be retrieved prior to 
comparison with the probe. A parallel re- 
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trieval would result in an intercept-only dif- 
ference between active and inactive memory 
functions. The slope difference does suggest 
a serial retrieval of the inactive memory 
items. The notion that this difference is due 
to comparison of different codes is rejected 
because the slope of the associational matches 
in active memory was not the same as the 
slope in inactive memory. This introduces 
the second important point. If recoding of 
physical into associational form is assumed 
to involve some interaction with inactive 
memory (e.g. the association between A 
and a, be it verbal or visual, is stored in 
inactive memory), then it would be expected 
that the estimate of recoding time should be 
the same as the retrieval of information 
stored in inactive memory. Figure 2 shows 
that the recoding time for active memory 
items is approximately 20 msec. The dif- 
ference in slopes between active and inactive 
memory functions is interpreted as an esti- 
mate of the serial retrieval time for the 
inactive memory items. This difference is 
also approximately 20 msec. Thus, recoding 
time and retrieval time, which have logical 
commonalities, are estimated to be of ap- 
proximately equal duration. This is not to 
say that recoding and retrieval can replace 
one another. If this were so, then retrieval 
of the inactive set items would eliminate the 
necessity for subsequent recoding, which 
again, would produce equal slopes in Figure 1. 
The third predominant feature in Figures 1 
and 2 is the intercept difference between the 
active memory and inactive memory func- 
tions (23 msec.). That this is due to parallel 
retrieval processes was rejected above (in favor 
of serial retrieval). Sternberg et al. (1969) 
suggested that it is due to the additional time 
required to locate the appropriate ensemble 
of items in inactive memory. A third pos- 
sible explanation is that this intercept dif- 
ference reflects time taken to "identify" or 
recode the probe item into its associate form. 
This, however, assumes that active memory 
is examined first and such identification fol- 
lows the examination and is prior to search 
of inactive memory. Such an assumption 
finds some support in the present study 
(Table 1); however, earlier it was shown that 
any recoding of the probe, in connection with 
active memory, is unlikely due to the mixing 
of cases in the memory set. Because the 
same logic applies here, the most parsimo- 
nious explanation for the difference in inter- 
cept between active and inactive memory 
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functions is simply the "time to locate" 
(Sternberg et al, 1969) the inactive set 
ensemble. Dd 
It was predicted that acoustic similarity 
would increase RTs when associational 


matches were made. If so, then the associa- 
tion would be presumed to be a verbal one 
based on an acoustically represented name. 
The total lack of a significant effect of this 
variable, coupled with a similar lack of effect 
of "nameability" on associational matching 
in a study by Taylor (1969), casts doubt on 
the hypothesis. There are 2 reasonable alter- 
native hypotheses concerning the nature of 
the associational match. First, the associa- 
tion may be an abstract verbal code which 
involves the name of the item but not in 
acoustical form (Taylor called it a "plan"). 
Second, the association may be a recoded 
physical representation. That is, when S is 
given A to memorize followed by the probe a, 
he recodes A into a. This recoding is fol- 
lowed by a physical match of a with a. An 
appropriate manipulation of the visual simi- 
larity would address this issue. 
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Words masked by upright random letters in a wordlike array or by unrelated 


words are more difficult to read tha 
cluding strings of complete letters in 


was not observed when targets were figures to be ident 
s that a wordlike mask made of letters sets up a reading response 


suggest 
that interferes with the processing 
reading. 


It has been reported (Jacobson, 1973) 
that w« that are backward masked by 
unrelated words or by zero-order pseudo- 
words are much more difficult to read than 
the same words masked by a random-dot 
Julesz (1971) field. Letters have a higher 
contour density than the random pattern 
used in the Jacobson study (squares half 
as high as the letters), and it was suggested 
that this localized di in "grain" 
produced the aforementioned result. 

However, a subsequent study called the 
contour-density explanation into question. 
Masks were made by systematically print- 
ing one letter on top of another and then 
erasing a randomly chosen half of each of 
the 2 letters (see Figure 1B). Strings of 
such “chimeric letter re found to be 
much less effective as masks of words than 
unrelated words used as masks.? It could 


rence 


not be argued that the chimeric-letter 
masks had less contour density than the 
words used as masks; if anything, the 
chimeric letters (which most Ss said re- 


haracters) had a greater 
nan the ordinary letters 


sembled Chinese c 
contour density tl 
that comprised the words. 

to find out 
t, a study was done to 


In an attempt more about 
this puzzling resu 
compare the maski 
of stimuli purposely 
larity to printed English. 
were; Julesz fields, randomly 
pieces of letters, randomly oriented whole 


ng of words by a number 
varied in their simi- 
These masks 
oriented 


1 Requests for reprints should be sent to 4 EAS 
Jacobson, Psychology Department, Queen's Uni- 
versity, Kingston, Ontario, Canada. 

? Unpublished pilot data collected by the author. 


n words masked by other patterns, in- 
random orientation, Such a difference 


d. This result 


of a target when the target requires 


letters, upright whole random letters (zero- 
order pseudowords), unrelated words, and 
words that sociates of the target 


The last condition was included 


were as 


words. 
as an internal check for the validity of the 
study. Words masked by associates are 


more readily reported than words masked 
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E 
F 
6 
Ficure 1, Stimuli used in a preliminary study 


and in Experiment I. (Rows labeled A and B — 
comparison of word and chimeric-letter stimuli used 
in the preliminary study; C-G stimuli used in 
Experiment I: C = word, D 
word, jumbled letters, F = letter pieces, and 
G = Julesz field.) 
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zero-order pseudo- 
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by unrelated words (Jacobson, 1973); if 
this finding could not be replicatéd in the 
present study, the remainder of the data 
would be suspect. Furthermore, an at- 
tempt was being made to continuously vary 
the similarity of the mask to the target; 
including associate masks in the study 
formed a logical extension of that con- 


tinuum in the direction of increased 
similarity. 
EXPERIMENT I 
Method. Twenty-five introductory psychology 


students (20 women and 5 men) served as Ss. 

Letter, word, and letter-piece stimuli were made 
by photographing typewritten materials with black- 
and-white film and mounting the negatives as slides 
for the projection tachistoscope. Thus, S saw white 
capital letters or pieces of letters on a black back- 
ground. Sixteen associated word pairs were chosen 
from Arthur's (1969) list of free associates (Figure 
1C); unrelated words were the same words mis- 
paired. Zero-order pseudowords (wordlike arrange- 
ments of randomly chosen letters) were also used as 
masks (Figure 1D). Randomly chosen letters in 
random orientation but arrayed on a line (“jumbled 
letters") formed another class of mask (Figure 1E). 
Pieces of letters in random order arranged on a line 
were yet another sort of mask (Figure 1F). 

Strips of Julesz fields from Gregory (1966) were 
photographed to form the random-dot masks 
(Figure 1G), Targets for the nonword masks were 
chosen randomly from the same set of words as 
gets for word masks. 

Each mask was arranged to cover its particular 
target when projected. A Gerbrands 2-field pro- 
jection tachistoscope was used to present the 
stimuli in a dimly lit room. The letters projected 
on the screen were 4 cm. high with a luminance of 
8.0 + .5 ftL. The background was 2.0 + .3 ftL.. 
There was no fixation point. The Ss were seated 
3 m. from the screen. 

In a pretest procedure, each S was shown 1 word 
for 8 msec. one or very few times (it is not difficult 
to'read unmasked words at this interval under these 
conditions). Then, as a demonstration of masking, 
5 was shown the same word followed 2 msec, later by 
another word presented for 120 msec. He wa asked 
if he saw the first (target) word, and when he said he 
did not, he was assured that a word was indeed 
shown. The masking sequence was slowed down and 
shown to him again to assure him of that fact. He 
was told that people for some reason find it difficult 
or impossible to read a word when it is followed so 
quickly by a m It was further explained that 


5A, Z. Arthur. Queen's norms for responses to 
100 words from the Kent-Rosanoff Word Association 
Test. (National Research Council of Canada Pro- 
ject APA 0269). Unpublished manuscript, Queen's 
University, Kingston, Ontario, Canada, 1969, 
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ı target word would) 
^s of masks, and 


this was to be a study in \ 
be followed by any of se 
task would be to read the : i 

Following the familiarization, 24 randomly ordered 
trials were given. Targets always exposed foi 
8 msec., masks for 120 mss: 

The mask delay needed 
correctly was determined 
of limits. Each trial w 
elapsing from the onset : 
of the mask (Kahnema: calls this interval 
stimulus onset asynchron \). Tf S could not 
read the target correctly, ral was increased) 
by 10 msec. and the laved again. To 
warn S, E said "ready" ! ) sequence. The 
Ss used this warning timc hat part of tl 
screen where the stimuli nt 
cedure was repeated, inc: 
msec. each time, until the 
The delay of the mask 
target word correctly wa 
variable, 

Results. An ana! 
conducted on the da 
slightly more facile 
of each mask condition, í 
of 43 msec. on the fi 
mask condition to 36 : 
F.(3, 72) — 3.64, 5 - 
apparent interaction ! 1 mask condi 
tion and repetition. Mean SOAs when 
the targets were read under the variou 
conditions were, in or less to more 
similarity between mask and target: J F 
field mask, 36 msec. ; ; of letters, 32 
msec.; jumbled-letter mask, 36 msec 
O-order letter sequences, 53 msec.; 
related words, 48 msec.; and associated 
words, 33 msec. (See ligure 2). 
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FIGURE 2. Mean stimulus onset asynchrony 


(SOA) when the target was read correctly: Expert- 
ment I. 
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INTERACTION OF LINGUISTIC CONTENT OF MASKS AND 


Individual comparisons with regard to 
mask treatments were done on the data. 
They showed that there were 2 separate 
kinds of masks: those that appeared like 
or actually were words, and those that 
did not appear like words, regardless of 
whether they were made of letters or not. 
Targets masked by unrelated words and 
by zero-order pseudowords were more diffi- 
cult to read than those masked by jumbled 
letters, letter pieces, and Julesz fields, 
F (1, 24) = 60.0, p < 10-*. There were 
no reliable differences among words masked 
by jumbled letters, letter pieces, or random 
dots. There was no reliable difference 
between words masked by unrelated words 
and those masked by zero-order pseudo- 
words. There was a reliable difference 
between the associated and nenassociated 
mask conditions, F (1, 24) = 16.04, p < 
.0008. This comparison is not orthogonal 
with the rest and is only presented to 
demonstrate that a previously noted effect 
was also present for these Ss. 


Discussion. Again, differences in grain can- 
not explain the results. Contour density was 
controlled in the jumbled-letter and letter- 
piece masks by, in the first case, centering each 
randomly oriented letter physically where it 
would have appeared if it had been a zero- 
order pseudoword, A similar procedure was 
used for the letter pieces except that 2 letters 
were centered in each position and a random 
half of each was erased, Thus, contour density 
was held constant across word, zero-order, 
jumbled-letter, and letter-piece masks. 

The crucial variable seemed to be the ar- 
rangement of upright letters in a wordlike 
array, regardless of whether the letters actu- 
ally formed a word. An after the fact explana- 
tion of this result might be that a reading set 
is triggered by a mask that is a word or by 
letters arrayed as a word, and that reading or 
attempted reading of the mask interferes with 
the reading of an unrelated target. 
` If this were true, the relative differences in 
interference with the target should be limited 
to reading the target word (or identifying its 
letters, perhaps), but not to recognizing some 
other property of it. To test this, another 
study was done in which the same masks were 
used, but the targets were rows of shapes for 
Ss to identify. The sharp distinction between 
zero-order masks and the rest of the nonword 
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masks might not occur under the foregoing 
suggestion. 


EXPERIMENT II 


Method. Sixteen students in introductory psy- 
chology, 12 women and 4 men, served as Ss. 

New targets were made of rows of squares, tri- 
angles, circles, diamonds, or arrowheads the same 
size as letters and arrayed in a line; each target 
contained only 1 sort of shape. These were masked 
by the same zero-order pseudowords, jumbled letters, 
letter pieces, and Julesz fields used in the last study. 
As before, trials in which words were masked by 
related and unrelated words were included as an 
internal check of the results. 

Again, each $ was pre: 
times, The procedure w 
in Experiment I. 


Results. As before, Ss improved steadily 
from the first trial in each condition to the 
last. Mean SOA was 70 msec. on the first 
trial and 54 msec. on the last, F (3, 45) 

7.35, p < .001. There was no interac- 
tion of repetition and mask type. 

Mean SOAs when the shapes were cor- 
rectly identified were; Julesz-field mask, 47 
msec, ; letter-piece mask, 51 msec. ; jumbled 
letter mask, 59 msec. ; and zero-order ma 
55 msec. ‘There was no apparent differ- 
ence between zero-order pseudowords on 
the one hand and jumbled letters, letter 


ted each sort of trial 4 
as the same as that used 


pieces, and Julesz fields on the other, 
F (1, 15) = .5342, p = .52. As before, 
associated targets were easier than un- 


associated targets, 42 msec. vs. 79 msec., 
F (1,15) = 30.42, p = .0002, 

There was a reliable tendency for shapes 
masked by letter pieces to be more readily 
reported than those masked by jumbled 
letters, F (1, 15) = 11.5, p < .004. 


Discussion. The Ss experienced more diffi- 
culty identifying word targets in the second 
study than in the first. It is likely that this 
was due to the uncertainty present in the 
second but not in the first study as to whether 
a particular target w word or a row of 
shapes. 

It is obviously not possible to state from the 
observed absence of a difference in identifying 
shapes when masked by zero-order pseudo- 
words relative to all other masks that no such 
difference exists. Nevertheless, it seems rea- 
sonable to say that such a difference, if it 
does exist, is much less marked than when the 
targets are words. This tends to confirm the 
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shapes when masked by zero-order pseudowords 
and jumbled letters. (SOA = stimulus onset 
asynchrony.) 


hypothesis that the wordlike arrangement of 
the mask sets up a process or response that 
interferes with the identification of a target 
when that target requires reading, but not 
otherwise. 

There is no inconsistency between this ex- 
planation and the fact that words masked by 
associates were more readily reported than 
words masked by nonassociates and by zero- 
order pseudowords. This result cannot be 
ascribed to guessing, and the explanation that 
has been suggested is that it is a perceptual 
effect involying memory traces of the target 
and mask words (Jacobson, 1973). The 
memory traces of 2 associated words involve 
large networks that include many of the same 
circuits or units. The sets of circuits 
(“lattices”) representing 2 unassociated words 
in memory would not have as high a degree 
of mutual inclusion among their elements. 
Thus, while masking a word by another word 
interferes with the processing of the target, 
the activity set up by the mask—if it is an 
associate of the target—tends to partially re- 
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stimulate the target’s trace, A word masked 
by an associate, then, needs less of an input of 
energy containing the target information” 
before it can be identified than the same target 
word would need if it were masked by an 
unassociated word. 

In the present study, this facilitation by as- 
sociation occurred and effectively countered 
the interference created by the wordlike ap- 
pearance of the associated mask. As a result 
(when the target is a word) there are 2 series 
of masking stimuli—a wordlike series in which 
association is an effective aid to recognition, 
and a nonwordlike series of masks in which 
word association obviously cannot operate. 

Turvey (1973) has suggested that there are 
2 kinds of visual masking, one taking place 
peripherally and one centrally, with markedly 
different properties. Except for the Julesz 
fields, all of the masking stimuli used in the 
present study seem to be the sort that Turvey 
found to act centrally as well as peripherally. 
The interaction reported here between mask 
and target material with regard to their lin- 
guistic content (see Figure 3) indicates that, 
if Turvey is correct, there must be different; 
processes acting even within the central sort 
of masking. Of course, this assumes that the” 
difference between zero-order and jumbled- 
letter masks that was observed in Experiment 
I was effected centrally and not peripherally, 
but it is difficult to imagine how that could 
be otherwise. 
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ORTHOGRAPHIC DISTINCTIVENESS OF CONSONANTS 
AND RECOGNITION LEARNING 
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The effects of varying the distinctiveness of uppercase and lowercase forms 
of consonants were determined for both single-item (i.e., individual study, 
or target, items that are later tested for recognition with paired distractors) 
and multiple-item (i.e., paired study items, one of which is right, that are 
also tested in pairs for recognition of right items) recognition learning. Both 


tasks re 


A feature-analytic hypothesis of single- 
item recognition learning (e.g., Anisfeld & 
Knapp, 1968) attributes the false-recogni- 
tion effect (Underwood, 1965) to a feature- 
tagging principle. Briefly, sensory and/or 
semantic features of individually presented 
study items are presumed to be tagged in 
some manner during their initial exposure. 
Recognition of later test items as o/d is 
then based on their possession of these 
previously tagged features. Accordingly, 
the false recognition of new items as old 
is likely to be greater when they share 
sensory and/or semantic features with 
prior study items than when they are un- 
related to prior study items. 

A seemingly potent source of sensory 
features for mediating the false-recogni- 
tion effect is that of orthography. In a 
recent experiment by Raser (1972), a 
number of study items were homophones 
that were later paired for testing purposes 
with their homophonic counterparts as 
distractors. The pairs were either high 
or low in orthographic distinctiveness (e.g., 
CLAWS-CLAUSE and FOUL-FOWL, respec- 
tively). Also included in the experiment 
were nonhomophonic study items that 
were later paired with distractors of either 
high or low orthographic distinctiveness 
(e.g., CLOUD-CLEAVE and SOUR-SLUR, re- 
spectively). The false-recognition rate 
(ie., selecting new intrapair members as 
old study items) was contrasted for test 


1 Requests for reprints should be sent to Donald 
H, Kausler, Department of Psychology, University 
of Missouri, 209 McAlester Hall, Columbia, Missouri 
65201. 


iled a greater error rate for consonants of low distinctiveness than 
for consonants of high distinctiveness. 


pairs of varying disinctiveness. Sharing of 
tagged orthographic features should, of 
course, be greater when the old-new mem- 
bers of test pairs are low in distinctiveness 
than when they are high. In agreement 
with this hypothesis, Raser found that 
orthography had a significant effect on the 
rate of false recognitions. In fact, the 
effect was more pronounced than that found 
for shared phonemic features. 

Similar findings have been reported for 
the multiple-item, or verbal discrimina- 
tion, recognition learning task. Kausler 
(1973b) discovered slower acquisition for 
paired homophones (i.e., one member of 
each pair functioned as the right item, the 
other as the wrong item) of low ortho- 
graphic distinctiveness than for paired 
homophones of high orthographic distinc- 
tiveness. Schulz and Lovelace (1972) 
found a parallel effect when either homo- 
phones or nonhomophones of high and low 
orthographic distinctiveness functioned as 
interpair right and wrong items. These 
results support the general hypothesis that 
multiple-item recognition learning, like 
single-item recognition learning, is medi- 
ated by a feature-analyticfeature-tagging 
process (Kausler, 1973a, 1973b, in press). 

The present experiments provided a 
further test of the effects of orthographic 
distinctiveness on recognition learning. 
Specifically, the test involved distinctive 
features at the level of graphemic content, 
rather than distinctive features at the more 
molar level of distinctive orthography (i.e., 
letter content, word length, etc.—as is the 
case when words serve as items). The 
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present study items were consonants scaled 
for the distinctiveness of their upper- and 
lowercase forms. Upper- and lowercase 
forms of the same letter have identical 
phonemic features, and, presumably, highly 
overlapping semantic features as well. 
Consequently, when one form of each letter 
is a study item and the other form a paired 
test distractor, intrapair discriminations 
are likely to be accomplished through what- 
ever distinctive graphemic features were 
tagged during the original study phase. 
Since consonants vary widely in the dis- 
tinctiveness of their upper- and lowercase 
forms (e.g., G, g vs. Z, z), their hit rates 
in single-item recognition learning should 
reflect this variation. A similar effect 
should occur when the upper- and lowercase 
forms function as intrapair right-wrong (or 
wrong-right) members of a multiple-item 
recognition learning task. Experiments I 
and II provided tests of this hypothesis for 
the single-item and multiple-item tasks, 
respectively, 


EXPERIMENT I 
Method 


Subjects and design. Forty students in general 
psychology classes at the University of Missouri, 
all of whom were naive with respect to prior par- 
ticipation in recognition learning experiments, 
served as Ss. They were assigned alternately to 2 
groups (n = 20) that differed only in the instruc- 
tions (standard vs. experimental) given prior to 
the first study phase. Distinctiveness was a within- 
groups variable, with half of the study list composed 
of consonants previously rated (by a separate 
group) as being high in upper- and lowercase dis- 
tinctiveness and the other half of consonants 
previously rated as being low. Thus, the design 
was a 2 X 2 factorial, with one between-groups and 
one within-groups variable. 

Materials and lists. Upper- and lowercase forms 
of the 20 consonants had been rated earlier by 112 Ss 
(general psychology students) for their physical 
distinctiveness on a 9-point scale (1 = very dis- 
tinctive, 9 = very similar), Mean ratings ranged 
from 3.16 (G vs. g) to 845 (Z vs. z). For purposes 
of this study, the consonants were divided at the 
median into a high distinctiveness (HD) set (G, Q, 
D, R, N, H, B, L, T, and M—in order of decreasing 
distinctiveness) and a low distinctiveness (LD) set 
(Z, V, P, X, W, S, C, K, J, and F—in order of in- 
creasing distinctiveness). The uppercase forms 
served as study items for 5 HD and 5 LD con- 
sonants; the lowercase forms served as study items 
for the remaining 10 consonants. For greater 
generalization of results, 2 variants of the study list 
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were employed. In the first, the uppercas m of 
the odd-numbered consonants (as ranke front 
most to least distinctive) and the lowercase form of 
the even-numbered consonants were study items. 
In the second variant, the lowercase form of the 
odd-numbered consonants and the uppercase form 
of the even-numbered consonants became the study 
items. The Ss in the 2 instructional groups were 

ned alternately to the 2 list variant In 
addition, 2 different random orders for presenting 
the study items were employed per trial in cach list 
variant. Neither list variation nor order of pre- 


sentation were later found to influence hit rate; 
Consequently, these factors were pooled for the 
analysesreported later. Study slides and subsequent 
test items were typed on the same IBM Selectric 
typewriter. 

List length was extended by including 30 2-digit 
numbers as additional study items, The first 5 


and the last 5 study items were always numbers; 
the remaining 40 items alternated between cone 
sonants and numbers. 

Procedure, - AM Ss were fully informed as to the 
composition of the study list. That is, they knew 


that uppercase consonants, lowercase consonants, 
and numbers would be exposed, and that they were 
to remember the items for a subsequent memory 
test. They were further informed that uppercas 
and lowercase consonants would always appear om 
the right and left sides of the screen, respectivelys 
This procedure was followed in order to assur 
perception of the appropriate case form of the les 
distinctive consonants. The standard instructional 
group received only the above information. The 
experimental instructional group received additional) 
information suggesting that a useful device fi 
memorizing the consonants would be to form afl 
image of each consonant that emphasized its 
distinctive features. It was anticipated that these) 
instructions would reduce the disparity in hit rates 
attributable to graphic distinctiveness. 

There were 2 study-test trials. On the study 
phase of each trial the individual items w 
posed, via a Kodak Carousel projector, at a 
rate, with a .9-sec. interitem interval. Test items 
were presented in a test booklet, with a separate 
page for each trial. Each old consonant item was 
paired with its upper- or lowercase counterpart asi 
a distractor, and each old number with a new number 
as a distractor. Correct items appeared equally 
often on the right and on the left of the test pairs. 
Two different random orders of pairs were use 
per trial with each of the list variants described 
earlier. Order was again found to be unrelated to 
hit rate, and the diflerent orders were pooled for. 
the main analyses. For each of the 50 pairs, S 
encircled the old item, and rated the confidence of 
his judgment on a 5-point scale. The test phases - 
were self-paced, 


Results and Discussion 


Means and standard deviations for total 
errors on the 2 trials are given in Table 1. 


ORTHOGRAPHIC DISTINCTIVENESS OF CONSONANTS 


The main effect for distinctiveness was 
statistically significant, F (1, 38) = 4.75, 
p «4.05. As hypothesized, the error rate 
was greater for LD consonants than for HD 
consonants. Neither the main effect for 
instructions nor the Instructions X Dis- 
tinctiveness interaction effect approached 
significance, Fs (1, 38) <1. It is con- 
ceivable that imaginal encoding is the dom- 
inant means of processing graphic features 
for tagging purposes. If true, further in- 
struction to use this strategy would be 
redundant. 

An additional question concerns the dis- 
tribution of errors across the full con- 
tinuum of the distinctiveness variable. 
The overall correlation between distinc- 
tiveness ratings and errors per consonant 


approached statistical significance, p (20) 
42, p < 10, The failure to find a more 


sooi correlation stems from the 
seemingly nonmonotonic covariation be- 
tween distinctiveness and error rates. 
Grouping the consonants into successive 
sets of 4 consonants each, the total errors 
per set (ranging from most to least dis- 
tinctive) were 42, 61, 43, 67, and 77. Thus, 
the departure from an otherwise regular 
increase in error rate with decreasing 
distinctiveness occurred for consonants of 
intermediate distinctiveness. 

i The confidence ratings revealed a pattern 
that paralleled that found for error rates, 
and they offered no additional insights 
into the processes underlying the effects of 


TABLE 1 
SUMMARY Statist i FOR TOTAL ERRORS IN SI 


Irem (E 
(Exp 


| Instructions 


Imagery Standard 


Bes 
| 


Distinctiveness 


Single-item recognition 


High 3.00 2.63 

Low 4.40 2.28 
Multiple-item recognition 

High 8.25 5.33 

Low 13.15 8.16 
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distinctivenes Consequently, these data 
will not be reported here. 


EXPERIMENT I] 
Method 


The design and general procedure followed that 
of Experiment I, but with appropriate modifications 
for the shift from a single-item to a multiple-item 
task. There were again 40 Ss, 2 instructional condi- 
tions as a between-groups variable, and 2 levels of 
distinctiveness as a within-groups variable. Each 
experimental list contained 20 pairs of upper- and 
lowercase consonants (without additional filler 
pairs). One member of each pair s designated 
the right item. Half of the right items were upper- 
case forms (5 HD and 5 LD), the other half, lower- 
case forms (5 HD and 5 LD). ‘Two variants of the 
list were employed, with the right and wrong mem- 
bers of pairs reversing functions across the variants. 
There were 8 study-test trials. On the study phase 
of each trial the individual pairs were exposed, via 
a Kodak Carousel projector, at a 2-sec. rate and 
.9-sec. interpair interval. The right member of 
each pair was underlined. Test pairs were presented 
in a test booklet, with a separate page for 
trial. The Ss indicated right items by enc 
one member of each pair (self-paced), Different 
random orders were employed across study and test 
ses, and standard control procedures for verbal 
rimination research were applied (eg, right 
items appeared equally often on the spa right 
and left for each study and test phase). Finally, to 
assure familiarity with the procedure, practice on 
the experimental list was preceded by 2 trials on à 
practice list composed of upper- and lowercase 
pairs of vowels. 


Results and Discussion 


Means and standard deviations for total 
errors on the 8 trials are given in Table 1. 
The main effect for distinctiveness was 
again statistically significant, F (1, 38) = 
21.16, p < .001, with the error rate once 
more being greater for LD consonants than 
for HD consonants. As in Experiment 1, 
neither the main effect for instructions nor 
the Instructions X Distinctiveness inter- 
action effect approached significance, Fs (1, 
38) — 1.60 and «1, respectively. If any- 
thing, there was a slight trend toward more 
errors under imagery instructions than 
under standard instructions. 

The distribution of errors across the full 
continuum of the distinctiveness variable 
was examined further, this time in the 
context of multiple-item learning. The 
overall correlation between distinctiveness 
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ratings and errors per consonant attained 
statistical significance, p (20) —.50, 5 «.05. 
The total errors per set of 4 consonants 
(ranging from most to least distinctive) 
were 105, 157, 122, 170, and 199. Thus, 
the locus for the departure from a mono- 
tonic relationship coincided with that found 
for single-item recognition learning, i.e., 
with consonants of intermediate distinc- 
tiveness. 

The extraction and concomitant tagging 
of distinctive graphemic features appear to 
be important components of both single- 
item and multiple-item recognition learn- 
ing. It is this commonality of processes 
that presumably underlies the commonality 
of the functional relationships demon- 
strated in the present experiments. Thus, 
the present results add further support to 
the basic continuity between variants of 
recognition learning. In both variants, 
distinctive features of items are tagged, 
most likely by frequency-of-response cues 
(Kausler, 1973a, 1973b), during study 
phases. The tagged features are com- 
ponents of target items in the single-item 
task and components of right items in the 
multiple-item, or verbal discrimination, 
task. Subsequent discriminations are based 
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Four groups of Ss learned a list of p 


AL INTERVAL EFFECTS 
CIATE LEARNING 

ER! anD DORIS KISTLER 

of Alabama 


yaired associates (consonant-consonant- 


consonant: digit) under 4 combinations of study and retrieval times (2 vs. 


6 sec.). 
of the task. 
trigram integration. 


Transfer tests were cai 
The major resu 


Other Ss were allowed to pace themselves during the study phase 


rried out to measure cue selection and 
ilts were as follows: (a) study time up 


to 6 sec. influences rate of learning, (b) study time and retrieval time produce 
interactive effects on learning, (c) longer retrieval times are associated with 
a greater amount of single-letter selection and less stimulus integration, 
and (d) study time has little effect upon cue selection. 


Studies of variations of item exposure 
duration have focused for the most part 
upon the rate rather than upon the process 
of acquisition. Much of the research per- 
taining to presentation interval has been 
generated by Bugelski’s (1970) folal-time 
hypothesis which states that the time re- 
quired to learn a given amount of material 
is a constant regardless of how that time is 
spread across trials. The amount learned is 
a function of the total time committed to 
learning, within limits. 

Actually, the total-time hypothesis is a 
generalized expression of a relationship 
between time available for learning and 
performance. It does little by way of 
elucidating what is learned and the way 
in which learning proceeds, Clearly, the 
manner in which time is distributed during 
a learning trial may differentially affect 
encoding, storage, and retrieval processes. 
Accordingly, the major focuses of the 
present investigation are upon the effects 
of the distribution of time in relation to cue 
selection and stimulus integration and the 
reciprocal interaction of acquisition and 
retrieval processes. 

Much of the support for the total-time 
hypothesis has come from paired-associate 
(PA) learning studies, primarily of the 
anticipation variety. Cooper and Pantle 
(1967) noted that the functional time, in- 


1 Requests for reprints should be sent to Alfred 
A. Baumeister, who is now at the John F. Kennedy 
Center for Research on Education and Human 
Development, George Peabody College for Teachers, 
Nashville, Tenne: 


sofar as the hypothesis is concerned, is 
probably the study interval. An implica- 
tion of this conclusion is that rehearsal is 
the critical process affected by variations 
in study time. 

Recently, a number of experiments have 
been reported that raise serious questions 
concerning the validity of the total-time 
hypothesis. Izawa (1971) has argued that 
the anticipation procedure of PA learning 
confounds the effects of study and test in- 
tervals. There is good reason to assume 


that acquisition and retrieval are qualita- 
S. 


tively different proces Izawa found 
that the way in which time is distributed 
in the study-test method produces effects 
that are different from those observed when 
anticipation procedures are employed. Stu- 
bin, Heimer, and Tatz (1970) also raised 
a number of conceptual and methodological 
criticisms of experiments that purportedly 
provide support for the total-time hy- 
pothesis. Using the study-test method, 
they reported that shorter presentation 
times are more efficient than longer presen- 
tation times. That is, doubling the number 
of exposures of a pair produced more 
effective performance than doubling ex- 
posure time per se. A similar conclusion 
was reached earlier by Johnson (1964). 
The results of these and other investiga- 
tions suggest the possibility that variations 
in study time may lead S to adopt different 
acquisition strategies. Furthermore, al- 
though acquisition and retrieval phases 
can be experimentally separated in the 
study-test method of PA learning, there is 
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the possibility that time constraints placed 
on S during retrieval may affect his en- 
coding of the stimuli. 

In the present experiment, study time 
and retrieval time were varied factorially. 
The stimulus terms were consonant tri- 
grams, the responses were digits. Transfer 
tests were made to determine degree of 
letter selection and trigram integration 
that occurred during learning. 


METHOD 


Subjects. The Ss were 75 undergraduate volun- 
teers from introductory psychology courses at the 
University of Alabama. None had been previously in- 
volved in PA learning experiments., 

Materials. The learning materials consisted of 
6 pairs with consonant trigrams as stimuli and single 
digits as responses. The pairs were as follows: 
PMD:7, KHB:5, VLT:6, GWS:4, JYC:8, and FRN:9. 

Apparatus. The learning and transfer materials 
were presented to S on an inclined viewing console, 
53.3 cm. long and 27.9 cm. high, in which 4 one-plane 
display projectors were mounted. The viewing 
surface for each projector was approximately 2.5 
X 5 cm. Three projectors, containing letters, were 
mounted side by side, The fourth projector, con- 
taining the digits, was placed 5 cm. to the right of 
the other 3. A series of electronic timers and 
stepping relays were used to program presentation of 
the materials, All intervals and sequences were 
automatically controlled. 

Procedure, The study-test (retrieval) method of 
PA learning was employed, Each of 60 Ss was 
randomly assigned to 1 of 4 combinations of dura- 
tion of study and retrieval intervals. The study and 
retrieval durations were either 2 or 6 sec., with the 
4 combinations designated as 2-2, 2-6, 6-2, and 6-6. 
The number on the left refers to study interval, the 
number on the right to retrieval interval. 

The remaining 15 Ss were assigned to a self-paced 
study condition. The S held a switch that enabled 
him to change study items, These Ss were given 2 
sec. to retrieve the digit. Latencies were recorded 
for each study item. 

Each S was read standard instructions for study— 
test PA learning. A complete trial consisted of one 
presentation of all 6 pairs and the 6 stimulus terms. 
A buzzer was used to cue S prior to the test phase 
within a trial. A constant 5-sec. interval separated 
study and test pha: Four different orders of 
items in both study and test were administered to 
prevent serial learning. 

The criterion of learning was 2 successive perfect 
recitations of the list. Immediately upon reaching 
criterion, transfer tests were given. In the first 
transfer test, S was shown the 18 letters, 1 at a time, 
for 10 sec. each, and asked to recall the appropriate 
digit. Following digit recall, Ss were again shown 
the 18 consonants and asked to identify the “miss- 
ing" letters, Each was presented for 10 sec. Four 
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different orders were used for individual le: pre- 
sentation. The letters appeared on the console in 
the same position as in original learning. 
RESULTS 

Original learning. Learning performance 
was expressed in terms of trials to criterion. 
Mean trials were 8.6, 6.1, 4.9, and 5.3 for the 
2-2, 2-6, 6-2, and 6-6 groups, respe ely. 
A 2 X 2 analysis of variance revealed a 


significant interaction effect, F (1, 56) = 
5.2, p < .05. The Ss learning under the 
short study and retrieval interv (2-2) 
required more trials to reach criterion than 
Ss in other groups. There was no signifi- 
cant difference between any of the other 
conditions. 

The self-paced Ss required a mean of 4.6 
trials to attain the performance criterion, 
Analysis of variance involving just the 
groups given the 2-sec. retrieval interval 
revealed a significant effect, F (2, 42) — 
10.8, p < .01. The 6-sec. and the self- 
paced groups did not differ and both were 
superior to the 2-sec. study group. The 
mean latency to respond during the study 
interval for the self-paced group was 15.2 
sec/item. However, these Ss were ap- 
parently able to differentiate learned from 
unlearned items in the study phase. They 
selectively took more time with items that 
they had failed on the previous trial, 


TABLE 1 


MEAN NUMBER OF SiNGI 
SrUDY TIME AND I 


ET SoLuTIONS BY 
TRIEVAL TIME 


Letter position 
Retrieval —- Ll = 
| 1 2 3 
2-sec. study 
2 sec 1.3 A 
6 sec. 1.7 2 0 
6-sec. study 
2 sec. 1.0 e 2 
6 sec. 2.2 A 3 
Self-paced 
2 sec. 1:5 2| 5 


zi 


EFFECTS OF STUDY AND RETRIEVAL TIME ON PA LEARNING 


F (1, 14) = 10.2, p < .05. However, total 
time was not significantly correlated with 
trials to criterion (r = .08). 

Cue selection. The double criterion 
method of scoring single-cue selection de- 
scribed by Berry and Baumeister (1971) 
was employed: A correct digit recall to 
one and only one of the letters of the tri- 
gram was scored as single-letter selection. 
Table 1 presents the mean number of single- 
letter solutions for various groups. A 
2 X 2 X 3 analysis of variance including 
study time, retrieval time, and letter posi- 
tion was applied to these data (excluding 
the data from the self-paced condition). 
As expected, letter positions were differ- 
entially effective recall cues, F (2, 112) 
= 46.0, p < .01. More correct recalls 
were made to the first letter than to either 
of the other 2 letter positions. The main 
effect of retrieval time, F (1, 56) = 5.1, 
p < .05, and the Retrieval Time X Letter 
Position interaction, F (2, 112) = 3.6, 
p < .05, were significant. More single- 
first-letter solutions occurred under the 
6-sec. retrieval interval than under the 
2-sec. condition. 

A separate 3 X 3 mixed analysis of 
variance comparing the self-paced, 2-sec., 
and 6-s study conditions across letter 
positions yielded a significant effect of 
position, F (2, 84) = 178, p < 01. 
Again, it was the first letter of the trigram 
that was the most effective recall cue. 

Trigram integration. In the final phase 
of the experiment, Ss were presented one 
letter of the trigram and requested to 
reproduce the remaining letters. The 
number of letters recalled was totaled for 
each individual. Means for the 2-2, 2-6, 
6-2, and 6-6 groups were, respectively, 
22.6, 17.1, 24.9, and 17.3 letters correctly 
recalled. An analysis of variance yielded 
a significant effect for retrieval time, 
F (1,56) 2.7.9, p < .01. This result can 
be interpreted to mean that significantly 
greater integration of the trigram occurred 
under the shorter retrieval interval. 

The mean number of letters reproduced 
in the self-paced study condition was 21.5. 
An analysis of variance comparing the self- 
paced with the 2-2 and 6-2 groups failed 


441 


to yield a significant effect, F (2, 42) — 
1.49, Length of study time by itself seems 
to have little effect on trigram integration. 


DISCUSSION 


The results demonstrate that variations of 
study time and retrieval time may produce 
interactive effects upon PA performance. The 
Ss given 6 sec. to study each pair reached 
criterion faster than those who were given 
2 sec. of study time. However, in the case 
of those individuals who viewed the pair for 
only 2 sec. but who also were provided rela- 
tively long (6 vs. 2 sec.) retrieval time, the 
detrimental effects of the brief study interval 
were mitigated considerably. This finding 
implies the existence of some sort of informa- 
tion feedback mechanism from retrieval to 
storage process in PA learning. As time con- 
straints are relaxed in the retrieval phase, 
different strategies appear to be utilized during 
storage. Loftus and Wickens (1970) have 
reached a similar conclusion based upon their 
study of incentive cuing in the retrieval phase 
of PA learning. 

Tests for letter selection and letter repro- 
duction indicate that variations in retrieval 
interval influence encoding strategies. — In 
particular, it appears that, given greater time 
to retrieve the response term, college students 
are more inclined to select the first letter of a 
consonant trigram as the functional cue, Not 
only was there a greater amount of gle- 
letter selection in the 6-sec. retrieval condition, 
relative to the 2-sec. condition, but there was 
also less integration of the stimulus compound 
under the former condition. 

Although these results may, at first, seem 
to be at variance with what one might expect 
on a commonsense basis, we believe that when 
the test or retrieval phase is brief, Ss do not 
have sufficient time to actively adopt a cue- 
selection strategy. What is learned during the 
acquisition phase in the PA task is influenced 
by time constraints placed on recall. 

The performance of the self-paced Ss is 
instructive in this regard. Although they were 
allowed all the time they wished to study each 
pair, they were permitted only 2 sec. for re- 
trieval of the response. Transfer tests for 
digit recall to single letters and for letter repro- 
duction revealed that the self-paced group 
performed in much the same ways as the other 
E-paced Ss in the 2-sec. test condition. _Love- 
lace and Greenberg (1969) have likewise re- 
ported that the length of the study interval 
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(2 vs. 6 sec.) does not affect cue selection in 
PA learning with consonantal trigrams as 
stimuli. 

Any hypothesis stating that the amount 
learned is a function of the time given to study 
is an oversimplification on at least 2 counts. 
First, study time and retrieval processes are 
notindependent. Secondly, even if overall per- 
formance is the same when time is equated, 
different learning strategies may be employed. 
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Previous research suggests that stimulus loudness 


affects simple reaction 


time to a greater extent than it affects subjective simultaneity judgments. A 
question has recently arisen as to whether these results can be attributed to 


an attention bias like that of “prior entry.” 
possibility of this artifact replicated previous results. 


An experiment precluding the 
Some alternative in- 


terpretations of the results are briefly outlined. 


There is evidence to suggest that a 
simple phenomenological model is adequate 
to account for the relationship between 
visual intensity and simple reaction time 
(RT). This view supposes that perceptual 
lag, i.e., the time to perceive a stimulus 
from its onset, is influenced by stimulus 
brightness, and that a reaction is initiated 
as soon as perception of the stimulus has 
occurred (Roufs, 1963, 1966). That is to 
say, the effect of stimulus intensity changes 
on difference in the time of perception is 
the same as the effect of RT differences. 
In studies supporting this contention, 
Roufs (1963) compared the size of the 
effect of intensity on behavioral RT and 
phenomenal simultaneity and found that 
the size of the intensity effect was the 
same in both cases. Others have suggested 
that not all of the behavioral RT effect 
is accounted for by perception lag (Hohle, 
1967). This is particularly evident in the 
case of auditory intensi (loudness). 
There is evidence that RT is affected to 
a greater extent by loudness changes than 
is perceptual latency, as revealed by a 
perceptual simultaneity task (Sanford, 
1971). Indeed, Roufs (1963) has reported 
a failure to find any effect of auditory 
intensity on the point of subjective simul- 
taneity for click-flash pairs. 


1 This research was undertaken in partial ful- 
fillment of the PhD degree requirements at the 
University of Cambridge while the author held 
a Medical Research Council studentship at the 
Medical Research Council Applied Psychology 
Unit. 
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The purpose of the present article is to 
examine the apparent discrepancy between 
the effect of loudness on perceptual lag 
and simple RT using the same paradigm 
as that of Sanford (1971). Specifically, 
the design used is an attempt to ensure 
that the conclusions that have been drawn 
are not based upon data resulting from an 
attentional artifact. The task used in the 
previous study required S to watch a 
quickly rotating pointer move across a cir- 
cular dial. While S was watching the dis- 
play, an auditory signal was presented, 
and S’s task was to say where the pointer 
was positioned at the onset of the sound. 
By manipulating the intensity of the 
sound, some changes in the reported posi- 
tion of the pointer at onset were obtained, 
and the results suggested that perceptual 
lag increased with decreases in stimulus 
intensity. The increase , however, 
much smaller than the increase in behav- 
ioral RT measured for the same loudness 
range. It was argued that, with such a 
design, behavioral RT and perception lag 
could be measured under almost identical 
conditions, thereby minimizing the pos- 
sibility of differences in intensity effect 
magnitude resulting from different response 
criteria. (See Murray, 1970, or Grice, 1968, 
for a discussion of the response criterion 
concept.) 

The design actually employed entailed 
the presentation of the different intensi- 
ties in blocks. However, this may have 
had a biasing effect on the distribution of 
S's attention to the two components (visual 
and auditory) of the task. In particular, 
it may be argued that the trial blocks in 
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TABLE 1 
RELATION oF MEAN Reaction Time AND MEAN L 
VALUES (IN MSEC.) TO STIMULUS-INTENSITY 
INCREASE 
T Samia ke 
(db, above masker 
Measure JE 2 Li AS DOE Roi 
2 3 7 18 
Reaction time 
M 314 | 231 | 198 | 184 130 
SD 25 28| 35]| 30 
L 
M ol 37 29 | 31 32 
SD 21 13 15 15 
Nota. The L value 1s derived from the ER pointer position 


— the observed pointer position, 


which extremely close-to-threshold stimuli 
were employed would cause S to attend 
more closely to the auditory display than 
he would for higher signal-to-noise ratios. 
Although we are only just beginning to 
construct models of attention bias of this 
type (cf. Sternberg & Knoll, 1973), the 
idea that attention bias influences the per- 
ceptual lag has long been in existence as 
the law of prior entry (Titchener, 1908), 
which states: “The stimulus for which we 
are predisposed requires less time than a 
like stimulus, for which we are unprepared 
to produce its full conscious effect [p. 
251]" Stone (1926) and Sternberg and 
Knoll have demonstrated that simultaneity 
judgments are influenced by attentional 
bias in the manner predicted by the prior- 
entry law. 

In the case of Sanford's (1971) pointer 
test it may be argued that the prior-entry 
effect could produce the result that the 
intensity effect under the pointer test was 
less than that under RT conditions. When 
blocks of quiet stimuli are being presented, 
additional attention may be paid to the 
auditory stimulus, which would, by prior 
entry, cause the stimulus to be detected 
earlier than it would were less concentra- 
tion being devoted to it. Thus, atten- 
tional bias runs against the naturally 
greater detection lag that could be asso- 
ciated with quiet stimuli. When blocks 
of loud stimuli are used, the converse 
argument appl Although detection 
lags are smaller for loud stimuli, this 
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effect may be offset by less «tention 
being paid to the loud stimuli. result 
of these two biases would be to i-»sen the 


magnitude of the intensity efile. in the. 
clock task. Such a possible bias «on only 
be eliminated by using a design which 
the stimuli of differing intensi ^s are 


presented randomly. 


METHOD 

Apparatus. The S (N = 8 Royal Nav; enlisted 
men) was required to do two tasks, th« l task 
and pointer task. In the pointer tas iad to 
watch a pointer revolving around a k face’ 
and say just where the pointer was at ‘ic time 
when the onset of the stimulus occur: The 
"clock" was a chronoscope graduated centi- 


h-of-a- 

The 
e face 
whole 


seconds, with a 1-sec. sweep. Every t 
second was numbered 0, 10, 20... ^v 
sweep-face was 8-cm. in diameter, and 
was set centrally into a 14.5-cm.* box. 
chronoscope was set into a 50-cm.? boi: The 
board was mounted on a table in front of S at 
a viewing distance of 60 cm. A comfortab!, bright 
level of illumination was used and lamps were 
arranged so that variations over the surface of the 
clock face were minimal. Stimuli consisted of 
increments in a background of random noise— 
intensities, measured by a Dawe sound-level meter 
coupled to the earphones, were 60 db, for the back; 
ground rising to 62, 63, 67, and 78 db. for Stimuli 
1, 2, 3, and 4. Stimuli were effectively continuous, 
being terminated by Æ after Shad responded. They 
were presented binaurally, and had an ellectively 
zero rise time. 

On each trial, S watched the pointer, which 
swept over the clock face once, and a stimulus 
was then introduced on the second sweep. Tem- 
poral uncertainty was thus in the range 1-2 sec 
The distribution of stimulus onset points was 
roughly equally dispersed across the range 1 C 

The RT apparatus consisted of a microswiteh 
arranged on the release-to-react principle for S's 
nonpreferred hand. Stimulus onset activated à 
system of decatron timers which ceased counting 
when S reacted. 

Procedure, There were three sessions, In the first 
session, 5s were acquainted with the stimuli an 
given some practice at the RT task. In the fol- 
lowing two sessions, Ss heard 30 stimuli at each 
intensity (120 stimuli in all) in a random order. 
In addition, 24 blank trials were introduced in 
which no stimulus was presented. The first © 
these sessions was considered practice. All analyses 
are therefore based on final-session data. The $$ 
were instructed to perform the pointer test and 
the RT test simultaneously, and were told that 
the pointer test was concerned with visual acuity 
and that they were to say where the pointer was 
when the stimulus first came on, Instructions fot 
RT were standard. 


i 
; 
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‘The pointer test scores were made in the 
following way. A value of L (actual pointer 
position minus the observed pointer posi- 
tion), was obtained for each judgment. 
The mean of these values was then ob- 
tained. The RT and L scores are given 
in Figure 1 and Table 1 and demonstrate 
the attenuated intensity function obtained 
in the judgment task. 

A simple nonparametric test, Fried- 
man's analysis of variance (Siegel, 1956), 
was applied to each set of scores and 
revealed an effect of intensity on RT 
(p < .001) and an effect of intensity on L 
(b < .001). The interaction effect was 
tested by comparing RT-L values as a 
function of intensity and was significant 
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FiGuxE |. Mean values of reaction time (RT) 


and L (actual pointer position — observed pointer 
position) plotted on a log scale against stimulus 
intensity, (It is apparent from this that the 
change in RT is greater than the change in L. 
Statistical analysis was carried out on the untrans- 
formed data.) 
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(a) Neural Counter 


Neural Impulses 


Detect Criterion 


a v b b 
Time since onset 


(b) Filter Analogy 


Interna! Response 


Peak Response 
Time since onset 


Ficuke 2, (a) Graphical representation of the 
counter model (eg., Grice, 1968). (The effect of 
intensity on the detect-for-judgment [DJ] state, 
fea, is seen to be les than that on the detect- 
to-react [DR] state, b-b, because the DJ state 
criterion requires less evidence than the DR state 
criterion.) (b) The internal response to a stimulus 
can also be considered as analogous to the output 
of a simple filter (Sternberg & Knoll, 1973). (If 
an event such as the peak internal response is 
used to trigger DJ states, then if there is any 
effect of intensity at all it should be less than 
that obtained in reaction time [RT] tasks, since 
DR states would be initiated by a variance setting 
amplitude detector analogous to the criterion of 
the counter model.) 


(p < 001). Hence, the results show that 
although intensity influenced L and RT, 
RT was influenced to a much greater 
extent. (Parametric tests were not used 
since use of a transform to satisfy paras 
metric assumptions would be incom- 
patible with considering a specified scale 
of measurement, namely, linear time. The 
primary interest in this experiment is in 
the interaction of RT and L, and this 
would be modified by any nonlinear 
transformation.) 
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Discussion 

The results obtained in the present experi- 
ment are substantially similar to those ob- 
tained by Sanford (1971). They support the 
view that the differential effects of stimulus 
intensity on RT and simultaneity cannot be 
attributed to an attentional bias interacting 
with the intensity effect, since this was pre- 
vented by the use of random stimulus 
schedules, 

The most general statement which can be 
made on the basis of the results is that dif- 
ferent amounts of stimulus information have 
to be sampled in order to cause a detect-to- 
react (DR) and detect-for-judgment (DJ) 
event to occur. Sanford (1972) suggested 
that the DR state may well occur at an 
earlier date than the DJ state. This is based 
on the argument used in favor of the simple 
counter model of RT, i.e., that smaller inten- 
sity effects reflect less stimulus sampling than 
larger ones. This is consistent with the view 
of Murray (1970) and others that the mag- 
nitude of an intensity effect is directly related 
to the lag in stimulus detection, since it 
reflects criterion level. This is clear from the 
model shown in Figure 2a where neural im- 
pulses are assumed to be counted until a 
criterion number have been accumulated. 
Quiet signals are assumed to cause slower 
neural impulse rates than loud ones. This 
interpretation is consistent with the findings 
that Ss are aware of some of the intensity- 
dependent delay in RT, when measures are 
made of subjective ratings of RTs (Sanford, 
1970). 

Other experimental work comparing inten- 
sity functions for RT and simultaneity has 
recently been discussed by Sternberg and 
Knoll (1973), who have developed an alter- 
native suggestion that DJ states may be 
triggered at a later point on the stimulus 
continuum than DR states. They suggest 
that the internal representation of a stimulus 
may be likened to the response of a filter to 
a step input, Figure 2b. The occurrence of 
a particular level of input mediates the DRs, 
while DJs are based on the peak response 
of the system. Since peak responses occur at 
the same time for stimuli of different am- 
plitudes, an absence of intensity effect on 
tasks tapping the DJ system might be ex- 
pected. However, Sternberg and Knoll do 
suggest this model for pulsed stimuli only, 
presumably with a duration less than the 
time constant of the filter. Whether it can 
be generalized to stimuli longer than the time 
constant (as used in the present study) is an 
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If it could be so 
it would reflect a detection of 
rather than peak. It is, of cou 
that the asymptote of the filter ou 
be associated with some neural event whi 
could act as a cue for DJ states if 
model is correct. At this stage, it see 
impossible to distinguish between these t 


open question. generalized, 


different aspects of the internal representa: 
tion of the stimulus in tasks that differ only 
slightly. The present results merely show 
that the discrepancy between RT and simul 
taneity measures of detection latency cannot 
be accounted for by an explanation based 
upon attentional bias. 3 
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ASYMMETRIES IN JUDG 


MENTS OF VERTICALITY +! 


PHILIP H. K. SEYMOUR? 


University of Dundee, Dundee, Scotland 


Two experiments are reported in wh 
above or below correctly specified the 


ich Ss judged whether or not the word 
location of a dot relative to a picture 


of a face. In Experiment I, the face was horizontally oriented with its top to 
the left or right and the dot to the left or right. An analysis of the latency 
data showed a substantial same-different effect and a marginally significant 


advantage for above vs. below. In 
oriented or inverted. The normal 
above vs. below difference and a sami 


Experiment II, the face was normally 
face was associated with a significant 
ne—different effect. When the face was 


inverted, both of these effects were eliminated. These results are discussed 


about thi 


in relation to recent theories 


Seymour (1969) reported a study in 
which Ss made same-different judgments 
of displays consisting of the words above or 
below and pictures of a dot above or below 
a reference square. The main findings were 
that reaction times (RTs) were faster for 
above displays than for below displays, and 
faster for yes reports than for no reports. 
These findings were subsequently confirmed 
by Chase and Clark (1971). 

Chase and Clark (1971) argued that the 
above-below asymmetry must depend on 
differences in encoding time for the words 
above and below and not on a tendency to 
scan the display in a top-down direction 
as had been proposed by Seymour (1969). 
However, the evidence cited by Chase and 
Clark was not sufficient to determine this 
issue. In their Experiment 2 they showed 
that theabove-below asymmetry disappeared 
when the words were replaced by arrows; 
this alters the task to one of 2-choice re- 
sponding to the proximity of a dot and 
arrow/head and appears not to involve a 
proper comparison of locations. In Experi- 
ment 3 they replaced the words above and 
below by present and absent, which were to 
be matched against a dot that might or 
might not appear at a constant location 
within a block of trials; this experiment 
would bear on the question of focused at- 
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analyzing the data. 
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H. K. Seymour, Department of Psychology, Uni- 
versity of Dundee, Dundee DD1 4HN, Scotland. 


e verification of spatial locations. 


tention on the upper region of the figure 
only if the dot signifying present were 
allowed to occur randomly in the upper or 
lower locations. 

Chase and Clark's (1971) Experiment 4 
involved figures in which the dot occurred 
only in the above location for one group 
and only in the below location for the other. 
The Ss were instructed to infer the below 
location from absent-above in the one case, 
and above from absent-below in the other, 
In the top-visible condition, the pattern 
of latencies was similar to-that obtained 
under the normal condition, but substantial 
alterations occurred in the bottom-visible 
condition, which Chase and Clark inter- 
preted as an addition to the time required 
to encode the location above. On the other 
hand, the top-visible condition did not 
result in an increase in the time to encode 
the location below, so that Chase and 
Clark commented that, "Apparently, sub- 
jects normally attend to the upper location 
of the circle, hence they normally (or often) 
only infer (circle below square) in the 
BELOW displays [p. 322].” 

Given this latter comment, it appears 
that Seymour (1969) and Chase and Clark 
(1971) are in agreement in arguing that 
attention is often initially focused on the 
upper region of the figure; a tendency of 
this kind might delay encoding of below 
displays because of top-down scanning or 
because of the occurrence of a time-con- 
suming inference. Chase and Clark have 
added the assumption of a difference in 
encoding time for the words above and below, 
and have argued that this occurs because 
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(a) Horizontal (b) Vertical 


acureg 1. Examples of face-dot displays used 
KEEN orizontal conditions of Experiment I and 
for (a) ical conditions of Experiment II. (All 4 
(b) Ve da the response yes when presented 


pictur word above, and no when presented with 


ith the 
ue word below.) 


, word below refers to a negative pole on 
e ension of verticality (Clark & Chase, 
JE ‘Thus, the word above may be repre- 
d as + vert/ + polar and the word 

as + vert/ — polar, indicating that 
Ve words are semantically represented as 
ROG > values on a verticality feature, but 
DEEST to positive and negative poles 
ad m vertical dimension (Leech, 1969). 
or : e and Clark (1971) argued that the 
S aie case involves some addition to the 
e tence-encoding stage of the comparison. 
Eike introduction of the notion that below 
is an inherently negative concept suggests 
a further line of theorizing which may be 
relevant to interpretation of the above- 
below asymmetry. Schaeffer and Wallace 
(1970) described a model for comparisons 
of word meanings in which semantic simi- 
larity, as determined by a preliminary 
holistic comparison of 2 word concepts, 
affects the thresholds of same and different 
z inters used to test for equivalence of the 
2. concepts with respect to some specified 
à tribute. Semantic similarity tends to 
al ver thresholds for same judgments and to 
ine thresholds for different judgments; 
ee ersely, semantic dissimilarity raises 
conva eskol for same judgments, but 
a rs the threshold for different judgments. 
Owe our (1973) proposed that the inherent 
oct or negativity of the concepts to 
e yared might have effects analogous 
be Pe semantic similarity. Thus, if the 
eu those GEH of an above-above word- 
represe display is inherently positive, as 
PESE ede ‘this display may result in a 

ark 2 


sented 


greater reduction in the threshold for a yes 
response than a below-below display. As 
a preliminary test of this assumption, 
Seymour instructed Ss to report mo t 
above-above and below-below displays, and 
yes for above-below and below-above. dis- 
plays. This reversal of the congruence- 
report mapping eliminated both the above 
below and the true-false effects, It was 
argued that this result was incompatibl 
with the assumptions of the Chase and 
Clark (1971) model, but might arise if th: 
advantage of above-above displays depended 
on the possibility of mapping the display 
onto a semantically affirmative report. 

It seems possible, therefore, to identify 3 
general theories about the nature of th: 
asymmetry observed in verticality judg- 
ments, (a) A perceptual theory states that 
S scans the picture display from top to 
bottom, or attends initially to the upper 
region of the figure, so that encoding time 
is less for above pictures than for below 
pictures. This is the theory which was 
attacked by Chase and Clark (1971), al 
though, as mentioned above, they restated 
an attentional variant of it in the discussion 
of their Experiment 4. (b) A linguistic 
theory states that the concepts above and 
below differ in semantic complexity, and 
that the marked term, below, requires more 
time at the word-encoding stage than the 
unmarked term, above. Pictures are en- 
coded in the same abstract or propositional 
format as sentences, so that a comparable 
difference in encoding time is postulated 
for above vs. below pictures. (c) A re- 
sponse availability theory states that posi- 
tive features in the semantic representation 
of the word-picture display tend to reduce 
the threshold for selection of a yes response. 
Responses to above-above displays are 
faster than responses to below-below dis- 
plays because there is greater inherent posi- 
tivity in the representation of above-above. 

This article describes 2 experiments 
which were designed to discriminate among 
these 3 theories. 


EXPERIMENT I 


Experiment | aimed to distinguish be- 
tween the perceptual and linguistic theories. 
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The Ss matched the words above or below 
against the location of a dot which was 
positioned relative to a horizontally ori- 
ented reference face. Figure 1(a) shows 
the face-dot configurations in which the 
dot was located above the face. In the 
below displays, the dot was located close 
to the chin of the face. Thus, the above and 
below locations might be to right or left 
of the display, depending on the position 
of the top of the face. This type of display 
eliminates any possible advantage for focus- 
ing attention on the upper part of the figure, 
and controls for tendencies to focus atten- 
tion to the left or right. If the above-below 
asymmetry disappears under these condi- 
tions, it will follow that the effect is largely 
attributable to a distinction between the 
above and below locations, or spatial/ 
perceptual factors. If the asymmetry per- 
sists, this will count as support for Clark's 
contention that above concepts may be 
established more rapidly than below con- 
cepts, but will also be compatible with the 
assumptions of the response availability 
theory. 


Method 


Subjects. Eight volunteers from final year classes 
at Dundee, Scotland, senior secondary schools 
served as unpaid 5s. 

Apparatus, On each trial a warning tone was 
sounded for 500 msec. followed by a 1-see. delay 
after which the word and picture displays were rear- 
projected onto a screen for 2 sec. The light sources 
of 2 projectors were obscured by a leaf shutter, which 
could be deflected by pulsing a galvanometer 
mounted on top of the projector. A Venner clock 
started when the projector shutters were opened, 
and stopped when S's vocal yes or no response closed 
the relay of a voice key. 

Displays. The words above and below were printed 
in lowercase Letraset (Univers 55) on cards and then 
photographed and mounted as s ides. The pictures 
were faces similar to those shown in Figure 1, with a 
dot above the head or below the chin. The ellipse 
outline of the face was 5 em. in length, and the center 
of the dot was 1 cm. from the circumference of the 
ellipse. These pictures were also photographed, and 
2 slides were made for each picture, so that the face 
could be presented in horizontal orientation, with 
the top of the head to the left or right of S, and the 
dot located at the top of the head or at the chin. 
On each trial, S saw the word above or below displayed 
above 1 of the 4 face pictures. Under the conditions 
of the experiment, the visual angles subtended by the 
word and picture were 2° and 3°, res tively. 
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TABLE 1 


PONSE LATENCIES (IN MSEC.) OF yes AND 
CTIONS TO above AND below DISPLAYS 
THE REFERENCE FACE was Hort 


ZONTALLY ORIENTED: E: 
Head above— below- above- below- 
direction above below below above 
Right 
M 976 1,055 1,122 1,114 
SEmu 55 70 74 55 
Left 
M 985 1,039 1,081 1,134 
SEm 63 17 78 91 
M 981 1,047 1,102 1,124 
Procedure, All Ss started with some practice in 


the use of the voice key and in classification of the 
displays. They were instructed that they should 
respond yes as rapidly as possible when the word 
display correctly specified the position of the dot 
relative to the face, and no when it did not, The 
main test session was organized as 3 blocks of 32 ob- 
servations h of 8 word-picture combinations 
occurred 4 times per block in a random order inde- 
pendently determined for each S at each block. 
Error responses were discarded and replaced, Each 
30-45-min. testing session was carried out in a 
quiet darkened room. 


Results and Discussion 

Error rates in this experiment were quite 
low, amounting to only 2.5% of all trials, 
and were not related in any consistent way 
to the main conditions of the experiment. 

A summary of the latency data is given 
in Table 1, classified by word-picture com- 
bination (above-above, below-below, above 
below, and below-above) and location of the 
dot (left or right). Each mean is based on 
12 observations on each of 8 Ss. A pre- 
liminary analysis of variance established 
that the effects for dot locations were not 
significant, either for yes or for no responses, 
Fs (1,7) < 1 and 1.6, and that the inter- 
action of this factor with effects for the 
words, above or below, also was not signifi- 
cant, F. (1,7) <1. 

The RTs were collapsed across dot 
position, and an analysis of the type em- 
ployed by Chase and Clark (1971) was 
run to test the effects of response (yes vs. 
no) and word (ubove vs. below). This 
analysis indicated that same combinations 
were verified about 100 msec. faster than 
different combinations, / (1, 7) = 59:89, 
p < .001. Neither the above-below differ- 
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TABLE 2 
PossiBLE SEMANTIC REPRESENTATION OF above-below DisPLAYs WHEN THE 
REFERENCE FACE Was HORIZONTALLY ORIENTED 
Item above-above below-below above-below below-above 

Word +vert/+ polar +vert/—polar +vert/-+polar /— polar 
Dot vert/— right vert / — left — vert / — left — right 
Face —vert/— right —vert/— right —vert/— right > right 
Dot/face-top T match — match —match 

Word/Dot/face-top T match +match —match 

Decision +true +true —true t 
Summed positivity 5 3 2 2 
Summed negativity 2 4 5 5 
Neutral components 2 z 2 2 


ence of 44 msec., F (1, 7) — 4.14, nor the 
Response X Word interaction, F (9 ya 
1.67, was significant. In a similar analysis, 
in which response and above vs. below 
pictures were factors, neither the pictures 
effect, F (1, 7) — 1.67, nor the Response 
X Pictures interaction, F (1, 7) = 4.14, 
was significant. 

These results contrast with those ob- 
tained for vertically oriented displays by 
Seymour (1969) and Chase and Clark 
(1971). Both previous studies showed 
highly significant effects for above vs. below 
and for same vs. different. In the present 
experiment, there was a large same- 
different effect, but the above-below effect 
was not significant. Hence, if one adopts 
the assumptions about independence and 
additivity of component encoding and 
comparison stages stated by Chase and 
Clark (1971, 1972) and Clark and Chase 
(1972), an appropriate conclusion might be 
that the above-below effect is dependent on 
the vertical arrangement of the picture 
display, but that the same-different effect 
is independent of this factor. This con- 
clusion is contrary to the predictions of the 
linguistic theory, which states that en- 
coding time is shorter for above sentences 
and above locations than for below sentences 
and below locations. Use of a horizontal 
reference face might be expected to add a 
small constant to the duration of the 
picture-encoding stage. Itshould not, how- 
ever, affect the difference in encoding time 
as between above and below sentences, or 
above and below pictures, which, in the 
theory, are supposed to reflect the time 
required to set up abstract propositional 
represen tations. 


The main prediction of the perceptual 


theory appears to be that there should be 
no difference between the same displays, 
above-above and below-below, or between 
the different displays, above-be! and 
below-above, although there may i» an 


overall same-different effect. This would 
follow if the above-below effect depended 
on location of the critical dot in the upper 
region of the figure. In the case of no re- 
sponses, the differences between above and 
below displays were not significant, F (1, 7) 
<1. However, in the case of yes responses, 
the difference of 66 msec. between ibove- 
above and below-below was Significant, 
F (1, 7) = 6.46, p <.05. The utcome 
for no responses is consistent with the per- 
ceptual theory, therefore, but the outcome 
for yes responses is not. 

In order to consider the predictions of the 
response availability model, it is nc cessary 
to make a number of assumptions about 
the nature of the semantic representation 
of the word-picture display. A possible 
formulation has been outlined in Table 2. 
It has been assumed that the encoding of 
the display involves establishment of 
featural descriptions of (a) the word above 
or below, (b) the absolute location of the 
dot (left or right), (c) the absolute location 
of the top of the face (left or right), (d) the 
outcome of a comparison of dot location 
and face-top location, (e) a comparison 
of word polarity and match or mismatch 
of the dot and face-top locations, and Cf) 
an indication of truth or falsity of the dis- 
play. The form of the featural description 
for word meaning, and dot and face-top 
location, is based on semantic analyses of 
locatives proposed by Leech (1969). The 
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TABLE 3 
POSSIBLE SEMANTIC REPRESENTATION OF NORMAL AND INVERTED FACE above-below DISPLAYS 
Normal Inverted 
Item => 
above— below — above- below- above- below- above— below- 
above below below above above below. below above 
Word -Fvert/ | -Fvert/ | +vert/ | +vert/ | +vert/ | +vert/ +vert/ | +vert/ 
+polar | —polar| +polar | —polar | +polar | —polar +polar | — polar 
Dot +vert/ | +vert/ | +vert/ | +vert/ | +vert/ +vert/ | +vert/ | +vert/ 
+polar | —polar | —polar | +polar | —polar +polar | + polar | —polar 
Face +vert/ | +vert/ |-Fvert/ | +vert/ | +vert/ | +vert/ +vert/ | +vert/ 
+polar| -Fpolar| +polar| +polar | —polar —polar | —polar | —polar 
Dot/face-top | -Ematch | —match | — match | J-match | +match | —match | —match | -- match 
Word/Dot/face-top | +match | +match | —match | — match | +match | -- match | —match | —match 
Decision -+ true -+true —true —true +true -+true —true —true 
Summed positivity 9 6 5 6 7 6 5 4 
Summed negativity 0 3 4 3 2 3 4 5 


vertical dimension is characterized by the 
feature +vert, and the horizontal dimen- 
sion by the feature —vert. The upper 
region of the vertical dimension is defined 
as positive and has the feature +polar. 
The lower region is negative and is assigned 
the feature —polar. Since the horizontal 
dimension is probably not bipolar in this 
sense, the directions left and right have 
been represented by the neutral features 
— left and — right. The outcomes of the 
various comparison operations are repre- 
sented by the features +match and 
— match. 

It is assumed that as S encodes the dis- 
play a featural representation of the type 
suggested in each column of Table 2 is con- 
structed. The response to be made depends 
ultimately on inspection of the value for the 
+match or — match feature, which defines 
the outcome of the comparison of word 
polarity and dot/face-top congruity. This 
corresponds to the test for sameness or 
difference of value in relation to a specified 
attribute in the model of Schaeffer and 
Wallace (1970). It is further assumed that, 
as the semantic representation of the dis- 
play is constructed, positive feature values 
tend to reduce the threshold for initiation 
of a yes response, whereas negative feature 
values will reduce the threshold for a no 
response. 

The last 3 rows of Table 2 show the 
values obtained by making a simple count 
of positive, negative, and neutral features 
for the 4 displays. Summed positivity for 
an above-above display has a value of 5 vs. 


a value of 3 for a below-below display 
Hence, the reduction in threshold for the 
yes response is expected to be somewhat 
greater for above-above than for below- 
below, and this might give rise to the small 
but significant difference in RT that was 
observed. In the case of no responses, 
summed negativity has a value of 5 for 
both above-below and below-above displays. 
Thus, these 2 displays will be expected to 
reduce the threshold for a no response by an 
equivalent amount, and no difference in 
RT is expected. 


EXPERIMENT II 


Experiment II was carried out with the 
aim of determining the effects on the above- 
below and true-false differences of inversion 
of the reference face. The words above and 
below were matched against the location of 
a dot relative to a reference face which 
might be normally oriented, or inverted, 
as shown in Figure 1(b). The normal 
orientation of the face corresponds to the 
condition tested in the studies by Seymour 
(1969) and Chase and Clark (1971), and 
may be expected to reproduce the signifi- 
cant above-below and same-different effects 
obtained in those studies. However, the 
3 theoretical positions discussed earlier lead 
to different predictions for the case where 
the reference face is inverted. 

If, as is implied by the perceptual theory, 
the above-below asymmetry depends on a 
tendency to focus attention on the top of 
the figure or to scan the figure in a top— 
down direction, inversion of the face should 


below. above below. 
YS ‘No’ 

Figure 2, Summary of mean reaction times (in 
msec.) for classification of vertical face displays 
(Experiment 11), (The means are for yes and no 
responses to above and below displays when the face 

might be normally oriented or inverted.) 


result in a symmetric reversal of the above- 
below effect. This assumes that S may take 
| some time to note the inversion of the 
face, but that, once this has occurred, he 
- will encounter the evidence critical for 
verification of the displav sooner for a 
below-below combination than for an above- 
above combination. Since these effects are 
arising at the stage of picture encoding, 
the same-different effect should not be 
altered. 
An assumption of the linguistic theory 
is that an abstract representation of “X 
above Y" may be set up more rapidly than 
a representation of “X below Y," for both 
“sentences and pictures. If sentence en- 
coding and picture encoding are genuinely 
independent mental operations, inversion 
of the reference face will not alter the 
difference in encoding time as between the 
words above and below. The effect of in- 
version should be on the duration of the 
picture-encoding stage, and should also be 
à constant increment which will leave the 
- difference in encoding time for above vs. 
| below pictures unaltered. Further, if the 
1 encoding stages are independent of the 
- comparison stage, inversion of the face 
- should not influence the magnitude of the 
| same-different effect. In general, therefore, 
- the linguistic theory predicts a constant 
increment in RT when the face is inverted, 
- but no change in the basic above-below and 
i true-false effects. 
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The predictions of the response avail- 
ability model must again depend on an 
assessment of the balance of positive and 
negative components in the semantic repre- 
sentation of the displays. Table 3 gives 
the outcome of an analysis which is similar 
in form to that proposed for the horizontal 
faces. Word meaning, dot location, and 
face orientation are represented by a +vert 
feature and a +-polar or —polar feature, 
and the outcomes of comparisons of dot 
location and face-top location, or word 
polarity and congruence of dot and face- 
top, by +match or — match features. The 
last 2 rows of the table show the values for 
summed positivity and summed negativity 
obtained for each display. For normal 
orientation of the face, there are 9 positive 
components in the representation of an 
above-above display, but only 6 in the 
representation of a below-below display. 
However, when the face is inverted, the 
ratio of positive components becomes 7:6. 
The prediction of the response availability 
model for yes responses, therefore, is that 
there will be a significant difference betweer 
above-above and below-below displays when 
the face is normal, but that this difference 
will be much attenuated when the face is 
inverted. In the case of no responses, tix 
tatio of negative components is 4:3 in 
favor of above-below for norma! orientation 
of the fact, but 5:4 in favor of below above 
when the face is inverted. The boi pre- 
diction, therefore, is that above-below will 
be classified faster than below-above when 
the face is normal, but that the direction of 
this difference will be reversed when the 
face is inverted. 


Method 


Subjects. Eight volunteers, all University of 

Dundee students, served as unpaid Ss. 
„Apparatus and displays. These were identical to 
Experiment 1 except that the faces were now dis- 
played in a normal or inverted orientation, and the 
word display appeared to the left of the picture on 
each trial. 

Procedure. The procedure was similar to that 
followed in Experiment I. After some preliminary 
Practice, each S recorded a total of 96 observations 
in which each of 8 word-picture combinations oc- 
curred 12 times in random orders inde; ently 
determined for each S. Error responses were dis- 
carded and replaced, 
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Results and Discussion 


Error rates were again low in this experi- 
ment, being limited to 2.8% of all trials. 

Figure 2 gives a summary of the latency 
data for the 4 word-picture combinations 
and for normal or inverted orientation of 
the reference face, A preliminary analysis 
was carried out to assess the effects of in- 
version of the face in relation to differences 
among the word-picture combinations. 
This analysis indicated that inverted face 
displays were classified about 58 msec. 
slower than normal face displays, F (t, 7) 
= 71.73, p < .001, and also that there were 
significant differences among the word- 
picture combinations, F (3, 21) = 10.39, 
p < 001. The Orientation X Displays 
interaction was also significant, F (3, 21) 
= 20.50, p < .001. 

The interaction between face orientation 
and differences among word-picture dis- 
plays appears contrary to the predictions 
of the linguistic model of Chase and Clark 
(1971). If inversion of the face added a 
constant to the duration of the picture- 
encoding stage, and picture encoding, word 
encoding, and comparison were truly in- 
dependent operations, the effects for dis- 
plays and face orientation would be inde- 
pendent. As it is, the results for the normal 
face appear similar to those obtained in the 
previous studies, in that they show quite 
large above-below and Irue-false differences, 
whereas these differences are absent, or in 
the opposite direction, when the face is 
inverted. 

As a check on these conclusions, an 
analysis was carried out on the data for 
the normal face condition in which response 
(yes vs, no) and word (above vs. below) were 
factors. The differences of 163 msec. 
between the means for yes and no, and of 
91 msec. between above and below displays, 
were significant, with Fs (1, 7) = 35.74, 
p < 001, in both cases, although the 
Response X Word interaction was not 
significant, F (1, 7 < 1. A similar 
analysis of RTs for the inverted face dis- 
plays failed to show a significant response 
effect, F (1, 7) < 1. Displays with the 
word below were classified 60 msec. faster 
than displays with the word above, and this 
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reversal of the usual effect was significant, 
F (1, 7) = 6.93, p < .05. The Response 
x Word interaction was not significant. 

In the case of yes responses, inversion 
of the face resulted in a delay of response of 
about 200 msec. for above-above combina- 
tions, and about 90 msec. for below-below: 
combinations. The difference between the 
2 same displays was significant when the 
face was normally oriented, F (1, 7T) 
= 13.41, p < .025, but not when the face | 
was inverted, F (1, 7) = 1.46. Thus, for - 
congruent combinations, inversion of the 
face resulted in an overall delay in RT, and -— 
also in elimination of the above-below differ- 
ence. This outcome appears consistent 
with the predictions of the response avail- 
ability theory but not those of the linguistic 
or perceptual theories. The linguistic 
theory predicts maintenance of an above - 
below difference when the face is inverted, 
whereas the perceptual theory predicts a 
reversal of this difference, The count of “4 
positive features shown in Table 3 indicates — 
that the above-below effect should be at- 
tenuated when the face is inverted, and 
it was this result that was obtained. 

Further analyses of no response times - 
indicated that inverted face displays were 
classified 33 msec. faster than normal face — 
displays, but that this difference was not 
significant, P (1, 7) = 3.13. The effect for 
words (above vs. below) also was not signifi- 
cant, but there was à significant Orienta- 
tion X Words interaction, F (1, 7) * 24.67, — 
p < 01. The absence of an effect for face — 
orientation and the occurrence of the inter- | 
action appear counter to the predictions - 
derived from the linguistic theory. The 
result is also in the wrong direction for the ^ 
perceptual theory, since displays having | 
the dot in the lower region of the figure 
were classified significantly faster than dis- 
plays having the dot in the upper region, - 
F (4,7) = 24.07, p < .01. However, the 
interaction is consistent with the pred z 
tions of the response availability model. — 
Summed negativity has a value of 4 for 
above-below combinations for both normal 
and inverted orientations of the face. 
When the face is normal, the representation 
of the below-above display contains only 3 
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negative components. Thus, under this 
condition, no responses should be faster to 
above-below displays than to below-above 
displays, and this is indeed the case, F (1, 7) 
= 20.93, p « .01. When the face is in- 
verted, the sum of negative components in 
the below-above representation increases to 
5. This increment in negativity is expected 
to lower the threshold for selection of a 
no response. The data indicate that, 
under this condition, belo:w-above displays 
were classified significantly faster than 
above-below displays, F (1, 7) — 12.63, 
p< .025. 

A further point of difficulty for both the 
perceptual and the linguistic theory is raised 
by the failure to obtain a true-false effect 
under the inverted face condition. If, as 
both theories imply, inversion of the face 
has its main effect on the duration of a 
picture-encoding stage, and this stage is 
independent of a comparison stage, there 
should be no effect on the magnitude of the 
Irue-false difference. In the model of Chase 
and Clark (1972), and Clark and Chase 
(1972), the true-false effect is defined as the 
time required to alter a “truth index" from 
"true" to "false." If this is correct, a 
true-false effect would be obtained in both 
the normal and the inverted face conditions. 
Thus, the absence of an effect in the present 
Situation counts as evidence against Chase 
and Clark’s (1971, 1972) assumptions about 
the independence and additivity of the en- 
coding and comparison stages. The re- 
sponse availability model is sufficiently 
flexible to accommodate variations in the 
size of the /rue-false effect. For example, 
presence of a negative feature for face 
orientation might raise thresholds for yes 
responses and lower thresholds for no re- 
Sponses, thus overriding a general bias in 
favor of yes responses. 


GENERAL Discusston 


The analyses of these 2 experiments appear 
sufficient to eliminate the Perceptual theory 
of Seymour (1969) as an explanation of the 
above-below asymmetry observed in word- 
picture comparison tasks. The theory must 
have difficulty in accommodating (a) the 
difference between above-above and below- 


below observed for horizontal í Lys, 


(b) the absence of a difference b E 
2 displays observed when the 

verted, (c) disappearance of the 

effect when the face was inverted, and 


tendency is Experiment II for ;; nses to 


occur faster to displays having t 
lower region than to displays ha dot 
in the upper region. 

The experimental results also if 
cient to disconfirm Chase and Cia 1 
assumption that the RT in this ty; con 
parison task is the sum of the ns of 
independent and successive w 
picture encoding, comparison and i 
stages, which differ in duration for above y 
below concepts and for true vs. falsi displays 
If word encoding and picture encoding were 
truly independent operations, an 4: n in 
the orientation of the reference face would 
affect only the duration of the pic ding 
stage. Further, if the compar nded 
on establishment of abstract asse: f rela 
tive location, and above assertious were 
established quicker than below assertions, 
picture encoding should be faster for above 
displays than for below displays even when 
the face is inverted or rotated to the horizontal. 
The difference in time to encode the words 
above and below should also remain unaffected, 
as should the comparison time difference be- 
tween true and false displays. The data are 
inconsistent with this position in the fol! 
respects: (a) The overall effects 
(above vs. below) and picture locati 
vs. below) were not significant whes ihi 
was horizontally oriented or when 
inverted, (b) The true-false effc 
nated when the face was invert 
version of the face did not result | rall 
delay of reaction for no response Experi- 
ment II, but did result in a reversal in the 
direction of the difference between above-below 
and below-above combinations, 

It has been argued that the obtained results 
are consistent with a response availability 
model of the type which has been described. 
The main assumption of the model is that 
variations in RT in the sentence-picture com- 
parison task reflect momentary adjustments 
in thresholds for selection of yes or no responses 
Which occur as the encoding the display pro- 
Ceeds. Positive features in the representations 
of word and picture meaning, or positive out- 
comes of matching operations, tend to lower 
the threshold for a yes response. [t is a 
facilitation effect of this sort th 


at underlies 


| 
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the above-below difference, rather than an 
absolute inequality in encoding time. This 
explanation may be extended to other pairs 
which involve an unmarked/marked, or 4- polar 
vs. — polar distinction, such as large vs. small 
(Seymour, 1971). A formulation of this type 
is sufficiently flexible to accommodate cases 
where the effect does not occur. For example, 
both the above-below and the Jarge-small effects 
may be eliminated by instructing Ss to report 
no to same displays, and yes to different dis- 
plays. In the present experiments, an altera- 
tion in the orientation of the reference shape 
was sufficient to attenuate or eliminate the 
above-below effect. It was argued that this 
might occur if a shift from normal vertical 
orientation of the reference face had the effect 
of introducing negative components into the 
semantic representation of the picture. As 
shown in Tables 2 and 3, this tends to decrease 
the difference in summed positivity scores as 
between the above-above and below—below 
combinations, and this in turn will reduce the 
difference in yes response times. 

An important feature of the response avail- 
ability model is that it predicts that facilita- 
tion of no responses may occur when the 
number of negative components in the repre- 
sentation of the display is increased. The 
summed negativity scores in Tables 2 and 3 
correctly predict (a) the absence of a difference 
between above-below and below-above dis- 
plays when the face is horizontal, (b) similar 
RTs for above-below for normal and inverted 
face displays, (c) faster times for below-above 
when the face is inverted than when it is 
normally oriented, and (d) a reversal in the 
direction of the difference between above-below 
and below-above for normal vs. inverted face 
displays. The model can also accommodate 
interactions of display characteristics with the 
size of the yes—no difference if negativity in the 
semantic representation tends to equalize the 
thresholds for the yes and no responses. 

The main contention of this article, there- 
fore, is that RTs obtained in a simple word— 
picture comparison task may be quite well 
represented in terms of effects of semantic 
positivity/negativity on thresholds for yes and 
no responses. Some question arises about the 
generality of these conclusions, and in par- 
ticular whether the difficulty encountered by 
the additive stages model in relation to the 


455 


present, relatively simple, experimental situa- 
tion necessarily implies a weakness in the more 
complex situations examined by Chase and 
Clark (1972) and Clark and Chase (1972). 
Krueger (1972) has presented evidence that 
the encoding and comparison stages are not 
additive in their effects on RT in the sentence- 
picture comparison task, in that effects at- 
tributable to sentence form (affirmative vs. 
negative) interact with effects supposedly 
arising during encoding (axis match effect). 
In general, the consequences of a negative in 
the sentence should be quite similar to those 
of inverting the face (negating the picture) in 
Experiment II of the present study, and should 
tend to eliminate or reverse the /rue-false 
effect. Thus, the Affirmative-Negative X 
True-False interaction obtained by Clark and 
Chase (1972) and other investigators may be 
handled by the response availability model, if 
it is assumed that a negative in the sentence 
causes a general rise in both response thresh- 
olds, and also tends to lower the zo threshold 
relative to the yes threshold. 
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Three studies investigated the effect of presenting a given test 
more frequently than the other test stimuli on generalization, 


training on 4 
identify the s 


single stimulus, either 100, 150, or 200 gm., Ss were 
mulus in a test series of 5 weights, ranging in value 10( 


in 25-gm. steps. Systematically overrepresenting each of the 5 to 
produced an ordered set of generalization gradients. A tendenc 
generalization responses to shift toward the overrepresented test stim 
observed. The data were interpreted in terms of adaptation leve} ti 
Direct evidence was found for the link between generalization respon nd 
adaptation level operationally defined by the category-rating method. 


In human voluntary stimulus general- 
ization, the context in which the original 
training stimulus (S+) is placed in testing 
has been shown to greatly alter the shape 
of the gradient of stimulus generalization. 
In an asymmetrical context (e.g. S+ is 
either the largest or smallest of the test 
values), Ss tend to choose central values of 
the test range as the original training 
stimulus more often than the S+ value 
itself (Thomas & Jones, 1962). This cen- 
tral-tendency effect has since been found 
with the following stimulus dimensions: 
size-area (Helson & Avant, 1967), weight 
(Hébert, Origlio, & McGuirk, 1972; Hé- 
bert & Capehart, 1969), brightness 
(Thomas, Svinicki, & Vogt, 1973), 
and line angle (Giurintano, 1972). How- 
ever, Thomas and Jones, using hue as a 
dimension, also found that Ss respond ac- 
curately to S+ as the original training 
stimulus if S4- is placed symmetrically in 
the middle of the test values. That Ss 
tend to respond to central values of the 
test range regardless of the position of S4- 
in that test range was confirmed by Helson 
and Avant, 

Related to this context effect is the find- 
ing that presenting S+ more frequently 
than the other test stimuli negates the 
central-tendency effect in an asymmetrical 
test context (Hébert & Capehart, 1969; 

' Requests for reprints should be sent to John A, 


Hébert, Department of Psychology, Colorado 
State University, Fort Collins, Colorado 80521. 


Hébert et al, 1972). i 
following training on gl eight, onc 
group of Ss was test h an 
representation” procedi 
was presented 5 times n 
other test stimuli. A; group was 
given an “equal-presentation” procedure 
in which S+ was presented equally as 
often as the other test stimuli. Since S+ 
was the smallest value in the test series, 
creating an asymmetrical context, a central- 
tendency effect would be expected. How- 
ever, the  overrepresentation-procedure 
group did not tend to choose central values 
as S+, as did the equal-presentation group, 
but rather, correctly chose the S+ value 
more often than any other test stimulus, 
yielding a downward, linear oradi 
Thus, a strong frequency 

with a context effect, < 12) 
has also demonstrated 

with a line angle dimensi: 

Capehart, Tempone, ar: Hébert (1969) 
interpreted these data in terms of an 
adaptation level (AL) approach to a stim- 
ulus generalization. According to AL 
theory, when S is presented with a single 
stimulus value in training, a training AL 
is established at that value on à subjective 
dimension. What S learns is not “stimulus 
of value x is correct" but “the stimulus 
whose value corresponds to AL is correct.” 
Thus, if in testing AL shifts away from S4- 
Íor some reason such as an asymmetrical 
context, then a concomitant shift in S’s 


"over- 
ch S+ 
quently than 
ier 


ienomenon 
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judgment of which stimulus is S+ will be 
observed. The AL by which S makes a 
judgment can be represented as a weighted 
average of the training and test ALs: 


AL = wiALtrain + WeALtest, [1] 
where w; and w, are subjective weights 
which sum to 1 and reflect the relative 
contributions of training and testing to 
the prevailing AL. With this model, 
varying the amount of training or of testing 
would alter the values of the subjective 
weights and thus result in different pre- 
vailing ALs. Similarly, the value of the 
training AL can be manipulated by choos- 
ing different values for S+, while the value 
of the test AL can be manipulated by choos- 
ing different test stimuli. In addition, the 
value of the test AL can be altered by 
manipulating the frequency of presentation 
of test stimuli. For example, when S+ 
is overrepresented in testing, greater weight 
is given to it in averaging the stimuli to 
obtain AL (Capehart et al., 1969; Helson, 
1964). The greater weight given the over- 
represented stimulus biases or "pulls" AL 
toward that value. 

However, the frequencies of stimuli 
other than S+ can also be manipulated. 
Suppose, for example, that S+ is the small- 
est of 5 test stimuli, and that the stimulus 
value most distant from S+, say Ss, is 
presented with much greater frequency 
than the other 4 test stimuli, including S+. 
Under these circumstances, S would have 
greater weight, pulling AL towards it, thus 
producing a larger discrepancy between 
training and test ALs. This would pro- 
duce the greatest amount of shift in the 
gradient, possibly beyond the center of 
the test values. A design that systemati- 
cally manipulates the frequencies of pre- 
sentation of each test stimulus indepen- 
dently could provide a more exacting test 
of the AL approach. An ordered set of 
gradients should be obtained if the model 
is at least correct in an ordinal sense. 

Anderson’s (1971) work in integration 
theory and its relation to AL (Anderson, 
1973) also suggests a rigorous test of the 
averaging hypothesis of Equation 1. A 
factorial design in which both the training 
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and test ALs are manipulated should yield 
test data which must meet the key pre- 
diction of a simple averaging model; A 
graphical representation of the interaction 
between training and testing manipulation 
should yield parallel curves. That is, the 
Training X Testing interaction should be 
"zero in principle and nonsignificant in 
practice [ Anderson, 1970, p. 155]." What 
is needed to make this test of the averaging 
model is a response measure directly re- 
lated to the prevailing AL. 

The present experiments were designed 
to test the effects of systematically manip- 
ulating the frequencies of presentation of 
the test stimuli in both symmetrical and 
asymmetrical test contexts. In addition, 
an experiment to tie S’s responses directly 
to Helson's (1947, 1964) operationally 
defined AL concept was conducted. 


METHOD 


Experiments I and II were designed to test the 
effect of manipulating test stimulus frequencies 
following training on either the lightest or heaviest 
of the test stimuli (Experiment I) or following 
training on the intermediate value of the test 
stimuli (Experiment I). 

Subjects and apparatus, A total of 120 students 
served as Ss in Experiment 1; 60 students served as 
i The Ss in both experiments 
à course requirement. The 
stimuli consisted of a set of 5 weights ranging in 
value 100-200 mg. in 25-gm. steps. The weights 
were made from aluminum cans 10,8 cm, tall and 
5.5 cm. diameter, and were filled with wax and shot 
and painted to appear identical, When the blind- 
folded .S lifted a given weight with the preferred 
hand, a microswitch was released which started a 
timer, When S reached a decision (same or different), 
he pressed 1 of 2 response keys with the index finger 
of his free hand, which rested above and between 2 
closely s sponse keys, labeled same and dif- 
ferent. ig of 1 of the 2 keys stopped the 
timer and activated a response light, and Æ then 
recorded choice and latency of response. 

Procedure. were told that they were 
participating in a weight perception experiment. 
In Experiment 1, they were instructed to lift the 
training weight or S+ (either 100 or 200 gm.) 5 
times and to remember it because they were to 
identify it in a test series of weights immediately 
following training. They were told to press a same 
key if a test stimulus was the same as S+, or to 
press a different key if they felt it was different 
(either heavier or lighter). In addition, Ss were 
asked to respond "as quickly and accurately as 
possible." Following instructions, all Ss were 


458 ]. 


mA 100-100 al 100-100 
ED 42 
.20 1.0) 
.00 os) 
+ 


+ 

= 100-125 7 100-125 
40) ? 

.20 Ci 

00 oa 

+ + 

ta 100-150 
40 

20 

E 


* 


Proportion of Same Responses 
Mean Decision Time (Secs.) 


80 
100-175 (31 100-175 
60 
42 
40 
1.0) 
EDI 
0.8 
oo| 
+ + 
à 100-200 m 100-200 
E 
1.0] 
o 
o.e] 
ao 
0.5 
00 
100 125 178 200 100123160 175 200 


+ + 


Stimuli (Grams) 


FiGURE 1. Generalization gradients of same 
responses and decision time for the 100-gm. train- 
ing group. 


blindfolded for the rest of the session, The training 
procedure for Experiment II, conducted after the 
completion of Experiment I, differed in only one 
respect, namely, Ss were trained on 150 gm., the 
intermediate value of the test series. Testing pro- 
cedures were identical in both experiments. 
Following training, which consisted of lifting 
S+ 5 times in a slow, paced fashion, test trials 
were immediately begun. A test trial consisted of 
S's grasping the weight from the top with 4 fingers 
and a thumb, lifting the weight up with elbow 
resting on table, making the response, bringing 
the weight down, and releasing it. Shaking or 
swinging the weight was not permitted. 
In Experiment I, half of the Ss were trained on 
100 gm. and half on 200 gm. Following this, Ss in 
both groups were assigned randomly to 1 of 6 test 
groups, 10 Ss per group, yielding a 2 X 6 factorial 
design. The 6 test groups for both the 100- and 
200-gm. training groups consisted of a control 
group in which the 5 test stimuli were presented 
equally often (5 times each) and 5 other groups 
which differed in terms of which test stimulus was 
overrepresented. There were 25 presentations of 
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RESULTS AND DISCUSSION 


The generalization gradients for each of 
the 10 experimental groups are depicted 
in Figures 1 and 2 for the 100- and 200-gm. 
training groups, respectively. The left- 
hand columns of both figures depict 
gradients of same responses for each group, 
while the right-hand columns represent the 
latency data for the same groups. In- 
spection of the choice gradients in the left- 
hand columns of Figures 1 and 2 indicates 
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CONTEXT AND FREQUENCY EFFECTS IN RESPONSE GENERALIZATION 


a systematically increasing tendency to 
choose stimuli farther from S+ as the over- 
represented stimulus becomes farther from 
S+, as predicted from the AL model. 
Statistically, this is represented by a sig- 
nificant Test Procedure X Stimuli inter- 
action, F (16, 360) = 5.13, p < .001. How- 
ever, the amount of shift appears greater 
for the 200-gm. training groups, resulting 
in a significant Training X Test Procedure 
X Stimuli interaction, F (16, 360) = 4.31, 
p «.001. The most dramatic shift oc- 
curred with Group 200-100, whose data 
are presented in the bottom left-hand panel 
of Figure 2. This group shifted past the 
center (150 gm.) to some point between 
100 and 125 gm. In contrast, Group 100- 
200, the counterpart of Group 200-100, 
failed to shift even quite to the center (see 
lower left-hand panel of Figure 1), if shift 
is defined by the modal response value or 
peak. This asymmetry may reflect the 
fact that the perceived midpoint of these 
stimulus values is closer to 125 gm. than 
to 150 gm., the arithmetic midpoint, when 
perceived midpoint is defined by AL. The 
AL of this test series is calculated to be 
approximately 128 gm. according to Hel- 
son's (1947, 1964) formula for lifted 
weights, and this value is confirmed by 
category-rating data from our laboratory. 
The notion of "central-tendency effect” 
(Thomas & Jones, 1962) might best be 
conceptualized as a tendency to respond to 
the psychological center, not the physical 
center. 

Inspection of Figures 1 and 2 indicates a 
correspondence between choice and la- 
tency data. The stimuli most often as- 
sociated with same responses are also as- 
sociated with the longer latencies than those 
of stimuli rarely chosen same, though this 
correspondence is far from exact. Capehart 
et al. (1969) offered the explanation that 
stimuli close to AL are less "distinctive" 
than those far from AL, and therefore, 
take longer to respond to than the more 
"distinctive" stimuli far from AL. Since 
same responses are associated with AL, 
then same responses take longer. 

The gradients of same responses of the 5 
experimental groups from Experiment II 
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Ficure 2. Generalization gradients of same re- 
sponses and decision time for the 200-gm. training 
group. 


are depicted in Figure 3A. It is clear from 
the inspection of these gradients that the 
same systematic effects of the frequency 
manipulation occurred here as in Experi- 
ment I, resulting in a significant Test 
Procedure X Stimuli interaction, F (16, 
180) = 3.56, p < .001. The most notable 
aspect of Figure 3A is the asymmetry: 
The modal response value is 125 gm., not 
150 gm., and the 100-gm. stimulus receives 
many more responses than its counterpart 
200 gm. This again may be due to the 
fact that perceived midpoint of this series 
(defined by AL) is close to the 125-gm. 
stimulus, not 150 gm. Figure 3B depicts 
the gradients of the 100-E, 150-E, and 
200-E control groups from both experi- 
ments, and again, the tendency to a shift 
in response toward the 125-gm. stimuli is 
apparent. The term central tendency should 
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Ficuxe 3, Generalization gradients for the experi- 
mental groups trained on the 150-gm, weight are 
plotted in the upper panel; generalization gradients 
for the equal-presentation control groups for 
Experiments | and || are plotted jn the lower 
panel, 


be used with caution since the 150-E group 
shifted away from the center of the series, 
although one could say foward AL, the 
psychological center. Note that the 100-E 
group did not actually peak at 125 gm., 
as in the Hébert and Capehart (1969) 
study, but at 100 gm. However, compared 
to the 100-100 group, there was a much 
greater tendency to respond to central 
values. An analysis comparing just these 
2 groups still yields a significant Test Pro- 
cedure X Stimuli interaction, F (4, 72) 
= 3.22, p < .05. 

While the above data are consistent with 
the AL approach, a rigorous test of the 
model as expressed in Equation 1 is also 
possible, Normally, AL is measured by 


means of a category-rating method (Helson, 
1947), but in this situation no ratings were 
taken. Capehart et al. (1969) proposed 
that the modal or peak response value is 
closest to AL in generalization experiments, 
and as such, is a crude indicator of AL. 
Unfortunately, this measure ignores re- 
sponses to all other stimuli, and is, there- 
fore, biased. A better index is the mean 
response value (Thomas, 1962) which takes 
into account responses to all stimuli by 
weighting each stimulus value by the 
frequency. with which it was called same. 
The mean response value for each S in the 
15 experimental groups in Experiments | 
and II was calculated and analyzed by 
means of a 3 X 5 completely randomized 
factorial design, with 3 training conditions 
(100, 150, and 200 gm.) and S test pro- 
cedures (overrepresentation of 100 through 
200 gm.). In.terms of the additive model 
of Equation 1, 3 training ALs are factorially 
combined with 5 test ALs, and, further, 
à plot of the curves of these groups should 
be parallel, or noninteractive. The Train- 
ing X Test Procedure interaction depicted 
in Figure 4 was not statistically signif- 
icant, F (8, 135) = .65, p > .05, indicating 
that no meaningful interaction is present. 
However, Anderson (1973) has indicated 
that nonparallelism should be present in 
the form of a bilinear interaction. Since 
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FIGURE 4. The nonsignificant Training X Test 
Procedure interaction for the 15 independent groups 
from Experiments I and IT, 
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this interaction is concentrated in only 
1 degree of freedom, a specific bilinear 
F test was made, but no significant inter- 
action was found, F (1, 135) = 2.27, 
p> .05. Thus, the additive model of 
Equation 1 is not disconfirmed and re- 
mains a tenable hypothesis. 

The question still remains whether the 
concept of AL as used in this study is 
substantially the same as the concept as 
operationally defined by Helson (1947, 
1964) ; neither is there any direct evidence 
that Ss were actually linking same re- 
sponses to AL. A third experiment was 
conducted to answer these questions. Nine 
introductory psychology students served 
as Ss, Training and testing of these stu- 
dents was identical to the 150-E control 
group. That is, Ss were asked to heft 
the 150-gm. training weight 5 times, and 
then were required to identify S+ in a 
subsequent test series of 5 presentations 
each of the 5 test stimuli, However, the 
testing procedure differed in one important 
respect: Following each same or different 
response each S was asked to rate the weight 
just lifted on a 7-point category-rating 
scale (1 = very light, 2 = light, 3 = fairly 
light, 4 = neither light nor heavy, 5 = 
fairly heavy, 6 = heavy, 7 = very heavy). 
The 150-gm. training stimulus was not 
identified at any point on the rating scale 
by Æ, and no instructions to do so were 
given to S. 

With this rating procedure, the stimulus 
which is closest to an average 4 rating can 
be assumed to be closest to AL, which 
would be operationally equivalent to a 
stimulus rated exactly 4, Figure 5 de- 
picts the patterns of choice and latency of 
response for this third experiment. In- 
spection of the gradient of same responses 
(solid line shows that 125 gm. elicited the 
greatest proportion of same responses, not 
the 150-gm. S+). This is in close agree- 
ment with the gradient from Group 150-E 
shown in Figure 3B. As predicted, the 
125-gm. stimulus also had the mean rating 
closest to 4.00 (100 gm. = 2.62, 125 gm. 
= 3.87, 150 gm. = 4.51, 175 gm. = 5.40, 
and 200 gm. = 6.20) The most com- 
pelling evidence that AL and same responses 
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are linked is the finding that 50 of the 54 
same responses in this study were to stimuli 
which were subsequently rated 4. Ad- 
ditionally, none of the differen! responses 
were given a 4 rating. 

These 3 studies taken together cast 
doubt on the assumption that physical 
similarity is the primary basis of stimulus 
generalization of voluntary responses in 
human Ss. The powerful effects of fre- 
quency and context must be accounted for 
in any theory of voluntary stimulus general- 
ization. The Capehart et al, (1969) theory 
of stimulus equivalence provides such an 
account, and the data in these studies 
offer strong support for the theory, As a 
final note, the reader should be reminded 
of the possibility of interpreting these data 
with Parducci's (1965) range-frequency 
model. Since both the AL and range- 
frequency approaches have generally the 
same qualitative predictions, range-fre- 
quency theory, with additional assumptions 
relating to a learning context (e.g., What 
is learned 2), may be useful for the inter- 
pretation of these data. 


REFERENCES 


Anperson, N. HL 
al judgment. 
70. 
ANDERSON, N. H. Integration theory and attitude 
change. Psychological Review, 1971, 78, 171-206. 
ANDERSON, N. H. Algebraic models in perception. 
In E. C. Carterette & M. P. Friedman (Eds.), 


Functional measurement. and 
Psychological Review, 


b 


462 J. A. HÉBERT, M. BULLOCK, L. LEVITT, K. WOODWARD, AND F. D. McGUIRK 


Handbook of perception. Vol. 2. New York: 
Academic Press, 1973. 

CAPEHART, J., TEMPONE, V. J., & HÉBERT, J. A 
theory of stimulus equivalence. Psychological 
Review, 1969, 76, 405—418. 

GIURINTANO, L. P. Stimulus generalization as a 
function of training and test stimulus parameters. 
Unpublished doctoral dissertation, University of 
Colorado, 1972. 

HÉBERT, J. A., & Capenart, J. E. Generalization 
of a voluntary response as a function of presenta- 
tion frequency of the training stimulus in testing. 
Psychonomic Science, 1969, 16, 315-316. 

HÉBERT, J. A., OrIGLIO, D. P., & McGuirk, F. D. 
Training and testing effects in the generaliza- 
tion of a voluntary response. Psychonomic 
Science, 1972, 26, 209—210. 

HrLsoN, H. Adaptation-level as a frame of refer- 
ence for prediction of psychophysical data. 
American Journal of Psychology, 1947, 60, 1-29, 


Hetson, H. Adaptation level theory. New York: 
Harper & Row, 1964. 

HELSON, H., & Avant, L. L. Stimulus gen 
tion as a function of contextual stimuli. 
of Experimental Psychology, 1967, 73, 565 


Parpucci, A. Category judgment: 


frequency model. Psychological Review, 1905, 
72, 407418. 

Tuomas, D. R. The effects of drive and discrimina- 
tion training on stimulus generalization. urnal 
of Experimental Psychology, 1962, 64, 24 

Tuomas, D. R., & Jones, C. G, Stimulus eral- 
ization as a function of the frame of relerence, 
Journal of Experimental Psychology, 1962, 64, 
77-80. 

Tuomas, D. R., Svinick1, M. D., & Vour, J. 
Adaptation level as a factor in human discrimina- 
tion learning and stimulus generalization. Journal 


of Experimental Psychology, 1973, 97, 210-219. 
(Received June 2, 1973) 


Journal of Experimental Psychology 
1974, Vol. 102, No. 3, 463-472 


RELATIONSHIPS AMONG HIGHER ORDER ORGANIZATIONAL 


MEASURES AND FREE RECALL! 


JAMES W. PELLEGRINO? anb WILLIAM F. BATTIG 
University of Colorado 


Higher order subjective organization units were measured in categorical 
and unrelated lists under fixed and random presentation schedules. Such 
units occurred frequently and also varied in sequential consistency across 
recalls thereby demonstrating the inadequacy of pair-wise sequential indices 
for measuring total organization. Further data confirmed the necessity for 
examining such units since, when included as additional predictors, they 
accounted for 96% of the between-groups recall variance. Several other 
organization measures indicated interactive relationships between organi- 
zational processes and changes in such relationships with practice which 
may partially explain inconsistencies in the organization-recall literature. 
Additionally, the present organizational data raise questions about the 
adequacy of current theories of storage and retrieval processes in free recall. 


Recent research in free recall has focused 
on the role of organizational processes in 
memory and questions concerning the 
storage and retrieval of information as 
related to such processes (e.g., Postman, 
1972; Slamecka, 1968). Postman has 
noted 2 major current problems related 
to the study of organizational processes 
in free recall. 

The first problem concerns the adequacy 
of certain organizational measures as valid 
indices of the total organization that has 
occurred at any point in learning, parti- 
cularly with respect to measures of sub- 
jective organization (SO). The 2 major 
indices of SO—the intertrial repetition 
measure (Bousfield & Bousfield, 1966) and 
the SO measure (Tulving, 1962)—have 
been criticized as having very serious 
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shortcomings both in the size of the se- 
quential units that are examined and the 
strict sequential criteria that are employed. 
Shuell (1969) and others (e.g., Postman, 
1972) maintain that such simple pair-wise 
sequential indices would seriously under- 
estimate the actual organization that oc- 
curred if Ss formed units containing more 
than 2 items with free-varying internal 
sequential constraints, i.e., the items within 
a unit always appear together but not 
necessarily in the exact same sequential 
order each time. Specifically, a unit of 
Size 5 could be randomly reproduced 5! 
possible ways which yield an average of 
only 1.6 pair-wise repetition as compared 
to a maximum of 4. Empirical justifica- 
tion for such criticism of the current mea- 
sures therefore requires demonstrating (a) 
the existence of such higher order units 
and (b) that such units have internal 
sequential variability across successive 
recalls. 

Pellegrino (1971) has presented a tech- 
nique for measuring organization which is 
capable of dealing with both issues. Under 
this procedure, the degree of sequential 
consistency can be examined for units of 
any size rather than being limited only to 
units of Size 2. This technique also per- 
mits measurement of organization based 
on any of the following 3 different se- 
quential criteria: (a) unidirectional, re- 
quiring exact reproduction of the sequen- 
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tial order, as characterizes most previous 
measures; (b) bidirectional, counting also 
a mirrorimage reversal as an organiza- 
tional unit; and (c) unordered, requiring 
only that the items be recalled together 
in any sequential order. For example, if 
Items A, B, and C were recalled in that 
order on Trial £ on Trial ¢+1 only 
A-B-C would. represent a unidirectional 
unit, either A-B-C or C-B-A would be 
bidirectional units, whereas any of the 3! 
possible orderings of these 3 items would 
constitute unordered units. In the present 
measure the actual number of repetitions 
of each type is separately determined for 
Unit Sizes 2-5 and compared to an ap- 
propriate chance expected value. The 
actual chance deviation —(observed-ex- 
pected) is divided by the maximum possible 
deviation (maximum-expected) to provide 
a ratio organization score (adjusted ratio 
clustering, ARC’) whereby 0 represents 
chance and 1.0 perfect organization. 

The initial purpose of this experiment 
was to employ this technique to examine 
how extensively higher order units occur, 
their sequential properties, and the effects 
of certain free-recall variables on the de- 
velopment of such units. To demonstrate 
the inadequacy of pair-wise measures re- 
quires not only the existence of higher 
order units, but more importantly they 
must be shown to vary in internal struc- 
ture from trial to trial. If there are no 
differences between the 3 sequential order- 
ing criteria, then pair-wise sequential mea- 
sures would appear adequately sensitive. 

A second major problem concerns the 
relationship of free-recall acquisition per- 
formance (mean recall) to the amount of 
organization, Several different types of 
organizational processes have been shown 
to function in free recall, eg. primacy 
and recency effects (Tulving, 1968), pri- 
ority of recall of newly learned items 
(Battig, Allen, & Jensen, 1965), taxonomic 
clustering (Bousfield, 1953), SO (Tulving, 
1962), and correspondence of recall order 
with list presentation order (input-output 
organization or seriation, Mandler, 1969). 
Investigations of the nature of any rela- 

tionships between organization and recall 
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have been severely handicapped by their 
reliance upon only one type of organiza- 
tional measure and the failure to consider 
other viable types of organization 
changes in organization occurring with 
practice. Thus the inconsistencies in this 
literature may be a function of a complex 
interactive relationship between several 
different modes of organization occurring 
simultaneously, any one of which is capa- 
ble of producing recall differences. It 
might also be the case that increases in 
one type of organization occur at the ex 
pense of other types of organization. Thus 
the present study also provides a detailed 
examination and comparison of all thi 
aforementioned types of organization, in 


or 


| order to evaluate their interactive natur: 


their relative importance as a function of 
different experimental variables, and anv 
changes over trials in the usage of various 
processes. Also examined is the relation- 
ship(s) of these various types of organiza- 
tion to recall performance. 
Two of the 3 variables in this stu: 
(categorical vs. unrelated word lists a: 
fixed vs. random presentation orders) have 
been previously examined with respect to 
SO. Puff (1970) failed to find any dif 
ferences between categorical and unrelat 
word list when measuring pair-wise S< 
The list-order variable has been examined 
in lists of related and unrelated materials 
(e.g., Winograd, Conn, & Rand, 1971). 
Both the acquisition and organizational 
data indicate a facilitative effect produced 
by a fixed serial order for unrelated words, 
but little or no effect for related materials. 
It was hypothesized that both the list 
structure and the list-order variable should 
produce large effects on SO and therefore 
provide the necessary test of the existence 
and sequential consistency of higher order 
units. The third variable, not previously 
examined in free-recall tasks, involved a 
categorization pretest procedure given to 
half of the Ss. Developed by Loftus and 
Scheff (1971), this pretest provided Ss with 
words later used in the actual free-recall 
list to which Ss produced the labels of 
categories which each word represented. 


HIGHER ORDER MEASURES OF ORGANIZATION IN FREE RECALL 


METHOD 


Design and materials, The design of this experi- 
ment was a 2 X 2 X 2 factorial manipulating list 
structure (taxonomic or unrelated), presentation 
order (fixed or random), and categorization pretest 
(pretest or no pretest). The 25-item taxonomic 
list consisted of 5 items from each of 5 taxonomic 
categories (animals, clergy, metals, instruments, 
parts of the body), with all items high in taxonomic 
frequency (Battig & Montague, 1969). The 25-item 
unrelated list consisted of high taxonomic fre- 
quency items chosen from 25 different categories. 
In both lists there were the same 16 initial letters, 
and the Kucera and Francis (1967) word frequency 
was also equal. There were 6 different orders of 
each list with 5 used for the learning task and 
the other used for the pretest and posttest. In 
the taxonomic list, no 2 items from the same 
taxonomic category ever appeared in adjacent se- 
quential positions, and in both lists each word 
appeared in each fifth of the list across the 5 random 
orders used for learning. For the random order 
conditions, each of the 5 different list orders was 
used equally often as the first-trial order, thus 
counterbalancing the random list order sequences 
ina 5 X 5 Latin square. Within the fixed-order 
conditions, each of the 5 different list orders was 
also used equally often for different Ss, thus equa- 
ting fixed and random conditions with respect to 
frequency of occurrence of each of the 5 list orders. 

Apparatus and procedure, All Ss were seated at 
a table and presented a test booklet containing 
12 blank pages. They were instructed that they 
would see a list of words, one at a time at a 3-sec. 
te, and that when they saw the words START 
RECALL they were to turn to the next blank page 
in their booklet and write as many words from 
the list as they could in any order they wished. 
They were further instructed that they had 13 min. 
to do so and this procedure would continue for 
several trials. The list acquisition lasted for 12 
trials. Under pretest conditions, Ss were first 
taken to a different room and instructed that they 
would hear a list of words one at a time at a 10-sec. 
rate. They were asked to write on each new 
page of the small booklet provided as many cate- 
gory labels as they thought were appropriate to 
that word, Immediately following the pretest, 
these Ss were taken into the experimental room 
and participated in the experimental task. Fol- 
lowing the 12 free-recall trials, all Ss were given 
a posttest identical to that given in the pretest 
conditions. 

Subjects. There were 120 Ss with 15 in each 
of the 8 conditions. All Ss were University of 
Colorado students fulfilling a course requirement 
for experimental participation. Each S was tested 
for 50-60 min. in groups of 1-3. These Ss were 
assigned to conditions using a block randomization 
procedure which assured that within each block 
of 8 test sessi 


ions each condition appeared once, 
and that within the first 40 test sessions each of 
the 5 counterbalancing orders for each condition 
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MEAN RECALL 


TRIALS 


Mean recall as a function of list struc- 
ture, list order, and trials. 


FiGURE 1. 


appeared once. Following the first 40 test ses- 
sions, Ss were assigned to conditions to complete 
the counterbalancing of list orders for those con- 
ditions where less than 3 Ss had served in the 
initial session. 

There were 4 Es, each testing approximately 
the same number of Ss in each condition, There 
was a loss of 10 Ss due to equipment problems or 
failure to follow instructions. 


RESULTS 


The minimum significance level for the 
mean recall and various organizational 
analyses that follow was .01. All main 
effects or interactions not specifically dis- 
cussed in any of these sections did not 
reach this criterion. In each section that 
follows, the results are both described and 
related to previous empirical findings. 

Acquisition. The basic recall results are 
presented in Figure 1. Mean recall was 
higher for categorical than unrelated list 
conditions, F (1, 112) = 9.86, and for fixed- 
than random-order conditions, F (1, 112) 
= 8.62. These differences decreased and 
increased, respectively, with trials, Fs (11, 
1232) = 6.90 and 3.31. No other inter- 
actions involving either variable ap- 
proached significance. A large Pretest 
X Trials interaction, F (11, 1232) = 10.00, 
was primarily due to superiority of pretest 
over no-pretest conditions on the first 2 
trials (15.76 vs. 13.25) and no effect there- 
after (21.43 vs. 21.56), with this com- 
parison accounting for 80% of the inter- 
action variance. 
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TABLE 1 


Mean Ourrvr-OurPUT AND INrUT-OvrPUT ARC',x SCORES AS A FUNCTION OF UNIDIRECTIONAL 
AND UNORDERED CRITERIA, UNIT Size, AND POINT IN LEARNING 


Output-output Input-output 
Trials 
Unidirectional | Unordered. | Difference Unidirectional | Unordered Difference 
Unit Size 2 
‘iret half H8 265 087 .069 079 010 
Second half 267 372 105 105 A18 013 
Unit Size 3 
iret half 073 +159 086 038 449 011 
Second hatt A32 258 126 085 105 020 
Unit Size 4 
First half 038 105 067 026 031 4005 
Second half Bi) 208 BI 073 090 O17 
Unit Size 5 
t half 026 A7 O51 020 O24 004 
Sadr O84 168 | | 069 | 081 O12 


Seeond half 


These recall results are in general agree- 
ment with previous investigations (e.g., 
Mandler, 1969; Winograd et al, 1971). 
"The one discrepant result was the facilita- 
tion by a fixed serial order within cate- 
gorical lists. Mandler failed to find such 
facilitation, but obvious ceiling effects oc- 
curred in his ló-item lists. The failure 


of Winograd et al. to find such an effect 


may be due to the relatively short number 
of trials (4) since the present data indicate 
that this facilitation only occurs in later 
Mages of learning. Thus the present results 
appear reconcilable with previous findings. 

Ordering criterion effects, Initial analyses 
of both output-output and input-output 
organization ARC’sx scores (Pellegrino, 
1971, 1972) included all 3 experimental 
variables, trials, and the 3 ordering cri- 
teria, and were performed separately for 
Unit Sizes 2-53 This section is concerned 


* Unit sizes were examined separa! 
differences in the range of ARC’ scores that actu- 
ally could be obtained. Although the theoretical 
range for ARC’ scores is identical for all unit 
sizes, any deviation from perfect sequential order- 
ing (upon which the maximum observed values 
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with differences between the unidirectional 
and unordered criteria in sensitivity as 
organizational measures. 

Table 1 summarizes ordering criterion 
differences for both types of organization 
separately for each unit size. The values 
in Table 1 are pooled across all experi- 
mental conditions since all showed a simi- 
lar pattern of results and any interactions 
aE ataca i Ug co AP al 1) E 9 dii 


are based) produces larger reductions in the range 
of actual possible observable repetitions for the 
larger unit sizes, thereby reducing the functional 
range of ARC’ scores more severely for higher 
order unit sizes. For example, a single departure 
from perfect sequential consistency of 11 recalled 
items can produce ARC’ scores for units of Size 5 
ranging 86-29, while for units of Size 2 this would 
range only .90-.80 (depending on how close the 
discrepant item is to either end of the sequence). 
Any comparison across unit sizes involving the 
absolute numeric ARC’ score would therefore be 
misleading and to avoid such problems in inter- 
pretation separate analyses were performed on 
each unit size. 

‘Only unidirectional and unordered values are 
compared in this analysis due to the necessary 

ntity of bidirectional and unordered scores for 
Unit Size 2, and data clearly showing that uni- 
directional and bidirectional values did not differ 
for Unit Sizes 3-5. 
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involving experimental conditions only re- 
flected differences in the magnitude of the 
ordering criterion effect. The present dis- 
cussion focuses only on the overall ordering 
criterion differences and a subsequent 
analysis will deal with the experimental 
conditions. ‘The output-output analysis 
showed highly significant differences be- 
tween criteria for all 4 unit sizes, Fs (1, 112) 
> 206.65. All 4 unit sizes also produced 
Ordering Criterion X Trials interactions, 
Fs (10, 1120) > 3.56. A subsequent break- 
down of these interactions into first and 
second half of learning (as shown in Ta- 
ble 1) showed that the unidirectional-un- 
ordered difference was greatest in the 
second half of learning, particularly for 
the larger unit sizes, yielding Fs (1, 1120) 
of 5.87, 8.98, 29.30, and 3842 for Unit 
Sizes 2-5, respectively. In the input- 
output organization analyses, significant 
ordering criterion differences also occurred, 
Fs (1, 112) > 2342, but these did not 
interact significantly with trials. 

These ordering criterion effects indicate 
quite strongly that the unordered criterion 
is the most sensitive for determining the 
total amount of organization based upon 
either the previous recall order or the 
present list order, They further indicate 
that when S is using prior recall as a basis 
for determining present recall order, such 
organizational units vary considerably in 
sequential consistency, particularly for 
larger unit sizes late in learning. Thus 
simple pair-wise sequential measures neces- 
sarily underestimate the total amount of 
organization, so units larger than Size 2 
must be examined to determine the effects 
of experimental variables upon SO. When 
S is basing recall order on the list presen- 
tation order, however, organizational units 
vary less in internal sequential consistency 
and this does not change over trials. 

As noted previously, the ordering cri- 
terion difference also interacted with ex- 
perimental conditions and these interac- 
tions generally showed that any differences 
between experimental conditions were pri- 
marily a function of the unordered values, 
particularly in the larger unit-size analyses. 
Since the unordered criterion more ade- 
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Froukk 2, Mean unordered output-output ore 
ganization for Unit Sizes 2-5 a» a function of list 
structure, list order, and trials, 


quately represents and is more sensitive 
to the total amount of organization that 
has occurred, the analyses that follow 
based upon only the unordered criterion 
scores. 

Output-output, The major results of 
these analyses are shown in Figure 2. 
The pretest variable had no effect in this 
or any other organizational analyses that 
follow, and will not be discussed further 
in those sections. Figure 2 shows overall 
significant increases in organization over 
trials for all 4 unit sizes, Fs (10, 1120) 
> 2435, Differences between categorical 
and unrelated list conditions were found 
only in the analyses of Unit Sizes 3, 4, 
and 5, Fs (1, 112) > 8.91. Interactions 
between this list structure variable and 
trials also appeared in the analyses for 
Unit Sizes 2, 4, and 5, Fs (10, 1120) > 2.60, 
reflecting maximal differences between cat- 
egorical and unrelated conditions on Trials 
4-10 with little difference on the first and 
last 2 trials. Figure 2 also shows large 
differences between fixed- and random-list- 
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order conditions for all 4 unit sizes, Fs 
(1, 112) > 13.45. However, List Order 
X Trials interactions only appeared in 
analyses for Unit Sizes 3, 4, and 5, Fs 
(10, 1120) > 3.57, indicating increasing 
differences favoring fixed- over random- 
order conditions as learning progressed. 
The present results for Unit Size 2 are 
consistent with previous data (e.g., Man- 
dler, 1969; Puff, 1970). However, Puff’s 
conclusion that the degree of SO is iden- 


TABLE 2 


SIMPLE AND MULTIPLE CORRELATIONS BETWI 
MEAN RECALL AND ORGANIZ TION ÜNrr-SizE 
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| 
Condition | 
| 


Categoric-random | .45* | 45* -70** | .g2** 
Categoric-fixed 21 25 26 34 
Unrelated-random | .65** | ,65** | '65** .66** 


Unrelated-fixed | 47" | 47* | 48 | “age 
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tical for randomized categorical and un- 
related lists is contradicted by larger unit- 
size data. This result further indicates 
that pair-wise indices are relatively insen- 
sitive to the total amount of SO, particu- 
larly if that organization has a free-varving 
structure. Moreover, higher order unit-size 
measures produce differences more closely 
paralleling those for list acquisition, notably 
superiority of categoric over unrelated 
and fixed over random orders especially on 
later trials. The latter results further indi- 
cate that higher order measures should be 
more closely related to between-groups re- 
call differences, which would provide a ong 
argument for their usage. "This was ex- 
amined by computing separate correlations 
for recall with Unit Sizes 2 and 2-5. 
Pair-wise and multiple correlations based 
upon group means yielded respective val 
ues of .67 (p > .05) and .98 (p < .001). 
These correlations differed significantly, 
F (3, 3) = 12.92, p< .05 (McNemar, 
1962, p. 284) thus demonstrating tha‘ 
higher order unit sizes contribute i: 
tantly to recall differences between g 
Similar correlation comparisons 

also made between Ss within each « 
present 4 major groups, as shown i: 
ble 2. Higher order units clearly accu 
for more of the within-groups vari 
only for the categorical list, partic 
in the categorical-random condition, F (3 


lists, 


25) = 11.95, p < .001. By contrast, higher 
order units contributed nothing within un 
related-list conditions. 

Input-output. 
these analyses are shown in Figure 3. 
Organization increased over trials in all 4 


The major results of 


unit-size analyses, Fs (11, 1232) > 6.63. 
Only Unit Size 2 yielded an overall effect 
of list structure, F (1, 112) = 28.45. How- 
ever, all 4 unit-size analyses produced a 
significant List Structure X Trials inter- 
action, Fs (11, 1232) > 4.59, reflecting the 
increasing superiority of unrelated condi- 
tions with practice. Figure 3 also shows 
large differences favoring fixed- over ran- 
dom-order conditions which were signifi- 
cant for all unit sizes, Fs (1, 112) > 14.43. 
There was a significant List Order X Trials 
interaction in all analyses, Fs (11, 1232) 
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> 12.48. As shown in Figure 3, this in- 
creasing fixed-random difference over trials 
was limited to unrelated-list conditions, 
yielding Fs (11, 1232) > 3.98. 

The input-output data for the unrelated 
conditions are consistent with Mandler and 
Dean's (1969) results. However, the 
present level of input-output organization 
for categorical-fixed conditions is well 
above that reported by Mandler (1969), 
who concluded that no Ss in this group 
showed a seriation tendency. The present 
data indicated that at least 2 of 30 Ss in 
the categorical-fixed group showed perfect 
seriation, although this in itself could not 
account for the mean level of performance 
exhibited. Another difference from Man- 
dler was the present failure to find any 
taxonomic clustering superiority of the 
categorical-fixed condition relative to the 
categorical-random condition which indi- 
cates that seriation may be used in lieu 
of or in addition to taxonomic organization 
in typical categorized lists. — Mandler's 
failure to find a seriation tendency in this 
group is probably because his Ss were 
completely instructed about the taxonomic 
list structure before learning. 

Primacy-recency. The input-output 
analyses assess the role of serial input 
order as a general determinant of recall 
order, but provide no information about 
the relative position of these items in the 
recall order or recency and primacy de- 
terminants of recall order. To determine 
any such effects, an adaptation of the 
standarized recall rank measure (Battig 
et al, 1965) was used, whereby positive 
and negative values, respectively, indicate 
mean recall positions before or after the 
median on that trial. Although obtained 
for each fifth of the list on each trial, 
only the primacy and recency components 
showed significant effects, so these alone 
will be discussed here. 

As shown in Figure 4, primacy and 
recency components showed no significant 
overall differences with respect to position 
in recall, but there was a significant 
Primacy-Recency X Trials interaction, 
F (11, 1232) = 11.03. All 4 panels of 
Figure 4 show this interaction to reflect 
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Ficure 4. Mean standardized recall rank values 
for primacy and recency components of list order 
as a function of list structure, list order, and trials. 


the large primacy-recency difference on 
Trial 1 which is reduced and reversed on 
Trial 2. Figure 4 additionally shows that 
this Primacy-Recency X Trials interaction 
depended upon list presentation order, 
F (11, 1232) — 2.52. Significant also was 
the opposite direction of this primacy- 
recency difference for fixed- (.43) and 
random-order conditions (—.15). The 
final significant effect was the overall dif- 
ference between categorical and unrelated 
lists, F (1, 112) = 24.63, indicating that 
the pooled primacy and recency com- 
ponents occurred earlier in recall in the 
unrelated- (.18) than categorical-list (.07) 
conditions. 

The results of this analysis are in gen- 
eral agreement with those studies (e.g., 
Deese & Kaufman, 1957) using correla- 
tional techniques to demonstrate a recency 
strategy. However, the present technique 
indicates that there are 2 factors deter- 
mining the occurrence of such a strategy. 
The first factor is experience in the ex- 
perimental situation, such that a naive S 
does not demonstrate any recency effect 
until Trial 2. The second factor concerns 
the maintenance of such a strategy which 
depends on both the list structure and list 
order, since consistent recency only occurs 
in unrelated-random conditions where no 
semantic or serial position cues are pro- 
vided. The primacy component results 
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are in agreement with Mandler and Dean 
(1969) who found that unrelated constant 
serial order lists contain a constant pri- 
macy component, and also extend this 
result to categorical-constant serial order 
lists. The latter results also indicate the 
sensitivity of this organizational technique 
for studying primacy and recency effects. 

Priority. The Battig et al. (1965) 
procedures were also applied to the anal- 
ysis of priority of recall of newly learned 
items (PRNI). Items were considered as 
new items only if never recalled previously. 
This analysis was performed only on 
"Trials 2-6 inclusive since no new items 
occurred following Trial 6. Included in 
this analysis were the variables of item 
type (old vs. new) and trials in addition 
to the experimental variables. Much 
higher priority was given to new items 
(.19) than to old items (—.19), F (1, 112) 
7 178.8. An Item Type X Trials inter- 
action, F (4, 448) = 3.97, reflected in- 
creasing priority scores for new items 
across Trials 2-4 (.20, .25, and .34, re- 
spectively) which decreased on Trials 5 
and 6 (.08 for both), while the priority 
scores for old items remained stable across 
all trials (—.22, —.20, —18, —17, —.18, 
respectively). There was a List Order 
X Item Type interaction F (1, 112) 
7 10.06, reflecting no difference in the 
rank of old items for either type of list 
order, but a much higher priority for new 
items under the random- (.27) than fixed- 
order (10) conditions. A similar List 
Structure X Item Type interaction, £ (1, 
112) — 6.79, resulted from a much higher 
priority given to new items under the un- 
related (.26) than the categorized (.12) 


conditions but no such difference for old 
items. 
oou ELE ACT do 

* In the standardized recall rank technique, sets 
of items comprising the total recall on that trial 
are compared with respect to position in recall. 
Only when the sets are nearly equal in size does 
the mean of the standardized recall rank values 
obtained across all sets necessarily approach 0. 
However, as the sets become increasingly dis- 
proportionate in size (as when old exceed. new 
items after Trial 2), the mean standardized recall 


rank values across all sets can differ substantially 
from 0, 


JAMES W. PELLEGRINO AND WILLIAM F. BATTIG 


The present PRNI effect suppo 
strongly several previous studies demo 
strating such an effect (e.g., Battig « 
Slaybaugh, 1969). Of particular interest 
however, is the limitation of this PRNI 
effect both with categorical structure and 
fixed serial orders. As in the presen: 
analysis of recency, this mode of orga 
nization seems to suffer when semantic o: 
serial list cues are present. 


DISCUSSION 

The present analyses of higher order SO 
units have demonstrated that such units are 
formed in typical free-recall learning, and 
more importantly, that they vary in their 
sequential reproduction across trials, "These 
data therefore support the contentions that 
pair-wise sequential indices are insensitive and 
underestimate total SO. The insensitivi 
problem was further illustrated by the 96% 
45% difference in the amount of between- 
groups recall variance that could be accounted 
for by using all unit sizes as opposed to 
Size 2 alone. 

The failure to observe similar correlation 
increases within groups, with the exception 
of the categorical-random condition, provides 
further information about the structure of 
recall and the relationships among unit-size 
measures. Since Ss in fixed-order conditions 
relied on input order as a basis of organiza- 
tion, thereby reducing the internal sequential 
variability in organization units (see Table 1), 
any higher order organization should be almost 
completely measured by Size 2 units. Cor- 
relations between pair-wise and higher order 
unit sizes were above .925 thus supporting 
this interpretation. In the unrelated-random 
conditions it is probably the delayed develop- 
ment of higher order units rather th 
strong relationships between unit-size mea- 
sures (all rs < .690) that caused the correla- 
tions to remain constant. Support for this 
interpretation comes from similar analyses 
based on the last 2 trials whereby the cor- 
relation increased from .601 to 721 by the 
addition of Size 3 units, producing a margin- 
ally significant difference (p < .10). Stronger 
evidence that higher order units enhance be- 
tween-5s  organization-recall relationships 
within unrelated-random lists comes from a 
similar analysis of previous data (Rogers & 
Battig, 1972). This showed a correlation of 
—-001 for pair-wise units which increased 
significantly to .454 when units of Size 3 were 
included, 
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The overall correlational data indicate that 
fairly strong recall-organization relationships 
may operate for the higher order units, 
although substantial between-Ss variability in 
recall within groups remains unaccounted for 
by such relationships. Part of this variability 
can be attributed to the existence of several 
different types of organizational processes 
which occur simultaneously, often in recip- 
rocal relationship to one another, and which 
can affect recall. Evidence of such reciprocal 
relationships is provided by the findings of 
(a) more SO but less overall primacy, recency, 
and priority-of-new-item effects within cate- 
gorical than unrelated lists, and (b) more SO for 
fixed- than random-order conditions but with 
less priority of new items and also a reduced 


SS 
recency strategy. 

Besides showing an overall reciprocal rela- 
tionship between processes, the data also 
indicate that such relationships change with 
practice. A hierarchy of organizational strate- 
gies or series of stages ìs evident such that 
Ss begin with a strong seriation strategy on 
the first trial followed by recency and priority 
strategies, and culminating in higher order 
SO processes. The main difference between 
conditions appears to be in the second and 
third stages of this organizational develop- 
ment. In those conditions where there is an 
interitem semantic relationship, or a fixed 
serial order, or both, the degree to which 
priority and recency strategies are developed 
is limited and these Ss appear to proceed 
more rapidly to an organizational stage in- 
volving the formation of higher order S units. 
Only in the unrelated-random condition do Ss 
maintain a recency strategy and show little 
development of higher order SO units, thus 
contradicting Puff's (1972) conclusion that 
recency might facilitate SO in unrelated lists. 
It appears that higher order units per se are 
not sufficient to explain recall differences, and 
that overall speed or ease of transition to 
this stage is equally important. Without 
semantic or sequential cues, 5s may be forced 
to operate with simpler organizational 
strategies. 

Based on the foregoing evidence of the 
interactive nature of organizational processes 
and changes in such processes over trials, 
previous inconsistencies in the recall-organiza- 
tion literature appear due to at least 2 factors, 
i.e, (a) inability of any one index to account 
for the complete pattern of organization and 
(b) differences in this pattern depending on 
degree of practice. 

An important additional consideration with 
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respect to the present results concerns their 
relationship to storage and retrieval processes. 
One apparently unresolved issue is whether 
the organization observed at output is a func- 
tion of storage, retrieval, or both. Slamecka's 
(1968, 1969) tests of the independent storage 
hypothesis were based upon list organization 
of items by pairs, i.e, the organizational units 
could be represented by direct associative 
relationships between pairs of list items. The 
present data question the validity of this as- 
sumption and his tests of the independence 
hypothesis. Additionally, given that higher 
order units are formed, one must question 
what constitutes the appropriate accessibility 
cue for any individual list item. The general 
lack of success of studies which have attempted 
to use individual list item cuing may be a 
function of this problem. 

Given that higher order free-varying units 
are formed, one can also wonder which current 
theories of storage and retrieval would predict 
such structures. The development of such 
units is consistent with the hierarchical model 
of Mandler (1967) although he did not state 
how recall should occur within or between 
units. A random selection process at all levels 
of the hierarchy would produce the desired 
results and would be similar to Shiffrin's 
(1970) restricted search hypothesis. An alter- 
native theory based upon an associative net- 
work (Anderson, 1972) might also produce 
organization results similar to those observed. 
In addition, Anderson's model would predict 
that the formation of higher order units is a 
direct result of storage and the elaboration 
of the associative graph. Since this graph 
changes from trial to trial, organizational 
units could be maintained although the order 
within such units might vary. The output 
protocols produced by Anderson's model 
would have to be analyzed for units larger 
than 2 to more fully determine the adequacy 
of the model. 

Also, while it may be relatively easy to 
predict the sequential organization properties 
of recall protocols via random selection within 
and across units, it is more difficult to hy- 
pothesize how such random units are formed. 
Wallace (1970) has speculated that the pair- 
wise organization typically measured is a 
function of implicit and explicit contiguity 
during learning. Such a theory must ulti- 
mately be able to explain the free-varying 
internal structure of higher order units in 
random-list-order conditions and the reduced 
variability under constant-order conditions. 
A comparison of Anderson's (1972) and 


Wallace's associative theories seems to be 
required to help resolve how such units are 
formed, what circumstances dictate their for- 
mation, and whether any associative theory 
is capable of explaining them. Irrespective 
of specific issues, the present organization 
results certainly must be dealt with in any 
complete theory of storage and retrieval 
processes, 
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Five experiments investigated both acquisition and extinction effects in 
4 discrete-trials (runway) escape paradigm. The escape procedure con- 
sisted of shocking S in the start and run sections of the runway, and reducing 
shock level in the goal box on reinforced (R) trials but not on nonreinforced 
(N) trials, The results of these experiments were: (a) greater resistance 
to extinction was exhibited by partially reinforced. Ss than continuously 
reinforced. Ss, (b) presence and number of N-R transitions influenced re 
sistance to extinction, (c) number of successively nonreinforced trials 
(N length) was also a determinant of resistance to extinction, and (d) alter- 
nating N and R trials resulted in patterned running, but a double-alternation 
schedule did not. The results of all $ experiments were interpreted as sup. 


port for the hypothesis that Capaldi's 1967 sequential theory can be extended 
to include instrumental escape conditioning " 


Capaldi's (1967) sequential hypothesis 
of instrumental learning has received con- 
siderable support when appetitive instru- 
mental conditioning procedures are used 
(e.g., Robbins, 1971). Relatively little 
work has been reported, however, inves- 
tigating sequential variables in aversive 
conditioning paradigms, although there is 
some evidence to indicate that sequential 
factors are important in pete re 
sponse persistence in punishment situa- 
tions (e.g., Campbell, Wroten, & Cleveland, 
in press; Capaldi & Levy, 1972; Dyck, Mell- 
gren, & Nation, 1974). It was the purpose 
of the present experiments to investigate 
the operation of sequential variables in 
instrumental escape conditioning and to 
determine if it i» possible to extend the 
boundary conditions of sequential theory 
to an area of research where they previ- 
ously have not been applied. 

Recently, Franchina and his co-workers 
(e.g, Franchina & Snyder, 1970) have 
reported several escape conditioning ex- 
periments in which the sequence of shock 
and nonshock trials was manipulated in 
a hurdle-box apparatus. A shock trial 
consisted of shock present in 1 side of the 
apparatus, and a nonshock trial consisted 
of the absence of shock in both sides of 

1 Requests for reprints should be sent to Roger 
L. Mellgren, rtment of Psychology, University 
of Lon dinde Oklahoma 73069. 


the apparatus, These authors and others 
(e.g., Woods, Markman, Lynch, & Stokely, 
1972) have noted, however, that these data 
cannot be explained in terms of the se- 
quential effects of reinforcement and non- 
reinforcement because the shock-nonshock 
procedure is not analogous to reward 

nonreward trials in the appetitive situa- 
tion, since there is a lack of primary 
motivation on the nonshock trials, Thus, 
in order to test the effects of sequential 
variables in instrumental escape condi- 
tioning, it would be necessary to utilize 
procedures analogous to rewarded and non- 
rewarded trials in appetitive instrumental 
conditioning. For example, in à runway, 
à "rewarded" escape trial would consist of 
the application of some aversive stimulus 
to S in the start and run sections of the 
straight runway and the absence (or re- 
duced level) of that aversive stimulus in 
the goal box. A "nonrewarded" escape 
trial would consist of the application of 
the same aversive stimulus in all sections 
of the runway. It should be noted that 
the duration of nonreward must neces 


sarily be limited in this procedure because — 


at some time after S enters the goal box 
it must be removed. Thus, the above 


i 


procedure may be more analogous to delay — 


of reward, but since both nonreward and 


delay have been postulated to involve — 


similar theoretical mechanisms (Capaldi, 
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1967) the terminology of nonreward will 
be maintained. 
One prediction of sequential theory that 
has been confirmed in appetitive studies 
involves the Schedule of Reward X Num- 
ber of Acquisition Trials interaction. Fol- 
lowing limited acquisition, reward sched- 
ules with N lengths—number of consecutive 
nonreinforced (N) trials followed by a 
reinforced (R) trial—of 1 produce greater re- 
sistance to extinction (Rn) than do sched- 
ules with longer N lengths; but after 
extended acquisition training, reward sched- 
ules with long N lengths produce greater. 
Rn than do schedules with N lengths of 1 
(see Capaldi, 1967) when percentage of 
reinforcement is equated for the 2 groups. 
Experiments I and II represent an attempt 
to demonstrate this interaction using an 
escape conditioning procedure. 


EXPERIMENT I 


Following limited acquisition training, 
à group receiving N lengths of 1 should 
show greater Rn than a group receiving 
N lengths of 3, because there should be 
greater strength of the association between 
the memory of nonreward and the in- 
strumental response (SN-R; in sequential 
terminology) for the N length 1 group due 
to the greater number of transitions from 
nonreinforcement to reinforcement (N-R 
transitions) that occur when percentage of 
reinforcement is equated. Both partial 
reinforcement (PRE) groups should show 
more Rn than the continuous reinforce- 
ment (CRE) control group. 


Method 


Subjects. The Ss were 36 experimentally naive 
male albino rats of the Sprague-Dawley strain 
purchased from the Holtzman Co. They rere 
approximately 100 days old at the start of training 
and were randomly assigned to 1 of 3 groups 
a= 12/group). 

Apparatus. The apparatus consisted of a straight- 
alley runway manufactured by the Hunter Co, 
The alley was constructed of clear Plexiglas with 
a grid floor and was 150 x 15 X9 cm. It 
was divided into a 30-cm, start section, a 90-cm, 
run section, and a 30-cm. goal section, with all 
sections separated by guillotine doors, The Ss’ 
progress in the alley was measured by 3 .01-sec, 
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Standard timers: the first, which measured start 
time, was started by a microswitch he start-box 
door and stopped by a photocell located 11 cm. 
into the alley; the second, which measured run 
time, was started by the first photocell and stopped 
by a second photocell located 11 cm. in front of 


the goal box; the third, which measured goal time, 
was started by the second photocc!! 1 stopped 


by a third photocell located 9 cm ide the goal 
box. Start, run, and goal speeds obtained 
by converting the start, run, and imes to 
reciprocals. A .3-ma. scrambled shock was auto- 
matically delivered to the start an in sections 
of the alley by a Model 700 Gras idler shock 


generator, which was activated by ch 


at the start-box door. Shock of t trength 
could be administered manually to il section 
of the alley by means of a l on the 
apparatus control panel. us was also 
used in Experiment II. 

Procedure. Upon arrival in the laboratory, Ss 
were individually housed and placed on an ad-lib 
schedule of food and water which contiucd ihre yh- 
out the experiment. During the 3 : ctore the 
start of the experiment, each § w d for 
5-10 min. daily. One day of pretr: ded 
the actual start of the experimeni, 1» ring this 


period, all Ss received 2-R trials in the apparatus. 
The following procedure was employed throughou 
the experiment on R trials, The S 
in the start box, and after a peri 
start-box door was raised, activati 
timer and the shock generator. d 
traversed the runway and entere 
trified goal box, the goal-box door y nd 
S was confined for 15 sec., after . vas 
removed and placed in a carryin. the 
procedure on N trials was similar 

trials except that as soon as S left il: 

E depressed the switch which electrified 
floor of the goal box with a +3-ma. shock. \ 
5 entered the goal box, the door was < 
it was confined to the electrified 
15 sec., after which it was removed tc 
cage. Thus, reinforcement consis Bd ofa 15 
confinement in the nonelectrified goal box followir 
shock in the start and run sections, while non- 
reinforcement consisted of a 1 confinement 
in the electrified goal box following shock in the 
start and run sections. 

Acquisition training consisted of 8 . is per day 
for 2 days, a total of 16 acquisition trials, Two 
groups received a 62% PRF schedule in the escape 
apparatus, while the third group (Group CRF) 
was reinforced on all trials during acquisition. 
Group 1N had 3 N trials in each daily session, 
each N trial being followed by at least 1 R trial. 
The acquisition. reward schedule for Group 1N 
was as follows: Day 1, RRNRNRNR; and Day 2, 
RNRNRRNR, Group 3N also received 3 N trials 
in each daily session, but the N trials occurred 
consecutively. The acquisition reward schedule 
for Group 3N was as follows: Day 1, RRNNNRRR; 
and Day 2, RRRNNNRR. The Ss were run in 


was nl 


carrying 
C. 
16 
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squads of 3 with an intertrial interval (ITI) of 
3-4 min. 

After acquisition training, all Ss were given 4 
extinction trials per day for 3 days, a total of 12 
extinction trials. The procedure for extinction 
trials was the same as for the N trials during 
acquisition. 


Results and Discussion 


In the 5 experiments reported here, data 
from all runway sections were examined. 
However, the most consistent and reliable 
effects were observed in the start-speed 
measure, and therefore all analyses reported 
and discussed in this and subsequent ex- 
periments will be in reference to start 
speeds. 

An analysis performed on the last block 
of 8 acquisition trials indicated no terminal 
acquisition differences, F (2, 33) <1. 
A 3X4 (Groups X Days) analysis of 
variance performed on the start-speed data 
for the last day of acquisition and the 
3 days of extinction revealed a significant 
days effect, F (3, 99) = 3.96, p < .025; 
a marginally significant groups effect, 
F (2, 33) — 2.82, p — .07; and a nonsig- 
nificant Groups X Days interaction, F (6, 
99) < 1. Sequential theory makes a speci- 
fic prediction concerning the rank ordering 
of the groups following limited acquisition : 
Group 1N should demonstrate greater Rn 
than Group 3N, and Group 3N should be 
more resistant than Group CRF. This 
prediction was tested by a Jonckheere 
(1954) test. The Jonckheere procedure 
tests the null hypothesis that groups are 
not significantly ranked in a specific pre- 
dicted order, and thus it is ideally suited 
for examining the predictions of sequential 
theory in the present experiment. Results 
of the Jonckheere test performed on the 
sums of start speeds for the 3 days of 
extinction indicated than the groups dem- 
onstrated the predicted rank ordering, 
z = 2.20, p = .01. The data from Experi- 
ment I may be seen in Figure 1. 

The results of Experiment I revealed 
that one portion of the Schedule of Reward 
X Number of Acquisition Trials inter- 
action found in appetitive conditioning 
also occurs in instrumental escape condi- 
tioning. Specifically, following a limited 
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Days 


FIGURE 1. Mean start speeds for the last day of 
acquisition (A) and 3 days of extinction, 


number of acquisition trials, a group re- 
ceiving short N lengths in acquisition was 
more resistant to extinction than a group 
receiving longer N lengths during acquisi- 
tion. This finding supports the prediction 
made from sequential theory. 


EXPERIMENT II 


According to sequential theory, after 
extended acquisition, it would be predicted 
that a group receiving N lengths of 3 in 
the instrumental escape procedure should 
show more Rn than a group receiving N 
lengths of 1, because the strength of the 
hypothetical SN—R; association would be 
asymptotic for both groups, and the N 
Length 3 group would experience less 
generalization decrement in extinction. 


Method. 


Subjects. The Ss were 36 experimentally naive 
male albino rats of the Sprague-Dawley strain 
purchased from the Holtzman Co. They were 
approximately 85 days old at the start of the 
experiment and were randomly assigned to 1 of 3 
groups ( = 12/group). 

Procedure. After arrival in the laboratory, Ss 
were individually housed and placed on ad-lib food 
and water. As in Experiment I, Ss were handled 
individually for 3 days prior to the start of the 
experiment and were given 2 reinforced trials in 


Days 


Ficure 2, Mean start speeds for the last day of 
acquisition (A) and 3 days of extinction. 


the runway as pretraining. The procedure for R 
and N trials was exactly the same as in Experiment I. 

Acquisition training consisted of 8 trials per day 
for 5 days, a total of 40 acquisition trials. Two 
groups had a 62% PRF schedule, while the third 
group (Group CRF) was reinforced on all trials in ac- 
quisition. Group IN had 3 N trials in each daily 
session, each N trial being followed by an R trial 
(3 N lengths of 1). Group 3N also had 3 N trials 
in each daily ion, but the N trials occurred 
consecutively (1 N length of 3). The actual se- 
quence of trials in acquisition for all groups may 
be seen in Table 1. Throughout the experiment, 
5s were run in squads of 3 (1 from each group), 
with the ITI being approximately 3-4 min. for 
each S. 

Upon completion of acquisition training, Ss were 
given 12 extinction trials, 4 per day for 3 days. 
The procedure for all extinction trials was the 
same as for the N trials during acquisition. 


Results and Discussion 


The analysis performed on start speeds 
for the last block of 8 trials in acquisition 


TABLE 1 


SEQUENCE or REINFORCED (R) AND NONREINFORCED 
(N) TRIALS FOR ALL GROUPS DURING ACQUISITION 
| Group 
Day |—— — — 
1N 3N CRF 

15 |RRNRNRNR | RRNNNRRR | RRRR. 

2 RNRNRRNR | RRRNNNRR RRRRRRRR 
3 RNRRNRNR | RNNNRRRR | RRRRRRRR 
4 RNRNRNRR | RRRRNNNR | RRRRRRRR 
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indicated no terminal acquisition differ- 


ences 42202; 39) « 1. A 3 L (Groups 
X Days) analysis of variance performed 
on the start speeds for the last day of 
acquisition and the 3 days of extinction 
revealed a significant main effect for days 
F (3, 99) = 3.51, p < .02, as well as a sige 
nificant groups main effect, (2,83) 
= 3.89, p < .03. The Grou; Trials 
interaction failed to reach icceptable 
level of significance, F (6 =. 1:983 


b> 10. The Jonckheere t performed 
on the sums of the start speeds for 3 days 
of extinction indicated that the «roups were 
rank ordered as predicted, 3.00, 5 


= 001. The data for Experiment II may 
be seen in Figure 2. 


The results of Experiment li icated 
that the second part of the ule of 
Reward X Number of Acquis als 
interaction, commonly found in appetitive 
conditioning and predicted by sequential 


theory, is also observed in instrument?! 
escape conditioning. Following exte 
acquisition, the group receivio- ^ s 
of 3 showed greater Rn ip 
receiving N lengths of 1. 


EXPERIMENT lll 


Although the results of |; 
and II were statistically rı 
might be seen as somewhat : sive, 
and it was deemed desirable t these 
effects be systematically replicated and 
extended, as there is a paucity of data 
concerning extinction effects in c: ape con- 
ditioning. Therefore, Experiment III rep- 
resents an attempt to again demonstrate 
the effects of differential N lengths follow- 
ing limited acquisition, 
procedural parameters. 

The procedural changes in the present 
experiment were made on the basis of un- 
published work in the authors’ laboratory 
which indicated that such changes might 
produce more clearly delineated results. 
These changes were: (a) an increase in the 
level of shock, (b) an increase in the rein- 
forcing amount of shock reduction, (c) the 
presence of a very low level of shock in 
the goal box on R trials, and (d) an in- 
crease in the goal-box confinement time 


ents I 
they 


using different 


. on both R and N trials. 


SEQUENTIAL EFFECTS IN INSTRUMENTAL ESCAPE LEARNING 


Finally, the con- 
tinuously reinforced control group was 
omitted, since the PRF extinction effect 
in escape conditioning has been reliably 
demonstrated, both in Experiments I and 
II reported here and elsewhere (e.g., Bower, 
1960; Woods et al., 1972). 


Method 


Subjects. The Ss were 16 experimentally naive 
male albino rats of the Sprague-Dawley strain, 
purchased from the Holtzman Co. They were 
approximately 110 days old at the start of the 
experiment and were randomly assigned to 1 of 2 
groups (n = 8/group). 

Apparatus. The apparatus was identical to that 
used' in Experiment I with 1 exception. An auto- 
matic timer was included in the control panel to 
regulate shock and confinement time in the goal 
box. This apparatus was also used in Experi- 
ments IV and V. 

Procedure. Upon arrival in the laboratory, Ss 
were housed 2 to a cage and placed on an ad-lib 
schedule of food and water which continued 
throughout the experiment. One day prior to the 
start of the experiment, each S was given 1 rein- 
forced pretraining trial in the apparatus. The 
procedure on R trials was similar to that in the 
first 2 experiments except that the shock level in 
the start and run sections was .5 ma., the shock 
level in the goal box was .1 ma., and the goal-box 
confinement time was 30 sec. On N trials the 
shock level was the same (.5 ma.) in all sections, 
and goal-box confinement time was 30 sec. Thus, 
reinforcement consisted of a 30-sec. confinement 
in the .1-ma. electrified goal box following .5-ma. 
shock in the start and run sections (a .4-ma. shock 
reduction), while nonreinforcement consisted of a 
30-sec. confinement in the .5-ma. electrified goal 
box following .5 ma. in the start and run sections 
(no shock reduction). 

As in Experiment I, acquisition training con- 
sisted of 8 trials per day for 2 days, a total of 
16 trials. The 2 groups (Group 1N and Group 3N) 
received 62% PRF in the apparatus, with the 
same schedules used for Group 1N and Group 3N 
as in Experiment I, The Ss were run in squads 
of 4 (2 Ss from each group per squad), with the 
ITI for each S being approximately 4 min. Fol- 
lowing acquisition training, all Ss received 8 ex- 
tinction trials per day for 2 days, a total of 16 
extinction trials. The procedure for extinction 
trials was exactly the same as that for N trials 
during acquisition, 


Results and Discussion 


An analysis performed on start speeds 
for the last block of 8 acquisition trials 
indicated that there were no terminal ac- 
quisition differences, ¢ (14) = .70, p > .05. 
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Ficure 3. Mean start speeds for 2 days of ex- 
tinction (8 trials/day, plotted trial by trial). 


It can be seen from Figure 3 that the 
extinction results of this experiment closely 
parallel those of Experiment I and are also 
comparable to appetitive studies using 
similar procedures (see Capaldi, 1967): 
Group 1N showed greater response per- 
sistence than did Group 3N on both days 
of extinction. These conclusions were con- 
firmed by separate 2X8 (Groups X 
Trials) analyses of variance performed on 
the start-speed data for each day of ex- 
tinction. On Day 1 the groups main effect 
was significant, F (1, 14) = 5.24, p < .05; 
as well as the trials main effect, F (7, 98) 
= 8.44, p < .001; but the Groups X Trials 
interaction failed to reach an acceptable 
level of significance, F (7, 98) = 1.74, 
p > 10. On Day 2 the groups main effect 
was again significant, F (1, 14) = 15.74, 
p «.005, as was the trials main effect, 
F (7, 98) = 17.50, p < .001. Since there 
was a large difference between the groups 
on Trial 1 of Day 2, a significant Groups 
X Trials interaction would be necessary 
in order to demonstrate differential rates 
of extinction as opposed to simply dif- 
ferential amounts of spontaneous recovery. 
This interaction was significant, F (7, 98) 
= 7.92, p « .001. 

Thus the finding that, following limited 
acquisition, short N lengths produce 
greater Rn than long N lengths is reliable 
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in instrumental escape conditioning. It 
should also be noted that the changes in 
procedure from Experiment I to the present 
experiment, i.e., increasing shock level, 
amount of shock reduction, and goal-box 
confinement time, seem to increase the 
magnitude of these effects. 


EXPERIMENT IV 


It has been demonstrated by a number 
of investigators using appetitive procedures 
(see Capaldi, 1967) that rats, if provided 
with a regular sequence of R and N trials, 
will learn to respond appropriately. That 
is, they will learn to run slowly on N 
trials and fast on R trials. It was the 
purpose of this experiment to investigate 
this phenomenon, known as patterned run- 
ning or patterning, in an instrumental 
escape conditioning paradigm. Patterning 
has been demonstrated in appetitive con- 
ditioning when single-alternation schedules 
are used, i.e., alternation of N and R trials; 
however, patterning is not usually observed 
when double-alternation schedules are used, 
ie. 2 N trials alternated with 2 R trials 
(e.g., Bloom & Capaldi, 1961). The pres- 
ent experiment utilized both single- and 
double-alternation schedules in order to 
determine if similar results hold using the 
instrumental escape procedure. 


Method. 


Subjects. The Ss were 35 male albino rats of 
the Sprague-Dawley strain, obtained from the 
Holtzman Co. They were approximately 180 days 
old at the start of the experiment and were ran- 
domly assigned to 1 of 5 groups (n = 7/group). 

Procedure, Throughout the experiment Ss were 
housed individually and maintained on an ad-lib 
schedule of food and water, One day prior to 
the start of the experiment, all Ss were given 
2 reinforced pretraining trials in the apparatus. 
Reinforced and nonreinforced trials were conducted 
exactly as in Experiment III. 

Acquisition training consisted of 4 trials per da 
for 30 days, à total of 120 acquisition Sie Fou: 
groups received a 50% PRE schedule during ac- 
quisition, while a fifth group (Group CRF) was 
reinforced on all trials during acquisition. The 4 
PRF groups formed the cells of a 2 X 2 factorial 
with the factors being N length (1 or 2) and type 
of ordinal trial (N trial first or R trial first), Thus, 
the 4 PRF groups were labeled NRNR, RNRN, 
NNRR, or RRNN to denote their schedule of R 
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and N trials on each day of acquisi 
were run in squads of 5, 1 S from 
a squad, with an ITI of 5-6 min. for « 

Following acquisition training, all 
4 extinction trials per day for 10 da: 
40 extinction trials. The procedure { 
trials was exactly the same as that 
during acquisition. 


trials 


Results and Discussion 


Acquisition. An analysis on nean 
of all 4 trials of the last day in a« ition 
indicated no differences between th g: 1ps 


at the end of acquisition, F (4, 0) -<1 


The start-speed data from the RE 
groups for all days of acquisitio ver? 
analyzed in order to determine if ; pat 


terning occurred. So that the data would 
not be contaminated by any possibl. first- 
trial warmup effect, the first trial o: each 
day was not included in the ana’ vsis.| 
The fourth trial of each day was also 
deleted in order to equate the number of 
N and R trials in the analysis. "^us, 
only the 2 middle trials for each day vere 
analyzed. These data were combined into! 
3-day blocks and analyzed via a ; (2 
X 2X 10 (N Length x Type of O:-'nal 
Trial—N_ First or R First X Type of 
Trial—N or R X Trial Blocks) analysis of 
variance. The type of ordinal trial nain 
effect was nonsignificant, F (1, 24) < 1, 
and it did not interact with any other 
variables. The N length main effect also 
failed to reach an acceptable level of sig- 
nificance, F (1, 24) <1, although the N 
Length X Type of Trial interaction was. 
significant, F (1, 24) = 447, p < .05, as 
well as the N Length X Type of Tria 
X Blocks interaction, F (97216) = 3.52 
b «.001. The main effect of type of tria 
was significant, F (1, 24) — 15.03, 

= 001, as was the blocks main effect’ 
F (9, 216) = 10.30, p < .001. Post hors 
analysis (using the Scheffé correction pro 
cedure) of the 3-way interaction indicated 
patterning in the single-alternation groups 
(R trials faster than N trials) on Blocks 
and 9, ps — .05. The Ss in the doubl 
alternation groups did not differ in spe | 
on N and R trials except on Block) ` 
(R trials faster than N trials, p < .0 
Thus, in the present experiment the c 


ate 
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denc for patterning must be considered 
Harg nal, since faster running on R trials 
"or he single-alternation groups was ob- 
ed on only 2 trial blocks. There seem 
t be 3 possible explanations for the failure 
o find reliable patterning behavior: (a) It 
is possible that an insufficient number of 
acquisition training trials were run, since 
the patterning that was observed occurred 
late in acquisition. (b) The ITI during 
acquisition may have been too long (5-6 
min.) for patterning to occur. In the 
appetitive situation, some investigators 
have observed patterning at long ITIs 
(e.g., Capaldi & Lynch, 1966), but others 
have failed to do so (e.g. Surridge & 
Amsel, 1966). Although a 5-6-min. ITI 
cannot be considered extremely long, it is 
nonetheless, substantially longer than the 
15-30-sec. ITIs that are often used in 
sequential experiments. (c) Acquisition 
training may have consisted of too few 
trials in each daily session (4 per day) 
for patterning to develop. Bloom and 
Capaldi (1961), for example, demonstrated 
patterning by single-, but not double- 
alternation Ss; they used a procedure that 
employed 12 trials per day, considerably 
more than in the present experiment. 
Thus, it is possible that any of these 
factors, or a combination of them, were 
responsible for the lack of patterning in 
the present data; or that patterning is 
difficult to obtain in escape conditioning. 
Extinction. The start-speed data from 
extinction were blocked in 4-trial blocks 
(days) and analyzed via a 5 X 10 (Group 
,X Days) analysis of variance. The ex- 
itinction data may be seen in Figure 4. 
2The analysis revealed significant main 
jeflects for groups, F (4, 30) — 60.58, 
ye < .001, and days, F (9, 270) = 52.24, 
ip < .001, as well as a significant Groups 
-X Days interaction, F (36, 270) = 15.02, 
p «.001. Post hoc analysis (using the 
Scheffé procedure) indicated that all PRF 
groups were more resistant to extinction 
han Group CRF on all days (all ps < :01). 
^here was no difference in the 2 single- 
Íclternation groups (NRNR and RNRN) 
inhroughout extinction. Group NNRR was 
quore resistant to extinction than all other 


se: 


15 
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Ficure 4. Mean start speeds for the last day 


of acquisition (A) and 10 days of extinction (in 
2-day blocks). 


all days of extinction (all 

The RRNN group was inferior 
to Groups NRNR and RNRN on all days 
of extinction (all ps < .01) except Day 3, 
where no differences in the 3 groups were 
observed. Groups RRNN and CRF did 
not differ in Rn except on Days 1 and 3, 
where RRNN was faster than CRF 
(p < .01 in both cases). 

The extinction results of the present 
experiment are consistent with those of 
Experiment ll, i.e., following extended ac- 
quisition, a long-N-length group (Group 
NNRR) was more resistant to extinction 
than a short-N-length group (Groups 
NRNR and RNRN). The present ex- 
periment also graphically illustrates the 
importance of another sequential variable, 
N-R transitions, in determining response 
persistence. Group RRNN received an N 
length of 2 in the present experiment, but 
received no N-R transition in its daily 
trial schedule (compared with Group 
NNRR, which received an N length of 2 
plus an N-R transition on each day of 
acquisition) and showed only slightly more 
Rn than Group CRF. This comparison 
demonstrates that in order for N length 
to be a factor in increased Rn, it must be 
followed by at least 1 R trial. This finding 


groups for 
ps < .01). 


Speed (ft/sec) 


Days 


Fiure 5. Mean start speeds on reinforced 
(R) and nonreinforced (N) trials for 12 days of 
acquisition, 


is similar to results from appetitive experi- 
ments using similar reward schedules 
(Spivey & Hess, 1968; Spivey, Hess, & 
Black, 1968). 


ZXPERIMENT V 


This experiment was designed to further 
investigate patterning behavior in instru- 
mental escape conditioning. In an attempt 
to provide more favorable conditions for 
patterning to occur, a number of changes 
were made in the procedure. Specifically, 
more trials were run per day for each S 
(8 instead of 4), and the ITI was shortened 
considerably (45 sec. rather than 5-6 min.). 


Method 


Subjects. The Ss were 24 male albino rats of 
the Sprague-Dawley strain, purchased from the 
Holtzman Co. They were approximately 125 days 
old at the start of the experiment and were ran- 
domly assigned to 1 of 3 groups (n = 8/group). 

Procedure. Throughout the experiment, Ss were 
housed individually and maintained on an ad-lib 
schedule of food and water. One day prior to the 
start of the experiment, all Ss were given 2 rein- 
forced pretraining trials in the apparatus. Rein- 
forced and nonreinforced trials were conducted 
exactly as in Experiment III. 

Acquisition training consisted of 8 trials per day 
for 12 days, a total of 96 acquisition trials. Two 
groups received a 50% PRF schedule during ac- 
quisition, while a third group (Group CRF) was 
reinforced on all trials during acquisition. The PRF 
groups differed only in the schedule of N and R 
trials received during acquisition. The single- 
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alternation (SA) group received an NRNRNR 
trial schedule on all 12 days of acquisition, wh 
the double alternation (DA) group received 
NNRRNNRR trial schedule on all 12 days. 
Ss were run in squads of 6 (2 Ss from each group” 
in each squad) with an ITI of 45 sec. for each 5, 

Following acquisition training, all Ss were given 
8 extinction trials per day for 5 days, a total of 
40 extinction trials. The procedure on extinction 
trials was exactly the same as that for N trials 
during acquisition. 


Results and. Discussion 


Acquisition. An analysis of variance 
performed on the mean of all 8 trials {or 
the last day of acquisition indicated no 
terminal acquisition differences, F (2, 21) 
<1. In order to examine possible pat- 
terning behavior the start-speed data frown 
the 2 PRF groups were combined into 3- 
trial blocks (days) and analyzed b; a 
2X 2X 12 (Type of Trial—N or R X N 
Length X Days) analysis of variance. 
The main effect of type of trial was not 
significant, F (1, 14) <1, although '': 
Type of Trial X N Length intera 
F (1, 14) = 81.00, p < .001, and the 
of Trial X N Length X Days interac : 
F (1i, 154) = 6.71, p < .001, were sig- 
nificant. The N length main effect was 
significant, F (1, 14) = 37.12, p < .001, 
as was the days main effect, F (11, 154) 
= 3.82, p < .001. The N Length X Days 
interaction was significant, F (11, 154) 
— 243, p « .01, while the Type of Trial 
X Days interaction failed to reach an ac- 
ceptable level of significance, F (11, 154) 
<1. Post hoc analysis (using the Scheffé 
procedure) indicated that patterning began” 
to appear on Day 3 (R trials faster than” 
N trials, p < .05) and was reliable there- 
after (Days 4-12, ps < .01) for Group SA 
The only differences observed in Group D 
were on Day 1 (R trials faster than 
trials, p < .01) and Day 2 (N trials faste 
than R trials, p < .01); thereafter, ther 
were no statistically reliable differences in 
speeds on N and R trials for Group DA 
The acquisition data from Experiment 
may be seen in Figure 5. 

Thus, the present results demonstr: 
that reliable patterning behavior can occ 
in instrumental escape conditioning, 
though it seems to appear much earlier 
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training than in similar appetitive proce- 
dures, and that massing of trials and/or 
fairly short ITIs are sufficient conditions 
for its occurrence. 

Extinction. A 3 X 5 (Groups X Days) 
analysis of variance performed on start 
speeds revealed a significant groups main 
effect, F (2, 21) = 58.94, p < .001, and a 
significant days main effect, F (4, 84) 
= 2.71, p < .05, while the Groups X Days 
interaction failed to reach an acceptable 
level of significance, F (8, 84) <1. Post 
hoc analysis (again using the Scheffé cor- 
rection procedure) indicated that the 
groups were ordered in terms of Rn as 
follows: DA > SA > CRF (all ps < .01). 
The extinction data for Experiment V may 
be seen in Figure 6. The extinction results 
of the present experiment are consistent 
with those of Experiments II and IV in 
demonstrating that, following extended ac- 
quisition training, Ss receiving longer N 
lengths are more resistant to extinction 
than Ss receiving shorter N lengths. Even 
though Ss in Group SA patterned, they 
were more resistant to extinction than 
Group CRF; this result is similar to ex- 
tinction data in a number of appetitive 
patterning studies (e.g., Rudy, 1971). 


GENERAL DISCUSSION 


Two general conclusions may be drawn 
from the data presented herein: (a) A reliable 
partial reinforcement extinction effect (PRE) 
was observed in all 4 experiments where a CRF 
control group was employed (Experiments I, 
II, IV, and V). The PRE is, of course, also 
reliably observed in appetitive instrumental 
conditioning, and has been previously dem- 
onstrated in escape conditioning (e.g., Bower, 
1960; Woods et al, 1972). (b) In all 4 ex- 
periments where a CRF control group was 
used (Experiments I, II, IV, and V), there 
were no differences between the PRF and 
CRF groups at the end of acquisition. This 
result is in disagreement with some studies 
(e.g., Bower, 1960; Woods et al., 1972, with 
large magnitude of reinforcement) which have 
shown superiority for a CRF group at the 
end of acquisition. The reasons for this 
discrepancy are not clear at the present time. 
The present data are also discrepant in this 
respect with much of the appetitive condi- 
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FiGuRE 6. Mean start speeds for the last day of 
acquisition (A) and 5 days of extinction. 


tioning literature in that PRF Ss are often 
superior to CRF Ss at the end of acquisition, 

Several statements can also be made in 
relation to the effects of specific sequential 
variables in instrumental escape conditioning. 


1. All 5 of the present experiments demon- 
strate N length effects very similar to those 
found in sequential appetitive conditioning 
studies. After limited acquisition, Ss re- 
ceiving short N lengths show more Rn than 
Ss receiving long N lengths (Experiments I 
and III); and the opposite is true following 
extended acquisition (E periments II, IV, 
and V). Thus the Schedule of Reward 
X Number of Acquisition Trials interaction 
seems to be a reliable finding in instrumental 
escape conditioning. 

2. The occurrence of N-R transitions was 
shown to be important in escape conditioning, 
much as it has been in appetitive procedures. 
The effects of N-R transitions were most 
pronounced in Experiment IV where 1 double- 
alternation group (Group NNRR), which 
received 1 N-R transition per day, demon- 
trated the greatest resistance to extinction, 
while the second double-alternation group 
(RRNN), which received no N-R transitions, 
was only slightly more resistant to extinction 
than the CRF control group. The slight 
superiority of Group RRNN over Group CRF 
can be accounted for by the fact that on 
each day of acquisition Ss in Group RRNN 
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received an N-R transition with an ITI of 
24 hr. Mellgren and Seybert (1973) have 
shown that N-R transitions with a long ITI 
in acquisition can increase massed-trials ex- 
tinction performance, According to sequential 
theory, the number of N-R transitions is also 
postulated as the theoretical determinant of 
Rn after limited acquisition in appetitive 
situations (the number of N-R transitions 
determines the habit strength of the SN-Ry 
association that determines Rn) and since the 
empirical data resulting from appetitive pro- 
cedures and the present escape experiments 
are so similar, there is no reason to assume 
any different mechanism in escape condi- 
tioning. Thus, the results of Experiments I 
- and IIT also implicate the importance of N-R 
, transitions in determining Rn in escape con- 
ditioning. A more complete theoretical ac- 
count of the sequential mechanisms that 
determine Rn after both limited and extended 
acquisition may be found elsewhere (eg., 
; Capaldi, 1967). 
-  & Pattern of N and R trials was shown to 


- be an important variable in escape condi- 
k tioning in Experiment V, where Ss which 

received a regular single-alternation schedule 
- (Group SA) learned to run fast on R trials 
and slowly on N trials. These data are 
) similar to those obtained when single-alter- 
- mation schedules are used in appetitive in- 
1 strumental conditioning, except that pattern- 
- Áng occurred earlier in escape training (after 
- approximately 30 trials) than is usual with 
- appetitive procedures. 


Thus, the results of the present experiments 
demonstrate the operation of a number of 
b sequential variables in instrumental escape 
conditioning. These sequential variables were 
found to operate in escape conditioning in 
much the same way as they do in corre- 
ing appetitive procedures. Therefore, 
the results may be seen as support for the 
_ hypothesis that Capaldi's (1967) sequential 
, — can be extended to include instru- 
mental escape conditioning phenomena. The 
present. experiments join several others (egn 
Campbell et al, 1973; Capaldi & Levy, 1972: 
Dyck et al, 1974) in demonstrating that the 
boundary conditions of sequential theory may 
be expanded to include a number of aversive 
conditioning situations. 

It should be noted, however, that some 
differences do exist between the present data 
and results of similar appetitive conditioning 
experiments. In appetitive experiments, the 
effects of sequential manipulations are usually 


observed in all runway sections (ie, start, 
run, and goal), In the present escape con- 
ditioning experiments, the differences between 
groups as a result of the manipulation of 
sequential variables occurred most reliably in 
the start section. Although in some of the 
experiments differences did occur in other 
runway sections (e.g. the goal section in 
Experiment I, the run section in Experimen 
II, and the run and goal sections in Experi 
ment III), these effects were most apparent 
in the start-speed measure, In Experiments 
IV and V, however, differences were observed 
in only the start section, Capaldi and Levy 
(1972), using punishment superimposed on a 
partial food reward schedule, also observed. 
sequential effects predominantly in the start 
section of a straight runway. They attributed. 
their results to a heightened avoidance gra- 
dient resulting from the highly aversive goal 
situation. The present data may be due to 
similar factors, although the punishment and 
escape procedures are somewhat different, 
particularly in relation to the locus and dura- 
tion of the shock received by S, A second 
possible explanation for the present find 

may lie in the number of acquisition tra 

trials (120 in Experiment IV and 96 in 

periment V). After large numbers of trials, 
the running response, as it occurs in the 
presence of shock, may become rather insen- 
sitive to other variables, Therefore, after 
relatively. extended. acquisition training, the 
main effect of acquisition variables may be 
restricted to the likelihood of the initiation 
of the running response, rather than speed of 
running after the response is initiated. This 
difference may be of considerable theoretical 


Leland and should be investigated 
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ROTARY ACCELERATION OF A SUBJECT INI CE 
REACTION TIME TO MOTION IN PERIPHI 


JAMES M. BORKENHAGEN * 


California State University, San Jose 


The effect of S's rotary acceleration upon choice re: 
accelerating target in peripheral vision was the central issue in t 

with the level of acceleration and the viewing angle of the v 
Twelve pilots were tested in a rotation device unde: 
visual stimulation alone and visual-plus-rotary stimulation. 
sponded to the direction of the visual motion by moving a hanc 
plus-rotary stimulation produced long 
Choice RT was inversely + 
level of acceleration and directly proportional to the viewing oi. 
findings are discussed in connection with theories of double stimula 


variables. 


the right or left. Visual 
than the visual stimulation alone. 


intersensory effects. 


A number of experiments have demon- 
strated that reaction time (RT) to a signal 
can be modified by the presentation of a 
secondary stimulus in conjunction with the 
primary signal. Some investigators have 
found a longer RT to double stimulation, 
even though the secondary stimulus re- 
quired no response from the S (Koster & 
Bekker, 1967 ; Nickerson, 1967). Presenta- 
tion of stimuli to 2 sensory modalities has, 
however, produced contrasting results, 
since intersensory experiments commonly 
show a facilitation of RT by the accessory 
stimulus (Bernstein, 1970; Swink, 1966; 
Symons, 1963). It is clear that either 
facilitation or inhibition of RT may result 
from double stimulation, depending upon 
a number of experimental variables. 

The inhibition of RT by accessory stim- 
ulation has often been interpreted as evi- 
dence for single-channel processing of 
perceptual information. Theoretical ap- 
proaches such as those reviewed by Broad- 
bent (1958) emphasize the limited capacity 
of the central nervous system (CNS) to 


! This research was conducted at Ames Research 
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NGL-05-046-002 to California State Universit 
San Jose. The author thanks Brant Clark, prin- 
cipal investigator, and John Stewart, technical 
monitor, for their support in this experiment. 

? Requests for reprints should be sent to James 
M. Borkenhagen, who is now at the Department 
of Psychology, University of Southern California, 
Los Angeles, California 90007. 
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i IS, 
which were presented to tai ly. 
An accelerating visual ta: he 
visual stimulus, and the « i tim- 
ulus was a rotary ac n. A direc- 
tional response was required to the visual 
motion only, so the rotary acceleration 
represented an accessory stimulus. This 


experiment can therefore be characterized 
as intersensory, with the 2 stimuli being 
presented simultaneously and the response 
required to the visual motion only. The 
specific purpose of thi S 


i to in- 
vestigate the effects of 5 levels of rotary 
acceleration of the S on ch RT to the 


acceleration of a luminou 
and peripheral vision. 


line in central 


METHOD 


Subjects. Twelve commercial airline pilots were 
paid for their participation in the experiment. All 
showed. normal response to angular acceleration 
ìn previous testing. 

Apparatus. The Man-Carrying Rotation Device 
(MCRD) produced the rotary accelerations in the 
experiment. The MCRD is a 1-degree-of-freedom 
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simulator which rotates the S about an earth- 
vertical axis, providing angular accelerations that 
have a rise time of about 100 msec. and may be 
regulated from .01° to 30?/sec?. The MCRD has 
been described in detail by Clark and Stewart 
(1968). The S sat in an aircraft seat at the center 
of rotation of the MCRD cab. He wore an avia- 
tor's helmet, lap belt, and shoulder harness through- 
out all testing, and he communicated with Æ 
through a voice communication system. A helmet 
headrest was provided to hold S's head stationary. 

A perimeter arc extended across the horizontal 
meridian of .S’s right temporal field of vision. The 
are was made of plastic and had a radius of 66 cm. 
Fixation points consisting of white numerals on a 
black background were located on the are at 10° 
intervals up to 60° from the axis of direct vision. 
Each numeral (6 X 8 mm.) corresponded to the 
angular distance from direct viewing, e.g., 0-0°, 
1-109, 2-209. 

The visual stimulus was a movable vertical line 
(18 cm. long) centered on the screen of a cathode 
ray tube (CRT). The CRT was located 66 cm. 
in front of S, and its screen measured 36 X 28 cm. 
The line could be accelerated on the screen to the 
right or left with a rise time equivalent to that 
of the rotary acceleration. The S responded to 
the acceleration of the line by deflecting a vertical 
response stick (8 cm. long) located on the right 
armrest of the aircraft seat. A force of 110 gm. 
to the right or 140 gm. to the left was necessary 
to register a response. The choice RT was regis- 
tered in milliseconds on a digital voltmeter as the 
time from the onset of acceleration to the moment 
S responded. An ink-writing recorder collected a 
record of the accelerations of the line and the 
MCRD, and the signal from the response stick 
could also be monitored on this device. 

Procedure, The S was instructed to respond to 
the movement of the line by deflecting the response 
stick in the same direction as the line motion, 
and he was asked to react as quickly as possible 
without making an error. When an error was 
made, the trial was repeated at the end of the 
series, The S sat in the illuminated MCRD cab 
and viewed the line on the CRT screen. By 
fixating on the appropriate numeral on the perime- 
ter arc, he observed the line at various points in 
the left periphery. Binocular vision was used 
throughout the experiment, but 5 Ss could not 
see the line binocularly at 60° because the nasal 
ridge obscured the vision of the right eye. 

There were 2 primary experimental conditions: 
(a) choice reaction to the movement of the line 
alone, and (b) choice reaction to the line motion 
with simultaneous rotary motion. The rotary 
acceleration, when present, was in the direction 
opposite from the acceleration of the line. The 2 
accelerations were, however, of equal magnitudes. 
All Ss received both stimulation conditions, half 
of them receiving Condition a first and half re- 
ceiving Condition b first. Five levels of accelera- 
tion were given in each condition: 1°, 2°, 3°, 6°, 
and 12°/sec. These levels of angular acceleration 
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are well above the thresholds determined for these 
pilots by psychophysical methods. Each level of 
acceleration was tested at the following 7 points 
in the visual field: 0°, 10°, 20°, 30°, 40°, 50°, 
and 60° to the left of fixation. The order of pre- 
sentation of the levels of acceleration and the 
peripheral angles was randomly determined, as was 
the direction of acceleration on each trial. 

At the start of each trial, the line was centered 
on the CRT screen. The number of the fixation 
point for the trial was announced over the voice 
communication system, and S acknowledged the 
number. Following a standby-ready exchange, the 
acceleration was applied. The foreperiod from S's 
reply to the moment of acceleration varied be- 
tween 1 and 3 sec., so it was not possible for him 
to anticipate the acceleration with much accuracy. 
The S terminated both the acceleration of the line 
and the MCRD when he responded, and a mini- 
mum of 20 sec. elapsed before the next acceleration. 

Three experimental trials were given at each of 
the 70 combinations of stimulation condition (2), 
level of acceleration (5), and position in the visual 
field (7). The testing was organized in blocks, 
all the trials within a block being at one level of 
acceleration. Practice trials preceded the experi- 
mental trials at all levels of acceleration. Fourteen 
practice trials were given before the first block of 
experimental t in both stimulation conditions: 
The first 7 trials included all levels of acceleration, 
whereas the second 7 trials included only the level 
of acceleration of the experimental trials that fol- 
lowed. The blocks of experimental trials following 
the first block in each condition were preceded by 
only 7 practice trials, all of them at the level of 
acceleration of the subsequent experimental trials. 
During a testing session, S completed 1 or 2 blocks 
of experimental trials together with the appro- 
priate practice trials, He completed 3 testing 
sessions in a day: 2 sessions of 2 blocks each and 
1 ion of 1 block. Two mornings of testing 
were required of each 5 to complete the experiment. 


RESULTS 


Choice RT was the dependent variable 
used in the data analysis. The mean of 
the 3 experimental choice RTs at each 
of the 70 treatment combinations was cal- 
culated for each S, and an analysis of 
variance was carried out to determine the 
effects of stimulation condition, level of 
acceleration, and peripheral angle on 
choice RT. The analysis followed a 2 X 5 
X 7 repeated measures design, with the 
Ss factor treated as a random variable. 
Errors in responding to the direction of 
the visual acceleration were infrequent, 
the overall error rate being 1.6%. There 
were, however, marked differences between 
Ss in the number of errors committed. 
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Ficure 1, Mean choice reaction time at different 
levels of acceleration for 12 pilots, 


Effect of rotary acceleration. The main 
effect of stimulation condition was signifi- 
cant, F (1, 11) = 11.37, p < .01, indica- 
ting a significant difference between the 
choice RT for unisensory stimulation and 
that for bisensory (see Figure 1). A com- 
parison of the 2 stimulation conditions for 
each level of acceleration was performed, 
and the difference between the means of 
the 2 conditions proved significant in 4 out 
of 5 levels (p < .05), At the lowest level 
of acceleration (1°/sec?), the difference 
between the visual-only and visual-plus- 
rotary motion conditions was not sig- 
nificant (p > .05). 

Effect of level of acceleration. The sig- 
nificant main effect of level of accelera- 
tion, P (4, 44) = 109.35, p < .001, dem- 
onstrated that the level of acceleration 
resulted in a decrease in choice RT (Fig- 
ure 1). It should be noted that in the 
bisensory condition, the rotary accelera- 
tion and visual acceleration levels were 
both changed together, thus confounding 
these 2 variables. In spite of this am- 
biguity, the hypothesis that higher levels 
of visual acceleration produced lower 
choice RT is supported by the matching 
profiles of the bisensory and unisensory 
data (Figure 1). 

Effect of peripheral angle. The main 
effect of viewing angle was significant, 
F (6, 66) = 43.13, p < .001, and indicated 
that there was a significant increase in 
choice RT as the eccentricity increased 
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(see Figure 2). A subseque id anal- 
ysis confirmed that a signi linear 
component characterized the | across 
the various angles (p < .01) | no evi- 
dence of a quadratic or highe order com- 
ponent was found (p > .05) he signifi- 
cance of the difference betw: eference 
mean (0°) and each treatme: ean (109; 
20°, etc.) was tested. All d ces were 
significant (p < .01) excep! he com- 
parison with the 10? positio: 

Interaction effects. A sig Levels 
of Acceleration X Peripherai les inter- 
action was found, F (24, = 12.43; 
p < .001, which is visible re 2 as 
the change in the overall of the 
plots with increased accel: magni- 
tude. The flattening of t! cs with 
increased level of acceler. ndicates 


that the effect of periph« 
diminished at higher levels « 


iewing is 
icceleration. 
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FIGURE 2. Mean choice reaction time at various 


peripheral angles for 12 pilots, 


CHOICE RT TO VISUALLY PERCEIVED MOTION DURING ROTATION. 


All other interactions were nonsignificant 
(b > .10). 


Discussion 


The larger choice RT for bisensory stimula- 
tion demonstrates an inhibitory effect of the 
accessory stimulus upon choice RT. This 
inhibition of response agrees with the findings 
of Clark and Stewart? for central vision, The 
absence of a significant inhibitory effect at 
the 1?/sec* level of acceleration was unex- 
pected, since the low levels of acceleration 
produced the greatest in ion in the Clark 
and Stewart study. 

Theories of intersensory facilitation are not 
applicable to the results of the present ex- 
periment; however, the inhibition of respons 
does suggest that some theory of perceptual 
interference. would apply. Response conflict 
theory (Herman & Kantowitz, 1969) ap- 
proaches the issue of double stimulation 
through the relationships of the responses to 
each stimulus, so an inhibition of response is 
predicted when there is an opposing response 
relationship. Because there is a response 
required to only one of the stimuli in this 
experiment, the response conflict theory is 
not properly applicable, Another approach 
has been concerned with the expectancy of 
the S and the time specification of the stim- 
ulus event. This analysis does not appear 
useful either, since the stimuli were presented 
simultaneously in this experiment. A popular 
concept in dealing with the inhibition of re- 
sponse in the double stimulation experiment 
has been the single-channel mechanism (Wel- 
ford, 1967). The delay in response to double 
stimulation is explained by a limitation of 
the capacity of the CNS to transmit and 
process information. It may be that the 
vestibular stimuli are processed to some degree 
before the visual stimuli are allowed to 


3B. Clark and J. D. Stewart, E 
acceleration on man: Choice reaction time using 
visual and rotary motion information. Paper 
presented at the meeting of the Aerospace Medical 
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Ames Research Center, Moffett Field, California 
94035. 


fects of angular 


proceed along the informational pathways. 


There is also the possibility, however, that - 


a communication channel accepts both stim- 
uli, and handles them simultaneously at a 
reduced rate of processing. 

The choice RT in this experiment was 
greatly influenced by the viewing angle of the 
visual stimulus. Sensitivity of the peripheral 
retina to an accelerating visual target is 
clearly inferior to the foveal sensitivity, but 
the difference between peripheral and central 
sensitivity is diminished by increasing the 
magnitude of the acceleration. The overall 
linearity in the pattern of choice RT to an 
accelerating visual target. is consistent with 
the linearity of sensitivity to a constant 
velocity stimulus (Leibowitz, Johnson, & Isa- 
belle, 1972). 
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The phenomenon of negative recency in secondary memory is usually at- 
tributed to the reduced amount of rehearsal associated with recency item 
On the other hand, negative recency can be explained by the adoption of 
different type of processing for recency items. An experiment is reported i 
which the recall of recency items was reduced in an immediate test, but ir 
creased in a subsequent test, under conditions in which the recency iten 
could not be identified as such during their presentation. It wa 
that the mode of processing items for an immediate free-recall test is normally 


modified for recency items. 


It is widely held that the serial position 
curve obtained in single-trial free recall 
reflects the operation of 2 memory sys- 
tems. The recency effect is attributed to 
a fixed-capacity, highly accessible primary 
memory, while prerecency items are con- 
sidered to be recalled from a more com- 
modious and permanent secondary mem- 
ory, with a probability that depends on 
a wide variety of experimental variables. 
Upon perception, an item is assumed to 
enter primary memory, where it remains 
until displaced. During its stay in pri- 
mary memory, information concerning the 
item is transferred to secondary memory. 
In brief, these are the principal features 
of the modal model of short-term memory 
(Atkinson & Shiffrin, 1968; Waugh & 
Norman, 1965). 

This 2-process view of short-term mem- 
ory was given considerable support by 
Craik's (1970) finding of a negative re- 
cency effect in secondary memory. Craik 
presented a series of word lists for im- 
mediate free recall, and then asked Ss to 
recall as many words as possible from all 
lists. It was found that in this final free- 
recall test, words presented in recency 
serial positions were the least well recalled, 
despite having been recalled well initially. 
This pattern of results has been replicated 
in several studies, and has usually been 
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conclude 


attributed to recency items ning less 
rehearsal and therefore less r« ration in 
secondary memory than prere y items; 
In other words, to the extent an item 
is likely to be represented primary 
memory at the end of list sentation, 
the transfer of information ab that item 
to secondary memory will not have run 
its full course. It follows that items re- 
called from primary memory in an im- 
mediate recall test should be recalled least 
well on a subsequent recall test, in which 
only secondary memory is operative. 

This quantity-of-rehearsal interpretation 
of negative recency has been well sup- 
ported by studies using the overt-rehearsal 
technique, in which S is required to vo- 
calize his rehearsal activity (Rundus & 
Atkinson, 1970). When the frequency of 
item rehearsal is plotted against serial 
position, the last few list iter found 
to be given progressively less rehearsal, 
so that the serial position function for 
rehearsal is very similar to t for the 
final recall phase of Craik's (1970) proce- 
dure (Rundus, 1971). "These findings in- 
dicate that the number of times an item 
is repeated correlates strongly with its 
registration in secondary memory. 

However, the notion that secondary 
memory registration is a simple function 
of quantity of rehearsal has not gone un- 
questioned. Even if we accept that the 
pattern of rehearsal yielded by the overt- 
repetition technique gives a reasonable 
indication of rehearsal in more conven- 
tional list-presentation procedures, a high 
correlation between number of rehearsals 
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and probability of recall from secondary 
memory does not prove that repetition is 
the agent of secondary memory registra- 
tion. The quantity-of-rehearsal approach 
has been challenged in a number of recent 
studies which describe procedures yielding 
very low or even zero correlations between 
the amount of rehearsal and secondary 
memory registration (e.g., Craik & Wat- 
kins, 1973; Meunier, Ritz, & Meunier, 
1972). Of particular relevance here is an 
experiment by Craik and Watkins (1973, 
Experiment 2), in which an initial free- 
recall test was given immediately after list 
presentation or after an interval of 20 sec., 
during which the last 4 items were repeated 
continuously. A final free-recall test re- 
vealed that the last 4 items in the re- 
hearsal condition were no better recalled 
than those in the control condition. These 
results were discussed within a depth-of- 
processing framework (Craik & Lockhart, 
1972) with a simple “maintaining” type 
of rehearsal being assumed during the post- 
presentation rehearsal interval. A main- 
taining mode of rehearsal serves to keep 
items in primary memory, but is inef- 
fective for secondary memory registration. 

This approach suggests a quality-of- 
rehearsal interpretation of the negative 
recency effect. The negative recency para- 
digm involves the presentation of a series 
of lists, with list length typically 12-20 
items. It seems reasonable to suppose that 
Ss could quickly learn to anticipate the 
end of the list more or less accurately and 
so could adopt a maintaining rather than 
an "elaborative" mode of rehearsal for the 
last few list items. "Thus, the negative 
recency effect would be explained by re- 
cency items being given a qualitatively 
different type of rehearsal, rather than a 
reduced amount of rehearsal. 

Such a view derives good support from 
the general lack of negative recency in 
those studies in which primary memory is 
eliminated by a distractor task given im- 
mediately following list presentation (e.g., 
Glanzer & Cunitz, 1966). When S knows 
that a distractor task is to follow list 
presentation, it is to be expected that he 
will encode the recency items in the same 
comparatively durable form in which he en- 
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codes prerecency items. Jacoby and Bartz 
(1972) have reported an experiment which 
clearly favors this interpretation. In brief, 
they found that the interpolation of 
15-sec. arithmetic task between presenta- 
tion and initial free recall reduced recency 
in the initial recall test but, at the same 
time, eliminated the negative recency effect 
in final recall. For those subscribing to 
the quantity-of-rehearsal interpretation of 
the negative recency effect, however, the 
fact that the effect is found with Craik's 
(1970) procedure but not with the dis- 
tractor paradigm has been something of a 
paradox. When primary memory is elimi- 
nated with a distractor task, there can be 
no beneficial effects of initial retrieval, so 
that negative recency should be at least 
as pronounced as with Craik's procedure. 

The aim of the present experiment was 
to seek further evidence relevant to de- 
ciding whether negative recency is better 
interpreted in terms of type or quantity 
of rehearsal. The experiment was de- 
signed to test a prediction of the quality- 
of-rehearsal view, a prediction not made 
by the quantity-of-rehearsal view. The 
procedure involved presenting word lists 
of variable length under conditions in 
which half of the Ss were given informa- 
tion allowing them to anticipate the end 
of each list, while list length was un- 
predictable for the other half of the Ss 
There followed an unexpected final rebel 
test. The quality-of-rehearsal view pre- 
dicts that, while the final recall of pre- 
recency items will remain unaffected by 
knowledge of list length, the final recall 
of recency items will be lower when list 
length is known. If it is not possible to 
identify during presentation which are to 
be recency items, then these items will be 
afforded the same relatively elaborate 
processing as prerecency items. On the 
other hand, when the end of the list can 
be anticipated, a simple maintenance re- 
hearsal will be adopted for recency items; 
hence, these items will be recalled with a 
relatively low probability in the final recall 
test. A quantity-of-rehearsal view, by 
contrast, does not predict any difference 
between the 2 conditions. 

Apart from the final recall test, the im- 


490 ^ 


mediate recall data could also prove of 
interest. One shortcoming of the quality- 
of-rehearsal interpretation is that it is 
post hoc. Recency items are assumed to 
be given a maintenance type of rehearsal 
because they are poorly recalled from 
secondary memory. Why Ss should change 


their rehearsal strategy for processing 
recency items remains an unanswered 
question. One possibility is that mainte- 


nance rehearsal may be better suited to 
meet the task demands. That is, more 
recency items may be recalled in the im- 
mediate test when a maintenance rather 
than an elaborative mode of rehearsal is 
adopted. It should be noted that the 
evidence currently available lends little 
support to this possibility. Crowder 
(1969), using a serial recall procedure, 
found that knowledge of list length en- 
hanced recall of primacy but not recency 
items. In addition, a number of studies 
have shown that the manner in which list 
items are encoded has little or no effect 
on the immediate free recall of recency 
items (e.g., Smith, Barresi, & Gross, 1971). 
However, the effect of being able to anti- 
cipate the end of the list on immediate 
free recall has so far not been investigated 
directly. 

In sum, the main purpose of the present 
experiment is to test the hypothesis that 
recency items will be better registered in 
secondary memory when they cannot be 
identified as recency items during their 
presentation. The experiment also serves 
to test the possibility that being able to 
anticipate the end of the list will serve 
to enhance the recency effect in immediate 
free recall—a finding which would explain 
why the mode of rehearsal should be 
shifted for recency items. 


METHOD 


Materials. All Ss were presented with 7 lists 
drawn from the same word pool of 98 2-syllable 
concrete nouns. These lists were of 8, 10, "a 14, 
16, 18, and 20 words. For each group of .Ss, 
a separate allocation of words to lists and positions 
was made. This allocation was essentially random, 
but with restrictions that (a) obvious within-lists 
acoustic and semantic associations were avoided, 
and (b) when measured from the end of the list, 
no word occupied the same list position for any 
2 S groups. 
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The Ss were given booklets for immediate free 
recall (IFR). Each page was headed with the 
appropriate list number, and contained a column 
of 1-in. squares, drawn down the right-hand edge. 
For half of the Ss, the number of cells always 
equaled the number of words in the ; hence, 
in this (informed) condition, Ss knew during 
presentation which words were to be recency ones, 
For the other (uninformed) half of the Ss, the 
number of cells was always 30. 

Subjects and design. A total of 122 under- 
graduate students took part in the experiment, 
They were tested in 7 groups, with group size 
ranging from 12-30 Ss. An even number of Ss within 
each group allowed equal, but otherwise random, 
S assignment to the informed and uninformed 
conditions. The order of presentation of the 7 list 
lengths was balanced between groups according to 
a Latin square arrangement. 

Procedure. "The Ss were told that thev would 
be presented with a series of word lists for IFR, 
and that the number of words per list would vary 
considerably. The nature of the booklets was ex- 
plained. As the words of a given list were presente 
check marks were to be made in successive cell 
of the column. The Ss were informed that if t 
column lengths varied from page to page, thel 
they were arranged such that list length woul 
always equal the number of column cells. The 
checking procedure was adopted in order to ensure 
that Ss in the informed condition remained fully 
aware of the stage of list presentation. 

The free-recall instructions were slightly modi- 
fied, in that it s suggested that performance 
could be maximized by recalling the last few 
presented items first. This procedure was adopted 
to ensure that any effect of presentation condition 
on the IFR of recency items was due to a dif- 
ference in the availability of primary memory 
items, and was not merely the result of a difference 
in retrieval strategy. Presentation auditory 
and at a 2-sec. rate. The end of a list was marked 
by a pencil tap given 2 sec. after the last word. 
The Ss in the informed condition were told not 
to begin recalling between presentation of the last 
word and the recall signal, but to remain ite 
still during this interval, in order to avoid ing 
unintentional cues to the uninformed Ss. The time 
allowed for IFR varied between 60 and 90 sec. 
according to list length. After presentation of the 
last list a 1-min. arithmetic task was given. The 
5s were then allowed 6 min. for an unexpected 
final free-recall (FFR) test, in which as many of 
the presented words as possible were to be recalled. 


RESULTS 


The main focus of interest will be the 
effects of knowing list length during pre- 
sentation on the IFR and FFR of recency 
items. First, however, the results will be 
examined at a more general level. To this 
end, the data were subjected to an analysis 
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TABLE 1 
PERCENT RECALL OF RECENCY ITEMS AS A FUNCTION OF RECALL TEST, 
PRESENTATION CONDITION, AND List LENGTH 
List length 
Recall test and M 
presentation condition 1 
8 10 12 14 16 18 20 
Immediate 
Informed 65 64 63 66 63 66 67 64.9 
Uninformed 60 61 60 61 55 55 57 58.6 
ferens 5 20] 3 5 8 ihi 10 6.3 
Fina 
Informed 12 10 | 12 12 14 13 17 13.4 
Uninformed 16 20 16 20 13 15 18 16.8 
difference -4 | —10 | —4 —8 1 -2 —1 —34 


of variance, with recall test (IFR or FFR), 
presentation condition (whether Ss were 
informed or uninformed of list length), 
position within the list (recency or pre- 
recency) and list length as main effects. 
The data were collapsed across serial posi- 
tions within the recency and prerecency 
components and across individual Ss within 
S groups, leaving the S groups to serve 
as the random variable. The dependent 
variable was probability of recall. Re- 
cency items were defined as those pre- 
sented at the last 6 serial positions. The 
decision to include 6 items in the recency 
span reflects the fact that this span en- 
compasses the full extent of almost all 
reports of negative recency, and is also 
sufficient to capture the greater part of 
the recency effect in IFR. The statistical 
inferences to be drawn from the results 
remain essentially unchanged when 5, 7, 
or 8 items are used to define the recency 
span. 

The main effects of recall test, position, 
and list length were all statistically sig- 
nificant, reflecting the higher probability 
of recall for IFR, recency items, and 
shorter list lengths. For recall test, F (1, 6) 
= 1,802.26, p < .01; for position, F (1, 6) 
= 95.62, p < .01; and for list length, 
F (6, 36) = 7.75, p<.01. Recall was 
higher in the informed than uninformed 
condition, but the difference was small and 
not significant, F (1, 6) = 1.07, p > .05. 

Recall test interacted with presentation 
condition, F (1, 6) = 14.73, p < .01, po- 
sition, F (1, 6) = 924.72, p < .01, and list 
length, F (6, 36) = 3.42, p < .01. These 


effects may be interpreted in terms of the 
IFR over FFR advantage being greater 
for the informed than the uninformed con- 
dition, for the recency than the prerecency 
items, and for the shorter list lengths. 
There was a significant Position X List 
Length interaction, / (6, 36) = 5.89, p 
« .01, which was very largely due to the 
recency over prerecency advantage in- 
creasing with list length. Presentation 
condition did not interact with either 
position, (1, 6) <1, or list length, 
F (6, 36) = 1.43, p > .05. 

Two second-order interactions proved 
significant. A Recall Test X Position X 
List Length effect, F (6, 36) = 2.45, p 
< .05, was essentially attributable to the 
decreasing effect of recall test with in- 
creasing list length being more pronounced 
for prerecency than recency items. A Re- 
call Test X Presentation Condition X Po- 
sition effect, F (1, 6) = 9.83, p < .025, 
results from opposite effects of presenta- 
tion condition on the recency items in the 
2 tests. Specifically, while knowledge of 
list length had virtually no effect on pre- 
recency items, it enhanced IFR and de- 
pressed FFR of recency items. This inter- 
action is, of course, central to the present 
study. There is no evidence to suggest 
that this interaction should be qualified 
with respect to list length, insofar as the 
third-order interaction (Recall Test X 
Presentation Condition X Position X List 
Length) was not significant, F (6, 36) 
= 1.56, p > .05. 

It will be recalled that the aim of the 
experiment was to test the hypotheses 
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Ficure 1. Immediate and final free-recall (IFR 
and FFR) serial position curves for informed and 
uninformed conditions. 


that IFR would be depressed and FFR 
enhanced for recency items when the end 
of the list could not be predicted during 
presentation. While these hypotheses are 
certainly in good accord with the results 
of the overall analysis of variance, 2 further 
analyses were performed in order that they 
could be tested more directly. The data 
were collapsed across S groups in the first 
of these tests and across list length in 
the second. 

Table 1 shows recall performance for 
the informed and uninformed presentation 
conditions as a function of list length and 
recall test. In IFR, performance in the 
uninformed condition was on average only 
90.3% of that in the informed condition, 
whereas in FFR, the difference was in the 
opposite direction, with recall in the in- 
formed condition averaging 80.2% of that 
in the uninformed condition. Both of these 
differences were significant: for IFR, ¢ (6) 
= 494, p «.01, and for FFR, ¢ (6) 
= 2.58, p < .05. 

Since Table 1 gives no suggestion of a 
systematic effect of list length on presen- 
tation condition, and since the analysis of 
variance showed that these 2 variables did 
not significantly interact with each other, 
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either directly or by entering into a sig- 
nificant higher order interaction, it is 
meaningful to consider the effects of 
presentation condition with data collapsed 
across the list length variable. These 
effects are shown in Figure 1. Each point 
in the primacy and recency positions is 
based on 427 observations, while the 
"asymptotes" of intermediate serial posi- 
tions are each derived from 2,562 ob 
tions. Since the number of observé ions 
contributing to the asymptotes varied 
with list length (from{0\for List Length 8 
to 732 for List Length 20), the level of 
performance for prerecency items should 
not be taken too seriously. However, 
Figure 1 does allow a meaningful com- 
parison of the effects of presentation con- 
dition on the IFR and FFR within both 
the recency and prerecency components of 
the serial position function. It is clear 
that for prerecency items there is virtually 
no difference between the presentation 
conditions. In IFR, the mean probabili- 
ties of recalling an item from prerecency 
positions were .289 and .282 in the in- 
formed and uninformed conditions, respec- 
tively; in FFR, these probabilities were 
.190 and .197. In contrast to the pre- 
recency items, the recency items were ap- 
preciably affected by presentation condi- 
tion. With scores collapsed over serial 
position, the superior IFR for Ss in the 
informed condition was significant, ¢ (120) 
= 4.27, p < .01, as was the superior FFR 
of the uninformed condition, £ (120) 
= 3.01, p « .01. 


DISCUSSION 

The principal aim of the experiment was 
to test the prediction that in a delayed 
(FFR) test, recall of recency items would be 
enhanced when the end of the list could not 
be anticipated during presentation. This 
prediction was confirmed, and hence the no- 
tion that the negative recency effect results 
from switching to a different mode of pro- 
cessing for recency items was supported. It is 
not immediately apparent how the effect of 
presentation condition on FFR can be re- 
conciled with the view that negative recency 
is due to the last few list items being given 
a progressively decreasing amount of a single 
rehearsal process (Rundus, 1971). 
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The quantity-of-rehearsal view has similar 
difficulties in accounting for the effect of 
presentation condition on the IFR of recency 
items. On the other hand, the superior IFR 
recency effect shown in the informed condi- 
tion is clearly in line with the notion that 
mode of processing is normally switched for 
recency items. In addition, the IFR results 
have more general theoretical implications. 
There is an extensive literature documenting 
the effects of a wide range of variables to be 
confined to prerecency serial positions (for a 
summary, see Glanzer, 1972). To date, only 
presentation modality has been unequivocally 
shown to have the converse effect, with an 
auditory over visual advantage restricted to 
recency items (Murdock & Walker, 1969). 
"Moreover, it is by no means agreed that this 
modality effect qualifies the primary memory 
property of invariant capacity, in that the 
auditory advantage is often attributed to a 
separate memory system (an interpretation 
that is perhaps less readily applied to the 
present findings). In this connection it should 
be noted that the effects of modality and of 
being able to anticipate the end of the list 
are not identical. Unlike the modality effect, 
the effect of knowing when to expect the end 
of the list is not apparent at the last 1 or 2 
serial positions, but is observed only over the 
earlier portion of the recency effect. Although 
it is not entirely clear why the effect of 
presentation condition should be so specific 
with respect to serial position, it may be 
speculated that when S believes that the end 
of the list is imminent, he switches to a mode 
processing somewhat similar to that used in 
a memory span task. 

We have noted that the quantity-of-rehearsal 
account of secondary memory registration 
does not predict the effects of presentation 
condition in either recall test. A far greater 
difficulty for this account lies in the fact 
that the effects were in opposite directions 
in the 2 recall tests. On the other hand, the 
interaction between recall test and presenta- 
tion condition is not difficult to understand 
when qualitatively different types of rehearsal 
are allowed. Specifically, when a list of verbal 
items is presented for immediate free recall, 
a certain amount of processing capacity will 
be reserved for the elaborative and organizing 
operations which are collectively referred to 
as secondary memory registration. When the 
end of the list is thought imminent, secondary 
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memory registration is reduced in order to 
increase primary memory capacity. The 
result is that recall of recency items is maxi- 
mized in an immediate test, but at the ex- 
pense of being reduced in a subsequent test 
in which only secondary memory is operative. 
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REACTIONS TO FRUSTRATIVE NONREWARD AS A FUNCTION OF 
PERCEIVED LOCUS OF CONTROL OF REINFORCEMENT 
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Based on Rotter’s definition of the personality variable-locus of control of 
reinforcement—it was predicted that internal Ss in an ambiguous experi- 
would be responsive to frustrative 


mental situation 


external Ss might not be similarly frustrated. 
following nonreward and, early in learning, 
a subsequent complex task. Consistent 
with Spence's theory of emotionally based drive, these same 
following frustration at a later stage in learning when 
No similar experimenta 
suggesting that only the internal Ss were significantly 
frustrated by the experimental procedures. 


responded with reduced latencies 
made significantly more errors on 


improved performance 
the task was largely mastered, 
for the external Ss 


The hypothesis that people with differing 
personality characteristics react differently 
to frustrating situations is not a unique 
or novel hypothesis. Neither is the postula- 
tion that differing experiences may be 
differentially frustrating to various in- 
dividuals. The possibility, however, of 
being able to predict in what situations à 
certain individual might be frustrated and 
the effects of his being frustrated upon his 
behavior in various situations would seem 
a meaningful contribution to attempts to 
relate learning theory to human behavior. 

According to Amsel's (1958) theory of 
frustrative nonreward, failure of reward 
following the establishment of an ex- 
pectancy for reward elicits a primary 
frustration reaction, Ry. Feedback from 
this reaction leads to enhanced motiva- 
tional effects evident in subsequent in- 
strumental behavior. The alleged motiva- 
tional effects are typically evident in the 
invigoration of responses following non- 
reward. The term frustration effect (FE) 
refers to this invigoration of performance. 
Numerous studies attest to the empirical 
validity of frustration theory in regard to 
FE with both animals and children (Ryan 
& Watson, 1968; Wagner, 1959). The 
data for adult human Ss are lacking, with 
one recent exception (Libb, 1972), per- 


1 Requests for reprints should be sent to jc 
Wesley Libb, Ridgecrest Children's Center, Uni- 
versity of Alabama, Box 2232, University, Alabama 
35486. 
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nonreward whereas 
As predicted, internal Ss 


Ss showed 


1 effects emerged 


haps due to difficulties encountered in pro- 
viding material incentives outweighing Ss’ 
self-induced symbolic motivations. 

The FE has generally been demonst rated 
in a task where there are few obvious 
competing responses and with a response 
well established by previous training. In 
general, frustration has resulted in increased 
running speeds and decreased latencies, 
which may be said to "enhance" perform- 
ance. On more complex tasks, in which 
there are more obvious competing re- 
sponses, effects differ. Both Schmeck and 
Bruning (1968) and Libb (1972) lend sup- 
port to the hypothesis that frustration 
produces impaired performance on a com- 
plex task. The relationship between frus- 
tration and performance is not a simple, 
positive linear function. Based on Spence's 
(1960) theory of emotionally based drive, 
Libb predicted that during the early stages 
of learning a complex task, more errors 
would occur following a frustrating condi- 
tion than following a neutral or rewarding 
situation, In later stages of learning, it 
was predicted that frustration would be 
less detrimental and might be associated 
with fewer errors, Le., an interaction over 
trial blocks was predicted. "These hypoth- 
eses were largely supported. 

While FE is a reliable experimental 
phenomenon, it has been noted by several 
authors that individual differences may 
occur in reaction to frustrative nonreward 
in human research. Ryan and Moffitt 


(1966) found that only approximately half 
of their Ss demonstrated FE. They argued 
that this differential reaction may not be 
an isolated chance occurrence but rather 
an individual difference variable generally 
obscured by the use of averages. Little 
experimental attention has been devoted 
to systematic analysis of individual dif- 
ferences in regard to frustration theory. 

- Ryan and Watson (1968) suggested that 
the individual difference variable, locus of 
control of reinforcement (Rotter, 1966), 
' may account for differential responding to 
nonreward. Internal-external locus of 
control of reinforcement refers to the ex- 
tent to which a person perceives reinforce- 
ment as under his own control (internal), 
or under the control of powerful others, 
fate, or chance (external). 

In the research that follows, a design 
similar to that of Libb (1972) was em- 
ployed. Human Ss were given ambiguous 
instructions with regard to locus of con- 
trol of reinforcement. Based on previous 
research findings (Rotter, Liverant, & 
` Crowne, 1961), it was assumed that inter- 
nal Ss would develop a higher expectancy 
for reward than external Ss and would be 
more responsive to nonreward than ex- 
ternally oriented Ss. Specifically, it was 
predicted that internal Ss would show a 
significant reduction in response latency 
-following nonrewarded trials. Should ex- 
ternal Ss manifest this effect at all, it was 
hypothesized to be of lesser degree than for 
internal Ss, With regard to errors for the 
internal group, it was predicted that in the 
"early stages of learning frustration would 
be evident in a decrement in performance 
or as increased errors following nonreward. 
Later in learning this effect was anticipated 
to disappear or a decrease in errors following 
nonrewarded trials actually would be evi- 
dent. Since the external group was pre- 
dicted to.be less frustrated than the inter- 
nal group, these effects were predicted to 
be of lesser magnitude, if present at all, 
for the external group. 


METHOD 


s, . 
Subjects. Thirty-two Ss from introductory 
psychology classes at the University of Alabama 
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participated as part of a course requirement. All 
32 Ss volunteered to participate with no prior 
understanding of the precise nature of the experi- 
ment. Rotter's (1966) Internal-External Control 
Scale was administered to all 5s who were divided 
into 2 groups based on a median score of 10.5. The 
internal group consisted of 16 internal Ss (8 male 
and 8 female) and the external group, 16 external 
Ss (11 male and 5 female). Little effort was made 
to match sexes within groups inasmuch as previous 
literature reveals no sex differences. Also, a £ test 
for sex differences within this experiment was not 
significant. 

Apparatus. The experimental chamber was a 
sound-dampened 10 X 10 ft. (3 X 3 m.) cubicle. 
Electromechanical programming equipment was 
located in a separate room. Manipulanda and the 
reward dispenser were mounted in the experimental 
chamber. Directly above a small table a Gerbrands 
Universal Feeder was mounted from which marbles 
were dispensed for appropriate responses in a 2-task 
sequence. The first manipulandum in position on 
the table was a single button (1 in. [2.54 cm.] in 
diameter) mounted on an 8 X 10 in. (2 X 2.5 dm. in 
diameter) panel. Immediately above the button on 
the panel a small 28-w. stimulus light was mounted. 
The second panel, 8 X 14 in. (2 X 3.5 dm.) in size, 
separately mounted to the right of the first, con- 
sisted of 4 buttons centrally positioned below a 
multiple-stimulus projector (Grason-Stadler, No. 
4580-159), also mounted on the panel. Complexity 
on the second task, as part of a 2-ti sequence 
paralleling Amsel's (1958) apparatus, was introduced 
by varying the stimuli presented on the multiple- 
stimulus projector which cued S as to the correct 
button on that trial. 

Procedure. The Ss participated to earn a pre- 
determined number of marbles to qualify for extra 
credit in course work. They were escorted to the 
experimental chamber and seated before the 
apparatus. The E read the following directions: 


You have fifteen minutes to earn 46 marbles 
to qualify for course credit in this experiment. 
Marbles will be earned by pressing the button 
when the light comes on. Press the button on 
the first panel when the light comes on and the 
first button on the second panel when the colored 
light comes on. After you earn 10 marbles, there 
will be a slight delay and change in the program. 


Instructions were repeated if procedure clarification 
was requested by S. All Ss performed 5 training 
trials and 24 experimental s. A trial consisted 
of a sequence of responses on both panels. The 
sequence started with illumination of the stimulus 
on the first panel. Five seconds after S responded 
on this panel, the multiple-stimulus projector on 
the second panel was illuminated permitting S to 
choose 1 of 4 buttons. The sequence recycled 30 
sec. after the appropriate response on the second 
panel. Reinforcement consisted of the delivery of 
one marble on rewarded trials. On training trials 
for the second task only one stimulus was presented 
and the first button was always the correct response. 
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TABLE 1 
MEAN ERRORS AND LATENCIES 
PER TRIAL BLOCK 
Errors Latency 
Group and - : 
condition. : - z 
Block | Block | Toray Block PESE rotat 
1 I 
Internal 
"Nonrewarded 9.44 | 3.60 | 6.57 | 1.41 | 1.26 | 1.34 
Rewarded 6.56 | 5.13 | 5.85 | 1.82 | 1.48 | 1.65 
External 2 
Nonrewarded | 7.50 | 3.69 | 5.60 | 1.74 | 1.72 | 1.73 
Rewarded 8.00 | 2.75 | 5.38 | 1.81 | 1.68 | 1.75 


Following training trials, stimuli on the projector 
were scrambled and S was required to learn the ap- 
propriate button to be paired with red, blue, yellow, 
or green lights. On this panel, S could press as 
many buttons as he wished as many times as he 
wished, but reward was forthcoming only when the 
correct button was pressed. Marble reinforcement 
for correct responses was continuous on the second 
panel for all Ss. Both groups received randomized 
50% reinforcement on Task 1 during experimental 
trials, with the one limitation that no more than 2 
trials of either reward or nonreward occurred in 
succession, In addition to the color presented on 
the multiple-stimulus projector, irrelevant cues (an 
X, a square, a circle, or a dash) were also presented 
simultaneously so as to enhance the complexity 
of the task. 

For each rewarded and nonrewarded response on 
Task 1, number of incorrect responses and latency 
to initial response on Task 2 were recorded. La- 
tency was defined as the time from illumination of 
the multiple-stimulus projector to S's first button 
press on Panel 2, whether correct or incorrect. 


RESULTS 


The latency scores were calculated for 
each S following both rewarded and non- 
rewarded trials in Blocks 1 and 2 (see 
Table 1). Since the homogeneity of vari- 
ance assumptions could not be established 
for these 2 groups, Fmax = 1.977, b < .01, 
latency scores for each of the 2 groups 
were analyzed separately with a Treatment 
X Treatment X S analysis of variance, 
The internal group demonstrated a sig- 
nificant main effect, F (1, 15) = 4.282 
p < .05, while the external group did dt 
exhibit this effect. The internal group 
thus demonstrated significantly reduced 
latencies following nonrewarded trials when 
compared with rewarded trials. 

Four separate error scores were calculated 
for each S: total error scores following 
nonreward and following reward in each of 
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2 blocks of experimental trials (see Table 
1) A 3-factor analysis of variance on 
these data yielded a significant triple 
interaction, F (1, 30) = 6.38, p < .02, and 
a significant trial blocks effect, F (1, 30) 
= 50.962, p < .001. The latter indicates 
that Ss performed with significantly fewer 
errors in the second block of trials than the 
first block. A Treatment X Treatment XS 
analysis of variance was performed sep- 
arately on each of the 2 groups. A signif- 
icant A X B interaction with the internal 
group indicated the appropriateness of 
individual ¢ tests, F (1, 15) = 5.684, p < 
.025. Internal group Ss made significantly 
more errors following nonreward than 
following reward in Block 1, £ (15) = 4.496, 
b «.01. Although the F test was sta- 
tistically significant for the interaction, 
the £ test for the second block of trials on 
error scores was not statistically significant, 
t (15) = 1.118, p <.3. Again the trial 
blocks effect was statistically significant, 
F.(1, 15) & 15.788. p «.005. The Treat- 
ment X Treatment X S analysis of vari- 
ance for external group Ss yielded only a 
trial blocks effect, F (1, 15) = 43.01, 
p < .001. 


Discussion 


These data, both error and latency scores, 
suggest that the internal group was frustrated 
by the experimental manipulation whereas 
the external group was not similarly frus- 
trated. Hypotheses were thus supported in 
that the internal group showed a significant 
latency effect consistent with the animal 
literature and a significant interaction with 
regard to error scores, consistent with Libb 
(1972). The interaction was clearer than in 
Libb's study, perhaps due to the fact that Ss 
who had little tendency to be frustrated by 
experimental manipulations were classified as 
externals. This experiment suggests that much 
of the difficulty in demonstrating FEs with 
human s may be due to the fact that indi- 
vidual differences obscure results when such 
factors are undetected and uncontrolled. If, 
in fact, an S is not frustrated, it hardly seems 
vp use to predict that he would manifest 

is 

It is also noteworthy that the latency effect 
was observed in this experiment and not ob- 
Served in the Libb (1972) experiment in which 


* 


» 
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both internal and external Ss were likely 
participants. These data are thus unique in 
that the FE was evident in 2 dependent vari- 
ables: latency and error scores. Such findings 


` lend further support to Spence's (1960) theory 


of emotionally based drive and its effect on 
performance of tasks of varying complexity. 

Difficulties in interpretation may be en- 
countered in the lack of a significant reduction 
in errors from Block 1 to Block 2 following 
reward in the internal group. Since trials 
introducing both nonreward and reward were 
randomly but evenly distributed within each 
block, one would expect errors to decrease 
following both reward and nonreward as 
learning progressed. The internal S may 
have become bored with the task such that 
his performance deteriorated unless frustra- 
tion was added to the motivational complex. 

Alternative explanations of these data are 
possible. For example, internal Ss more than 
external Ss may initially be searching for a 
combination Task 1-Task 2 strategy which 
will permit them to be correct 100% of the 
time. Such an approach to the experimental 
situation could account for not only the higher 
number of errors following nonreward but 
also the equivalent reward performance on 
Blocks 1 and 2; ie., occasionally trying out 
more complex strategies intrudes on perform- 
ance following rewarded trials. Alternatively 
internal Ss may have experienced some 
anxiety with regard to Task 1 since the nature 
of the task may have precluded their pre- 
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ferred perception of skill control. Increased 
anxiety may lead to enhanced drive and in- 
creased errors in parallel to the predicted FEs, 
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Fehrer and Raab and subsequent investigators have reported that simple 
reaction time (RT) to a metacontrast-inducing combination of test and mask 
stimuli is more rapid than to either component (test or mask) alone. Thus, 
they concluded that simple RT is not affected by metacontrast. We note 
that their conclusion is not warranted. A test-mask combination normally 
should evoke faster RT than either test or mask alone because of the greater 
area and, hence, total energy of the combination. A more appropriate com- 
parison is between the metacontrast-inducing combination and a combination 
of equal area and energy that does not induce metacontrast. The results of 
this comparison were that 4 Ss gave equivocal results; RT to the 2 com- 
binations and to its components were all equal, but 4 S; 
the non-metacontrast-inducing combination. 


showed faster RT to 
The overall results indicated 


that metacontrast does, in fact, affect si 


Metacontrast refers to a decrement in 
the clarity or visibility of a stimulus pro- 
duced by adjacent, nonoverlapping stimuli 
that occur in close temporal proximity. 
One typical configuration used in studies of 
metacontrast is a centrally located square 
(test stimulus) and a pair of flanking 
squares (mask stimuli) which share adja- 
cent contours with the test stimulus. The 
test, mask, and combination of test and 
mask occur in random order, over trials. 
The temporal spacing between test and 
mask—stimulus onset asynchronies (SOAs) 
—are generally varied over blocks of trials. 
When the test and mask energies are equal, 
the phenomenal brightness of the test 
square is a U-shaped function of SOA. 
With test energy less than mask energy, 
phenomenal brightness is an increasing 
function of SOA. In either case, the phe- 
nomenal brightness of a test stimulus is 
reduced by the presence of nonoverlapping 
mask stimulus. (See Kahneman, 1968, for 
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mple RT. 


a general substantiative and procedural 
iew.) 


Despite the phenomenal dimming of the 
test stimulus, however, simple reaction 
time (RT) to a test-mask combination is 
typically no slower than simple RT to the 
test alone. This finding has led several 
investigators to conclude that metacontrast 
does not affect simple RT (Fehrer & Raab, 
1962: Harrison & Fox, 1966; Schiller & 
Smith, 1966). Fehrer and Raab further 
concluded that ‘‘It was therefore the physi- 
cal dimensions of the [test] stimulus rather 
than its phenomenal characteristics that 
determined RT [p. 147 ]." 

However, the support for the hypothesi: 
that simple RT and verbal report obey 
different functional relationships may sim- 
ply be based upon an improper compari- 
son. The RT to the combination should 
not be compared to the test alone. A more 
proper comparison is with a second combi- 
nation of equal energy and area which does 
not produce metacontrast. Such a combi- 
nation could consist of squares of the same 
individual dimensions, durations, and lumi- 
nances as the metacontrast combination 
but which are spatially arrayed so as not to 
induce metacontrast (i.e., are nonadjacent). 
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Comparing simple RT to a metacontrast- 
inducing (experimental) combination with 
a non-metacontrast-inducing (control) com- 
bination rather than the mask alone is 
necessary to equate for stimulus energy. 
The RT to both combinations can reflect 
spatial integration and, when SOA is not 
zero, temporal integration as well (Bern- 
stein, Amundson, & Schurman, 1973; Kietz- 
man & Gillam, 1972). Spatial integration 
was reported by Fehrer and Raab (1962) 
among many others; the 2 mask squares, 
being of greater combined area, produced 
faster RT than the single test square. The 
crucial question is whether metacontrast 
does or does not affect integrative processes. 

The preceding implies that 2 additional 
findings must converge upon the observa- 
tion that RT to an experimental combina- 
tion is faster than RT to the mask alone 
(and, consequently, the test alone as the 
greater mask area normally yields the 
shorter RT of the 2 conditions) before one 
can conclude that metacontrast does not 
affect simple RT. One should demonstrate 
that the experimental configuration did, 
in fact, produce metacontrast as inferred 
from verbal report. In addition, RT to the 
experimental and control combinations 
should be equal if the extent of integration 
is the same. Conversely, if RT to the con- 
trol combination is faster than RT to the 
experimental combination, one should con- 
clude that metacontrast does affect RT 
even if RT to both combinations is faster 
than RT to the mask alone. 

A third possible result is that RT to the 
mask and the 2 combinations are all equal 
. to one another. This finding would be 
uninformative, since it would indicate that 
the mask alone was the effective stimulus, 
ie., the additional energy provided by the 
test was not sufficiently great to affect RT. 

The present data were gathered from 
the third study of a series. The first 2 
studies employed a less optimal design and 
will be discussed briefly. As with other 
metacontrast studies in which the mask 
energy exceeded the test energy, pilot data 
revealed that the verbally inferred masking 
function for the experimental combination 


was monotonic. Since the metacontrast 
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effect was greatest with concurrent test- 
mask onset, 0-msec. SOA was best suited 
to explore possible effects of metacontrast 
upon simple RT. Hence, SOA was not 
varied in the experiment reported here 
although some relevant data were gathered 
in the earlier experiments. 


METHOD 


Subjects. The Ss were 8 paid volunteers, 4 males 
(RF, JSA, GB, and KS) and 4 females (VA, JS, SF, 
JC). All were graduate students in psychology at 
the University of Texas at Arlington. All had nor- 
mal or corrected-to-normal visual acuity. 

Apparatus and stimuli. Stimuli were presented 
by transillumination in a 3-channel tachistoscope 
(Scientific Prototype, Model GB) using standard 
Sylvania bulbs (F8T5/D). Because imperfections 
in mirror alignment might have produced binocular 
parallax, viewing was monocular through a 2-mm. 
artificial pupil. Luminances were measured with a 
McBeth Illuminometer. The tachistoscope's timing 
accuracy was verified with an oscilloscope (Tek- 
tronix, Model 564). The RT responses were right- 
hand telegraph key depressions timed with a Hunter 
Klockounter (Model 120G). A foot-pedal was 
used for S to self-initiate trials with the left foot. 
The E and the S were seated in adjacent rooms. 

The stimulus display consisted of a dim fixation 
circle that remained on continuously. The circle 
was located in the blank field of the tachistoscope 
and was illuminated by an external source so that 
the blank field could be used as a separate exposure 
field. The fixation point had very low luminance 
and became clearly visible only after fixation. Ex- 
posure Field 1 contained a centrally located square, 
the base of which was .33° of arc above fixation 
(experimental test. Exposure Field 2 also con- 
tained a centrally located square, the top of which 
was .33° of arc below fixation (control test). Ex- 
posure Field 3 (nominally the blank field) contained 
a pair of squares (the mask), whose inner sides were 
contiguous to the experimental test, The individual 
squares were all transilluminated. The dimensions 
(.67° of arc/side) and luminances (63 cd/m?) were 
identical to those used by Fehrer and Raab (1962). 

The test and mask durations were 2 and 100 msec., 
respectively. This typically produced chance ac- 
curacy in discriminating experimental test + mask 
(TM-E) from mask (M) alone at an SOA of 0, the 
SOA used throughout the experiment, This dura- 
tion was adequate to allow perfect discrimination 
of control test + mask ((TM-C) from M alone in 
pilot work. The entire display fell within a 2° arc 
of fixation. 

Procedure. All but 2 Ss had extensive practice at 
reacting to the various stimulus conditions in either 
or both of 2 pilot studies and were run without addi- 
tional practice. The 2 experimentally naive Ss 
were run for a 1-hr practice period. All Ss were 
then run in a single session of approximately 1 hr. 
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The session began with 5 min. of dark adaptation. 
Half of the RT trials were then run. A 3-min. 
break was provided following which a series of verbal 
report trials, described below, were run. A second 
3-min. break was provided, and the remaining RT 
trials were run, 

Both verbal report and RT trials began with a 
signal from E that the necessary adjustments had 
been made. The S then fixated, blinked if necessary, 
and when the circle appeared as sharp and clear as 
possible, initiated the trial with the foot switch, 
This activated a timer to initiate 1 of 3 randomly 
chosen foreperiods (1.9, 2.2, or 2.5 sec.). 

For RT trials, 1 of 5 stimulus conditions was 
possible: (a) TM-E, (b) TM-C, (c) M alone, (d) ex- 
perimental test (T-E) alone, and (e) control test 
(T-C) alone. These occurred randomly, subject to 
the condition that TM-E, TM-C, and M occur 
20 times each and T-E and T-C occur 10 times each 
per half session. A complete factorial design com- 
bining the 2 test positions with masks above and 
below fixation was not possible since there was no 
fourth tachistoscopic field available for mask 
below fixation. Thus, one purpose of the 1 
T-C comparison was to insure that the retinal 
regions stimulated were equally sensitive. 

All responses were timed from the end of the fore- 
period. All RTs longer than 350 msec. or shorter 
than 100 msec. were rerun (less than 1% of all trials). 

The verbal report trials consisted of the random 
presentation of M and TM-E 50 times each. The 
Ss responded on each of these trials along a 6-point 
confidence rating scale (Green & Swets, 1966), with 
1 = very sure that TM-E had been presented, 
2 = moderately sure that TM-E had been pre- 
sented, . and 6 — very sure that M had been 
presented. As noted above, pilot data indicated 
that the TM-C and M could always be discriminated 
Írom one another. Likewise, T-C and T-E were 
never confused with each other or with the other 
alternatives. 


s. 


RESULTS AND DISCUSSION 


'The major finding was that metacon- 
trast did affect simple RT. The RT to 
'T M-C (201 msec. over Ss) was faster than 
RT to TM-E and RT to M which, in turn, 
did not differ (205 msec.). The RT to T-E 
and T-C, likewise, did not differ, and due to 
their smaller area were the slowest (216 
msec.). Analysis of variance (Stimulus 
Conditions X Ss) indicated that the 5 
means differed, F (4, 28) = 24.13, p « .01. 
Also, RT to TM-C was significantly faster 
than RT to TM-E, F (1, 7) = 5.67, 
p < .05, but RT to M and TM-E did not 
differ (F < 1.0) likewise, RT to T-E and 
RT to T-C did not differ (F < 1.0). 

Receiver operating characteristic (ROC) 


curves were obtained from each S's con- 
fidence ratings, treating TM-E presenta- 
tions as "signal plus noise" and M presenta- 
tions as "noise." The mean area under 
these ROC curves was .524. This is well 
within chance limits, indicating that con- 
current presentation of the experimental 
test (T-E) and the mask (M) produced 
complete masking. Only 1.5 (KS) exceeded 
chance. 

Thus, the data show that more integra- 
tion of test and mask energy occurred 
without rather than with metacontrast. 
In fact, with the present high level of 
metacontrast, no integration at all occurred ; 
RT to the experimental combination and 
mask alone were equal. Fehrer and Raab's 
(1962) conclusion that metacontrast does 
not affect RT seems to be an artifact of 


TABLE 1 


INDIVIDUAL S MEAN (X) Reaction Time (RT) AND 
rANDARD D. TIONS (SDs) AS A FUNCTION 


or SrIMULUS CONDITION 
Stimulus condition 
Subject ——— 
TM-E TM-C M T-E T-C 
GB. 
X 221 222 222 | 237 | 240 
SD 18 20 16 23 20 
JS. 
€ | 177 177 177 | 196 185 
SD 13 11 16 20 13 
RJS 
X 211 211 208 | 218 | 217 
SD 20 18 18 21 15 
SE. 
X 220 | 219 218 | 225 | 236 
SD 24 21 27 19 22 
KS. 
X 187 180%* 189 | 209 | 203 
SD 22 25 25 27 33 
JSA 
Ü 12224 2T ns ces Pale ea O eal 1231 
SD 1-49 24 23 32 29 
JC. | 
X 209 | 201»* 203 | 212 | 215 
SD 21 18 25 24 25 
VA. 
X 191 186*** | 195 | 203 | 200 
SD 20 19 21 14 21 
Comp. 
X 205 | 201 ~ | 205 | 216 | 216 


Note, Abbreviations: 
TM-C = control test + masi 
perimental test alone, and T-C 
are 1-tailed. 

«RT to TM-C <RT to M. 

è RT to TM-C « RT to TM-E. 

*p <.05. 

p <.01. 


perimental test + mask, 
mask alone, 
trol test alone. 


A À—— ately 


improperly comparing RT to the experi- 
mental combination with RT to the mask 
instead of RT to the control combination. 
Our results are not due to differential sensi- 
tivity of the retinal loci stimulated by the 2 
test squares, since RT to each of them was 
identical. 

The individual S data are presented in 
Table 1. The results of / tests comparing 
TM-C, TM-E, and M among themselves 
are included. Only 4 Ss (KS, JSA, JC, and 
VA) had faster RTs to the control combina- 
tion than to the experimental combination 
and mask alone. The remaining Ss’ RTs 
to these 3 stimulus combinations are all 
equal; hence, their data are inconclusive. 
The test energy was apparently too low 
for the latter Ss to demonstrate any integra- 
tion. No S fulfilled the criteria discussed 
earlier to support Fehrer and Raab's (1962) 
conclusion by manifesting equal amounts 
of integration to the 2 combinations. Since 
enough Ss were run in the present study to 
demonstrate the metacontrast impairment 
in RT statistically over all Ss, there seemed 
to be no reason to run additional trials or 
Ss. We did look for factors separating the 
2 groups of Ss who, respectively, did and 
did not demonstrate integration with T-C. 
These analyses failed to show that verbal 
report performance, familiarity with the 
metacontrast phenomenon, RT experience, 
or S’s own expectancies were pertinent. 

The results of our 2 prior studies (Bern- 
stein, Amundson, & Schurman, 1973; 
Bernstein, Futch, & Schurman, 1973) were 
that RT to TM-E fell between the RT to 
TM-C and RT to M, unlike the present 
results. This was because the test energy 
was sufficiently high to allow partial verbal 
report detection and, hence, integration. 
The stimulus parameters in the Fehrer and 
Raab (1962) and related studies suggest 
p partial detection. This would explain why 
RT to TM-E would be faster than RT to 
M. Despite the energy differences among 
the studies, the difference at 0-msec. SOA 
between RT to TM-C and RT to TM-E 
was about the same 4 msec. as shown here. 
These earlier studies also found that not 
all Ss demonstrated integration effects. 

The failure to obtain integration effects 
with the present high levels of metacontrast 
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is contrary to findings reported by Fehrer 
and Biederman (1962) when (unlike the 
Fehrer & Raab 1962 study) verbal report 
performance was near chance due to a 
slightly different method of presenting the 
test. Fehrer and Biederman reported 
integration did occur with chance verbal 
report accuracy, a form of ‘‘discrimination 
without awareness.” Recently, Bernstein, 
Admunson, and Schurman (1973) reported 
an artifact underlying Fehrer and Bieder- 
man’s conclusion. The artifact was a dif- 
ference of statistical power between the 
verbal report comparison and the RT com- 
parison. Bernstein, Amundson, and Schur- 
man 1973 used signal detection methods to 
compare verbal report and RT directly, 
using inferential tests of similar power. 
They found verbal report to be the superior 
performance index. 

The results presented in this report and 
those of Bernstein, Amundson, and Schur- 
man (1973) jointly suggest that verbal 
report and RT do not differ as much in the 
type of perceptual mechanism but, rather, 
in the duration of processing that occurs 
prior to a response. That is, a verbal dis- 
crimination, or more generally, any re- 
sponse requiring a choice, involves a longer 
period of temporal integration within the 
system. Additional evidence for this differ- 
ence in processing duration is the shorter 
critical duration for simple RT when com- 
pared to verbal report (Bernstein, Futch, 
& Schurman, 1973; Bruder & Kietzman, 
1973; Fehrer & Raab, 1962; Kietzman & 
Gillam, 1972). It is with this observation, 
first made in the Fehrer-Raab series of 
experiments and their most important con- 
clusion, that we strongly agree. The 
shorter period of temporal integration would 
explain why Fehrer and Raab and others, 
including us in our pilot studies, found no 
difference between RT to the combination 
and RT to the mask alone at longer SOA 
even though verbal report was impaired. 

Special note needs to be made about 
Fehrer and Raab's (1962) 25-75 msec. 
range of SOA, where there was partial to 
total phenomenal test suppression. The 
RT to the combination was essentially 
equal to the RT to the test (but slower than 
RT to the mask). As the authors noted, 
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Ss easily learned to discriminate the combi- 
nation from the mask by use of apparent 
movement cues. They also noted that 
these cues could have constituted part of 
the effective stimulus for RT to the combi- 
nation. Thus, the near equality of RT to 
the combination and test alone seems to 
reflect opposing processes; apparent move- 
ment decreased RT to the combination 
whereas phenomenal dimming increased it. 
The surprising point of their experiment 
thus does not seem to be that these RTs 
were essentially equal. Rather, it is why 
the second event (mask) could influence 
simple RT at SOAs beyond the critical 
duration for the first event (test). There 
are abundant data to suggest that length- 
ening the duration of a target of moderate 
to high luninance beyond 10-20 msec. does 
not reduce and may even increase simple 
RT (Bernstein, Futch, & Schurman, 1973). 
Bernstein, Futch, & Schurman argued for 
the importance of “on” and “off” responses 
as RT determinants. These effects also 
seem appropriate to consider in the light 
of the present paradox. 

"Thus, one need not assume that the phys- 
ical stimulus intensity is processed at one 
level, and that metacontrast affects a 
higher, more conscious level. Mechanisms, 
such as lateral inhibition and luminance 
summation, which collectively seem to ac- 
count for much of the metacontrast phe- 
nomenon, can occur at the earliest stages 


of information processing and thus affect 
both verbal report and RT. 
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RELATIVE EFFECTIVENESS OF RHYMES AND SYNONYMS 


AS RETRIEVAL CUES! 


DOUGLAS L. NELSON * anb DAVID H. BROOKS 
University of South Florida 


An experiment was performed to compare the relative effectiveness of rhymes 
and synonyms as extralist retrieval cues, The a priori relation between 
synonym cues and their targets and rhyme cues and their targets was equated, 
at both weak and strong levels, by the.use of normative data, An attempt 
also was made to hold "set size" constant. The results indicated that recall 
was highest with strong c intermediate with weak cues, and lowest when 
no cues were provided. This relative ordering was obtained for both rhyme 
and synonym cues, suggesting that when a priori strength of relationship has 
been equated, at either weak or strong levels, both types of cues are equivalent 


as aids for retrieval. Implications for the encoding specificity principle and 
for generation-recognition models of free recall were discussed, 


Words are represented in memory as 
complexes of distinctive features. This 
conjecture represents the principal assump- 
tion of several recent conceptualizations of 
word coding processes (e.g., Anderson & 
Bower, 1972; Nelson, 1972; Tulving & 
Thompson, 1973). These features are 
empirically identifiable as graphic, pho- 
netic, associative, semantic, and imaginal, 
but they could plausibly consist of any 
discriminable attributes. According to one 
version of the multifeature hypothesis, the 
process of coding a word that is presumably 
already within S's vocabulary consists of 
priming one or more of its distinctive 
features during the study portion of the 
current learning task (Nelson, 1972). 
"Thus, when a word is presented along with 
àn instruction to remember it, one or more 
of its attributes are either selectively or 
automatically primed, i.e., they become 
functionally effective in the task. Another 
assumption of this conception is that the 
priming of specific attributes can be in- 
fluenced by task characteristics, contextual 
cues, and instructional sets, as well as by 

1 This research was supported by Grant MH-16360 
to the first author from the National Institute of 
Mental Health and by Stipend GZ-2341 from the 
National Science Foundation to the second author. 
The authors are indebted to Eileen Edmunson and 
Kris Emden for their aid in collecting the data for 
this experiment. 

2 Requests for reprints should be sent to Douglas 
L. Nelson, Department of Psychology, University 
of South Florida, Tampa, Florida 33620. 


item attributes themselves, and that these 
features, once effective, function as re- 
dintegrative cues for subsequent stimulus 
retrieval. 

One implication of this conceptualiza- 
tion is that items that cannot be recalled 
under noncued conditions might be made 
accessible in the presence of extralist cues 
that prime features shared by specific list 
items (cf. Tulving & Pearlstone, 1966). 
Strong extralist associates of list items (e.g., 
Bahrick, 1969), names of conceptual cate- 
gories (eg, Tulving & Psotka, 1971), 
hononyms and synonyms (Light, 1972) as 
well as rhymes and graphic cues (Bregman, 
1968; Nelson & Brooks, 1974), have all 
been used as effective recall aids. However, 
with some exceptions (e.g., Bregman, 1968; 
Light, 1972), very little effort in this area 
has been directed toward assessing the 
relative efficiency of various types of cues. 
Although the apparent differential empha- 
sis given semantic factors by psychological 
theorists leaves the impression that mean- 
ing is the sine qua non of memory, there is 
ample evidence suggesting that sensory 
features, both graphic and phonetic, play 
a prominant role in word processing (e.g., 
Gibson, 1971; Nelson, 1972), These con- 
siderations suggest that comparisons of the 
effectiveness of meaning and sensory fea- 
tures as extralist retrieval aids might shed 
some light on the relative importance of 
these attributes within the memory repre- 
sentation of a given word. Therefore, this 
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experiment directly compared rhymes and 
synonyms as retrieval cues for rendering 
access to words stored during a rapidly 
paced single presentation. In order to 
ensure that this comparison was on a sound 
basis, two potential problems had to be re- 
solved. First, the a priori similarity be- 
tween the synonym cues and their targets 
had to be equated to the a priori similarity 
between the rhyme cues and their targets. 
If not so equated, one type of cue may 
prove to be superior to or not different 
from the other because of an initial con- 
founding of the cue-to-target relationship. 
Second, "set size" had to be equalized for 
both types of cues. If cued recall is anal- 
ogous to a search task, then the cue that 
provides the greatest reduction of the 
number of relevant items will be the most 
efficient. Providing that the procedures 
used to eliminate these potential con- 
foundings are effective, equivalent facilita- 
tion would suggest that the two types of 
cues were equally effective, while differ- 
ential facilitation would imply that one type 
of cue provided superior retrieval access 
relative to the other. This experiment, 
therefore, was explicitly designed to ma- 
nipulate type of cue—synonym or rhyme— 
at two levels of cue-to-target relationship, 
while, at the same time, the a priori rela- 
tions between cue and target and set size 
were carefully equated for both types of 
cues. 


METHOD 


Design. The experimental design was a 25 
between-within factorial. Type of cue, type of 
recall, lists, and Es were all between-Ss factors and 
strength relation of cue to target was a within-Ss 
factor. Cues were rhymes or synonyms, recall was 
cued or free, and strength was either strong or weak. 
Separate free-recall groups were used for rhyme and 
synonym conditions, since the targets for each of 
these lists had to be composed of dilferent sets of 
words. 

Maierials. All words used in this experiment were 
chosen from norms obtained from a separate intact 
group of 74 Ss in an ongoing psychology course. 
Each 5 was given a booklet containing 6 pages. 
The initial page contained rhyme instructions for 
half of the Ss and synonym instructions for the 
remaining half of the Ss, and the next 5 pages each 
contained 1 column of 20 words with a blank to the 
right of each item, The rhyme instructions indi- 
cated that on the blank beside each word they 
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should write the first word that they could think 
of that rhymes with the word on the left and, 
similarly, the synonym instructions indicated that 
they should write down the first word that they could. 
think of that means the same thing he word on 
the left. The same 100 words were used with both 
instructional sets, and these words had been chosen 
by E in advance so that each item had several 
rhymes and synonyms. The order of the pages 
containing the sample was independently random- 
ized for each S. 


The normative data provided by these Ss was 
then used to construct 4 lists, 2 for the rhyme con- 
dition and 2 for the synonym condition. Each list 


contained 24 pairs, the cue words having served as 
stimuli in the controlled association task and the 
targets having been provided by various propor- 
tions of Ss in that task. For half of the pairs, the 
cue words were strongly associated to their respec- 
tive targets. An average of 52% of the Ss provided 
the target in response to the cue word and, for all 
of these pairs, the target was the primary response 
to the cue, For the remaining half of the pairs 
within a given list, the cue-to-target relation was 
weak, with an average of 7% of the Ss responding 
with that particular target. The average numbers 
of responses given more frequently than response 
words selected as targets were 2.83 and 2.63, respec- 
tively, for synonyms and rhymes. Furthermore, of 
all the responses given during the recall phase of 
the experiment, 22.195 were given more frequently 
in the controlled association task than the targets 
used in the weak-synonym-cue condition, 2 
were given more frequently than the targets used in 
the weak-rhyme-cue condition. Thus, strength of 
relationship was carefully equated at each level for 
rhyme-related and synonym-related pair: In 
addition, to preclude the possibility that the effects 
of type or strength of cue might be attributed to 
confoundings associated with differences i: 

an attempt was made to equate this va j 
conditions. Potential set size was e« 1 
the number of different responses giv: ue 
in the controlled association task. 7 v the 
rhyme-related and synonym-related pa were 
an average of 6.83 (SD =1.74) and 6.42 ) = 2.36), 
respectively, different targets provided for each cue. 
To control for potential set size effects within the 
strength variable, the cues having a strong relation 
to their respective targets in one list had a weak 
relation to their targets in the other list. For ex- 
ample, in the rhyme condition, KEG was used as a 
strong cue for the target LEG in one of the lists and 
as a weak cue for the target PEG in the second list. 
In the synonym condition, FABLE was used as a 
strong cue for the target STORY in one of the lists 
and as a weak cue for the target LEGEND in its 
complementary list. i ulted in 
in the 
rhyme condition and for each of the 2 lists in the 
synonym condition. The targets were, however, 
different for all 4 lists and, although it was possible 
to equate word frequency within types of cues, it 
was impossible to equate relative frequency within 
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the strength variable. Thus, for each of the 2 lists 
in the rhyme and synonym conditions, the targets 
having a strong relation to their cues had an average 
frequency of occurrence of approximately 80 times 
per million (Kučera & Francis, 1967). The targets 
having a weak relation to their cues had an average 
frequency of occurrence of approximately 17 times 
per million. This confounding of strength and 
frequency of occurrence was unavoidable since, in 
the controlled association ta words provided by 
the smallest percentage of the Ss were also less 
frequent than words provided by the largest per- 
centage of the Ss. Because of this confounding, 
separate free-recall control groups were used for 
ch list of targets. In addition to providing a direct 
assessment of the frequency variable, this procedure 
permitted evaluations of the effectiveness of strong 
and weak rhyme and synonym cues relative toa free- 
recall baseline appropriate for each frequency level. 
Procedure; AN Ss participated in individual 
sions. On study trials, each list word was shown 
by a Kodak Carousel ide protector driven by an 
auxiliary timer at a 2-sec. rate. All words were 
typed in uppercase. The Ss were instructed to 
pronounce the items aloud as each one appeared 
and, since the list would be shown only once, to 
concentrate in order to remember as many items as 
possible, Immediately following the presentation 
of the last word, instructions for recall were read. 
In the cued-recall groups, these instructions speci- 
fied the nature of the relationship of the cue words to 
the targets and indicated that using the cues would 
aid their recall. Thus, the information concerning 
the nature of the retrieval cues was provided after 
list presentation. In the free all control groups, 
these instructions indicated th row of asterisks 
would be repeatedly flashed on the screen, and that 
each time the asterisks appeared, a word—any 
word from the list—should be recalled. Both sets 
of instructions were carefully equated for length. 
In all conditions, recall was oral and was response 
paced, i.e., as soon as the target was recalled or S 
indicated that he could not remember the appropri- 
ate word, the next cue or row of asterisks was shown. 
The order of presentation of the targets on the study 
trial and the order of presentation of the cues on 
the test trial, if any were independently randomized 
for every 5. 


se 


TABLE 1 


Mean NUMBER OF List Worps 
CORRECTLY RECALLED 


Type of cue 


Strength Rhyme Synonym 
of cue ia E 

| | 
Cued Free Cued | Em 
recall | recall recall recall 
Weak 9.20 4.70 9.00 4.50 
Strong 6.65 4.15 5.80 4.35 
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Subjects. Ten Ss were assigned to each list within 
each condition, and each Æ collected data from 5 of 
these Ss. This procedure resulted in 20 Ss within 
each of the major conditions, and a total of 80 in 
the entire experiment. For each E, Ss were assigned 
to conditions in blocks of 8, with 1 S from each 
condition and list per block. Assignment within 
blocks was determined by a table of random num- 
bers, All Ss were selected from introductory psy- 
chology courses and received. points toward their 
grades for participation. 


RESULTS 


Table 1 presents the mean number of 
correct recalls as a function of type of cue, 
type of recall, and strength of relation of 
cue to target. An analysis of variance of 
these data indicated that type of recall, 
F (1,76) = 94.61, 2 < .01; strength of cue, 
F (1, 76) = 22.93, p < .01; and the Type 
of Recall X Strength of Cue interaction, 
F(1, 76) = 21.40, p < .01, were the only 
significant sources of variance. Type of 
cue and all other interactions were not 
reliable. When recall was free, an average 
of 4.60 items were recalled in the strong-cue 
(high frequency) condition, and an average 
of 4.55 items were recalled in the weak-cue 
(low frequency) condition. When recall 
was cued, an average of 9.10 items were 
recalled when cues were strong, and an 
average of 6.22 items were recalled when 
cues were weak. Fisher’s least significant 
difference for this interaction was .87. 
'Thus, when recall was free, there were no 
differences between the strong- and weak- 
cue conditions, i.e., word frequency had no 
reliable effects. When recall was cued, 
significantly more items were recalled when 
strong as compared to when weak cues were 
provided. Furthermore, comparison of 
recall performance in the weak-cue con- 
dition to the free-recall conditions indicated 
that weak cues significantly facilitated 
recall compared with no cues at all. Thus, 
recall was highest when strong cues were 
provided, intermediate when weak cues 
were provided, and poorest when no 
retrieval aids were presented. The facili- 
tating effects obtained with weak cues 
contrast with the inhibiting effects ob- 
tained with comparable cues used by 
Thompson and Tulving (1970). 
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Discussion 


When set size and a priori strength of rela- 
tion of cue to target have been equated, 
synonyms and rhymes are apparently equally 
effective as extralist retrieval aids. This 
equivalency is obtained at both strong and 
weak levels of associative strength. There- 
fore, cuing with either semantic or sensory 
attributes can provide equally effective access 
to the coded representations of target words 
primed in the context of a rapidly presented 
list of unrelated items. Presumably, the two 
types of cues are effective either because they 
provide independent access routes to two dif- 
ferent representations of the target word, 
one based on sensory attributes and one based 
on semantic attributes, or because they provide 
access to a multifeatured representation that 
contains both types of information. lrrespec- 
tive of the correctness of either of these alter- 
natives, the primary implication of the results 
of this experiment is that the sensory attributes 
of a word may be as functionally important 
in its representation as are its semantic 
attributes. 

"These results also carry implications for two 
theories concerning extralist cuing effects, the 
encoding specificity principle (Tulving & 
"Thompson, 1973) and the generation-recogni- 
tion models (Anderson & Bower, 1972; 
Bahrick, 1970). According to the encoding 
specificity principle, the effectiveness of an 
extralist cue is completely determined by what 
is encoded while the target items are being 
studied. A retrieval cue cannot be effective 
unless the target is specifically encoded with 
respect to that cue at the time of its storage. 
The generation-recognition models assume 
that the extralist cue serves to generate rele- 
vant response alternatives and that recognition 
tests are applied to each item, Items meeting 
the criteria of acceptability are produced as 
overt responses. These models make no 
explicit assumptions concerning the relation 
between the effectiveness of the extralist cue 
and the manner of storage of the target item. 
However, most of them assume that words are 
represented in memory as lists or collections 
of features that specify their sound, meaning, 
and other attributes and, presumably, most 
might easily incorporate the notion that the 
representation of a word might be modified 
when presented with a context word that 
primes a specific attribute, Thus, at least in 
general orientation, the theories are similar. 
in many respects, 
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To account for the facilitating 
cuing in this study, the encoding spe 
principle would have to assume that 


both 
semantic and sensory features of the target 
words were primed during the encoding phase 


of the task. False recognitions obtained by 
using synonym distractors in a recognition 
memory paradigm attest to the reasonablen 
of the assumption that meaning attributes are 
primed during the presentation of a list of 
unrelated words (e.g, Anisfeld & Knapp, 
1968). Furthermore, false recognition and 
transfer data using homophones are consistent 
with the assumption that sensory attributes 
also are primed in this situation (Nelson & 
Davis, 1972), and therefore, extralist cuing 
effects obtained with these stimuli (Light, 
1972) can be interpreted as being compatible 
with this principle. Thus, synonyms and 
rhymes were presumably effective cues be- 
cause they provided sufficient redintegrative 
information about their respective targets. 
However, although this interpretation accounts 
for the cuing effects per se, it fails to explain 
why differences were obtained between strong 
and weak rhyme cues. Words at either level 
provide the same degree of overlap of sensory 
features and, therefore, in contrast to synonym 
cues, there should have been no strength effect 
associated with these cues. This strength 
effect might be explained by the assumption 
that, given the cue, Ss simply guessed the 
first rhyme that came to mind. This possi- 
bility would account for some portion of the 
cuing effect associated with 


strong 


(Bahrick, 1969). However, simple ing 
would appear inadequate as an exp! on for 
the cuing effect associated with vesk cues 
since, if Ss were Saying the first word that 


occurred to them, nearly all of thei: responses 
would have been intrusions, Alternatively, the 
strength effect might be explained by assuming 
that, in cued recall, the retrieval aid acts to 
initiate a search through items sharing fea- 
tures with the cue word and, as items are 
generated, recognition checks are performed 
(Anderson & Bower, 1972; Bahrick, 1970). 
If it can be assumed that words associated with 
each type of cue are generated hierarchically, 
then the chances of producing a list word are 
greater when extralist cues serve as cues for 
responses that are high in the hierarchy (strong 
cues) than for those low in the hierarchy (weak 
cues). This explanation for the facilitating 
effects of cuing incorporates the major assump- 
tions of both the encoding specificity hypoth- 
esis and the generation-recognition models. 
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RHYTHM IS PROCESSED BY THE SPEECH HEMISPHERE ! 


GEORGE M. ROBINSON ?* anb DEBORAH JO SOLOMON 


Duke University 


Following Lashley and Neisser, models of speech cognition frequently stress 
a rhythmic organizing principle and would predict that speech processing is 
intimately related to the processing of rhythmic patterns. However, in agree- 
ment with clinical data, dichotic listening studies establish that, while speech 
stimuli are processed by the left hemisphere, other nonspeech auditory stimuli 
are processed by the right. Challenging this distinction, the present study 
shows that nonspeech rhythmic patterns carrying no phonetic information 
are processed by the same hemisphere as speech. Twenty-four Ss listened to 
30 dichotic pairs of rhythmic pure-tone patterns. In a counterbalanced, 
forced-choice recognition task, they were able to correctly identify the patterns 


heard in the right ear significantly more often than the left. 


suggest that since rhythmic patterns 


Lashley's (1951) classic paper focused 
attention on the problems of organization 
in speech cognition. He proposed that the 
temporal order of the elements of speech 
results from an underlying hierarchical 
structure, The regular appearance of a 
series of elements with respect to time has 
recently been described as a rhythmic 
phenomenon. 


One way to think of the effect of rhythm is that 
it may provide a set of reference points to which 
digits and words can be attached. ... A rhythmic 
pattern iy a structure which serves as a support, 
an integrator and a series of cues for the words 
to be remembered ( Neisser, 1967, pp. 223, 235]. 


In a recent review, Martin (1972) pre- 
sented “a good deal of evidence that 
rhythmic patterning carries a heavy in- 
formation load in ordinary connected 
speech [p. 5007." 

In the past decade, several converging 
lines of research have supported and refined 
the classical notion that language func- 

f tions are primarily mediated by the left 
cerebral hemisphere. Such hemispheric 
specialization is supported by studies on 
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are the only nonspeech auditory stimuli 
to share the processing of the left hemisphe 
rhythmic organization in speech cognition a 
speech and rhythm require hierarchical organi: 
hemisphere is better able to process hierarchically. 


The results 


re with speech, models involving 
to be encouraged. Since both 
tion, it is likely that the left 


patients with unilateral lesions and ex- 
cisions (e.g, Milner, 1971) and cerebral 
commisurotomy (e.g., Milner, Taylor, & 
Sperry, 1968). For normal Ss, the asym- 
metries found using the dichotic listening 
technique, in which different stimuli are 
presented simultaneously, one to each ear, 
have closely paralleled the clinical studies. 
Speech material entering the right ear 
is processed more effectively, while the 
opposite is true for nonspeech mate- 
rial (Kimura, 1973; Studdert-Kennedy & 
Shankweiler, 1970). The resulting con- 
clusion that contralateral auditory path- 
ways are stronger than ipsilateral ones is 
independently supported by electrophysio- 
logical evidence (e.g., Rosenzweig, 1951). 
Thus, the dichotic listening technique has 
become a tool for investigating hemi- 
spheric specialization. 

Given the well-established laterality of 
speech functions, theoretical models that 
involve rhythmic organization in speech 
processing would predict that the hemi- 
sphere that is more effective in processing 
speech will also be the more effective 
hemisphere for rhythm. Since a wide 
variety of nonspeech auditory stimuli, in- 
cluding music, were demonstrated to be 
processed better by the right hemisphere 
(Dee, 1971; Gordon, 1970; Kimura, 1964; 
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Milner, 1962; Spellacy, 1970), such a pre- 
diction is far from obvious, In fact, both 
Milner and Spellacy, expecting the right 
hemisphere to be superior, tested for lat- 
erality of rhythm but found no significant 
effect. 

Halperin, Nachshon, and Carmon (1973) 
pointed out, however, that the speech vs. 
nonspeech distinction is no longer adequate 
to describe hemispheric specialization for 
auditory stimuli. Their Ss were able to 
report the order of 3 broad-band noises 
heard in the right ear more accurately 
than in the left when the frequency and 
bandwidth or pulse durations included 1 
or 2 changes. For noises without these 
changes, the left ear was superior. A re- 
lated finding is the one by Carmon and 
Nachshon (1971) that left-hemisphere le- 
sions impair the ability to correctly report 
the order of a series of sounds and colored 
lights. Halperin et al. concluded that 
these findings suggest 
that the left hemisphere specializes in the pro- 
cessing of temporal patterns of stimuli and that 

ption of verbal stimuli might be associated 
with sequential analysis of acoustic stimuli (p. 49). 


Some cautions should be observed in 
interpreting the results of Halperin et al. 
(1973), The frequency shifts and band- 
width changes in the noises used in their 
first study may have directly involved the 
phonetic analyzing ability of the speech 
hemisphere. Thus, instead of attributing 
the increasing dominance of the right ear 
to increasing complexity of the temporal 
pattern, the shift may have occurred be- 
cause changes in the spectrum were ac- 
tually encoded in a way similar to speech 
encoding. Their second experiment, using 
changes in the durations of 3 noise bursts, 
comes much closer to isolating temporal 
complexity as the factor responsible for 
shifting dominance from the right to the 
left hemisphere, It may be, however, that 
the abrupt onset and offset of the broad- 
band noises were not so unlike speech 
sounds. In fact, their data do not show 
that the right hemisphere was dominant 
when there were zero duration changes. 

The following experiment attempted to 
directly investigate the hypothesis that 


n 
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complex nonspeech temporal patterns cons — — 


taining no phonetic information (rhythms) 
are processed by the speech hemisphere. 


Memon 


Subjects. Twenty-four Duke University students 
with normal hearing were unobtrusively screened 
to eliminate left-handers by noting which hand 
they used to sign their names. After the listening 
task, S» were further examined for handedness, 
None were ambiguous cases, The Ss were volun- 
teers without any knowledge of the purpose of 
the study, None were familiar with Morse code, 

Stimuli and procedure. Each rhythm consisted 
of a “refrain” repeated 3 times with a 50-msec. 
interval. Each refrain consisted of a random series 
of 4-7 short (50-msec.) and long (150-msec.) 
900-Hz. sinusoidal pulses separated by 50 msec. 
The durations were chosen so that both the short 
and long pulses with their interpulse intervals 
would fall well within the range of spoken syllable 
durations. Each of the 30 trials consisted of à 
dichotic pair of rhythms followed by a random 
series of 4 individual recognition teat rhythms, 
binaurally presented: the 2 components of the 
dichotic pair and 2 new ones, These 4 recognition 
test rhythms were separated from the dichotic 
rhythm and from each other by 4 sec, with a 
l0-sec. intertrial interval, Within each trial, all 
6 rhythms had identical durations, The com- 
ponents of the dichotic pair began and ended 
simultaneously. 

"The test materials were generated by a PDP-8L 
computer, recorded on a Revox A77 stereo tape 
recorder and played through Koss PROAA ear- 
phones at 00-db. SPL. 

Each S first balanced the 2 channels for a bin- 
aural series of regular pulses so that the sound 
appeared to be centered in his head, The Se then 
listened to the text materials in 1 of 4 conditions: 
counterbalanced by interchanging left and right 
channels of the tape and interchanging the 2 half 
blocks of 15 trials. The Ss were required in a 
forced-choice paradigm to mark 2 of a row of 
4 boxes to indicate which 2 of the 4 recognition 
choices they believed to be part of the dichotic 
pair, No letters or numerals appeared anywhere 
on the coding sheets. 


ResuLTSs 


Table 1 shows the number and percentage of 
correct identifications out of 30 trials for the right 
and left ears of each 5. The right-car scores were 
significantly better: £ = 3.01, p < .01, 2-tailed test. 
The scores for each ear were both better than 
chance: for the right ear, £ = 6.45, p < 001, and 
for the left ear, 1.9 2.87, p < 01, 2-tailed tests, 


Discussion 


The results show that rhythmic patterns, 
unlike other nonspeech auditory stimuli, are 
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TABLE 1 


NUMBER AND PERCENTAGE OF CORRECT RESPONSES 
IN 30 TRIALS FOR 24 Ss 


Right ear Left ear 
Ss 
No. correct | Percentage | No. correct | Percentage 
1 19 63 13 43 
2 21 70 14 47 
3 21 70 19 63 
4 17 57 20 67 
5 25 83 16 53 
6 22 73 20 67 
7 20 67 18 60 
8 15 50 16 53 
9 20 67 13 43 
10 21 70 12 40 
11 17 57 21 70 
12 18 60 17 57 
13 21 70 15 50 
14 17 57 15 50 
15 16 53 16 53 
16 17 57 21 70 
17 17 57 19 63 
18 22 73 17 57 
19 17 57 16 53 
20 15 50 12 40 
21 13 43 18 60 
22 18 60 17 57 
23 20 67 16 53 
24 20 67 16 53 
x 18,71 62 16.54 55 
SD 2.76 2.58 


ed better by the same hemisphere 
that is dominant for speech stimuli. This 
study supports the heuristic discussions of 
Lashley (1951) and Neisser (1967). The 
results confirm the suggestion by Halperin 
et al. (1973) that temporal patterns as well 
as speech stimuli are processed by the left 
hemisphere. Because rhythm is the only 
nonspeech auditory feature found to be pro- 
cessed by the speech hemisphere, models of 
speech cognition based on rhythmic organiza- 
tion are to be encouraged. 

Because the rhythms used in the present 
experiment were moderately complex, rapid, 
and repeated 3 times, it is likely that they 
were processed hierarchically in the manner 
described by Lashley (1951) and Neisser 
(1967). The interested reader will find a dis- 
cussion of how the details of the hierarchic 
structures underlying rhythmic patterns might 
be organized in Martin (1972). It is probable 
that the left hemisphere is dominant for both 
nonspeech rhythms and speech stimuli be- 
cause it is better able to do the hierarchical 
processing they both require. 

The dichotic listening technique demon- 
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strates directly only a relationship between 
the performance of one ear | handedness. 
Two inferences permit conclusions about 
whether or not a particular stimulus is pro- 
cessed better by the speech hemisphere. The 
first was discussed in the introduction, that is, 
that contralateral auditory connections are 
stronger than ipsilateral ones. In this study, 
material entering the right ear was processed 
(mainly) by the left hemisphere. The second 
inference is that right-handed Ss do their 
speech processing with the left hemisphere. 
Branch, Milner, and Rasmussen (1964), by 
comparing the effect on speech of injecting 
sodium amytal into first one and then the 
other of their Ss’ carotid arteries, determined 
that 90% of right-handers and 60% of left- 
handers have their speech functions mediated 
by the left hemisphere. 

On the assumption that the hypothesis is 
correct, we should not be surprised that our 
data show (as do other dichotic studies) a few 
Ss to have higher scores for their left ears. 
Although it is possible that this is a major 
source of variance, there is no independent 
means to assess it within this study. While 
the distribution tends to be bimodal, the 
tendency is not significant. 

Halperin et al. (1973) asked their Ss to 
name the noises in the order they appeared. 
In our study, the coding sheet was deliber- 
ately kept free of language-related material to 
avoid biasing the data by "priming" the left 
hemisphere. The spatial array used would, 
if anything, invite right-hemisphere proc: 
In addition, the recognition test rhvt for 
the left and right ears appeared 


lom 
order. These procedures bias 
claimed by theories which the di- 
chotic listening effect to di es in the 


order of recall, claiming a gr hort-term 
memory loss for material played to the left 
ear (e.g, Inglis, 1965). Such caution may 
not be necessary. Bryden (1967) reviewed 
the several explanations for dichotic listening 
asymmetry and concluded that Kimura's 
(1961) theory attributing the asymmetry to 
perceptual differences is the only one that is 
consistent with all experimental results. 
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PERFORMANCE DURING CONTINUOUS AND IN' 


RMITTENT 


NOISE AND WEARING EAR PROTECTION 
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The Ss were tested on the 5-Choice Test of Serial Reaction for 40 min. in 
intermittent and continuous broad band noise (Experiment |) and while 


wearing ear protection in continuous noise (Experiment 1I). 
formance during intermittent noise were approximately half those 


Gaps in per- 
in con- 


tinuous noise. Errors were affected equally adversely by both intermittent 
and continuous noise. Ear protection interacted with noise and time-on-task, 
reducing gaps in noise in the first half but not the second half of the test. 
Results are consistent with the theory that there are 2 different effects of 
noise. The improvement of performance when noise was intermittent was 
attributed to a reduction in the monotony experienced during a long exposure 


to continuous noise. 


The value of ear protection was attributed to a reduc- 


tion in perceived loudness and prevention of temporary arousal following 


the onset of noise. 


One recent interpretation of the adverse 
effect of noise was that it functioned as a 
distractor (Broadbent, 1958) causing im- 
pairment in tasks that had characteristics 
rendering them sensitive to distractions. 
However, evidence against this view arose 
from the beneficial effect of noise on per- 
formance of the 5-Choice Test of Serial 
Reaction following sleep loss (Corcoran, 
1962; Wilkinson, 1963) when both condi- 
tions, applied separately, caused impair- 
ment. In addition, incentive augmented 
the adverse effect of noise as compared to 
its normal beneficial action (Wilkinson, 
1963). These results suggested that the 
adverse effect of noise was mediated by 
an increase in arousal which offset the 
effect of sleeplessness but which was further 
enhanced by incentive. One difficulty with 
this view was that the arousing effect of 
noise, and consequent change in perform- 
ance, would probably be greatest when the 
noise was first introduced, but the observed 
adverse effect generally increased. with 
duration of noise exposure, independent of 
time-on-task (Hartley, 1973), The results 
of Hartley's experiment, which also in- 
volved the 5-Choice Test of Serial Reac- 
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tion, ruled out the possibility that the 
adverse effect of noise was dependent on 
prior performance of the test, since noise 
introduced after many minutes of per- 
formance had virtually no effect. Con- 
sequently, the appearance of the adverse 
effect of noise after exposure was attri- 
buted to an accumulating function of the 
noise. 

This conclusion was substantiated by 
performance on a Stroop color interference 
test in noise (Hartley & Adams, 1974). 
Interference was increased by  20-min, 
prior exposure to noise as compared to 


quiet. An additional finding of in st in 
Hartley and Adams’ exper (also, 
Houston, 1969; Houston 3, 1967; 
O'Malley & Poplawsky, 1971) was that 


interference was reduced if noise was given 
only during performance and not also 
prior to it, suggesting different effects of 
short and long noise exposures. ‘There is 
some evidence that reduced interference 
accompanies a raised level of arousal. 
Agnew and Agnew (1963) and Tecce and 
Happ (1964), using threat of electric 
shock, and Callaway (1959), using am- 
phetamine, obtained results indicating re- 
duced interference under these conditions; 
a short exposure to noise may thus be 
arousing. The explanation of the increase 
in interference following a long exposure 


could be in similar terms. The increase 


PERFORMANCE DURING CONTINUOUS AND INTERMITTENT NOISE 


TABL 
N NuMBER or Gars iN Each 10-MrN. BLOCK UNDER EACH CONDITION 


513 


E1 


Knowledge of results 


| No knowledge of results 


Con- 

se — — | —————L————ÀL——- 
Block 1 Block 2 Block 3 | Block 4 | Total | Block 1 | Block 2 | Block 3 Block 4 Total 

1.44 2.67 3.78 4.00 | 11,89 1.83 6.11 5.44 6.00 19.38 
IN 4.78 39 6.83 7.44 24,44 4.56 6.44 8.06 8.17 27.23 
CN 4.39 8.28 9.78 | 9.50 31.95 | 6.33 10.17 11.67 10.94 39.11 
L te i — Je | e =? - 
Note. Abbreviations: Q = quiet, CN = continuous noise, and IN = intermittent noise, 
may indicate some form of reduced arousal, Apparatus. Performance was measured with 


which would be consistent with results 
indicating increased interference following 
amylobarbital (Callaway, 1959) and hyo- 
scine (Ostfeld & Arugete, 1962). The 
initial arousing effect of noise could be 
attributed to the onset of the intense 
stimu'ation to which S later adapted. 
The later reduction in arousal could be 
attributed to the monotony and sensory 
isolation accompanying the longer ex- 
posure to noise. 

To test these suggestions, 2 experiments 
were considered. If the later part of the 
adverse effect of continuous noise is due 
to the monotony and lack of variability, 
then presenting the noise intermittently 
should reduce this part of the adverse 
effect. Secondly, if the adverse reaction 
following the onset of noise is due to its 
loudness or intensity, then wearing ear 
protection should be beneficial at the start 
of the exposure. Wearing ear protection 
should not be of any further benefit when 
adaptation to loudness has occurred and the 
adverse effect of monotony predominates. 


EXPERIMENT l: INTERMITI 
Continuous NOISE 


iNT AND 


Performance of Ss on the 5-Choice Test 
of Serial Reaction was compared during 
quiet (Q), intermittent noise (IN), and 
continuous noise (CN) conditions. 


Method 


Subjects. There were 2 groups of Ss. Each group 
consisted of 18 housewives and professional men, 
ranging 18-45 yr. of age and totaling 36 Ss in all. 
All possessed normal hearing (ie, an absence of 
hearing loss greater than 35 db. in one ear or 30 db. 
in both ears at any of 7 test frequencies covering 
the audible range). 


Leonard's (1959) 5-choice serial reaction apparatus, 
which requires that S be seated before a display 
consisting of 5 neon light sources arranged in a 
pentagon, one of which is always illuminated. 
Arranged on a horizontal response board are 5 brass 
disks corresponding to the light sources. The S is 
required to tap the disk appropriate to the lamp 
illuminated. The light promptly extinguishes and 
another is lit. The S$ works as quickly and ac- 
curately as pc coring as many corr 
as few error: possible. The third score 
of pauses or gaps of 14 sec. between tapping one 
disk and tapping the next. The Ss in this study 
performed with and without knowledge of their 
results (KR and NKR). Immediate knowledge of 
gaps and errors was furnished by a brief illumina- 
tion of a lamp on the left of the response board 
if an error was made, and by a brief illumination 
of a lamp on the right of the board if à gap was 
made. Knowledge of the number of correct re- 
sponses made in each test was provided at the 
end of each test. 

Procedure. Following 40-min. preliminary prac- 
tice on the test, each S performed the test for 
40 min. in Q at 70 db., CN at 95 db., and IN at 
10 and 95 db., measured on the C scale of the sound. 
level meter, Although Ss were tested at different 
times of day, each 5 was given 1 treatment per 
day at the same time of day. The order of presenta- 
tion of the 3 treatments was determined by counter- 
balanced Latin squares for each group of 18 Ss, One 
group of 18 Ss performed the test with KR and the 
other group performed with NKR. The former 
group received brief flashes from the indicator lights 
at the side of the response board whenever a gap or 
error was made, and they were also shown the 
total number of gaps, corrects, and errors made 
by each S in each test in a chart. Broad-band noise 
having equal energy per octave was used in all 
conditions. The IN condition consisted of alternate 
presentations of 95 and 70 db. The mean duration 
of the 70-db. interval was 1.5 sec., with a range 
.3-2.7 sec. The mean duration of the 95-db. in- 
terval was 3 sec., with a range .6-5.4 sec. 


Results 
Performance was scored in 10-min. 
blocks for each 40-min. test. Table 1 
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TABLE 2 


MEAN NUMBER OF ERRORS IN EaAcH 10-MIN, BLOCK UNDER EAC)! 


Knowledge of results No know! 
Con- aa T — LR 
dition ] 
Block 1 Block 2 Block 3 Block 4 Total Block 1 | Block 2 | Bl Block 4 Total 
Q 3.17 4.94 4.17 3.61 15.89 TEEI 6.72 19.99 
IN 5.11 5.83 4.22 4.89 20.05 S39 | 772 | 8.50 31.61 
CN 3.33 6.50 6.39 5.89 22.11 689 | 5.94 9,44 29,44 


Note, Abbreviations: Q = quiet, CN = continuous noise, and IN = intermittent noise, 


shows mean gaps and Table 2 shows mean 
errors for KR and NKR for each experi- 
mental condition. These scores were ana- 
lyzed using analysis of variance as described 
by Winer (1962, p. 329) following arc sine 
square root transformation of the data. 
Considering gaps, there was no difference 
between the KR and NKR groups, F (1, 34) 
= .86, p > .10. Knowledge of results did 
not interact with any of the other treat- 
ments, and hence results from the 2 groups 
were pooled together for further analysis. 
There was a large adverse effect of the 
noise conditions upon the number of gaps 
made, F (2, 68) = 21.92, p < .001. Mean 
number of gaps in Q was 15.6, and in IN, 
25.8. This increase (p < .01) was very 
similar to the increase in gaps in CN 
(X gaps = 35.5) as compared to IN 
(p < .001). The noise conditions did not 
interact with any other factor. 

Considering errors, there was no overall 
reliable difference between KR and NKR, 
but more errors were made in the last 
10 min. of the test with no knowledge 
than with knowledge, F (1, 272) = 3.91, 
p < .05. Pooling the NR and NKR con- 
ditions together, the noise conditions had 
a reliable effect upon errors, F (2, 68) 
= 449, p < .025. Compared to Q, the 
mean increase of 7.89 errors in IN 
(p < 0.05) was virtually identical to the 
mean increase of 7.84 errors in CN 
(p < .05). These results indicate that IN 
caused approximately half of the impair- 
ment apparent in the CN condition in 
terms of gaps, but that the effect of the 
2 conditions upon errors was highly similar. 


EXPERIMENT II: EAR PROTECTION 


The Ss were tested on the 5-Choice 
Test of Serial Reaction in Q and noise (N), 


both with and without 


protection. 


wearing ear 


Method 


Subjects. The Ss were 16 housewives and pro- 
fessional men, ranging 18-45 vr. of age. All pos- 
sessed normal hearing (ie., an absence of hearing 
loss greater than 35-db. in one ear or 30 db. in 
both ears at any of 7 test frequencies covering 
the audible range). 

Procedure. Following 40-min. preliminary prac- 
tice on the 5-Choice Test of Serial Reaction, each 
S performed the test for 40 min. in continuous Q 
and continuous N, both with and without ear 
protection. The order of presentation of the 
4 treatments was determined by counterbalanced 
Latin squares (Williams, 1949). Broad-band noise 
having equal energy per octave was used through- 
out. In the N conditions it was presented at 
95 db., and in the Q conditions, at 70 db., mea- 
sured on the C scale. Each S5 was given one 
treatment per day at the same time of day, al- 
though Ss were not all tested at the same time 
of day. 

The ear protection was provided by Amplivox 
Sonogard ear defenders, specified as providing a 
substantially linear attentuation with frequency 
over the audible range (20 db. at 200 Hz., 30 db. 
at 500 Hz., 38 db. at 1 and 2 kHz., and 45 db. at 
4 kHz., measured at one-fifth octave bands and 
the American Standard Method for the Measure- 
ment of the Real Ear Attenuation of Ear Protection 
at Threshold, ASAZ-24.22-1957). 


Results 


Performance was scored in 10-min, blocks 
for each 40-min. test. Table 3 shows 
mean gaps and Table 4 mean errors for 
each experimental condition. These scores 
were analyzed using analysis of variance 
with repeated measures on each factor, 
following arc sine square root transforma- 
tion of the data. Error terms in the F 
ratios utilized interactions with Ss. 

Considering gaps, as Table 3 shows, 
there was an overall, adverse effect of N 
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TABLE 3 


Mean Gars IN Eacn 10-MiN. BLOCK 
UNDER EACH CONDITION 


Condition | Block1| Block 2 | Block 3 | Block 4| Total 
Q 2:57 5.13 6.69 | 7.31 | 21.70 
Q 4- ED | 4.07 6.57 8.25 | 7.50 | 26.39 
N 5.06 | 11.12 | 10.44 | 8.94 | 35.56 
N+ED | 3.57 6.50 9.81 | 9.88 | 29.76 


Note. Abbreviations: Q = quiet, N = noise, and ED = ear 
defenders. 


as compared to Q, F (1, 15) = 10.88, 
p €.01. Overall, ear defenders had no 
effect, F (1, 15) = .05, p > .10, but ear 
defenders interacted with Test Blocks 
X Noise, F (7, 105) = 2.12, p < .05. This 
interaction was mainly due to the fact 
that in the first 20 min. of the test, when 
ear defenders were worn, the N as com- 
pared to the Q condition had no adverse 
effect at all. Without ear defenders, N as 
compared to Q had a large adverse effect 
(p < .02), reliably larger than when wear- 
ing ear defenders (p < .02). In the last 
20 min. of the test, this antagonistic Ear 
Defenders X Noise interaction disappeared 
(p > .10), and the benefit of wearing ear 
defenders in N as compared to Q was 
reliably less than in the first half of the 
test (p < .05). 

As Table 4 shows, errors were not 
nificantly adversely affected by either noise 
or ear defenders; since the variables did 
not interact, their analysis was not pur- 
sued. These results indicate that ear 
defenders greatly reduce the adverse effect 
of the first 20 min. of the N condition 
but have little or no effect after this point 
in terms of gaps. Clearly there would be 
no point in looking for changes in errors 


TABLE 4 


Mean ERRORS IN Each. 10-Min. BLOCK 
UNDER EACH CONDITION 


Condition | Block 1 | Block 2 | Block 3 | Block 4 | Total 
i | 


2.40 | 4.67 | 6.11 5.72 | 18.60 
Q+ED | 3.30 | 5.22 | 8.80 | 6.25 | 23.57 
N 2.87 | 6.10 | 8.10 | 5.19 | 2226 
N+ED | 345 | 9.90 | 9.73 7.97 | 31.05 


Note. Abbreviations: Q = quiet, N = noise, and ED = ear 
defenders. 
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if noise and ear defenders had no reliable 
effects on this score. 


DISCUSSION 


In general, the pattern of results from both 
experiments is consistent with the suggestion 
of 2 rather different effects on the organism 
of loud noise. In terms of the introductory 
hypothesis, IN greatly reduced the adverse 
effect upon gaps, and although interactions 
between noise conditions and time-on-task 
were not apparent, it is clear from the means 
in Table 1 that the beneficial effect of inter- 
mittency developed only after the first 10 min. 
of exposure. In the first block of a test, 
IN was approximately similar to CN in its 
adverse effect, but substantially less in the 
following part of the test. This pattern fits in 
well with the expectation that any initial 
adverse effect would, in the main, be due to 
arousal, but that any later decrement would 
be due to monotony which would be partially 
alleviated by intermittency or variability in 
the noise. 

Still considering gaps, the benefit of wearing 
ear protection appears to be restricted to the 
first 20 min. of exposure, suggesting that 
intensity causes the initial adverse effect, 
which is consistent with the arousal inter- 
pretation. Similarly, the absence of any such 
benefit in the latter half of the test points 
to an increasing influence of monotony and 
isolation on performance, unaffected by ear 
protection. 

Although the changes in gaps support the 
view that there are 2 different effects of 
noise, the rather similar effects of IN and CN 
on errors has to be accounted for. The pre- 
vailing view of the relationship between gaps 
and errors has been that they represent op- 
posite points on the speed and error dimen- 
sion. Performance under stress has often 
shown a reciprocal relationship between the 
2 scores; noise usually increased errors (Broad- 
bent, 1953) but sleep loss increased gaps 
(Wilkinson, 1963). Increasing arousal was 
conceived as moving the criterion in a risky 
direction and vice versa (Broadbent, 1971). 
Hartley? found just such a reciprocal rela- 
tionship when noise presented over head- 
phones caused more gaps and fewer errors 
than noise in the free-field condition. On 
this view, if the effect of the monotony was 
to reduce arousal, errors would be expected 

ITOR tley. Comparison of headphone and 
free-field noise on the Stroop test. In preparation. 
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to be more numerous in IN as compared 
to CN, particularly toward the end of the 
test when IN should have most benefit in 
opposing the reduction in arousal during the 
monotony of the CN condition. There is 
little evidence here that this is the case. 
Probably the most satisfactory explanation is 
that IN and CN are broadly similar in their 
arousing quality, since they have the same 
intensity, and that the main benefit of inter- 
mittency is in reducing the monotony and 
isolation of continuous noise rather than in- 
creasing the level of arousal. This conclusion 
would rather imply that monotony does not 
simply oppose arousal. Indeed there may be 
2 mechanisms involyed in arousal, one of 
which is affected by monotony and the other 
of which is affected by loudness or intensity, 
in this case. Such a view has support from 
a number of experiments involving the inter- 
action of stresses, including the combined effect 
of noise and sleeplessness, alcohol and incen- 
tive, and personality variables and stresses 
(Broadbent, 1971), 
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RECOGNITION AND RECALL AS A FUNCTION OF INSTRUCTIONAL 
MANIPULATIONS OF ORGANIZATION * 


ROBERT M. 


SCHWARTZ? axb MICHAEL S, HUMPHREYS? 


University of British Columbia, Vancouver, British Columbia, Canada 


Recognition and recall were examined as a functi. 


on of organizational instruc- 


tions, In Experiment I instructions to group items in input order by using 
visual imagery produced increased recall but did not increase recognition 


performance. 


In Experiment I instructions to rehearse overtly the item being 


presented produced decreased recall but did not decrease recognition perform- 


ance, 
recognition and recall. 


A recent controversy. concerns the effect of organi- 
zation on recognition accuracy. Some studies have 
demonstrated a facilitative effect (eg, Jacoby, 
1972), and other studies have found no effect (e.g., 
Bruce & Fagan, 1970). One possible reason for 
the discrepancy is that the manipulations of orga- 
nization which have been used are manipulations of 
the structure of the to-be-presented list; that is, 
Ss in experimental and control conditions have not 
been presented the same lists in the same presenta- 
tion order, and thus the effects of organization may 
be confounded with the effects of the nature of the 
lists. In order to avoid this possible confoundi 
the present experiments, the manipulations used 
volved instructions about processing the list items, 
With instructional manipulations, 5» in experimental 
and control conditions could be presented the same 
lists in the same orders, Also, in addition to recogni- 
tion tests, recall tests were u The use of recall 
tests allows differences in organization to be inferred 
from dilferences in order of output and/or dif- 
ferences in recall performances. 

Experiment I. |n. Experiment I S» in experi- 
mental conditions were instructed to group the 
words to be recalled in their input order by using 
visual imagery. The Ss in control conditions were 
given no special instructions. Insofar as organiza- 
tion includes grouping of items at input, the present 
instructions can be interpreted as a manipulation 
of [SH men (see, e.g., Bower, Lesgold, & Tieman, 
1969). 

The stimuli were 120 concrete (C > 5.50) and 
frequent (F > 20) nouns chosen randomly from the 
Paivio, Yuille, and Madigan (1968) norms Of 
these 120 nouns, 60 were randomly chosen to be on 
the list to be presented to all of the Ss; the other 
60 nouns served as distractors for the Ss in the 
recognition conditions. 

Each of the 60 nouns on the list to be presented 
was mounted as a slide. The slides were then 

] 
tlon aubmitied ix part alten of te Ph D denec i te 


University of British Columbia 
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be sent to Michael S. 
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Humpoxeys: woo who is now at the D of Psychology, 

Northwestern University, Evanston, Illinois 60201. 


The results were discussed as supportive of a 2-process theory of 


randomly arranged to determine order of presenta- 
tion for all of the Ss. Presentation was via a 
Kodak Carousel projector. (model 850) with an 
external timer. Answer sheets were prepared for 
Ss in the recognition and recall conditions, For 
the recall conditions, the answer sheets contained 
2 columns of 30 blank lines, At the top of the 
answer sheet was an instruction asking S to write 
down, in any order he desired, as many words as he 
could remember from the list he had been presented, 
For the recognition conditions, the 60 presentation 
and 60 distractor items were randomly arranged 
and typed in 4 columns of 30 items each. The 
instruction at the top of the answer sheet asked S 
to put a line through all of the words he thought 
were contained in the list he had been presented, 

A2 X 2 factorial design, with both factors varying 
between groups of Ss, was used, The first factor 
was whether the S was given standard learning 
instructions or instructions to group items, The 
second factor was whether S was given a recognition 
or recall test. "The 4 resulting conditions were 
labeled — standard-recall, — standard-recognition, 
grouping-recall, and grouping-recognition, 

The Ss were 57 volunteers from introductory 
psychology classes. They were tested in groups of 
1-3 with all Ss in a group receiving the same in- 
structional manipulation. There were 14 Ss in each 
condition except the standard-recognition condition, 
which had 15 Ss, 

The Ss in all conditions were told that they would 
be presented with 60 nouns that would be familiar 
to them. Upon seeing each noun, they were to 
try to remember it as best they could, They were 
also told that after seeing all 60 nouns, they would 
be tested on how well they remembered them, 
although they were not informed of the nature of 
the test, 

The Ss in the grouping conditions were given addi- 
tional instructions about learning the list. These 
instructions told the Ss to group the words in input 
order by using visual imagery. They were to use 5 
or fewer items in each group, and they were given 
an example of how to group a set of items. 

If the Ss had no questions, the list was then 
presented. Rate of presentation was 4 sec. per item, 
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Immediately following presentation and according 
to their conditions assignment, the Ss were handed a 
recognition or a recall answer sheet. The Ss were 
told that the instructions were at the top of the 
sheet and that they would have 4 min. to complete 
the task, 
Following Kintsch (1968), the score for Ss in all 
conditions was the difference between the number 
of correct and the number of incorrect responses. 
Means for these scores were 45.29, 42.27, 20.71, and 
16.29 for the grouping-recognition, standard-recog- 
nition, grouping-recall, and standard-recall condi- 
tions, respectively. Each S's score was transformed 
using a square root transformation, and the trans- 
formed data were subjected to analysis of variance. 
The results of the analysis indicated that the 
recognition scores were higher than the recall scores, 
F (1,53) = 191.60, p < .001. The grouping instruc- 
tion resulted in significantly improved recall per- 
formance, F (1, 53) = 4.30, p <.05, but not 
significantly improved recognition performance, 
ues 
A second analysis was performed on the data of 
the recall conditions to assure that the superiority 
of the grouping-recall condition could be attributed 
to the instructional manipulation. As the Ss in the 
grouping conditions were told to form groups of 5 
words according to input order, it was predicted 
that the protocols of the Ss in the grouping-recall 
condition would show more clustering according to 
input order than those of the Ss in the standard-re- 
call condition. To test this prediction, the presenta- 
tion list was treated as if it contained 12 categories of 
Swords each. Each category was a group of 5 words 
presented adjacently, starting with Words 1-5, 
6-10, etc. Then each S's recall protocol was analyzed 
according to Frankel and Cole's (1971) adaptation 
of the runs test, and a z score was determined for 
each S. The mean s scores for the grouping-recall 
and standard-recall conditions were 3.53 and 1.93, 
respectively. Both means indicated significantly 
more clustering according to input order than would 
be expected by chance, s = 13.20 for grouping- 
recall and z= 7,21 for standard-recall, both 
ps < 001. However, the grouping-recall condition 
showed significantly more clustering according to 
input order than the standard-recall condition, 
z = 424, p < 001. Thus, it can be concluded that 
the grouping-recall Ss were organizing in accordance 
with the instructions. 

Experiment II. Rundus (1970) had Ss rehearse 
aloud during list presentation and found that items 
that had been rehearsed together were likely to be 
recalled together. Thus, if S could be prevented 
from rehearsing item together, it is possible that 
his level of organization would be depressed. In 
Experiment II Ss in the experimental groups were 
required to rehearse aloud the item being presented, 
thus reducing the opportunity to rehearse 2 or more 
items together (see, e.g., Fischler, Rundus, & Atkin- 
son, 1970), The Ss in the control groups were not 
required to rehearse aloud and were not asked to 
use any particular rehearsal strategy. 

The stimuli were 480 words, 3 from each of 160 
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conceptual categories in the Battig and Montague 
(1969) and Shapiro and Palermo (1969) norms, 

The words were high associates to the category 
names. One word from each category was chosen to 
be on the lists to be presented ; the remaining 2 words 
from each category served as distractors for the 
recognition conditions. 

The 160 words to be presented were typed on 
3X5 in. cards. The cards were then randomly 
divided into 4 lists of 40 words each. Within each 
list, the cards were randomly arranged to form a 
presentation order for that list. 

Recall answer sheets were prepared in the same 
manner as in Experiment I, except there were only 
20 blank lines in each of 2 columns. Four separate 
recognition answer sheets were prepared, one for 
each of the 4 lists, A recognition answer sheet for a 
given list contained a random arrangement of the 
40 words on that list and the 2 distractor words from 
the categories of those 40 words. The recognition 
answer sheets were arranged in 4 columns of 30 
words each, and the instructions at the top of the 
recognition answer sheets were the same as those 
used in Experiment I. 

Like Experiment I, Experiment 1I could be con- 
ceptualized as a 2 X 2 factorial design, The first 
factor was rehearsal conditions, silent or overt, and 
the second factor was type of test, recognition or 
recall. Thus, there were 4 conditions which were 
labeled silent-recognition, overt-recognition, silent- 
recall, and overt-recall. 

However, unlike Experiment I, the 4 conditions 
were varied within Ss so that each S served in each 
of the 4 conditions. Each S was presented each of 
the 4 lists and served in one of the 4 conditions on 
that list. To counterbalance order of conditions 
and presentations of lists, a Greco-Latin square 
design was used. The Greco-Latin square resulted 
in 4 arrangements of lists and conditions for 
presentation. 

The Ss were 20 volunteers from introductory 
psychology classes. Five Ss served in each oí the 
4 Greco-Latin square arrangements. The Ss were 
tested individually. 

Each S was read 2 sets of instructions, one set 
before he served in each of the 2 rehearsal conditions. 
In neither set of instructions was S informed of the 
type of test which would follow the presentation 
of the list. 

For both sets of instructions, S was told that he 
would be presented 40 words that were familiar to 
him. He was shown a deck of cards on which one 
of the lists of words was typed and was told that he 
would be presented the list, one word at a time, by 
E turning over the cards. The S was told that a 
card would be turned over every 5 sec. and that he 
should try to remember as many words as he could. 
In addition, for the silent rehearsal conditions, S 
was told he could study the words in any way that 
he wished. For the overt rehearsal conditions, .S 
was told to repeat aloud the word he was currently 
being presented. He was to repeat the word through- 
out the 5 sec. that he saw it and repeat it at a 1-sec. 
rate. 
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After S was read the set of instructions appro- 
priate for his first 2 lists, the first list was presented. 
The click of an electric timer paced E's presentation. 
Immediately following the completion of first-list 
presentation, S was handed an answer sheet appro- 
priate to the condition and list under which he was 
being tested, He was given 2 min. to complete the 
answer sheet. Then he was told that he would be 
presented another list and he was to rehearse this 
list in the same way as he rehearsed the first list, 
Following the test of the second list, S was read 
instructions appropriate for rehearsal of the third 
and fourth lists. The third and fourth lists were 


then presented and tested. 


The Ss’ protocols were scored as in Experiment I. 
Means for these scores were 27.60, 28.60, 12.75, and 
10.00 for the silent-recognition, overt-recognition, 
silent-recall, and overt-recall conditions, respec- 
tively. Again the data were transformed using a 
square root transformation, and analysis of variance 
was used to examine the transformed data, The 
results of this analysis indicated that recognition 
scores were higher than recall scores, F (1, 48) 
= 905,04, p < .001, The instruction to rehearse 
overtly resulted in a significant decrement in recall 
performance, F (1, 48) = 18.57, p < .001, but no 
significant decrement in recognition performance, 
F <1, Thus, Experiment II has replicated Expe 
ment I by showing an effect of organization in recall 
but not in recognition. 

Discussion. The data from the 2 experiments 
clearly suggest that the instructional manipulations 
affected recall but not recognition performan 
As the present experiments manipulated organiza- 
tion without changing list members or order of 
presentation, the results are an alternative form of 
evidence that organization does not affect recogni- 
tion accuracy. 

The present data are also supportive of a 2-process 
theory of recognition and recall, such as that offered 
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by Kintsch (1970). A 2-process theory includes 
the argument that there are qualitative, rather than 
quantitative, differences between recognition and 
recall. One-process theories, such as the assertion 
that the superiority of recognition as compared to 
recall performance lies in the greater "strength" 
necessary for recall of those items, predict that ex- 
perimental manipulations would have the same 
effect on recognition and recall, The present data, 
which showed that instructional manipulations of 
organization affected recall but not recognition per- 
formances, are an example of the type of data which 
Kintsch (1970) cites as support for his 2-process 
theory. 
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CONCRETENESS OF PEG WORDS IN TWO MNEMONIC SYSTEMS 


DENNIS J. DELPRATO! AND ELIZABETH J. BAKER 
Eastern Michigan University 


Two experiments provide evidence that (a) concreteness of peg words facili 


l tates recall in the one-bun mnemonic system as well as in a paired-associate 
(PA) system, and (b) the one-bun system, in which peg-word retrieval cues 
are not explicitly presented at study and recall, and the PA system, in which 


peg words are explicitly presented, yield comparable results with respect to 
d the effect of peg-word concreteness on recall. These findings are consistent 


with current views of imagery. 


The one-bun. (Bugelski, Kidd, & Segmen, 1968) 
and paired-associate (PA) techniques (Wood, 1967) 
have been used to study the role of imagery in 
1 verbal learning and memory. In the former ap- 
- proach, S first memorizes a list of rhyming numbers 
and peg words (e.g, ONE-BUN, TWO-SHOE). Once 
the rhyme list has been learned, S is instructed 
to learn new words in ordinal position of presenta- 
tion by associating them with images evoked by 
the corresponding rhyming peg words, The PA 
mnemonic system involves presenting S with the 
list of peg words during study and recall trials; 
with the one-bun system, S is instructed to 
integrate to-be-learned words with images aroused 
the pegs. Both techniques are basically typical 
'A tasks on which response terms (to-be-learned 
words) are learned by associating them with stim- 
ulus terms (pegs). The approaches differ because 
the peg words are explicitly supplied by E in the 
PA technique, whereas they must be memorized 
by Sin the one-bun technique. 
. According to contemporary views of imagery 
and the conceptual-peg hypothesis (Paivio, 1969), 
‘the higher the concreteness of stimulus items, the 
(imb their capacity to evoke images that can 
unction a» mediators of associative learning. 
Given this basic theory, the empirical evidence of 
the capacity of concrete. nouns to readily evoke 
images, and the documented facilitative effect of 
concreteness with the PA system (Paivio, 1969), 
concreteness of pegs in the one-bun mnemonic 


This latter finding led Paivio (1968, 1969) to 

that the effect of peg concreteness in the 
‘one~bun system is attenuated because the pegs are 

it; with instructions to use imagery, abstract 
odii just as effective in arousing mediating 
i as concrete pegs. The situation with the 
‘one-bun system is presumed to differ from that 
‘with the PA system, which consistently shows 
coucreteness effects, due to the implicit nature of 
the pegs (retrieval cues) in the former and their 
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explicit nature in the latter system. In other 
words, Paivio assumes a fundamental difference 
in the functional effectiveness of identical pegs 
that is based upon their nominal presence or ab- 
sence, even though 5 can demonstrate they arc 
in memory by his recall of the pegs. This assump- 
tion is not supported by experiments showing that 
well-learned retrieval cues can be just as effective 
in mediating recall when physically absent as when. 
physically present (e.g, Delprato & Garskof, 1969); 
these findings are not surprising if activation of 
the memory trace is critically dependent upon S's 
functional encoding of the nominal stimulus, since 
mere external absence of the stimulus should not 
influence its encoding when it is in memory. 

On the basis of tonsiderations such as the above, 
we submit that it would be more consistent with 
current imagery and encoding theory if peg con 
creteness were a significant factor in the one-bun 
technique than if it were not. Therefore, Experi 
ment | was designed as a partial replication o! 
Paivio's (1968) study. 


EXPERIMENT I 


Method, The Ss were 84 male and female 
undergraduates. 

The concrete and abstract rhymes were identical 
with those used by Paivio (1968), ie., ONE-BUN, 
TWO-SHOR, etc., and ONE-FUN, TWO-TRUK, etc., 
respectively. 

Two 10-item word lists (A and B) to be recalled 
were composed of nouns from the Paivio, Yuille, 
and Madigan (1968) norms, The mean imagery 
ratings, associative meaningfulness, and Thorndike- 
Lorge frequencies for Lists A (first value of each 
pair) and B (second value) were 6.31, 6.30; 5.98, 
5,98; 27.8, 22.7, respectively. 

The Ss were individually tested and were ran- 
domly assigned to 1 of 3 conditions (one-bun, 
one-fun, and control), with 28 Ss in each. All Ss 
were given a single study and recall trial on each 
of the 2 word lists. Half of the Ss in each group 
received List A first, the other half received List B 
first. The instructions for the first list did not 
include mnemonic instructions and were identical 
for all Ss. The Ss were read the list of 10 words 
at a 4-sec. rate, with the words preceded by the 
numerals 1-10. After the list was read once, the 
numerals alone were read in a Prepared random 
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order and 5 was given 10 sec. per numeral to 
- recall the corresponding item. One point was given 
for each word recalled to the appropriate number, 

After S recalled the first list, those who were in 
the rhyme conditions were told that they were 
going to go through the same thing again with 
another list. This time, however, 5s were told 
that an easy technique for learning the 10 words 
‘required that they first learn a simple rhyme, 
then use mental images to picture the object 
represented by the word in the list in relation 
to the word that rhymed with the particular number 
given with the word, The Ss were then read the 
rhyme (abstract or concrete) and tested on it until 
they could recite it 5 successive times without 
error, Examples appropriate to the rhyme condi- 
tion were given, The Ss in the control condition 
did not receive mnemonic instructions between 
lists, Instead, Æ engaged S in casual conversation 
for approximately the same length of time required 
to give the mnemonic instructions to Ss in the 
experimental conditions, 

Results and discussion, The mean recall scores 
on the first list were 3,89, 3,96, and 4.00 for the 
control, one-fun, and one-bun conditions, respec- 
tively, The means on the second list were 4,96, 
5,29, and 6.36 for control, one-fun, and one-bun, 
respectively, The data were analyzed by a 3 X 2 
(Groups X Lists) factorial analysis of variance with 
lists as a within-Ss variable, The between-group» 
effect over both lists was not significant (F = 1.32) 
but the list effect, F (1, 81) 39,85, p < .001, and 
the Groups X List interaction, F (2, 81) = 3.25, 
p <.05, did reach significance, The significant 
interaction indicates differential improvement over 
the 2 lists in the different groups; analysis of the 
simple main effect of the list variable showed that 
all 3 groups did improve significantly from the 
first to the second list, Fs (1, 81) > 4,92, p < 05. 

Individual comparisons via. Duncan's range test 
among the group means on each list yielded results 
consistent with the prediction that peg concreteness 
is a critical variable in the one-bun technique. 
The groups were statistically equivalent prior to 
differential treatment (p 2.05), On the critical 
second list, the abstract-peg one-fun rhyme was 
not more facilitative than the control treatment 
(p.05). The one-bun group was superior to 
the control group (p < .05) and, in contrast with 
the results of Paivio (1968), the concrete-peg one- 
bun rhyme led to significantly higher recall scores 
than the abstract-peg one-fun rhyme (p < .05). 

Paivio's (1968) failure to obtain an effect of peg 

V concreteness in the one-bun system (a finding now 
questioned by Experiment 1), coupled with re- 
peated demonstrations of the effect of this variable 
with PA learning, suggested to him that the 2 sys- 
tems produce different results with respect to peg- 
word concreteness, ie, the Mnemonic System 
X Peg Concreteness interaction. Wood and Bolt 
(1970) tested this implication with the same ma- 
terials in the 2 systems and did not find the pre- 
dicted interaction, ie, comparable results were 
found with the 2 systems. However, since they 


d 

: 

a 
B. 


did not find an effect of peg concreteness with 
either the one-bun or the PA system with their 
materials, Wood and Bolt's finding of equivalent 
effects of peg concreteness in the 2 systems is 
inconclusive in that they did not show the expected 
clfect of concreteness with the PA technique, 


Comparable recall differences between lista 
learned with concrete and abstract pegs in one-bun 
and PA systems are predicted if functional en- 
coding of retrieval cues is not differentially affected 
by the mere presence or absence of the nominal 
stimulus at the time of study and test. While 
the results of Experiment | indirectly support this 
prediction, Experiment II was designed to test it 
directly, Further, the role of peg concreteness in 
the one-bun system, theoretically important in 
itself, was again attacked, 


Exrenment IL 


Method, The Ss were 64 undergraduates of both 
sexes, "Two mnemonic systems (one-bun vs, PA) 
were factorially combined with 2 levels of peg 
concreteness (concrete vs, abstract) to form 4 inde- 
pendent conditions (m = 16/group). 

The materials were identical to those of Experi- 
ment I, In the present study, the abstract and 
concrete peg words became the abstract and con- 
crete stimulus words for the PA conditions, there- 
fore, both stimulus and peg words are referred to 
as pegs. 
All Ss were given 2 study and test trials on a 
single list, with mnemonic instructions prior to the 
first trial, Half of the S» in each condition re- 
ceived List A and half List B. The study-test 
procedure on both trials for Ss in the one-bun 
and one-fun conditions was the same ax that used 
with the second list in Experiment I, 

The procedure for PA Sh was as similar as pos- 
sible to that of one-bun Sa. The PA Sk were not 
required to learn the rhyme, but they were given 
instructions corresponding to those given to one- 
bun Ss on the use of imagery for study and recall, 
The E read the peg words to PA Ss on study and 
recall trials in place of the numbers read to one- 
bun Sx» On recall trials, E read PA Se the peg 
words in a prepared random order (a different 
order on each trial) that corresponded to the 
random order in which the number cues were read 
to one-bun Ss, 


TABLE 1 
Mran Recan Scores: Exrnanent I 


Trad One-bun/fun 


Paired amociate 
Abstract Concrete Abaract | Concrete 
pex pei pes 
1 4.50 se) "mm 64 
2 5.06 aso 6.94 LEL 
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Results and discussion. The mean number of 
words correctly recalled to the pegs on each trial 
by the 4 groups is presented in Table 1. Groups 
receiving concrete pegs in both mnemonic systems 
had higher recall scores than groups with abstract 
pegs. The only significant effects in a Mnemonic 
System X Peg Concreteness X Trials analysis of 
variance were those of trials, F (1, 60) — 143.13, 
p «.001, and concreteness, F (1, 60) = 6.11, 
b «.025. The nonsignificant Mnemonic System 
X Peg Concreteness interaction, F (1, 60) = 1.02, 
supports our suggestion that the functional en- 
coding of stimulus pegs of varying concreteness 
would not be differentially affected by their mere 
nominal presence or absence at the time of study 
and test. 

In conclusion, it appears that peg concreteness 
is a critical factor in both one-bun and PA mne- 
monic systems. These findings are consistent with 
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PROBABILITY OF CONDITIONED RESPONSES AS A FUNCTION 
OF VARIABLE INTERTRIAL INTERVALS! 


KARL HABERLANDT,? K 


Trinity College, Hartford 


Rabbits were trained and extinguished in c 


variable intertrial intervals. 


memory which 


The present experiment examined the probability 
of conditioned responses (CRs) on trial n + 1 as a 
result of the interval since trial » in light of predic- 
tions based on Atkinson and Shiflrin's (1968) mem- 
ory theory and Estes’ (1955a,b) stimulus fluctuation 
theory. Both theories expect that a CR should be 
more likely in early acquisition and less likely in 
early extinction when trial n + 1 is preceded by a 
relatively short rather than a long intertrial interval 
(ITI). Atkinson and Shiffrin suggested that a CR 
occurs on trial n + 1 when information about the 
preceding trial is in the short-term store (STS). 
If the information is not in STS, a search will be 
made for it in the long-term store (LTS). Early 
in training, such information is found only in STS, 


! This report is based on a paper read at the meeting of the 
AR Psychological Association, Honolulu, September 
972. 
? Requests for reprints should be sent to Karl Haberlandt, 
Department of Psychology, Trinty College, Hartford, Con- 
necticut 06106, 


SHORT REPORTS 


IN C. HAILS, ANb ROBERT LEGHORN 


Responding in each rabbit was recorded as a 
result of the interval immediately preceding a trial. Conditioned responses 
were more likely in early acquisition and less likely in early extinction when 
trial n + 1 followed trial n by a shorter rather than a longer interval. These 
results confirm predictions of stimulus fluctuation and 2-store theories of 
uggest that events experienced on a trial, whether in acquisi- 
tion or extinction, are more available and generally influence performance 
more the sooner a subsequent trial is given. 


contemporary imagery theory and the conce 


ptual- 
peg hypothesis (Paivio, 1969). 3 
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sical eyelid conditioning with. 


where it decays relatively quickly. Accordingly, | 
whether in acquisition or extinction, S should recall 
events from trial n better at shorter than at longer. 
ITIs. Similarly, Estes based his prediction on the 
assumption that stimulus elements of trial » are. 
better available on trial n +1 when the ITI is 
relatively short, thus allowing little opportunity 
for stimulus fluctuation. 

Estes (1955, pp. 375-376) cautioned that stimulus 
fluctuation factors may be masked by more tran- 
Sient posttrial processes. Previous work on ITI 
effects confirmed Estes’ caution and showed that 
brief posttrial processes may depress or enhance 
subsequent responding temporarily. Prokasy (1965) 
and Prokasy and Papsdorf (1965) investigated the 
probability of CRs as a function of the preceding 
ITI within the context of evaluating opposing 
predictions from Estes’ (1955) theory and Hull's 
(1951) reactive inhibition concept. The experi- 
ments studied the acquisition of eyelid responses in 
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FIGURE 1, Mean percentage of conditioned responses (CRs) 


following each of 3 intertrial intervals (ITIs) in 3 acquisition 
sessions and 1 extinction session. 


a classical conditioning situation using variable 
ITIs within Ss. Prokasy used ITIs of 2, 4, 6, 8, 
10, 12 and 14 sec. in human eyelid conditioning, and 
Prokasy and Papsdorf used ITIs of 15, 20, 25, 30, 
40, and 45 sec. in rabbit eyelid conditioning. Both 
studies reported that CRs tend to increase with 
longer ITIs, reflecting the dissipation of reactive 
inhibition, as Hull had proposed. Inspection of 
Prokasy’s and Prokasy and Papsdorf’s data suggests 
that the probability of CRs increases most between 
the 2 shortest ITIs, remaining ntially stationary 
thereafter. Apparently, the response decrement is 
temporary, and the influence of reactive inhibition 
decreases or is balanced by stimulus fluctuation after 
about 20 sec. following trial n. Temporary response 
enhancement (up to 5 sec.) was reported by Prokasy 
in those conditions where only 2 ITIs, a short and 
a long one, were randomly distributed. According 
to Prokasy, “S learns to pay attention immediately 
following a trial and then in the absence of another 
trial, relaxes and is subsequently not ready for one 
when it does occur [p. 471]." Temporary response 
enhancement, lasting up to 10 sec., was observed by 
Landauer (1969) in a free-responding situation and 
was taken as evidence for "'posttrial hyperexcit- 
ability" which may be initiated by the unconditioned 
stimulus (UCS). In addition to the choice of 
relatively short ITIs, previous research may have 
made it difficult to detect ITI effects occasioned by 
stimulus fluctuation in another way. That is, ITIs 
are assumed to affect responding differentially in 
early acquisition. Later in training, events from 
trial » are expected to be in STS and LTS and 
equally available, regardless of the length of the 
ITI, and hence CR probability should be invariant 
(Prokasy, p. 471). The studies mentioned above, 
however, do not detail percentage of CRs during 
early acquisition, 

The present study sought to evaluate the expecta- 
tions of Estes’ (1955) and Atkinson and Shiffrin’s 
(1968) theories by minimizing transient posttrial 
processes through the choice of relatively long ITIs 
(20, 100, and 420 sec.) and by observing responding 
from early acquisition through extinction. 

Method. Data were collected from 22 male New 
Zealand white rabbits in acquisition and from 21 of 
these rabbits in extinction. 

A 3,160-Hz. tone of 88 db. as measured at the 
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position of S’s head (General Radio sound pressure 
meter, Scale 20 kHz.) was used as the conditioned 
stimulus (CS) and an 8-ma. shock delivered through 
2 stainless-steel sutures (ga. #32) across the eye 
was used as the UCS, Eyelid responses were mea- 
sured by means of a microtorque potentiometer 
affixed to S's head and an Offner dynograph. Dur- 
ing experimentation, each S was restrained in a 
50.8 X 17.8 X 14.0 cm. Plexiglas box which was 
located in a 66.0 X 63.5 X 53.5 cm. sound-atten- 
uated air-ventilated chamber. After Ss had been 
habituated for 4 hr. to the restraining box and the 
experimental chamber, they received 3 acquisition 
ions and then 1 extinction session each, separated. 
18 hr. On each of 61 acquisition trials per 
session, the CS was presented for 600 msec., over- 
lapping with the UCS for the last 100 msec. On 
extinction. trials, the CS was presented alone. 
"Trials were separated throughout by an average ITI 
of 180 sec. The individual ITIs of 20, 100, and 420 
sec, each occurred 20 times in each session, their 
occurrence following a predetermined order with 
the restrictions that each ITI occur twice within 
6 trials and that during each block of 18 trials, 
the ITIs follow each other with equal frequency. 
In acquisition, all Ss received the same ITI sequence. 
In extinction, Ss received the same sequence as in 
acquisition, except that the sequence was started 
at an ITI of 20 sec. for 7 Ss, at 100 sec. for 7 other 
Ss, and at 420 sec. for the remaining Ss. 

Results. he CRs following the 3 dillerent 
ITIs were analyzed separately within Ss. Figure 
1 presents the mean percentages of CRs following 
each of 3 ITIs during 3 sessions of acquisition anc 
the extinction period. In Session 1 of acquisition, 
Ss responded significantly more following an ITI of 
20 sec. than following the longer ITIs, Friedman 
x°(2) = 9,30, p < .01. Specifically, CRs following 
an ITI of 20 sec. were more likely than CRs follow- 
ing ITIs of 100 sec., / (21) = 1.93, p < .10, anc 
420 sec, ¢ (21) = 2.45, p < .05. There was no 
significant difference in responding following the 
latter 2 ITIs. 

Analyses of runs of consecutive CRs for each .5, 
regardless of the ordinal number of trials on which 
CRs occurred, showed that of the first 10 CRs, 
40.9% followed an ITI of 20 sec., 31.8% followed 
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CONDITIONED RESPONSES ON TRIAL n + 1 
TRIAL INTERVALS AND PRESENCE 
A CR ON TRIAL n 


MEAN PERCENTAC 
AS A FUNCTION OF 
OR AB 


4: Intertrial interval (in sec.) 
Conditioned responses on 


trial n a ig 7 
20 100 420 
83.8 69.8 59.4 
16.5 7.3 6.7 
Extinction» 
Present 71.9 82.5 75.7 
Absent 1.4 16.5 26.2 
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an ITI of 100 sec., and only 27.3% followed an ITI 
of 420 sec., x2(2) = 7.66, p < .05, although there 
was an equal opportunity (33.3%) for CRs to 
follow each ITI equally often. Later in acquisition, 
the percentages converged, such that out of the 
first 40 CRs, 35.1% followed an ITI of 20 sec., 
32.5% followed an ITI of 100 sec., and 32.4% 
followed an ITI of 420 sec. Table 1 summarizes 
percentage CRs on trial n + 1 following each of the 
3 ITIs as a function of the presence or absence of a 
CR on trial n. (Prokasy & Papsdorf, 1965). Per- 
centages are shown only for Session 1 of acquisition 
and extinction, because most Ss had missing data 
in some cells. For the same reason, only 15 Ss were 
included for Session 1 of acquisition. As in the 
Prokasy and Papsdorf experiment, CRs were more 
likely on trial n + 1 when a CR was present on 
trial n than when there was no CR, F (1, 14) 
= 84.45, p < .01. This finding was a reflection of 
the tendency of CRs to occur on adjacent trials 
rather than on separate trials. Similarly, the ITI 
effect was significant, F (2, 28) = 3.88, p < .05. 
However, unlike Prokasy and Papsdorf's study, 
CRs were more likely following the short than the 
longer ITIs. This result obtained regardless of 
whether or not a CR occurred on trial » as indicated 


X ITI interaction, F (2, 28) = 1.22. 
2 and 3 of acquisition there appeared to be some- 
what more responding following the short than the 
longer ITIs, but the difference was not significant. 

In extinction, the pattern of responding was 
opposite to that found in early acquisition, i.e., 
CRs were more likely following the 2 longer ITIs 
than following the short ITI, as indicated by a 
significant main effect, F (2, 40) = 5.20, p < .01. 
Specifically, CRs were significantly less likely after 
an ITI of 20 sec. than after ITIs of 100 s 
1 (20) = 2,58, p < .02, and 420 sec., ¢ (20) = 2.88, 
b €.01. There was no significant difference in 
responding following the latter ITIs. Of the first 
10 failures to respond, analyzed separately for each 
55, 40.5% followed an ITI of 20 sec., 31.9% followed 
an ITI of 100 sec., and only 27.6% followed an 
ITI of 420 sec., Friedman x*(2) = 3.90, p < .15. 
Analysis of variance of percentage CRs on trial 
n + Lasa function of ITI and presence vs, absence 
of a CR on trial » revealed a significant CR presence 
vs. absence effect, F (1, 20) = 181.98, p € .01, and 
a significant ITI effect, F (2, 40) = 8.86, b «€ .01. 
The significant CR Presence vs, Absence X ITI 
interaction, F (2, 40) = 4.12, p < .05, shows that 
ITIs affected the occurrence of CRs on trial n oT 
differentially, depending on whether or not a CR 
was present on trial n. That is, percentage CRs on 
trial x + 1 increased between ITIs of 20 and 100 
sec, regardless of whether a CR was present on 
trial n, ¢ (20) = 2.87, p < .01, or absent, ¢ (20) 
= 247, p < .05. Beyond an ITI of 100 sec., CRs 
on trial » + 1 continued to increase when no CR 
was present on trial w, ¢ (20) = 1.98, p < .10, but 
they appeared to decl if not significantly, when 
a CR was present on trial m, £ (20) = 1.54, p < .20, 

Discussion. The present study found that the 
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length of the ITI preceding a trial influenced th 
likelihood of a CR early in acquisition and extine 
tion. Early in acquisition, a CR was more likely 
when the preceding ITI was relatively short tham 
when the ITI was relatively long (100 sec. and 
longer). On the other hand, in extinction, a CR 
was more likely when the preceding ITI was rela- 
tively long than when the ITI was relatively short. 

The results support a stimulus fluctuation theory 
such as Estes' (1955) and a 2-store theory of memory 
such as Atkinson and Shiffrin's (1968). These 
theories suggest that trial events are more available 
to S the sooner a subsequent trial is given, Even 
if it is assumed for a moment that transient posttrial 
processes, whether hyperexcitability or reactive” 
inhibition, last for intervals as long as used here, 
neither process could account for the present data 
in their entirety. The occurrence of posttrial hyper- 
excitability could explain the present acquisition 
s, because it might be argued that the shock. 
UCS had increased the likelihood of a blink response: 
for a brief period. However, the fact that new) 
learning was more prominent at the shorter ITT 
than at the longer ITIs in extinction cannot be 
accounted for simply by assuming that a shock 
UCS temporarily excited $. Similarly, while a 
reactive inhibition construct would predict the 
higher overall responding at long than short ITIS 
in extinction, it could not readily explain the higher 
occurrence of CRs following long than short ITIs 
when zo CR was present on extinction trial m, or 
the higher occurrence of CRs following short than 
long ITIs in acquisition. 

"The advantage of Estes' (1955) stimulus fluctua- 
tion theory and Atkinson and Shiffrin's (1968) 
memory theory is their ability to handle the present 
data with comparable ease and to integrate them 
with results obtained in human ver learning. 
For example, the results reported here invite com- 
parison with results from some continuous paired- 
associate experiments, especially Bjork's (1966) 
study. In Bjork's research, Ss were tested for the 
recall of paired-associate responses alter varying 
interpresentation intervals (IPIs), Ear y in train- 
ing, i.e., before the trial of the last error, the propor- 
tion of correct responses was higher the shorter the. 
IPI, Bjork suggests that these short-term effects 
may be due to the better retention of items in STS 
with shorter than with longer IPIs. 
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TEMPORAL SEPARATION IN VERBAL DISCRIMINATION TRANSFER ! 


STANLEY J. PASKO anp EUGENE B. ZECHMEISTER ¢ 
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In verbal discrimination transfer: when situational frequency is initially 
superior for incorrect (I) than correct (C) items, Hintzman and Block have 
predicted that the degree to which transfer is inhibited will depend on the 
temporal separation of the original and transfer tasks. As a test of this hy- 
pothesis, Ss learned a reversal list task wherein designation of C and I items 
learned in List 1 was reversed in List 2 (A-B, A-B, where italics indicate 
C item). "Temporal separation between lists (0 or 7 min.) and number of 
List 1 trials (4 or 8) were factors, Support for the hypothesis was found only 
for conditions of low List 1 learning. It is suggested that Ss obtain informa- 
tion concerning List 1 context and intrapair associations as number of List 1 
trials inereas "This information may aid transfer discrimination as frequency. 
accrues to List 2 C items. 
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A serious shortcoming of the frequency theory 
of verbal discrimination (VD) learning (Ekstrand, 
Wallace, & Underwood, 1966) has been the failure 
to predict, completely, transfer performance in 
those situations wherein situational frequency is 
initially greater for incorrect (I) than correct (C) 
VD items. A study by Raskin, Boice, Rubel, and 
Clark (1968) is of this general type. These in- 
vestigators examined VD transfer when pairs from 
List 1 were retained in List 2, with the designation 
of C and I items reversed in the second list (A-B, 
A-B, where italics indicate C item). Positive 
transfer was observed on the initial List 2 trial; 
however, performance first deteriorated slightly and 
then improved on the next few trials, becoming 
inferior to that of a control group on remaining 
trials, The frequency theory assumes that on the 
initial transfer trial situational frequency is greater 
for List 2 I than C items due to their prior de- 
signation as correct in List 1 and that Ss can 
discriminate correctly by choosing the lower level 
frequency items as correct. However, as learning 
demands the continued frequency buildup to C 
items (eg, saying aloud the C items on the test 
trials), the initial advantage to | items is lost and 
a discrimination based on a relative frequency dif- 
ference between pair members is made difficult 
and transfer is inhibited. A strict interpretation 
of the theory suggests that discrimination should 


1 This experiment is part of research submitted by the first 
author in partial fulfillment of the requirements for the MA 
degree at Loyola University, Chicago. 

? Requests for reprints should be sent to Eugene B. Zech- 
meister, Department of Psychology, Loyola University, 
6525 North Sheridan Road, Chicago, Illinois 60626, 


fall to chance levels of responding when the point 
of frequency equality is reached. Although some 
investigators, such as Raskin et al., have reported 
a slight deterioration during early List 2 learning, 
a complete breakdown in discrimination has not 
been observed. s is evident even when indi- 
vidual pair frequencies have been adjusted for the 
effect of wrong responding or, to ensure the hy- 
pothetical equality point for all pairs simultane- 
ously, when individual pair frequencies have been 
rigidly controlled through pronunciation instruc- 
tions (Underwood & Freund, 1970). It has been 
concluded that, at some point during second list 
learning, Ss make use of attributes other than 
frequency to aid discrimination (cf. Underwood 
& Freund, 1970). 

Hintzman and Block (1971) have recently sug- 
gested that frequency theory need mot predict 
deterioration to chance levels of responding in the 
relevant. transfer cases if S's ability to discriminate 
recently from remotely accrued frequencies is taken 
into consideration. These authors reason that if 
early in acquisition of the transfer list Ss ignore 
recent frequencies (List 2), and as frequency ac- 
crues to C items, later ignore older frequencies 
( 1), then deterioration to chance would not 
be expected. Since the ability to separate List 1 


from List 2 frequencies is presumably based on 
nt on the 


the discriminability of "time tags" pre 
list items, it can be expected that “the eff 
of a manipulation, particularly one designed to 
inhibit verbal discrimination performance, should 
depend on the temporal separation of the initial 
and transfer task [p. 305." 


The present experiment is a test of the Hintzman 
and Block (1971) hypothesis as it applies to VD 
transfer. Specifically, transfer in a VD task was 
examined after designation of C and I items learned 
in List 1 was reversed in List 2 (A-B, A-B) and 
when List 2 learning followed List 1 learning either 
immediately (0 min.) or after a short interval 
(7 min.). A reversal task was chosen since both 
members of the transfer list were part of List 1 
learning and, therefore, differed only in terms of 
situational frequency. Also, transfer was examined 
following 2 different degrees of List 1 learning 
(4 or 8 trials). 

Method. Sixty-four 2-syllable nouns varying 
20-50 in Thorndike and Lorge (1944) frequency 
were selected with an attempt at minimizing 
semantic and orthographic overlap. Under the 
restriction that pair members not begin with the 
same letter, 32 words were randomly selected to 
form 16 pairs for presentation to the experimental 
groups. The remaining 32 nouns were randomly 
paired under the above restriction to serve as 
List 1 for the control group. 

In both experimental and control lists, one item 
of each pair was randomly designated the C term. 
For the reversal conditions, 2 forms of List 1 were 
prepared so that each item might be both a C 
and | item. In List 2 the C and I relationships 
were reversed in both form For the control 
groups, one form of List 1 was used for all Ss, 
Control Ss were then equally assigned to the 2 
forms of the reversal list as List 2. Four orders 
of all the above forms were created and were 
presented to Ss equally often in counterbalanced 
form. 

After all Ss received standard VD instructions, 
the experimental groups received 4 or 8 trials on 
List 1. The Ss were not informed as to the number 
of List 1 trials or as to the learning of a second 
list. The C and I items of a single pair were 
printed in juxtaposition and exposed on a memory 
drum, Each pair first appeared for a 2-sec. anti- 
cipation interval, after which the C term was 
shown alone for 2 sec. Across the 4 orders, the 
C term of each pair appeared equally often, but 
nonsystematically, in the left and right positions, 
The intertrial interval for each list was 4 sec. 
"The first trial was always a study trial (S did not 
guess), On the second and succeeding trials, Ss re- 
sponded to each pair, having been told to guess 
if not sure, The List 1 trial numbers given earlier 
include the study trial. The control group was 
arbitrarily given 4 trials on List 1 in order to pro- 
vide warm-up and/or learning-to-learn experience, 

After the requisite number of List 1 trials, ex- 
perimental Ss in the 7-min. temporal separation 
conditions performed a mathematical filler task, 
while Ss in the 0-min. conditions were immediately 
informed as to the nature of the second list and 
that the first trial would again be a study trial. 
The same instructions were repeated to Ss in the 
delay conditions following the math activity. 
Control Ss were informed that the second list was 
unrelated to the first and, for these Ss, List 2 
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learning followed immediately after List 1. List 2 7 
learning consisted of 11 trials for all Ss, with t! 


first trial a study trial. 


A total of 120 Loyola University undergradu- -s 
participated in the experiment in partial fulfilly ont 
of their introductory psychology course reco ve- 
ments, Each of the 5 transfer conditions conta «ed 


24 Ss, Conditions were randomized in blocks {5 
and assignment of Ss was made on the ba: of 
appearance in the laboratory. 

Results, Analysis of variance of the total nuu ber 
of errors made on the first 3 List 1 trials indicated 
that the 4 reversal list conditions did not dier 
on List 1 learning (F <1), For each experimental 
condition and for the control condition the mean 
total errors were as follows: 0-min/4-trial reversal 
list (0/R4), 8.12; 0-min/8-trial reversal list (0/R8), 
8.04; 7-min/4-trial reversal list (7/R4), 6.62; 
7-min/8-trial reversal list (7/R8), 8.08; and control, 
9.29. Within the experimental conditions, how- 
ever, the 2 forms of the VD list could not be 
judged of equal difficulty, F (1,88) = 4.95, p < .01. 
Nevertheless, the differences between forms of the 
list did not vary with experimental conditions, 
F (3, 88) = 2.49, p > .05. 

Figure 1 shows mean errors made on each of 
the 10 transfer trials by those Ss learning a reversal 
list and for control Ss. Statistical analysis of the 
reversal list performance indicated reliability for | 
the pattern of results seen in Figure 1. While 
neither temporal separation nor number of List 1 
trials was significant as main effects, the Number 
of List 1 Trials X Reversal List Trials interaction 
was significant, F (9, 92) = 3.06, p < .01, as was 
the second-order Temporal Separation X Number 
of List 1 Trials X Reversal List Trials interaction, 

F (18, 828) = 2.38, p < 01. These results reflect 
the initially inferior performance, and subsequent — 
faster rate of improvement, of the 0/R4 condition | 
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relative to the remaining reversal list conditions. 
Th versal list trials effect was significant, 
FP O, 92) = 29,33, p < .01, but no other effects 
were reliable, Separate analysis of transfer per- 


formance for the 0/R4 and 7/R4 conditions in- 
dicat ı significant effect for temporal separation, 
F >. 6) = 5.19, p < .05, as well as for the Tem- 
paration X Reversal List Trials interaction, 
1) = 1.94, p < .05. 
wman-Keuls range test of mean errors on 
transfer trial indicated that the experi- 
groups were all significantly different from 
rol group (p < .05). Within the experi- 
groups, the 0/R4 and 7/R4 groups did not 
nificantly, but the 0/R4 group was sig- 
different from the 0/R8 and 7/R8 groups 
i). In terms of overall errors, Ss in Condi- 
R4 were significantly different from the 

valning 4 transfer groups (p < .05). 

Discussion, The frequency theory of VD learn- 
ing requires that several predictions be met regard- 
ing reversal list performance as a function of 
number of List 1 trials, For the present task, an 
increase in number of List 1 trials presumes a 
larger difference in situational frequency between 
C and I items at the point of the first transfer 
trial. If frequency alone serves as a discriminative 

^e larger the initial frequency difference be- 
ind 1 pair members, the easier should be 
diccromnation and the greater should be positive 
transfer on the first reversal list trial, An examina- 
tion of Figure 1 shows that this expectation is 
sitisfiod only when List 2 immediately followed 
the rning of List 1 (0/R4 vs, 0/R8), The same 
on between conditions differing on number 
| trials, but when transfer list acquisition 
was delayed. (7/R4 vs. 7/R8), shows no overall 
difference as a function of increasing trials on the 
original list. A further prediction made on the 
basis of frequency theory concerns the point during 
second-list learning at which deterioration of trans- 
fer performance should be observed, The larger 
the initial frequency differential between C and I 
items, the more transfer trials will be required to 
approximate an equalization of apparent frequency. 
Looking again at Figure 1 and comparing groups 
differing only in number of List 1 trials, there is 
a slight tendency for delayed deterioration between 
Conditions 0/R4 and 0/R8, but little difference 
between Conditions 7/R4 and 7/R8. However, 
differences are slight and, therefore, the results are 
equivocal for this prediction. Finally, frequency 
theory cannot presently account for the difference 
in performance between conditions differing only 
in temporal separation of original and transfer lists 
(0/R4 vs. 7/R4). This latter finding, however, 
is amenable to predictions made from the multiple- 
trace hypothesis of Hintzman and Block (1971) 
and provides evidence that attributes other than 
frequency (e.g, temporal) are involved in transfer 
of a discrimination. 

An explanation for the fact that temporal separa- 
tion was effective in reducing transfer list inhibi- 
tion only for a relatively low degree of List 1 


learning may be related to one of several con- 
sequences of increasing number of List 1 trials, 
First, it can be noted that in the present experiment 
degree of List 1 learning is confounded with tem- 
poral separation between the initial trials of the 
original and transfer lists. It might be logically 
argued that an increase in the number of List 1 
trials would lessen the effectiveness of the present 
temporal manipulation since the more trials on 
List 1, the greater is the time associated with List 1 
practice and, conceivably, the more discriminable 
are the time tags. On the other hand, if increasing 
the number of List 1 trials is correlated with other 
than temporal changes which can mediate list dis- 
crimination, then Ss may ignore List 1 frequency, 
on the basis of these cues, as List 2 frequencies 
become sufficiently disparate to provide a con- 
sistent cue for discrimination performance. Addi- 
tional cues that can mediate discrimination may 
be expected to obviate the use of a temporal cue 
if these added attributes are more dominant in 
discrimination performance, 

Two correlates of increasing number of List 1 
trials can be expected. First, as number of List 1 
trials increases, so too does S's information regard- 
ing List 1 context. If frequency can be assumed 
to be context related, then it is likely that List 1 
frequency is more discri ble after a larger 
number of List 1 trials, For example, this inter- 
pretation would explain the failure of King and 
Levin (1971) to find inhibition in VD transfer 
after 8 List 1 trials when between-lists changes 
involved either designation of a new C item (A-B, 
C-B), or both the reversal of List 1 C items and 
the designation of a new C item (A-B, C-A). 
In both these paradigms, situational frequency is 
greater for I than C items on the initial transfer 
tria nd the frequency theory predicts initial 
positive transfer and subsequent deterioration on 
remaining transfer trials, King and Levin's results 
are in contrast to results seen in Figure 1 where 
reversal list inhibition after 8 List 1 trials was found, 
If context aids list discrimination, less inhibition 
would be expected in those VD paradigms which 
change list composition between initial and transfer 
lists (e.g., A-B, C-B) than in a reversal paradigm 
wherein changes in original and transfer list learn- 
ing are not correlated with changes in list com- 
position (A-B, A-B). 

A second correlate of increasing number of ac- 
quisition trials is an increase in S's information 
regarding intrapair associations (eg, Zechmeister 
& Underwood, 1969). In terms of reversal list 
performance, this fact suggests that as frequency 
accrues to the C items of the transfer list, dis- 
crimination may be aided by S's information about 
intrapair relationships, particularly as equalization 
of frequency between pair members is approximated. 

Finally, all reversal conditions of the present 
experiment showed a slight deterioration on early 
List 2 trials and subsequent inhibition on later 
transfer trials. This finding suggests that Ss are 
not entirely successful at ignoring prior frequency 
during List 2 acquisition, a fact which tends to 


confirm the notion that frequency is the dominant 
attribute governing discrimination performance and 
that Ss have difficulty using less preferred cues. 


REFERENCES 


R., WaLLACE, W. P., & UNpERWOOD, B. J. 


EKSTRAND, B. U x 
nation learning. Psy- 


A frequency theory of verbal-disc 
chological Review, 1966, 73, 566-578. 

Hintzman, D. L., & Brock, R. A. Repetition and memory: 
Evidence for a multiple-trace hypothesis. Journal of Ex- 
perimental Psychology, 1971, 88, 297—306. j 

KinG, M. E., & Levin, I. P. Tests of frequency theory in 
verbal discrimination transfer with varying paradigms and 
List 1 trials. Psychonomic Science, 1971, 25, 226-228. 


Journal of Experimental Psychology 
1974, Vol. 102, No. 3, 528-530 


MOTIVATING EFFECT OF CONTINGENT SELF-REWARD! 


GISELA E. SPEIDEL * 


Kamehameha Early Education Project, Honolulu, Hawaii 


'The motivating effec 


group, yoked to the self-reward < 


‘The variables that are relevant to the establish- 
ment and occurrence of contingent self-reward have 
been charted extensively under laboratory condi- 


tions (for reviews, see Bandura, 1971; Kanfer, 
1970). By comparison, there are relatively few 
studi eking to demonstrate that self-reward can 


have a reinforcing or motivating function. Exami- 
nation of the most frequently cited studies (Kanfer 
& Duerfeldt, 1967; Marston & Kanfer, 1963) has 
revealed methodological problems that make equi- 
vocal the conclusion that self-administered reward 
functions to motivate behavior (Speidel, 1972). 

The present experiment was designed to furnish 
further evidence on the effects of contingent self- 
reward. Specifically, the study tested whether 
sections of a movie, self-administered contingently 
upon the completion of a certain number of ad- 
dition problems, would serve to increase the number 
of problems completed and to increase the rate 
of responding. 


! This research was submitted in partial fulfil 
requirements for the PhD degree at the University of Hawai 
The author wishes to thank Roland G. Tharp, W. J. M. Eva 
Clifford R. O'Donnell, and John G. Carlson for their helpful 
aval cont niente. d 
? Requests for reprints should be sent to Gisela E. 
Kamehameha Schools, Kapalama Height, Honolulu, Hawa 
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s of contingent self-reward on an intellectual though 
boring task were studied. The Ss were instructed to work on simple addition 
problems for as long as they wished. li 
presented themselves with sections of a television film contingent upon a fixed- 
ratio schedule that they had selected from 3 alternative 
s, received the movie sec 
A control group received rest periods instead of the film. The results partially 
supported the hypotheses that contingent self-reward can have motivating 
effects. The self-reward group completed significantly more problems than 
the control group while there were no significant differences in number of 
problems completed between the self-reward and the E-rewarded groups. 
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The Ss in the self-reward condition 


An E-rewarded 
ons automatically. 


Method. The Ss were 45 undergraduate psy 
chology studen Fifteen (7 males and 8 fe 
males) were assigned to each of the following 
groups- self-reward (SR), E-rewarded (ER), andi 
no-reward control receiving rest periods. The S8 
in the ER and the control groups were yoked 
Ss in the SR group in ter of the schedule on 
which they received the reward or the rest periody 
respectively. 

The Ss were tested individually and were pre 
sented with a stack of 70 pages containing 29) 
simple addition problems each. The problems all 
had the same pattern of 2 rows and 3 columns 


of numbers as shown in the example 
A panel with 2 buttons and a counter were con 
nected to a central relay system in E's roomW 
One of the buttons activated a cumulative recordet 
and a counter in E's room and also the counter 
s panel (during the ER and SR conditior 
‘The other button activated a buzzer in E's room 
E then turned on a videotape recorder (for thé 
SR condition only). A television monitor was 
placed in the S room. A videotaped television 
detective story served as the reward. 

All groups received instructions to work on the 
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addition problems and to press the “problems com- 
pleted" button every time they had finished a 
problem. They were given one page of trial prob- 
lems for practice, (The rate with which Ss worked 
on this page was used as a preexperimental rate 
measure.) 

The SR Ss were then shown how they could 
keep track of the number of problems they had 
completed by checking the counter on their panel. 
"They were instructed to treat themselves to some 
“movie time" whenever they had finished a certain 
number of problems by pressing the televi 
button on their panel. The film was briefly de- 
scribed to them and they were shown how to tune 
the monitor, The were presented with a card 
that contained 3 alternative schedules on which 
they could reward themselves. They were asked 
to choose one of the schedules (but not to tell E). 
If they found the schedule on which they worked 
too easy or too difficult they could change sched- 
ules, but not more than once. 

The ER Ss were given the same information 
relating to the counter on their panel, the film, 
and the television-tuning instructions as SR Ss. 
However, they were told that the television would 
come on automatically each time they had com- 
pleted a certain number of problems, For each 5, 
the schedule on which he was to be rewarded 
(which was the same as that on which the SR S 
to whom he yoked had rewarded himself) 
was stated. 

In place of the television instructions the control 
group was informed that occasionally the loud- 
speaker would come on and announce break time. 
During this period they were not to work on the 
problems. Control Ss were yoked to SR Ss in 
terms of the schedule on which they received the 
n period. However, rather than giving the 
period in a single unit at the completion of the 
schedule, it was given in 2 parts—one at some 
point in the middle of the set of problems and 
the other at the completion of the schedule, This 
s done so as not to convey to Ss that the rest 
period was contingent upon performance, 

The final instructions for all 3 groups informed 
Ss that the experiment was open ended in length 
and that they could leave the experiment whenever 
they wished, When they decided to stop they 
were to come into E's room and fill out the ques- 
tionnaire. If S had not signaled that he wanted 
to leave after 900 problems, Z stopped him, The 
Ss were given a questionn and asked to rate 
the movie or the break on a 7-point bipolar scale 
from —3 (extremely boring) through 0 (the neutral 
point) to +3 (extremely interesting or restful). 

Results. The effects of the experimental condi- 
tions were measured on 3 dependent variable: 
(a) the number of problems completed; (b) the 
average number of seconds needed to complete a 
problem, i rate; and (c) the decrease in number 
of seconds per problem from the beginning to the 
end of the experiment, i.e., increase in rate. 

The mean number of problems completed by 
the control, SR, and ER groups was 527.53, 723.33, 
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and 826.67, respectively, The differences among 
the means were statistically significant, F (2, 42) 
= 7.23, p < .005. A comparison of the means by 
the Newman-Keuls method showed that both the 
SR and ER groups completed significantly more 
problems than the control group; the difference 
between the ER and control groups was significant 
at the .01 level; the difference between the SR 
and control groups was significant at the .05 level. 
Although the ER group completed more problems 
than the SR group, this difference was not 
significant. 

Both rate and increase in rate of performance 
during the experiment showed no significant treat- 
ment effects. However, the pretreatment rate 
correlated positively with rate during the experi- 
ment, r (43) = .78, p < .001, and negatively with 
an increase in rate, r (43) = —.61, p < .001. 
"Therefore point-biserial correlations were conducted 
between treatment conditions and each of these 
2 dependent variables with pretreatment rate 
partialed out. correlation coefficients were 
in the predicted direction: The ER group worked 
faster and increased more in rate during the ex- 
periment than the control group while the SR group 
took an intermediate pc However, the cor- 
relations were not significantly different from 
each other. 

The mean rest-period rating given by the control 
group was .33 and did not differ significantly from 
the neutral point. The mean movie ratings given 
by the SR and the ER groups were 1,4 and 2.0, 
respectively. The movie was ed significantly 
more positively than the rest period, ¢ (44) = 4.28, 
p «.001. The more attractive an S rated the 
movie, the more problems he tended to complete, 
r (29) = 28, p < .06. 

Discussion. ‘The results on the number of prob- 
lems completed indicated that self-reward can func- 
tion to motivate behavior in a manner similar to 
E-presented contingencies, Further support for the 
conclusion that the difference in performance be- 
tween the experimental groups and the control 
group was due to the motivating function of the 
film, was provided by the findings that the movie 
s rated significantly more positively than the 
neutral rest period and that the more attractive 
the movie was rated, the more problems were 
completed. 

The fact that the other 2 dependent meas 
showed only tendencies in the predicted dir 
does not provide any evidence against the pre 
hypothesis. Since not even the ER group exhibited 
any significant treatment effects, it is concluded 
that these measures did not constitute a sufficiently 
sensitive test for the effects of contingent reward 
whether externally or self-administered. These 
measures seem to reflect mainly stable individual 
differences in arithmetic skill, as pretreatment per- 
formance accounts for most of their variation. 

It is therefore concluded that the present results 
confirm the findings of Bandura and Perloff (1967) 
who demonstrated that self-administered tokens 
could maintain a motor task in children. A broader 
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data base is thus provided for past and future 
applied studies using self-reward to effect behavior 
change. 

A warning, however, against overenthusiastic 
application of self-reward in a clinical setting, at 
this stage, is appropriate. Experiences gained from 
conducting pilot work and the present experiment 
suggest that the motivating effect of contingent 
self-reward is a delicate one that requires special 
conditions for its demonstration. Some of the 
factors to which special attention must be given 
are the (a) other self-control behaviors an indi- 
vidual already has in his repertoire; (b) appro- 
priate level of aversiveness of the task on which 
the motivating effects are to be demonstrated; 
and (c) appropriate level of attractiveness of the 
reward, so as to ensure motivating effects but not 
to evoke rule violation. Furthermore, most experi- 
ments in self-reinforcement have demand 
characteristics for compliance with the rules for 
contingent self-reward (Bass, 1972). A still un- 
answered question is whether contingent s If-reward 
can occur without such demand characteristics. 
Even in the present experiment demand charac- 
teris existed (e.g. the presence of E in an 
adjoining room, wanting to be a good S, etc.) 
that may have been responsible for the maintenance 
of contingent self-reward, and therefore prevented 
rewarding oneself before the completion of the 
criterion behavior. The design of the present 
study greatly reduced the potential operations of 
such demand characteristics in that (@) Ss were 
given a choice of alternative work schedules and 
the opportunity to change schedules (b) they 
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monitored the number of problems they had com- 
pleted on their counter; and (c) they received no 
signal to indicate that they had reached their 
criterion for self-reward and they consumed the 
reward immediately and did not have to exchange 
tokens for a reward at the completion of the ex- 
periment. The last 2 features had been present 
in the Bandura and Perloff (1967) study. 

In light of the above discussion the conclusion 
permissible from the present study and Bandura 
and Perlofi’s (1967) study is that under circum- 
stances that insure compliance with the rule of 
contingent self-reward, self-administered rewards 
can be shown to motivate and maintain behavior 
in a manner similar to rewards controlled by 
another person, 
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USE OF RULE 1 AND RULE 2 IN VERBAL DISCRIMINATION TRAINING ! 


SLATER E. NEWMAN,? RALPH E. SUG AND CAROL H. AVERITT 


North Carolina State U niversity 


During verbal discrimination training, each S in Experiment I was told on 
study trials which item of each pair was right or wrong. Appropriateness of 
Rule 1 (select the more frequent item of the pair) and Rule 2 (select the less 
frequent item of the pair) was manipulated by having S designate on test 
trials the same item of the pair designated by E on study trials (Rule 1 thus 
being the more appropriate) or the other item of the pair (Rule 2 being the 
more appropriate). In Experiment II, appropriateness of Rules 1 and 2 was 
manipulated by having S designate on the test the same item of the pair E had 


instructed him to learn during study trials or the other item. 


In both experi- 


ments, performance during training was better for Ss for whom the use of 


Rule 1 was the more appropriate. 


According to the frequency theory of verbal 
discrimination (VD) learning (Ekstrand, Wallace, 
& Underwood, 1966), a frequency unit is added 
to an item each time S responds with it. The 
theory asserts that correct responding during VD 
training is directly related to the difference in 
frequency units between items of a pair. According 
to Ekstrand et al, S responds with the more 
frequent (Rule 1) or less frequent (Rule 2) item 
of a pair depending on the requirements of the 
situation. 

A transfer experiment by Underwood, Jesse, and 


Ekstrand (1964) has provided some data on the 


use of these rules. After each S learned a VD list 
to 3 consecutive perfect trials, he learned a second 
list. For half of the Ss, the right items from List 1 
were still right on List 2 but the wrong items were 
new; for the other Ss, the wrong items from List 1 
were still wrong on List 2 but the right items were 
new. Performance on the first test of List 2 was 
almost perfect for the right Ss (for whom the use 
of Rule 1 was the more appropriate). The wrong 
Ss (for whom the use of Rule 2 was initially the 
more appropriate) also made few errors on the 
first test of List 2, though substantially more than 
right Ss. (This difference in performance between 
the right and wrong Ss can be explained by as- 
suming that the frequency difference between old 
and new items for the right group was greater 
than for the wrong group.) These data thus accord 
with the statement that "either Rule 1 or Rule 2 
can be applied depending on the appropriateness 
of the situation [Ekstrand et al., 1966, p. 570." 


EXPERIMENT | 


This experiment studied the use of Rule 1 and 
Rule 2 in a single-list situation. For half of the Ss, 
Rule 1 was the more appropriate from the begin- 
ning of training; for the remaining Ss, Rule 2 was 


1 The authors appreciate the help of Harry K. Dorsett i 
making students at Meredith College available for particip: 
tion in this experiment, and of Donald H. Mershon who 
commented on an earlier version of this manuscript 
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the more appropriate. A 2 X 2 design was used 
which permitted study of the effects of (a) des- 
ignating an item as right or wrong on study trials 
and (b) requiring S to designate the right or wrong 
items on test trials. Thus, on study trials, half 
of the Ss were told which item of a pair was right 
and half were told which item was wrong. On test 
trials, half of the Ss in each treatment were in- 
structed to indicate the righ item of each pair, 
and half were instructed to indicate the wrong 
item of each pair. As in the experiment by Eks- 
trand et al. (1966), S was asked to indicate whether 
the to-be-designated item was first or second rather 
than to pronounce it, thus presumably eliminating 
the buildup of frequency units on test trials. The 
4 groups were designated RR, RW, WR, and WW 
(the first letter refers to the item designated on 
study trials, i.c., R = right, W = wrong; the second 
letter to the item to be designated by Son test trials). 
All Ss were given 4 study trials, each followed 
by a test. 

Method. The 20 pairs were those used in the 
control condition by Ekstrand et al. (1966). For 
each treatment, 2 lists were constructed so that 
on study trials each item of a pair was repeated 
either on List A or on List B (e.g., in Group RW, 
half of the Ss heard "queen-fast, queen is right"; 
the rest heard "queen-fast, fast is right”). During 
training, 8 different orders of the pairs were used, 
4 for study trials and 4 for test trials. Each item 
appeared first in its pair on half of the study trials 
and on half of the test trials. 

The Ss were run individually. A tape recorder 
was used to present the instructions and to present 
the items on study trials and on test trials. The 
instructions told S which item of a pair would be 
designated on study trials and which item he was 
to designate on test trials. On study trials, each 
item of a pair was presented and S was then told 
which item was right (Groups RR and RW) or 
wrong (Groups WR and WW). On test trials, 
each item of the pair was presented and S called 
out that item of the pair (first or second) that he 
believed was right (Groups RR and WR) or wrong 
(Groups WW and RW). A 4-sec. rate was used 
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TABLE 1 
Mean Correct ON Each TRIAL FOR EACH 
TREATMENT: EXPERIMENTS I AND IT 
Trials 
Treatment? 
1 2 | 3 4 Total 
Experiment I 
RR .25 16.55 
RW 7: 14.40 
WR 13,00 1 
ww 16.35 18.95 
Experiment. 11 
RR 12.70 14.95 17.15 17.40 
RW 10.80 | 13.25 | 13.90 | 14.25 
WR 10.75 12.85 14.10 14.45 
ww 12.80 | 14.50 | 17.25 | 16.60 
| 


a Abbreviations: R = right, W = wrong: for further ex- 
planation, see description in text for each experiment, 


on both study and test trials. There were 4 study 
trials each followed by a test. A 50-sec. interval 
separated the first study trial from the first test 
trial. During this interval, E reiterated instruc- 
tions for the test. Subsequent intervals between 
each study trial and its following test trial were 
15 sec, and between each test trial and its sub- 
sequent study trial, 4 sec. 

There were 80 Ss, 56 from the introductory 
psychology course at North Carolina State Uni- 
versity and 24 from a class in educational ps 
chology at Meredith College. A balanced Latin 
square was used in assigning Ss from each popula- 
tion to the treatment conditions. 

Results and discussion. A 2X2X4 (Study 
Trial Repetition X Test Response X Trials) anal- 
ysis of variance was applied to the data for number 
correct on Tests 1-4. Only the effects of trials, 
F (3, 228) = 102.34, p < .001, and the Study Trial 
Repetition X Test Response interaction, F (1, 76) 
= 30.25, p < .001, were significant. Application 
of Duncan's multiple-range test showed that the 
means for Groups RR and WW each differed 
(p < .01) from the means for Groups RW and WR; 
but the RR mean did not differ from that for WW 
(p > .05) and the RW mean did not differ from 
that for WR (p 2.05). The mean correct for 
each treatment on each trial is shown in Table 1. 

In accord with predictions derivable from the 
frequency theory, performance during training was 
affected neither by the item (right or wrong) des- 
ignated on study trials nor by the item to be 
designated on test trials. Performance was affected 
only by whether the item designated during study 
trials was the same as (or different from) the item 
to be designated on the test trials. If it was the 
same (as in Groups RR and WW, for which use 
of Rule 1 was the more appropriate), then per- 
formance was better than if it was different (as in 
Groups RW and WR, for which use of Rule 2 
was the more appropriate). Examination of the 
treatment means on each test shows that the 
between-treatments differences occurred on the 
first test and remained relatively stable throughout 
training. 
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The improvement in performance during training 
may be taken as evidence that during VD train 
“either Rule 1 or Rule 2 can be applied depending 
on the appropriateness of the situation [Ekstrand 
et al., 1966, p. 570]." The present results appear 
to indicate, however, that where use of Rule 1 is 
the more appropriate, performance durin; ning 
(as early as the first trial) may be be 
where the use of Rule 2 is the more appr 


EXPERIMENT IL 


In explaining the results from Experiment | 
was assumed that the difference in frequency unit 
between items of a pair derived from the ! 
of their overt occurrence. In Experinw 
right and wrong items of each pair were p! 
only once on each study trial, S was instruc 
prior to each study trial which item (right or wrong) 
he was to learn (but not which item he was to 
designate on test trials), and was then tested for 
those items or for the other items. If frequency 
units accrue to an item only as a result of its overt 


quency 


occurrence, all groups should perform at chanc 
level since the number of frequency units for the 
2 items of a pair would be equ Four trials 


were run to determine whether performance of any 
of the 4 groups would change with practice. 

Method, The method was similar to that used 
in Experiment | with the following exceptions: 
(a) Ss were instructed prior to each study 
to learn the right (Groups RR and RV 
(Groups WR and WW) item of each j t 
were not told until after the first study tria h 
item they would have to designate on test tr 
(b) on study trials, half of the Ss heard “qi 
right, fast is wrong" (List A) and the rest heard 
“queen is wrong, fast is right" (List B); (©) & 3- 
rather than a 4-sec. rate was used on study and 
test trials; (d) immediately prior to the first test 
and to each test thereafter, half of the Ss in each 
treatment were asked to indicate (by saying "first" 
or "second") the items they had been instructed 
to learn (Groups RR and WW) and the remaining 
Ss were asked to indicate the other item of each 
pair (Groups RW and WR); and (e) 50 sec. inter- 
vened between each subsequent study trial and 
the test that followed, and 24 sec. between each 
test and the study trial that followed. 

The design was 2 X 2 with 20 5s, 12 male and 
8 female, in each treatment. All were under- 
graduates at North Carolina State University. 
Males and females were separately assigned in 
counterbalanced order to each of the treatments. 

Results and discussion. To determine whether Ss 
performed better than chance on Test 1, a l test was 
done comparing the mean correct (11.76) for all 
80 Ss and the mean expected by chance. The 
t (78) = 2.79, p < .01, may be interpreted as in- 
dicating that at the end of the first study trial 
a discriminable difference in frequency units existed 
between the items of each pair. 

An analysis of variance for number correct on 
Test 1 showed a significant effect only for the 
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Study Instructions X Test Response interaction, 
F (1, 76) = 6.94, p < .05. For both main effects, 
F was smaller than 1.00. Duncan's test showed 
no significant differences between the 4 treatment 
means. When, however, the mean for Ss in Groups 
RR and WW was compared with the mean for Ss 
in Groups RW and WR, the difference was sig- 
nificant at the .01 level. 

A separate analysis of variance for number cor- 
rect on each trial during training showed that only 
the effects of trials, F (3, 228) = 53.72, p < .001, 
and the Study Instructions X Test Response in- 
teraction, F (1, 76) = 23.01, p < .001, were sig- 
nificant, The means appear in Table 1. Duncan's 
test showed that the means for Groups RR and 
WW differed from the other 2 means but not 
from one another, and that the RW mean did not 
differ from the WR mean. 

The results from this experiment replicate those 
from Experiment I in demonstrating that having S 
designate the wrong items on the test has the same 
effect as having him designate the right items. 
These results demonstrate also that instructing S 
to learn the wrong items has the same effect as 
instructing him to learn the right items. 

The better-than-chance performance on Test 1 
may be explained by assuming that Ss rehearsed 
covertly those items they were instructed to learn. 
If they did, then the better performance of Ss in 
Groups RR and WW than of Ss in Groups RW 
and WR suggests that performance is better where 
use of Rule 1 is the more appropriate than where 
use of Rule 2 is the more appropriate. This could 
occur if, as in these experiments, S had little time 
to respond and if use of Rule 1 involved fewer 
operations (e.g, "A is more frequent than B, 
select A") and thus took less time than use of 
Rule 2 (e. 'A is more frequent than B, reject A, 
select B"). 

An alternative explanation for these results is 
based on the assumption that Ss rehearse covertly 


“Such behavior for Rule 1 Ss would also result in a dif- 
ference in the acerual of frequency units for the 2 items of 
à pair on test trials. 


those items they expect to designate on the test. 
Since these are the same items that Ss in Groups 
RR and WW were instructed to designate on tests, 
the difference between items of each pair would 
increase during training, Rule 1 would become 
easier to use, and this would be reflected in im- 
proved performance over trials. For Ss in the 
other 2 groups, RW and WR, the results for Tests 
2, 3, and 4 are harder to interpret since Ss were 
instructed to learn one item of a pair, but were 
tested for the other. One possibility is that a 


training proceeded Ss came to expect that they 
would have to designate those items they had to 


designate on previous tests. Thus during Study 
Trials 2, 3, and 4, Ss would rehearse those items 
(rather than those they were instructed to learn) - 
so that Rule 1 would come to be the more ap- 
propriate for Ss in those groups also. The better 
performance by Ss in Groups RR and WW would 
then be due to a greater difference in frequency. 
units between items of each pair in those groups 
than in the other 2 groups. 

The assumption that Ss rehearse covertly those 
items they expect to designate on the tests, can 
also be used to explain the results from Experi- 
ment I. Thus Rule 1 could have been the more 
appropriate for Ss of all 4 treatments of that 
experiment, since all Ss were told at the beginning 
of training the items they were to designate on 
the test and there would be more covert rehearsal 
for those items. The better performance by Ss in 
the RR and WW groups would then be due to 
a greater difference in frequency units between 
items of each pair in those groups than in the 
other 2 groups. 
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ACQUIRED PLEASANTNESS AS A STIMULUS AND A RESPONSE 
VARIABLE IN PAIRED-ASSOCIATE LEARNING ! 


ALBERT SILVERSTEIN * 
University of Rhode Inland 


Two experiments are reported in which nonsense syllables paired with pleasant 
pictures (P-paired syllables) were subsequently learned more rapidly than 
syllables paired with indifferent pictures when they served as responses—but 
not when they served as stimuli—in paired-associate lists, The bulk of the 
superiority of P-paired syllables was in response availability, and only a smaller 
effect was found in association learning. Both effects depended upon Ss learn- 
ing to anticipate responses differing in acquired pleasantness, Superior ratings 
of pleasantness were given to the P-paired syllables regardless of whether 
they were learned as stimuli or responses on the list or whether by anticipation 


training or associative-match training. 


Recent demonstrations that nonsense syllables 
that have been paired with pleasant pictures (P- 
paired syllables) are subsequently learned faster in 
3 ired-associate (PA) lists than are syllables that 
a Es been paired with indifferent pictures (I-paired 
j syllables) have been limited to the case in which the 
© syllables are in the response positions of the list 
1 (Silverstein, 1966, 19728, 1972b). The present study 
i compares the effects of acquired pleasantness of 

nonsense syllables placed in stimulus and response 

] positions of PA lists. 
| The aforementioned studies support the hypothesis 
that the acquired-pleasantness effect is based largely 
upon Sw disposition to implicitly rehearse P-paired 
j syllables more vigorously than l-paired syllables 
- during the response-learning stage. The bulk of the 
d difference between performance with P- and I-paired 
. syllables has been found in the number of trials 
before the first response (whether it is correct or not), 
a measure used to index speed of response learning 
(Underwood & Schulz, 1960). Moreover, instruc- 
tions to contingously pronounce the P- and I- paired 
syllables during the learning task removed the 
superiority of the former (Silverstein, 1972b), Since 
jcs of the stimulus terms are largely 
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syllables in either the stimulus or response positi 
of a PA list. The second task of Experiment 
consisted of standard anticipation learning, while 
that of Experiment I consisted of alternating study 
and test trials of various types. 

Experiment I. 'Thirty-two female and 48 male 
Ss were first given 6 trials of rating the difficulty of 
learning 8 pairs of pictures and nonsense syllables. 
They were told that we wanted to find out how well 
people can judge the difficulty of learning a. pair of 
items às a “unitary pair" and were instructed 10 
mark 9-point scales ranging from “very cary "mw 
“very difficult” that were printed on each page of d 
booklet. The members of each pair were s! ' 
simultaneously for 10 sec., and there was 
intertrial interval. On each trial, half of 
had the syllable on the left side and hall ' 
syllable on the right side. The syllable for coy pat 
was shown on each side on an equal numb | 
but S's task was the same rega. 
the syllable and picture. In the second i > 
and female Se were each randomly divido! > 
groups receiving the syllables from Task 1 asots 
responses to or stimuli for 2-digit numbers for 1: 
anticipation trials, Stimulus and response ters 
were each shown for 5 sec., and there was a 10-« 
intertrial interval, Following this task, all Ss rated 
the syllables for pleasantness on a 7-point scale, 

Materials shown in both tasks were displayed by à 
Kodak Carousel slide projector at a distance of 6 ft. 
from S. Different sets of pictures were wed for 
male and female Ss, having previously been rated 
Ss of the appropriate sex, Four pictures in 
list were in the "very pleasant" range of a 9-point 
scale, and 4 were rated as “indifferent.” All 
tures had very small 


less of the | 


each oup mere ive cach of 2 le m 
in Task 1 
leri d: 


Task 2, such that each syllable was paired 


| 
| 


green ana nde t picture, and each num- 
ber was paired with both a P- and an I paired 
dq Four different random orders of presenta 
used for both lists, with the restriction 
that pleasant and indifferent items appear in first 
and last positions — often, and S» were se 
quentially assigned onder for their first trial, 
Table 1 shows that for both male and female Sx 
i gestr forme or said in percentage 
— eorrect to the ired syllables when they were in 
EUR tar hon vies inex wees te 
‘the stimulus position. Pooling the data for male 
and female Ss produced a significant Pleasantness 
X Position of Syllables interaction, F (1,76) = 5.63, 
p <.05. The superior performance of male Se to 
female Ss was also significant, F (1, 76) = 4.41, 
p <05, but none of the other Fs approached signi- 
» Correlated-means comparisons of the P- 

and l-paired syllables for the various grou 
a marginally significant superiority for de former 
when in the response position, £ (23) = 1,89, p < .10, 
for males, but not for females, The indiferent- 
pleasant difference did not approach significance for 

either of the stimulus position groups. 

No significant influence of acquired pleasantness 
was found In the analyses of the reeponse-learaing 
and asoclation-learni ne m m either as a main 
effect or in interaction with variables (probably 
because of the ened small. overall superiority 


The Popaired syllables 
Welle hewn 


pesti ratings than did the 
pe e 


for all 4 ps, F (0, 76) = 696, 
superiority equally great in the 
stimulus and the response groups. 

Experiment LE. The superiority of P- to PFpalred 
syllables found in Ex, t Í for the response 
position was a weak effect, However, the pairing of 

bles with the pictures in the first. task 
was simultaneous rather than successive as in pre- 
vious experiments! Since the acquired: pleasantness 
effect is presumed to be the consequence of classi 
cally conditioned affective responses to the syllables 


beoeficiary of pleasantness can be advanced 

with certainty only with data from a task that 
completely: n an anticipat n. 

Such =. was sought in x 


In the —Q groups, Se were tested 
either by response recall (in any order) or by asocia: 


ESI Tine orale d 


Postion ed eyllatde 

Measure LI | Mimslus 
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tive matching, both written (the former test being 

sensitive to response learning, while the 

pier is especially sensitive to aeoclative learning), 

rog trained with associative matching were 

only by associative matching and were in- 

cluded to control for any possible effect on similarity 
between test and study trials 

A total of 72 male Se were assigned in randomized 
blocks to the 6 groups, The apparatus, lists, and 
first task were identical to those of Experiment I M, 
except that the number of trials was increased to 
9 in order to strengthen the conditioning of penitive 
affect to the syllables. 

In the ipe given anticipation training in the 
second s S was shown the stimulus term alone 
for S ve, followed by the correct palt for 5 sc, 
with no overt response required. In the groups 
given associative-mateh. training, S was shown the 
stimulus term along with all & response terms for 
5 sec., followed by the correct pair for S see., with 

required. ta both of study 
erent random order of t i 


following study ee A lOs interval eui 
between study and test trial, Fi the second 
task, all Ss rated the nonsense syllables for plessant- 
ness on a T-point scale. 

Table 2 shows the mean number of correct re- 
sponses for both the first test trial and all 4 test 
trials, as well as the mean pleasantnees rat The 
only significant differences between P- and F-paired 
syllables were found when the syllables were in the 
response positions amd when Se had been given 
anticipation training. A surprising result was that 
the P-paired syllables were superiot as response 
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TABLE 2 
EANS OF Major STATISTICS EXPERIMENT IT 
Training-testing sequence 
5 Anticipation- | Anticipation- | Matching- 
SE recall matching matching 
P H R I P I 
Syllable in resi 
M correct. I 
responses 
Trial 1 2.08* 147] 1:92 1.67| 2.08 
AII trials 10.83% | 8,42 | 11.83* | 10,00 | 12.67 
Pleasant- 
ness 
rating 5.05* | 4.10| 4,63*| 3.80 | 5.17*| 4.03 
l 
Syllable in stimulus po: 
| er 
M correct | 
responses | 
Trial 1 1.50 | L33| 1.83 | 2.00| 2.42 
All trials 11.00 |11,75| 11,92 |11.92| 1 
Pleasant- | 
ness | 
rating 4.98" | 3,76] 4.83*| 397| 5.37*| 4.11 
Note. Abbreviations: P = syllables paired with pleasant 


pictures, I — syllables paired with indifferent pictures, 
*p <05. 
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terms to the I-paired syllables even by the associa- 
tive matching test, provided that anticipation train- 
ing had been used. This superiority, however, was 
clearly smaller and appeared later in training than 
that obtained by the response recall test (the latter 
was largest on Trial 1). This pattern is consistent 
with the typical finding that the primary locus of 
indifferent-pleasant differences is prior to the first 
response, whether it is correct or not. Asin Experi- 
ment I, the P-paired syllables were rated as more 
pleasant than the I-paired syllables regardless of 
whether or not PA performance was superior with 
them. 

Discussion. Both of the experiments reviewed 
here confirmed the prediction that acquired pleasant- 
ness affects PA learning when it is manipulated on 
the response side, but not when it is manipulated 
on the stimulus side. The expectation that the 
acquired-pleasantness effect is essentially a response 
effect was based on the hypothesis that the primary 
mechanism for the effect is a disposition to select 
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P-paired items for earlier and more vigorous re- 
sponse rehearsal (Silverstein, 1972a). This hypothe- 
sis received additional support in Experiment II, 
in which response recall was found to benefit from 
acquired pleasantness to a greater extent and earlier 
in training than did associative matching. The 
small superiority found in the associative-matching 
test for P-paired responses suggests a secondary 
mechanism for the acquired-pleasantness effect that 
ope s exclusively within the a ition-learning 
stage. Silverstein (1972a) hypothesized such a 
secondary mechanism after finding a small super- 
iority for P-paired responses in homogeneous (with 
regard to pleasantness) test lists, located exclusively 
within the association-learning stage, and suggested 
that it was based upon P-paired items having ac- 
quired a more reliable capacity to evoke distinctive 
and effective mediators. But even this association- 
learning effect requires active response rehearsal, 
since it appeared only after anticipation training 
and only on the response side of PA learning. 

'The fact that the P-paired syllables were rated 
significantly higher in pleasantness than the I- 
paired syllables in all conditions of both experiments 
demonstrates that the absence of superior perform- 
ance for P-paired syllables in some experimental 
conditions was not the result their having failed 
to acquire affective characteristics. Silverstein 
(1973) has shown that there is often no correlation 
between judgments of affective quality and perform- 
ance in learning tasks. 
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SUBJECTIVE CORRELATION AND THE 
MICHAEL H. BIRNBAUM? 
University of California, San Diego 
MARC KOBERNICK anp CLAIRICE T. VEIT 

University of California, Los Angeles. 
Judgments of the numerousness of patterns of dots vary with the 
background on which the dots are presented. The direction of this judgmental 
illusion depends upon the experimental correlation between background size 
and actual number of dots. 
ousness varies inversely with background size; however, when the correlation 
is negative, the illusion is reversed. The subjective correlations estimated 
from an expectancy-contrast model were found to be a monotonic function of 
the 7 actual correlations used in 2 experiment: 
extend previous findings for the size-numerosit 

An expectancy-contrast model for the size- 


numerosity illusion was proposed and tested by 
Birnbaum and Veit (1973). According to the model, 
the judgment of numerousness reflects an additive 
contrast between the numerousness of the dots 
and an expectancy produced by the context. The 
expectancy is assumed to depend upon the subjec- 
tive correlation between numerousness and back- 
ground size. In simplified form, the model can 
be written: 


J = N — RysB, 


where J is the impression of numerousness, A is 
the numerousness of the dots apart from the back- 
ground effect, Ryg is the subjective correlation 
between numerousness and background size, and 
B is the subjective size of the background. The 
product, RygB, is interpreted as the expectancy 
for numerousness, based on background size. Overt 
ratings of numerousness are assumed to be a linear 
function of the impressions. 

Several implications of the expectancy-contrast 
model have received tentative experimental sup- 
port. First, it should be possible to reverse the 
direction of the illusion by changing the sign of the 
subjective correlation. Second, this interaction be- 
tween background size and correlation is multiplica- 
tive and thus should be located in the bilinear 
component. Third, the effects of background size 
and numerosity should combine additively, inde- 
pendent of the correlation. Although the data 
appeared to be in general agreement with the first 
2 predictions, a small but statistically significant 
discrepancy was obtained in 1 condition for the 
third prediction. 


1Computing funds for this research were supplied by the 
Campus Computing Network, University of California, Los 
Angeles. This paper was completed while the first author held 
a National Institute of Mental Health postdoctoral quee 
at the Center for Human Information Processing, Uni 
of California, San Diego, which provided additional Sipa 
through National Institute of Mental Health Grant MH-15828. 
The authors thank Norman H. Anderson for comments on an 
earlier draft. 

? Requests for reprints should be addressed to Michael H. 
Birnbaum, who is now at the Department of Psychology, 
Kansas State University, Manhattan, Kansas 66506, 


TThis report replicates the previous work, with 
an improved experimental design, to provide larger 
and cleaner effects. The second experiment employs 
more levels of correlation to examine the relation- 
ship between actual and subjective correlation and 
also explores the effect of changing the correlation. 


METHOD 


The general experimental procedure was similar 
to that of Birnbaum and Veit (1973), with several 
improvements. The main differences were as 
follows: (a) greater variation in background size 
relative to the variation in number of dots, (b) 
greater manipulation of the correlation, and (c) a 
longer series of warm-up trials. 

The stimuli were patterns of black dots, 2.5 mm. 
in diameter, on square white cardboard back- 
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Ficure 1. Diagram of the experimental design. (Solid 
Squares represent test stimuli which were the same for all 
conditions. Plus signs represent contextual stimuli for posi- 
tive correlations. Minus signs represent contextual stimuli 
for negative correlation conditions, Note that axes are spaced 
in logarithmic steps.) 
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Panel A presents mean judgments of test stimuli, averaged across 


stimuli, averaged across 


levels of number and plotted as a function of size for each correlation condition 


(Experiment D). 


grounds of varying *. In presenting the stimuli, 
the blank side of the card was shown first, then the 
card was turned over so that .S could see the dots 
for 5 sec. The Ss made their ratings on a 9-point 
scale, with 1 = very very few dots and 9 = very 
very many dots. t 

The experimental design is shown in Figure 1. 
The test stimuli (solid squares) represent a 4 X 4 
(Number X Size) factorial design, in which either 
20, 30, 45, or 68 dots were presented on square 
backgrounds with side lengths of either 10, 15, 
22.5, or 33.8 cm. 

The contextual stimuli were presented to manipu- 
late the overall correlation between background 
size and number of dots. The contextual stimuli 
for the positive correlation (plus signs in Figure 1) 
paired 8-13 dots with background sizes of 5.0-6.5 
cm., and 102-135 dots with background sizes of 
54-60 cm.; thus, small numbers of dots appeared 
on small backgrounds and large numbers of dots 
appeared on large backgrounds. Contextual stimuli 
for the negative correlation (minus signs in Figure 1) 
paired 8-13 dots with background sizes of 54-60 cm., 
and 102-135 dots with background sizes of 5.0- 
6.5 cm., so that small numbers of dots appeared on 
large backgrounds and large numbers of dots on 
small backgrounds, The zero correlation condition 
included half of either type of contextual stimuli. 

In Experiment I, the 21 University of California, 
Los Angeles, undergraduates were divided into 3 
groups of 7 Ss each, with either positive, zero, or 
negative contextual stimuli for each group. Each 
group received an inducing series of 24 randomly 
ordered presentations of contextual stimuli, Fol- 
lowing this series, there were 2 replicates of the 16 
test stimuli randomly interspersed among 48 trials 
of contextual stimuli, giving a total of 104 trials, 


In Experiment If, the ratio of contextual trials 
to test trials (and hence the correlation between 
size and number) was manipulated. The Ss were 
30 University of Cal i Angeles, under- 
graduates divided into 6 groups of 5 Ss cach. Foch 
group received 16 presentations of 
stimuli followed by either 36, 24, or 16 positiv 
negative contextual stimuli interspersed among tb 
basic 16 test stimuli, 

After the first series came the counterinducing 
series, in which 16 contextual stimuli of th« 
sign were presented, followed by the 16 test «t: 
interspersed among 24 contextual stimuli of the 
opposite sign from the first series, 


a, Los 
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RESULTS 


Experiment I. Figure 2A plots mean ratings of 
numerousness as a function of background size, 
with a separate curve for each level of numerosity, 
averaged over correlation conditions, According 
to the expectancy-contrast model (Equation 1), 
the curves in Figure 2A should be parallel, because 
the effects of background size and number of dots 
combine additively. Although the graphical ap- 
pearance is somewhat rough, there do not appear 
to be any systematic deviations from parallelism. 
The parallelism prediction is supported by a non- 
significant overall Background X Numerosity inter- 
action, F (9, 162) = 1.42. Analyzed separately 
for each correlation condition, this interaction was 
also nonsignificant, Fs (9, 54) = 1,39, 1.75, and 
2.01 for the positive, zero, and negative conditions, 
respectively. 

Figure 2B shows mean judgments of numerous- 
ness as a function of background size, with a sepa- 
rate curve for each correlation condition, Con- 


sistent with theoretical predictions, judgments vary 
inversely with background size when the correlation 
is positive and directly when the correlation is nega- 
live, The illusion for the zero correlation group is 
intermediate, but similar in direction to the positive 
correlation, consistent with previous results (Birn- 
baum & Veit, 1973). The interaction between 
background size and correlation is statistically signi- 
ficant, F (6, 54) = 20.23, with about 98% of its 
variance in the bilinear component. The residual 
interaction is nonsignificant (F <1) as were the 
Context X Number, F (6, 54) = 2.11, and the 
Context X Background X Number interactions, F 
(18, 162) = 1.67, These results replicate previous 
findings for the size-numerosity illusion (Birnbaum 
& Veit, 1973). 

The expectancy-contrast model implies that the 
curves in Figure 2B should be linear functions of 
B with slopes proportional to Rwa. The curves 
are very nearly linear functions of the a priori 
levels of background size, which were chosen in 
equal log steps, The slopes (reversed in sign) are 
3.7, 1.7, and —1.4 for the positive, zero, and 
negative correlations, respectively. The present 
illusory effects are much greater than those reported 
by Birnbaum and Veit (1973). The larger illusions 
are probably due in part to the improved experi- 
mental design and the greater manipulation of 
the experimental correlation, Part of this increase 
may be due to a range effect on the category rating 
scale, attributable to the smaller variation in actual 
number of dots relative to the variation in back- 
ground size. 

Experiment I. The values of Rys for the first 
series, estimated from Equation 1, were 5.6, 3,9, 
2.8, —1.1, —1,5, and —1,8 for the 6 conditions, 
in order of decreasing actual correlation. The trend 
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is clearly monotonic. This finding lends further 
support to the notion of subjective correlation by 
showing that Rwa is sensitive to the actual correla- 
tion employed. 

The estimated values of Rwa for the second series 
of Experiment IT showed a small and nonsignificant 
effect of the correlation used in the first series; d 
they were positive when the second correlation was 3 
positive and negative when the second correlation L 
was negative, The intervening series of 16 reversed 3 
contextual trials may have been sufficient to largely 4 
erase the preceding experience. 

Discussion. The fact that the illusion can be 4 
manipulated and reversed through manipulation 
of the correlation supports the expectancy inter- 
pretation of the illusion, The slope for the zero 
correlation condition would be interpreted to imply 
that everyday experience produces a positive cor- 
relation between size and number (Birnbaum & Veit, 
1973). By analogy, these results support an expe 
tancy account of the size-weight illusion (Anderson, 

1970). 

These results replicate and extend the previous 
findings for the size-numerosity illusion, The im- 
proved experimental design appears to produce 
larger and cleaner effects (Experiment I). The 4 
fact that subjective correlation estimated from the 
model is a monotonic function of actual correlation 
(Experiment H) provides further support for the 
expectancy-contrast model of Birnbaum and Veit 
(1973), 
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Three experiments (91, 42, and 68 Ss) were conducted to study spatial stimulus 
ary S response as a function of white noise and 
ation level, Activation was assessed by several verbal report measures. 
creased with noise, but changes were 
apparently unrelated to measured activation level. The results do not en- 
tirely rule out activation as a mediator, and if a relationship ex 
involve more than 1 activation dimension, Arguments are made concerning 
use of the present experimental task and procedure to as 


t could 


ss hypotheses con- 


Several studies have been conducted on spatial 
stimulus generalization with voluntary responses. 
Typically, Ss are trained to respond rapidly to a 


MEAN RESPONSES PER SUBJECT 


I 2 3 cs 5 6 7 
TEST STIMULI 
FIGURE 1, Spatial generalization gradients as 


a function of white noise. 


center stimulus and are tested on successive trials 
when peripheral stimuli are occasionally presented. 
Generalization gradients as a function of stimulus 
proximity are then observed. 

Previous research has focused on generalization 
as a function of such variables as “speed-up” in- 
structions, drive, anxiety, and shock (Harleston, 
1961; Mednick & Freedman, 1960). Activation is 
probably common to all these variables. (See 
Thayer & Moore, 1972, for a further discussion of 
this point.) However, if it is to be argued that 
activation is a mediating variable, it is important 
to provide some kind of direct arousal assessment. 
Otherwise, diverse underlying physiological states 
might be confused witha single activation dimension. 

While physiological activation measurement is 
quite complicated, the development of the Activa- 
tion Deactivation Adjective Check List (AD ACL; 
Thayer, 1967) offers an easy way of determining 
multidimensional activation changes from experi- 
mental manipulations. The AD ACL is composed 
of a group of activation-descriptive adjectives which 
‘Ss use to rate their momentary activation feelings. 
The adjectives have been factor analyzed into 
4 independent dimensions: General Activation (e.g., 
lively, active, full of pep), High Activation (e.g., 
clutched up, jittery, stirred up), General Deactiva- 
tion (e.g. at rest, still, leisurely), Deactivation- 
Sleep (e.g., sleepy, tired, drowsy). These activation 
dimensions were shown to substantially correlate 
with physiological measurements and to be sensitive 
to a variety of activation-inducing conditions. (See 
Thayer & Moore, 1972, for a current review of 
validation research.) 

The purpose of the present research was twofold. 
‘The first purpose was to study spatial stimulus gen- 
eralization with voluntary S response as a function 
of activation and to assess activation directly with 
the AD ACL. A second research purpose was to 


SHORT REPORTS 


study the effects of white noise as an influ n 
generalization. Noise is associated with ax 1 
effects, but previously has not been show 
crease generalization. 


EXPERIMENT I 


Method. The Ss were 91 male and fema) 
ductory psychology students. The apparat 
cluded 4 visual displays constructed in a 
pattern, thus accommodating 4 Ss at a rur 
display board was straight and included a hc: 
row of 7 white 6-v. lamps, 14 in. apart, an i 
6-v. warning lamp 22 in. above the cent 
lamp. The Ss sat in fixed chairs so their « 
approximately 46-in. from the center w! 
Responses to the stimulus lights were 1 
presses of a noiseless button. Either 40, 60, o = 
db. of white noise was presented by a Gras 
Stadler Model 901B white noise generator and 
S-in. speaker equidistant from each S, The exper: 
ment was conducted in a soundproof room. 

Using a between-Ss design, 31 Ss (balanced for 
sex) were tested in 40- and 60-db. conditions, and 
29 Ss were tested in the 80-db. condition, Tho Ss 
were given 5 practice trials and 25 conditioning 
trials, during which the center lamp was flashed 
every 10 sec sec. duration), The test trials 
followed immediately and included 7 presentations 
of each peripheral lamp and 10 booster presentations 
of the center lamp. con- 
ditioning and test presentation, a 
flashed (.3-sec. duration). The 
center presentations were distributed thr 
52 test trials in 1 of 3 random orders. The A} 
was given to each S at the end of the test seric: 

The Ss were told that this was a reaction i: 
experiment, and that they must press the button : 
fast as possible whenever the center lamp was lit. 
"They were also told not to be concerned about 
errors, but to try to press only when the center 
lamp was lit. 


Five seconds before ever 


periph 


TABLE 1 


MEANS AND STANDARD DEVIATIONS (IN PARENTHESES) OF 
PERIPHERAL ERRORS (PEs) AND CENTER Responses (CRs) 
AS A FUNCTION OF WHITE Norsk 


Noise level and response measures 


Experi- 
ment 40-45 db. 60 db. 80 db. 

PE PE CR PE CR 

I 6.1 9.5 26.8 9.0 25.3 
(3.9) (5.0) (5.4) | (4.1) | (5.7) 

H 5.7 6. 28.8 
(3.4) (4.0) | (4.5) 

Ir 4.8 7.9 30.1 
(3.4) (6.6) (3.8) 


Note. Half of the center responses in Experiment, II were 
multiplied by 1.66 to equate frequency of conditioning trials 
across counterbalanced conditions. 


* In this and the 2 subsequent experiments, noise was pre- 
sented continuously throughout conditioning, testing, and AD 
ACL administration, 
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Results, The response measures were defined as 
follows: Peripheral errors occurred during the test 
period when S responded to a peripheral light as 
though it were a center light; center responses were 
correct responses to the center light during the con- 
ditioning and test periods. 

Figure 1 indicates the shape of the generalization 
gradients for the 3 noise conditions. It can be seen 
that gradients for 40 db. are flatter than those for 
qu and 80 db., particularly from Test Stimuli 1, 2, 

6, and 7. 

' Table 1 presents the means and standard devia- 
tions of peripheral errors and center responses in 
each of the noise conditions. Analysis of variance 
in peripheral errors showed the expected significant 
effect of intensity, F (2, 88) = 5.34, p < .05. Post 
comparison tests showed significant differences in 
the 40:80 and 40:60 db. comparisons, /s (58, 60) = 
2,79 and 2,94, 

While the above results indicate increased gen- 
eralization as a function of noise, changes might be 
due to greater responsiveness to all stimuli, includ- 
ing center stimuli, or primarily to peripheral- 
stimuli responsiveness. Analyses of center responses 
showed no significant differences, though the re- 
sponses did increase as a function of noise. A com- 
parison of the strength of effects (omega squared) 
for the 40:80 db. conditions showed that noise 
counted for 10% of the peripheral-error variation, 
and 4% of the center-response variation, 

The anticipated reported-activation differences as 
a function of noise were not obtained, nor was there 
a correlation between errors and activation. The 
differences across noise conditions were small and 
inconsistent. There was some tendency for the AD 
ACL General Activation subtest to be related to 
peripheral errors within each condition, but the 
relationships were weak. 


EXPERIMENT II 


Since Experiment I showed noise effects on gen- 
eralization, but highly variable activation reports, 
an attempt was made to further investigate the 
activation relationship with somewhat less error 
variance, Thus, the manipulations of Experiment I 
were repeated in a within-Ss design. 

Method. Forty-two male and female Ss from in- 
troductory psychology cla. were run in a counter- 
balanced design (half received 45 db. first and half, 
80 db. first). The apparatus and materials were the 
same as those described in Experiment I, except 
that tests were not conducted in a soundproof room. 

'The Ss were again given 25 conditioning trials 
and 52 test trials, followed by the AD ACL. They 
then received 15 more conditioning trials, a second 
set of 52 test trials, and another AD ACL. An 
attempt was made to duplicate most of the condi- 
tions in Experiment I; however, 45 db. was chosen 
instead of 40 db. because of a higher uncontrollable 
ambient noise level in the test room, 

Results, Consistent with Experiment I, peripheral 
errors increased with noise (Table 1), / (41) = 1.69, 
b «.10. Center responses also increased slightly. 
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Because of the within-Ss design employed, it is 
somewhat difficult to estimate strength of indepen- 
dent variable effects (omega squared), but the 
most conservative estimate would be that noise 
accounted for 2% of peripheral-error variation, and 
essentially zero center-response variation. 

The AD ACL subtests General Activation and 
Deactivation-Sleep apparently were not influenced 
by noise, but substantial noise effects were observed 
for the High Activation and General Deactiva- 
tion subtests, / (41) = 2.53, p < .02; t (41) = 4.42, 
b «.001. Although the activation measures indi- 
cated a noise effect, the activation change scores 
were not correlated with the perceptual-performance 
change scores. 


EXPERIMENT III 


Since the generalization differences in Experi- 
ments I and II apparently were not related to re- 
ported activation, the present experiment was de- 
signed to investigate alternative individual-difler- 
ence measures. To investigate the possibility that 
noise was creating activationlike changes not sensed 
by the AD ACL, 2 anxiety trait measures were in- 
cluded. Also, Ss were asked to rate and verbally 
describe their reactions to the noise. 

Method. As in Experiment I, the experimental 
design was a between-Ss type; it included one 45- 
and one 80-db. condition. Sixty-eight male and 
female introductory psychology students were used. 
The apparatus and procedure were the same as in 
Experiment I, except that Ss were again tested in 
a nonsoundproof room, 

In addition to the AD ACL, the present experi- 
ment employed the Manifest Anxiety Scale, a 
shortened version of the S-R Inventory of Anxious- 
n and 2 5-point rating scales on which Ss could 
rate the extent that noise generally disturbed their 
concentration and, specifically, the degree to which 
the experimental noise influenced their performance. 
Finally, one question requested a paragraph de- 
scription of how the experimental noise disturbed 
performance. 

Results, Similarly to Experiments I and II, high 
noise resulted in increased peripheral errors (Table 
1), t (66) = 2.42, p < .01. Center respon: also 
increased, but not to a statistically significant 
degree. The omega-squared index indicated that 
7% of the variation in peripheral errors could be 
accounted for by noise, while less than 1% of the 
center-response variation was due to noise. 

The AD ACL High Activation subtest may have 
been influenced by noise, £ (66) = 1.69, p <.1, 
but the other subtests showed small changes. 
Again, none of the AD ACL scores was reliably re- 
lated to peripheral errors or center responses. 

Ratings by Ss indicated greater interference from 
the 80- than the 45-db. condition, ¢ (66) = 2.80, 
p <.01, but these ratings were not correlated with 
errors. The other individual-difference measures 
did not predict peripheral errors or center responses 
in any reliable and consistent way. Only 56% of 
the Ss in the 80-db. condition indicated that noise 


interfered with their reactions to the light. Al- 
though there was no obvious consistency in Ss’ 
verbal explanations of how the sound interfered, one 
provocative note is that several Ss said the noise 
interfered by making them tense and nervous, 
while several others said it interfered by making 
them sleepy and drowsy. 

Discussion, Consistent results from all 3 experi- 
ments clearly demonstrate that loud noise (i.e, 80 
db.) increases spatial stimulus generalization, Fur- 
thermore, this effect is primarily due to increased 
peripheral errors, not increased general responsive: 
ness, Comparisons of the strength of effects of 
noise between peripheral errors (incorrect responses) 
and center responses showed that the peripheral- 
error changes were substantially greater than center 
response changes in all 3 experiments, The influence 
of noise on generalization seems to be the same as 
special instructions continually prompting Ss to 
speed their responses (Harleston, 1961), and it may 
be equivalent to such variables as anxiety and drive 
(Mednick & Freedman, 1960). 

However, contrary to expectations, stimulus gen- 
vralization was not obviously related to 
activation, Although the AD ACL has been shown 
to substantially correlate with psychophysiological 
measures and to reliably ict activation-inducing 
independent variables yer, 1967; Thayer & 
Moore, 1972), the activation measures did not 
consistently predict the observed perceptual-per- 
formance effects, This is true even though certain 
AD ACL subscales apparently monitored noise 
effects reliably in one experiment, Other activation- 
related individual difference measures, particularly 
2 measures of anxiety, did not predict generalization 
effects either, 


Nevertheless, the similarity of the t noise- 
generalization effects to previ demon- 
strated effects caused by instructions sug- 


gesta the common construct of activation. Several 
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states has been demonstrated elsewhere (1 
Moore, 1972), and it creates substantia! « 
tions in predicting behavior without use 
priate individual-difference measure 

The present experimental task avd ; 
an interesting possibility for testing t 
noise and activation level on attentional » 
Broadbent (1971) has proposed that inv 
arousal result in the focusing of attentis 
peripheral to central cues when peripheral o 
unimportant, 

In order to demonstrate selectivity in t 
results, it would be necessary to show hi 
responsiveness to the center stimulus and 
duced responsiveness to peripheral lights unde ù 
activation conditions, Although loud ‘ 
produce more responses to the center stonu 
all 3 experiments, there was not reduced respon l 
ness to peripheral cues; rather, as described ntx 
prid ree ae with Ca wod 
greater degree than center responses, " 
results appear to be contradictory to Broutlbent's 
(1971) expectations, However, 2 points lead to 
caution in direct application of these rool. First, 
the nature of the present experimental task ow 
have introduced confounding psychomotor perlore 
mance effects in addition to the expected percept 
attention effects, Second, Broadbent and his aw 
sociates have used higher noise levels to produce 
arousal than those employed in the present research, 


* The authors wish to thank Keith (or 
the relationship be 
and att welectivity, 
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Ratings of the likableness of persons described by 2 adjectives showed con- 
sistent violations of additive and Pearman 2 t averaging models: The 
effect of either adjective varied directly with the likableness of the other 
adjective, Monotonic rescaling could Rs oe interactions, raising the 
theoretical question of whether Interactions were due to nonlinearity in the 


of hypothetical persons, each person described by 2 adjectives, were ondinally 
inconsistent with additive models, confirming the interpretation that the 
interactions are "real" and should not be mald away. Theoretical and 
methodological implications are discussed. 


This research is concerned with how recently received new analytic attention 

impressions of personality are formed. (Anderson, 1968b, 1971, 1972, 1974, in 

This topic, introduced by Asch (1946), has press). The theoretical problem in to ex- 

T PTh at article le is used on a dimwertation submitted eee un ne is ——— to rem 

E tel rear E re 2 je a overall esc In the majority of the 

LI t u jui it» for t d 

ow [ Pimper nog doge dae research to be reported here, S's task was 

Education Act Tithe IV to read 2 adjectives, imagine a person who 

from would be described by both adjectives, 

and rate how much he would like such 
à person, 

Figure | provides a schema for the 
analysis of impression formation. The 2 
stimulus adjectives are referenced by the 
indices, 4 and j. The psychological rep- 
resentations of the adjectives, s, and J» 
are combined by the integration function, 
1, to form the psychological impression, 
vu, which is transformed by the response 
function, J, to the overt response, Ri» , 

"There are 3 problems to be solved: 
(a) finding the appropriate stimulus rep- 
resentation (5), (6) determining the inte- 
gration function (J), and (c) finding the 
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FIGURE 1. Schema for discussing research on 


information integration. 


relationship (J) between the overt re- 
sponses and the integrated impressions. 
Any theory of impression formation must 
deal, at least implicitly, with all 3 of these 
problems. 


ADDITIVE AND CONSTANT-WEIGHT 
AVERAGING MODELS 


A simple linear model provides an 
example of such a theory. The adjectives 
are assumed to be represented as single 
values on a likableness continuum. The 
I function, defined on all adjective com- 
binations, would be a linear combination 
of the values of the adjectives: 


[1] 


where w and ws are the weights reflecting 
position in the set, s; and s; are the scale 
values of the adjectives, and woso is the 
postulated initial impression, reflecting 
what the impression would be in the ab- 
sence of information. 

Anderson (1962) pointed out that if sets 
of adjectives were constructed according 
to factorial designs, then the analysis of 
variance test for interactions provides a 
powerful test of Equation 1. If the ad- 
jectives were to change their meaning non- 
linearly in combination, if the 7 function 
was nonadditive, or if the judgment func- 
tion (J) was nonlinear, then significant 
interactions would be expected. If they 
were nonsignificant, there would seem to 
be no reason to postulate such complica- 
tions. The data for the majority of 
Anderson's (1962b) 12 Ss appeared to be 
in rough agreement with Equation 1. 


Vj; = Woso + wis; + wssj, 
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The constant-weight averaging model (An- 
derson, 1968b) is a special case of Equa- | 
tion 1 that requires the weights to sum 
to 1; the additive model places no restric- 
tion on the sum of the weights. Critical 
tests of averaging vs. adding models (in ` 
which the number of items in the set are 
varied) favor the averaging formulation | 
over the additive (Anderson, 1971, 1974, 
in press). However, when the number of 
items in the set is held constant, as in 
the present experiments, the models are 
equivalent. Therefore, the present article 
uses the term additive to include both 
additive and constant-weight averaging 
models. 

Anderson's (1968b) review of the evi- 
dence for the constant-weight averaging 
model concluded that 


The model always fits the data quite well, but 
there are almost always small, significant discrep- 
ancies. Inspection of the data has failed to reveal 
the origin of the discrepancies; they may reflect 
some fundamental error in the model, or they 
may result from remaining shortcomings in the 
experimental technique [p. 736]. 


Recent studies have shown large dis- 
crepancies from the constant-weight aver- 
aging model for several information inte- 
gration tasks. Interactive models have 
been proposed to account for these non- 
additivities (Anderson, 1972; Birnbaum, 
1972a, 1972b, 1973; Birnbaum, Parducci, 
& Gifford, 1971; Lampel & Anderson, 
1968). However, there are 2 possible 
interpretations of the discrepancies that 
have been observed in these studies: (a) 
the integration (I) of information is not 
an additive or simple averaging process, 
or (b) the judgment function (J) is non=- 
linear. The next section shows how 
previous conceptualizations cannot dif- 
ferentiate these alternatives. 


MopeL TESTING AND MEASUREMENT 
Assuming Validity of Responses 


When the J function is assumed to be 
linear, models of impression formation can 
be tested by comparison of theoretical 
predictions with the raw data. Functional - 
measurement (Anderson, 1970) finds the 
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stimulus representation in accord with the 
model to be tested. These stimulus pa- 
rameters are then used as the basis for 
statistical tests of fit. If either the model 
or the response scale was incorrect, it 
would show up as a significant discrepancy. 
When the data fit the model, the fit is 
usually interpreted as joint support for 
both the model and the response scale. 


Ordinal Tests 


When the J function is only assumed 
to be strictly monotonic, it becomes more 
difficult to discriminate among different 
models, Conjoint measurement analysis 
(Krantz & Tversky, 1971) describes con- 
ditions that ideal data would have to 
satisfy to be ordinally consistent with the 
theory. For example, crossover interac- 
tions would be ordinally inconsistent with 
additive models, for no monotonic trans- 
formation could make the data fit the 
model. In this case, everyone agrees that 
the model should be rejected. It should 
be noted that ordinal violations of ad- 
ditivity will show up as significant inter- 
actions in the analysis of variance. The 
problem arises when significant interactions 
occur in the absence of ordinal violations. 


Assuming Validity of the Model 


When the additive model is assumed to 
be valid, then the analysis of variance 
tests the linearity of the J function. 
A nonlinear J function will produce sig- 
nificant interactions even though the un- 
derlying integration is additive. In the 
absence of ordinal violations of the model, 
it is possible to find a monotonic trans- 
formation, J+, which rescales the data to 
additivity. 


When Both Model and Response Scale are 
in Doubl 


Krantz, Luce, Suppes, and Tversky 
(1971, p. 445) have taken the extreme 
view that when discrepancies from ad- 
ditivity can be removed by monotonic 
transformation, they should be attributed 
to nonlinearity in the response scale, rather 
than to nonadditivity of the integration 


function. However, when a monotonic 
transformation would bring otherwise con- 
tradictory data into line with the model, 
the status of the model remains uncertain. 
This procedure assumes the validity of the 
model; hence, the existence of such a 
transformation does not mean that the 
model is validated. To resolve this diffi- 
culty, it is necessary to place transforma- 
tions within the scope of psychological 
theory and to provide additional con- 
straints which determine their appropriate 
application. 


ADVANCES IN MODEL TESTING 


These 4 experiments provide a progres- 
sive sequence that systematically elimi- 
nates the additive or constant-weight 
averaging models. The experiments illus- 
trate novel approaches to model testing 
that remove the difficulty of deciding 
whether or not to rescale the data. Since 
these techniques will be of interest to 
psychologists in many areas, they are 
outlined briefly below. 

The first 2 experiments apply criteria 
of stimulus and response scale consistency. 
Experiment I assumed the validity of a 
priori values for the adjectives obtained 
in previous work (Anderson, 1968a) and 
required that the integration model yield 
scale values that are linearly related. 
Experiment II investigated the effects of 
different response procedures, thought to 
produce different J functions. By a prin- 
ciple of convergent operationism (Garner, 
Hake, & Eriksen, 1956), if similar inter- 
actions are obtained with different response 
procedures (operational definitions of Y), 
then the interpretation. that the inter- 
actions are "perceptual" is enhanced. 

Experiment II illustrates how the si- 
multaneous evaluation of 2 or more inte- 
gration processes can provide the leverage 
to define the concept of an appropriate 
transformation (Birnbaum, 1972b; Birn- 
baum & Veit, 1974). Stimulus scale 
invariance requires that the scale values 
(s in Figure 1) be independent of task. 
Response scale invariance requires that the 
J function be independent of task. 

In Experiment III, Ss performed 2 in- 
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tegration tasks, rating the difference in 
likableness between the 2 adjectives as 
well as the likableness of a person de- 
scribed by the combination. Since the 
same stimuli and response scale were 
employed for both tasks, response scale 
invariance requires that the same J-in- 
verse transformation be applied to both 
tasks. The stimulus scale convergence 
criterion defines an appropriate rescaling 
of the combination ratings as one which 
both makes an hypothesized model fit the 
data and leads to the derivation of scale 
values that agree with those derived from 
the difference task. 

Although rescaling of the data might 
make the model fit a single set of data, 
the transformation that reduces the inter- 
actions implies which psychological dif- 
ferences are greater than or equal to 
others. Experiment IV obtained direct 
ratings of these differences, and it tested 
whether these ratings are qualitatively con- 
sistent with the transformation that makes 
the data additive. 

Experiment IV required only the ordinal 
information in the data to reject the ad- 
ditive and constant-weight averaging mod- 
els. This leverage was provided by a 
compound integration task in which .Ss 
rated the difference in likableness between 
pairs of hypothetical persons, each de- 
scribed by a pair of adjectives. The ratings 
can be rescaled to fit the subtractive 
model; this rescaling may or may not 
make the additive model of impression 
formation fit. Thus, difference judgments 
can be used as a basis for response re- 
scaling to provide a scale-free test of the 
constant-weight averaging model of im- 
pression formation. 


EXPERIMENT I: TEST OF ADDITIVE 
MopErs 


The first experiment was designed to 
uncover the nature of the discrepancies 
previously observed. Although the basic 
procedures were similar to those of the 
research described by Anderson (1968b), 
these were modified to permit a clearer 
assessment of the expected interactions. 
Thus, only 2 factors were employed, but 
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each was represented by a greater number 
of levels covering a wider range. This 
permitted a greater variation of within- 
set range. 


Method. 


The Ss were presented with pairs of personality- 
trait adjectives and were instructed “your task is 
to imagine a person who would be described by 
both of the traits and judge how much you would 
like such a person." The Ss recorded their judg- 
ments in numerical form, using 1 of 9 ratings for 
each pair: 1 = dislike very very much, 2 = dislike 
very much, 3 — dislike, 4 — dislike slightly, 5 
neutral (neither like nor dislike), 6 — like slightly, 
7 = like, 8 = like very much, and 9 = like very 
very much. 

Subjects. The Ss were 300 University of Cali- 
fornia, Los Angeles, undergraduates fulfilling a re- 
ment in introductory psychology. One hun- 
different Ss served in each replicate of the 
experiment. 

Stimuli. 


The adjectives were taken from Ander- 


et of 25 adjective pairs produced from a 5 X 5 


(Adjective A X Adjective B) factorial design. The 
5 levels of likableness of each adjective factor 
were separated by steps of approximately 1.28 on 


Anderson's 0-6 normative scale. The adjectives 
are printed (with normative values in parentheses) 
in the margins of Figure 2. 

Procedure. The 25 adjective pairs were printed 
in 1 of 6 random orders on the same page with 
the instructions and the labeled response scale. 
"The adjectives in each pair appeared side by side, 
with the adjective from Factor A on the left for 
half of the forms. The Ss were instructed to read 
through the entire list before beginning to record 
their ratings. 


Results and Discussion 


Additive and constant-weight averaging 
models. The 3 panels of Figure 2 show 
the mean judgment of each adjective pair, 
averaged over all Ss within the replicate. 
The mean judgments are plotted as a func- 
tion of the normative value for the ad- 
jective from Factor A, with a separate 
curve for each adjective from Factor B. 
The slope of the curves represents the 
effects of the Factor A adjective. The 
vertical differences between the curves 
represent the effects of the Factor B 
adjective. According to additive or con- 
stant-weight averaging models, the curves 
in each panel should be parallel; that is, 
the effect of one adjective should not 
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(Anderson's, 1968a, normative values for individual adjectives are listed 
above and to the right of each panel.) 


depend upon the particular adjective with 
which it is paired. Instead, the curves in 
each panel diverge to the right; the effect 
of either adjective appears to be propor- 
tional to the likableness of the other ad- 
jective describing the same person. 

This divergence was characteristic of the 
data for most of the individual Ss: 88%. 
88%, and 89% of the Ss in the respective 
replicates showed this form of interaction. 
In Experiments I-IV between 85-95% of 
the single Ss resembled the group data. 
The A X B interactions are highly sig- 
nificant for each of the replicates; / values 
for these interactions are indicated in the 
panels of Figure 2. Of course, the F is 
much larger when the data from all 3 
replicates are combined, Æ (16, 4752) 
= 72.80. If the response scale of this 
experiment is assumed to be valid, the 
interactions shown in Figure 2 can be 
interpreted as evidence against the ad- 
ditivity of impressions. 

Although the A X B interaction varies 
between stimulus replicates, F (32, 4752) 
— 8.28, this higher order interaction is 
small in comparison with the similarity 
of the nonadditivities. There are, how- 
ever, a few peculiarities that appear to 
depend upon the meanings of the particular 
adjectives. For example, SELF-CONFIDENT 
& maticrovs is less likable than suy & 


MALICIOUS although SELF-CONFIDENT is 
more likable than suy in combination 
with other traits. A SELF-CONFIDENT, 


malicious person may be perceived as more 
likely to carry out malicious actions than 
a suy one. But the divergent interaction 
occurs in all 3 replicates and represents 
the main source of difficulty for the ad- 
ditive models. 

Multiplicative model. The multiplicative 
model, W;; = sisj, predicts that the curves 
in each panel should be a diverging fan 
of linear functions. The interaction should 
be located entirely in the bilinear com- 
ponent (Anderson, 1970). Although the 
major portion of the variance of the inter- 
action (81.6%, 56.2%, and 86.9% for the 
3 replicates) was located in the bilinear 
component, the residuals were also statis- 
tically significant, Fs (15, 1485) = 6.36, 
20.68, and 3.26. The nature of the dis- 
crepancy can be seen most clearly in the 
second panel of Figure 2. According to 
the multiplicative model, the curves should 
be linear; instead, the upper curves are 
negatively accelerated relative to the lower 
curves. The other 2 replicates show the 
same discrepancy, but to a lesser degree. 

Range model. The range model was fit 
to the data (averaged over replications) 
following the general procedures outlined 
in Birnbaum et al. (1971), and using 
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Anderson's (1968a) normative values as 
estimates of the scale values. The range 
model, using just one additional parame- 
ter, accounts for some 80% of the variance 
left unaccounted for by the constant- 
weight averaging model. 

Averaging model with differential weights. 
The mean absolute discrepancy was only 
-05, indicating a very good fit to 15 points 
using 10 parameters. The best-fit estimates 
of scale values for the 5 levels were 1.49, 
3.02, 5.24, 6.48, and 7.83; the best-fit 
weights were 1.92, 1.18, .67, .73, and .90. 
Fewer parameters are required with the 
assumption of the validity of Anderson's 
(1968a) values and by estimating the 
weights as a polynomial function of scale 
value. Although this procedure uses more 
parameters than the range model, the fit 
was not as good. 

Rescaling. Interactions in the analysis 
of variance might be due to a nonlinear 
relationship between the impressions of 
likableness and the overt responses of 
the Ss. Thus, the actual impressions may 
be additive, but the overt responses may 
be related only ordinally to the theoreti- 
cally correct response scale. The pos- 
sibility of response scaling was investigated 
using 2 techniques of data transformation. 

The first technique assumes that Ander- 
son's (1968a) normative values for the 
single items are appropriate estimates of 
the scale values and that judgments of 
pairs of items of equal value should be 
linearly related to the judgments of the 
single items (Birnbaum, 1972a) The 
transformation is then estimated as a poly- 
nomial by a least squares criterion. The 
advantage of this technique is that al- 
though it is capable of producing a radical 
rescaling, it will not eliminate "real" inter- 
actions that depend upon differences in 
within-set range, if the a priori scales are 
appropriate. As can be seen in Figure 2, 
the judgments of pairs of minimal within- 
set range (connected by dashed lines) are 
already nearly linearly related to norma- 
tive ratings of single values. If anything, 
this function (treated in this procedure as 
the J function of Figure 1) is slightly 
negatively accelerated, so that the rescaling 
that makes it linear would actually in- 
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crease the magnitude of the divergent 
interaction. 

If the divergent interaction is real, then 
the distribution of psychological impres- 
sions is somewhat positively skewed (as 
given by the projection of the data points 
on the ordinate of each panel of Figure 2). 
Positive skewing would lead to a nega- 
tively accelerated judgment function 
according to  range-frequency theory 
(Parducci & Perrett, 1971). This sug- 
gests that contextual effects operating on 
the J function would tend to counteract 
the effects of true interactions. Birnbaum 
et al. (1971, Experiment V), demonstrated 
that manipulation of the frequency dis- 
tribution of physical means of sets of 
psychophysical stimuli influences the form 
of the interaction between the components 
in a manner predictable from range-fre- 
quency principles of judgment 

The second transformation procedure 
assumes that the integration is additive 
(or constant-weight averaging) and at- 
tempts to transform the data to fit the 
additive model (Kruskal, 1965). The 
monotone analysis of variance (MONA- 
NOVA) computer program (Kruskal & 
Carmone, 1969), applied to the mean 
judgments, greatly reduced the percentage 
of total variance in the interaction (from 
5.0% to .4%). Thus, the data appear 
roughly consistent with the constant-weight 
averaging or additive model at an ordinal 
level. The fact that the data can be 
transformed to fit raises a difficult theo- 
retical problem: Is the nonparallelism in 
Figure 2 due to nonadditive integration 
of information or to a nonlinear response 
function ? 

If the additive model is assumed to be 
correct, the MONANOVA analysis indi- 
cates that the judgments are a positively 
accelerated function of the impressions. 
Additionally, it would mean that the scale 
values for adjectives presented in pairs 
are a negatively accelerated function of 
Anderson's (1968a) values for the same 
adjectives presented singly. The positively 
accelerated function for J derived from 
the MONANOVA analysis can be inter- 
preted by range-frequency theory (Par- 
ducci & Perrett, 1971) to indicate that 
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the psychological distribution of stimuli is 
negatively skewed. Kanouse and Hanson 
(1972) have also hypothesized that the 
distribution of evaluative stimuli is nega- 
tively skewed, but based their arguments 
on other considerations. 

In short, Experiment I demonstrates 
that ratings of likableness are inconsistent 
with the additive models. However, 2 in- 
terpretations are consistent with the data. 
The first assumes that the ratings are 
valid measures of psychological impressions 
and concludes that the integration of in- 
formation is nonadditive. The second as- 
sumes the validity of the constant-weight 
averaging (or additive) model and con- 
cludes that the responses are a positively 
accelerated function of the impressions. 
Both interpretations can account equally 
well for the data of Experiment I. Con- 
sequently, the following experiments were 
designed to discriminate them. 


EXPERIMENT ll: VARIATION OF 
RESPONSE SCALES 


The earlier experiments yielding data 
more consistent with the additive models 
have used other procedures for obtaining 
S's response (see Anderson, 1974, in press). 
The interactions observed in Experiment I 
may be due to nonlinearity in the J func- 
tion that depends upon the particular 
procedure S uses to indicate his impres- 
sions. Therefore, Experiment lI tested 
the additive models using several different 
procedures for responding to assess whether 
the interaction obtained in Experiment | 
is specific to the 9-category rating scale 
used in that experiment. Each of the 4 
conditions of the present experiment tested 
a different interpretation of how the in- 
teractions might depend upon the method 
for responding: reversing the scale, end- 
point anchoring with 20 categories, line- 
mark responses, and matched pairs. 

Although the matched pairs procedure 
has not been used before, it has the ap- 
parent advantage that it does not require 
a metric response from the S. Thus, this 
procedure avoids the argument that metric 
responding may induce Ss to integrate 
separate implicit responses to the indi- 


vidual adjectives rather than forming an 
overall impression before responding. 


Method 


As in Experiment 1, Ss read pairs of personality- 
trait adjectives and judged how much they would 
like hypothetical persons who would be described 
by both adjectives. The conditions differed with 
respect to the instructions for the response. Each 
set of instructions was used in a separate small 
experiment, with different Ss. 

For the reversed scale, 1 = like very very much 
and 9 = dislike very very much. The Ss were 
consequently instructed to rate how much they 
would dislike the hypothetical persons. The 20 Ss 
rated the adjective pairs of both Replicates 1 and 2 
of Experiment I. 

The 20-category scale was anchored by instruc- 
tions that 1 represented the likableness of a person 
who would be described by MEAN, PHONY, MALI- 
OBNOXIOL 20 represented the 
likableness of SINCE NTELLIGENT, UNDER- 
STANDING, and Di The Ss were in- 
structed to judge each adjective pair relative to 
these end anchors and to ign a numeral be- 
1 1 and 20 to represent the appropriate position 
relative to the end values. The 34 Ss rated the 
25 adjective pairs of Replicate 2. 

For the linemark response, Ss were instructed 
to indicate each judgment of likableness by making 
a short vertical mark on a line so that the length 
between the margin and the mark would be pro- 
portional to the likablene The 46 Ss in this 
condition judged the adjective pair of Replicate 2. 

The list for the matched pairs procedure was 
constructed from Anderson's (1968a) normative 
data. Each pair contained 2 adjectives of equal 
scale value. The 22 pairs covered the baseline 
range .35-5.60, in .25-category steps. The 50 Ss 
were instructed to judge each of the adjective 
pairs of Replicate 2 by selecting the pair of ad- 
jectives from the list of 22 pairs, “most nearly 
equal in likeableness to the pair you are judging." 
The value of S's response was taken to be the 
normative value of the adjective pair selected by 5. 
"The 22 pairs are listed in Birnbaum (1972b). 

Supplementary scaling. These 22 adjective pai 
and the 30 adjectives used in Experiment | were 
printed in random order. Each of 100 5s judged 
3 aspects of each adjective or pair: (a) the likable- 
ss of a person who would be described by the 
adjective or pair, (b) the activity of such a person, 
and (c) the range of likableness of persons who 
would be described by the adjective(s). The Ss 
judged all of the adjectives on one aspect before 
proceeding to the next task.  Nine-point scales 
were used for all 3 scaling tasks, with 9 — like 
very very much, very very active, or very very 
wide range of possibilities. 


Results and Discussion 


Figure 3 shows the mean responses, 
averaged across Ss, in each condition. 
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ment II. (Each panel shows results for a different response procedure.) 


Additive and constant-weight averaging 
models again predict parallel curves for 
each panel. Instead, the interactions are 
all of the same general form (divergence) 
as in Figure 2, and the F values listed in 
each panel show that they are all highly 
significant. 

The ordinate of the upper left-hand 
panel of Figure 3 has been relabeled for 
direct comparison with Figure 2. The 
data show the same interactions, and 
ratings of "dislikableness" appear to be a 
linear function. (with negative slope) of 
ratings of "likableness." Since reversing 
the scale did not reverse the interaction, 
the deviation from additivity cannot be 
attributed to anything like preference for 
smaller numbers. The interaction for the 
anchored scale (upper right-hand panel) 


shows the same divergent form as those 
shown in the other panels and is highly 
significant. The percentage of variance 
associated with the interaction is actually 
larger for the linemarks condition than for 
any of the other conditions. The posi- 
tively accelerated relationship between 
linemarks and category ratings may ac- 
count for the larger interaction. 

The matching procedure assumes that 
the normative values obtained by Anderson 
(1968a) are valid estimates of scale values. 
One check on this technique is to see 
whether the pairs having minimal within- 
set range are linearly related to Ander- 
son's scale values. As can be seen by 
connecting these points (the diagonal) in 
the lower right-hand panel of Figure 3, 
this assumption is supported. Never- 
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theless, the interaction has the same 
divergent form as the others and is highly 
significant. The results for this non- 
numerical response technique do not sup- 
port the objection that perhaps S's inte- 
gration strategy is affected by the numerical 
response procedures typically used in this 
type of study. The error terms for this 
method were greater than for any of the 
other methods, perhaps because Ss must 
judge the pairs of items defining the scale 
in addition to the usual stimulus pairs. 

In summary, the main conditions of 
Experiment II show that variation of ex- 
perimental procedures for responding did 
not eliminate the divergent interaction. 
Each procedure can be considered an al- 
ternative operational definition of the im- 
pression. By the principle of converging 
operations (Garner et al., 1956), the di- 
vergent interactions in Figure 3 support 
the interpretation that impressions of per- 
sonality are nonadditive. In order to 
retain the additive model, it would be 
necessary to assume that the responses for 
all of these procedures are nonlinearly 
related to the impressions. 

Supplementary scaling. One purpose of 
the supplementary scaling was to check 
a previous prediction (Birnbaum, 1972a) 
that items of lower value have narrower 
dispersions. Successive interval scale val- 
ues and dispersions (Torgerson, 1958) were 
obtained for each adjective and adjective 
pair. Mean ratings were linearly related 
to Anderson's normative values and to 
Thurstone scale values derived from the 
same data. In agreement with the dis- 
tributional interpretation, Thurstone scale 
values were positively correlated with dis- 
persion values (r = .65). Since each of 
these dispersions was produced by the 
differences between the responses of dif- 
ferent Ss to the item, they are probably 
not the best estimates of the distribution 
of meaning for the adjectives. Only if it 
is assumed that different Ss select a ran- 
domly sampled “person” from the dis- 
tribution of “persons” who possess the trait 
would the Thurstone dispersion reflect 
this dispersion directly. Nevertheless, the 
correlation between value and dispersion 
seems consistent with this prediction. 
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Subjective estimates of the range of 
likableness seem a more direct measure of 
dispersion. These were highly correlated 
with mean ratings of likableness (r — .91). 
The plot of this relationship showed that 
although range is a nearly linear function 
of likableness for the dislikable traits, 
neutral and positive traits are all seen as 
implying similar wide ranges of likable- 
nesses. This finding is consistent with the 
fact that the interactions in Figure 2 ap- 
pear to be located in combinations that 
include dislikable traits. In a post hoc 
analysis of the 2 X 2 subdesign containing 
only positive adjectives for Experiment I, 
the interaction was nonsignificant (F < 1). 
The 2 X 2 subdesign containing the nega- 
tive traits had a significant interaction, 
F (1, 297) = 5.93. 

In summary, the supplementary scaling 
suggests that adjectives should be repre- 
sented by distributions, with the lower 
valued items having smaller variance. 

Supplementary test of activity. One in- 
terpretation of the crossover interaction 
observed in Experiment I, Replicate 3 
(Figure 2) is that the activity component 
of the meaning of one adjective can mul- 
tiply the evaluative component of the 
adjective with which it is paired. Hence, 
à SELF-CONFIDENT MALICIOUS person is 
more actively malicious than a SHY one. 
Similarly, an IMPULSIVE CRUEL person may 
be more likely to act cruel than an UNTIDY 
one (Experiment I, Replicate 2). 

This hypothesis—that one adjective can 
behave like an adverb—was further in- 
vestigated by having 130 additional Ss rate 
the likableness of 36 hypothetical persons, 
each described by 2 adjectives produced 
from a 6 X 6 factorial design. The ad- 
jectives for the first factor varied in 
likableness (RUDE, IMMATURE, TROUBLED, 
DIRECT, INTELLECTUAL, and HONEST). The 
adjectives for the second factor were rela- 
tively neutral in likableness, but 3 were 
"active" (IMPULSIVE, AGGRESSIVE, and 
CHANGEABLE) and 3 were “passive” (QUIET, 
HESITANT, and SHY). 

Consistent with the prediction, the lika- 
bleness effect of an adjective was greater 
when paired with an active adjective. 
The interaction was large and significant, 
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F (25, 3225) — 14.49, and even showed 
reliable crossovers; for example, HESITANT 
& HONEST is less likable than AGGRESSIVE 
& HON although nu ANT & RUDE is 
more likable than AGGRESSIVE & RUDE. 

Ratings of the activity of the 30 ad- 
jectives used in Experiment I were un- 
correlated with ratings of likableness of 
the same adjectives but were correlated 
with the Thurstone dispersions (r — .43). 
The activity hypothesis is not sufficient to 
account for the overall divergent inter- 
actions in Figures 2 and 3, but it appears 
to explain some of the second-order effects 
(which would otherwise be called ‘“pecu- 
liarities") for particular adjective combi- 
nations in Experiment I. 


EXPERIMENT III: SUBTRACTIVE 
PREFERENCE VERSUS ADDITIVE 
COMBINATION 


Experiment III had the following ob- 
jectives: (a) to evaluate a subtractive 
model for preference judgments reflecting 
the difference in likableness between 2 hy- 
pothetical persons, each described by one 
adjective; (b) assuming that the sub- 
tractive model were to fit, to compare 
Scale values for the preference task with 
those obtained for the usual combination 
task, in which Ss judge the likableness of 
a person possessing both traits; (c) to use 
the scale values obtained for the preference 
task as the basis for rescaling the data 
from the combination task; (d) to use the 
scale values for the preference task to 
evaluate Anderson's normative values for 
single items; and (e) to evaluate the effects 
of a change in instructions for the com- 
bination task, in which the adjectives are 
given equal importance and accuracy. 

The subtractive model asserts that pre- 
ference ratings can be represented as the 
algebraic differences between the values of 
the items: 

PaP 


pac oe 


[2] 
where W;,? is the psychological difference 
in likableness (preference for Stimulus i 
over j), and s; and s; are the scale values 
of these 2 stimuli. If the subtractive 
model can be fit to the data, then the 
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adjectives can be located as point i 


unidimensional likableness continuum 
Two principles of scale converge: 
be applied to this experime 
sponse scale invariance: Wit! 
set of stimuli and the same r 
cedure, the judgment function is 
to be independent of task. (b) Sti 
scale invariance: With the samc 
stimuli, the scale values of the stir 
assumed to be independent of task. 
ciple a implies that whatever transfos 
tion is applied to fit the data to : 
ditive model for the combination 
should also be applied to the data 
the difference task. Principle b implic 
that irrespective of whatever transforma- 
tions of the responses fit the data for the 
2 tasks to their respective models, the 
scale values derived from the subtractive 
model should be linearly related to the 
scale values for the same adjectives in 
combinations. Stimulus scale invariance 
can be considered a necessary (but not 
sufficient) condition for establishing mean- 
ingful scales and psychological laws. A 
failure of scale convergence can provide 
the leverage for rejecting either the sub- 
tractive or additive model, even thougl: 
the J functions are unknown. 


Method 


Stimuli. The stimuli were the adjective pairs 
of Experiment I. 

Subjects. The Ss were 90 University of California, 
Los Angeles, undergraduates, 30 serving in each 
replicate. 

Procedure, Each S performed both ta 
half of the Ss in ea 
difference task first. 
task order. 

In the difference task, Ss were instructed to 
imagine 2 different people, each described by one 
of the adjectives of each pair, and then to judge 
the difference in likableness between the 2 persons. 
The 9-point scale had labels varying from 1 = like 
the person (described by the adjective) on the 
left very very much more than the person (de- 
scribed by the adjective) on the right, through 
5 — like both persons equally, to 9 — like the 
person on the right very very much more than 
the person on the left. 

The procedure for the combination task was in 
all respects identical to that of Experiment I, 
except that the instructions were modified to em- 
phasize that the adjectives should be considered 
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ch replicate performing the 
There were no effects of 
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FIGURE 4. Mean ratings of difference in likableness between 2 hypo- 
thetical persons, each described by a single adjective (upper panels), and 
mean ratings of likableness of hypothetical persons, each described by the 
same 2 adjectives (lower panels) : Experiment III. 


of equal importance and accuracy, each word 
having been contributed by a different acquaintance 
"who knows the person well." 


* Results and Discussion 


Difference task. The upper panels of 
Figure 4 show the mean judgments of the 
difference in likableness. Because ratings 
reflect the difference between Factors A 
and B, the slopes are positive, but the 
order of the curves is negative. 

Since the subtractive model is a special 
case of an additive model, the parallelism 
of the curves supports its validity. Only 
the interaction for Replicate 2 is signifi- 

cant, and examination of the figure in- 
dicates that the interaction is minor and 
mainly due to one point. Since Anderson's 
(1968a) normative values are plotted on 
the abscissa, the near linearity of these 
functions supports the validity of the 
normative values as estimates of scale 
values. Similarly, the vertical separations 
between the curves are nearly proportional 
to differences in Anderson's scale values. 


The good fit of the subtractive model 
supports the validity of the category rating 
scale and indicates that the adjectives can 
be located on a unidimensional likableness 
continuum. 

Combination task. The lower panels of 
Figure 4 show the mean judgments of the 
likableness of persons described by both 
adjectives. The data appear to replicate 
Experiment I closely, including the pecu- 
liarities for ratings of particular adjective 
combinations. The interactions for this 
experiment diverge in the same fashion 
as Experiment I and are highly significant 
for each replicate, as indicated by the F 
values printed in the panels of the figure. 
The F values are smaller than those for 
Experiment I because there were fewer Ss 
in this experiment. The change in in- 
structions, emphasizing equal importance 
and accuracy, appears to have had no 
discernible effect on the interactions. 

Scale convergence. Both the stimulus 
and response scale invariance assumptions 
would require that either the subtractive 
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model for differences or the constant- 
weight averaging model for combinations 
must be rejected. Response scale invari- 
ance implies that any transformation of 
the ordinate of the lower panels must be 
applied to the upper panels as well. Be- 
cause the subtractive model fit the data 
directly, any nonlinear transformation 
would induce an interaction. Therefore, 
if the validity of the subtractive model 
and the principle of response scale invari- 
ance are assumed, the constant-weight 
averaging model must be rejected. 

The fit of the subtractive model implies 
that the marginal means for these data 
constitute an interval scale for the ad- 
jectives (Anderson, 1970). However, when 
the combination data were rescaled to 
additivity using Kruskal's (1965) MONA- 
NOVA, scale values derived from this 
procedure were nonlinearly related to those 
for the same adjectives derived from the 
subtractive model. Therefore, the prin- 
ciple of stimulus scale invariance implies 
that this transformation would not be 
appropriate. 

Further support for the validity of the 
rating scale is provided by the finding 
that ratings of homogeneous combinations 
(adjectives of similar normative value) are 
nearly linearly related. (slightly sigmoidal) 
to subtractive model scale values. Hence, 
the assumption of stimulus scale invariance 
implies that the J function is roughly 
independent of task. Thus, 3 procedures 
agree—Anderson's (1968a) normative val- 
ues, subtractive model scale values, and 
ratings of homogeneous combinations are 
all nearly linearly related. 

This analysis also indicates that a simple 
multiplicative model, W; = s,5;, would be 
inappropriate for the combination task; it 
would incorrectly predict that judgments 
of adjectives of equal value are a positively 
accelerated quadratic function of the scale 
values for the difference model. However, 
à geometric averaging model (square root 
of the product of the scale values) would 
still be consistent with the subtractive 
model scale values. 

The data of Experiment III indicate 
that with the same stimuli, Ss, and general 
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experimental procedure, ratings o! 
ence are consistent with a 
model, but ratings of combinatior 
again inconsistent with tlie additive m E 
This suggests that the intera: 

due to trivial details of experi 


subi 


cedure. If scale values for the € s 
are assumed to be independent ot a 
the fact that the ratings of homo: i$ | 
combinations are a nearly linear f n 
of the scale values for the difference : s 
implies that it would be inapprop:: o 
rescale the ratings of combinatio! " 


order to retain the additive n: i 
would be necessary either to reje 
subtractive model (in spite of its goo 

to the data in Figure 4), or assume t. t 
both the scale values amd the respoi 
function depend upon the task. 


EXPERIMENT IV: QUALITATIVE 
NONADDITIVITY 


Experiment IV was designed to provi 
a test of additivity that would requir: — 
only the ordinal information in the data. 
The evidence against the additive and 
constant-weight averaging models obtaine i 
in the first 3 experiments is illustrated a | 
Figure 5A, where the difference in rati g | 
between asb; and asb; is greater than te | 
difference between ab: and abı. An i l- 
ditive model requires that these differences — 
be subjectively equal. If the J function 
is assumed to be linear, then differences | 
in judgment are proportional to differences 
in the impressions; therefore, the diver- 
gence (previously shown in Figures 2, 3, 
and 4) would be contrary to additive 
models. However, if J were positively 
accelerated, impressions might be additiv 
The same ratings have been transformed 
to parallelism in Figure 5B. Although the 
difference in rating (Figure 5A) due to. 
the change from bs to b, depends on 
whether a, or as was in the same set, 
the difference in psychological value (Fig- 
ure 5B) is assumed by additive models to 
be constant. 

Experiment IV tested this possibility by 
asking Ss to judge directly the differences 
in likableness between integrated impres- 
sions. The success of the subtractive 
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FiGvgk 5. A, Schematic diagram of nonadditive data. B. Schematic 
j diagram of additive impressions (following transformation) showing predic- 
tion for difference judgments (note that differences in transformed ratings 


are assumed to represent psychological differences), C. Matrix of hypo 
thetical data representing ratings of the difference in likableness between 
the person described by the pair of adjectives for the column and the person 
described by the pair for the row, assuming nonadditive impressions but 
subtractive preferences, D, Matrix of hypothetical data for column minus 
row differences, assuming additive impressions and subtractive preferences. 
[Matrix entries for Panels C and D are equal to 4(d) + 5, where d is the 
diflerence between the ordinate values for Panels A and B.] 
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nodel in Experiment III suggests that it 
can be used to infer the impressions of the 
combinations. It is only necessary to as- 
sume that ratings of differences in likable- 
ness are strictly monotonic to differences 
in psychological impressions. If the inter- 
action is real, then the preference for abs 
over deb, should exceed the preference for 
abs over aibi, as illustrated by the length 
of the dashed arrows in Figure 5A, If the 


‘interaction reflects only a nonlinearity in 


the J function, however, then the 2 prefer- 


ences should be equal, as shown by the 
dashed arrows of Figure 5B. 

The matrices in the lower panels of 
Figure 5 illustrate 2 patterns of data that 
might be obtained for ratings of the dif- 
ference in likableness between 2 persons, 
each described by a pair of adjectives. 
Figure 5C shows the pattern that would 
be obtained if the integration of informa- 
tion were nonadditive, Each pair of 
circled entries is an example of a predicted 
rank-order comparison. The dashed ar- 
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TABLE 1 
MEAN RATINGS OF THE DIFFERENCE IN LIKABLENESS BETWEEN Pairs or Hyroriw 
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Person on the right 


Person on the left 
PHONY & 
MEAN 


PHONY & 
TRUSTWORTHY 


Replicate 1 
PHONY & MEAN 
PHONY & TRUSTWORTHY 
SINCERE & MEAN 
SINCERE & TRUSTWORTHY 


Replicate 2 
DECEITFUL & CR 
DECEITFUL & DEPE 
HONEST & CRU 
HONEST & DE 


ENDABLE 


6.34 t 
5.07 
4.89 
2.60 
DECEITFUL & HONEST & 
DEPENDABLE CRUEL 
5.03 6.26 8.2t 
3.65 4.88 7.89 
3.89 5.06 7.95 
2.09 2.12 4.83 


MALICIOUS & 
OBNOXIOUS 


MALICIOUS 
UNDERSTANDING 


LOYAL & 
OBNOXIOUS 


LOYAL & 
UNDERSTANDING 


Replicate 3 
MALICIOUS & OBNOXIOUS 5 
MALICIOUS & UNDERSTANDING cen 
LOYAL & OBNOXIOUS 3.5 
LOYAL & UNDERSTANDING 1 
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rows represent the same comparison as the 
dashed arrows in Figure 5A. Since the 
ratings represent judgments of differences 
between impressions labeled by the column 
and the row entries, the direction of the 
order is reversed below the diagonal. 
Figure 5D represents the type of pattern 
that would be obtained if the adjectives 
combined additively. Each pair of circled 
entries would be equal. Both matrices in 
Figure 5 were generated by assuming the 
subtractive model for preference (Equa- 
tion 2). However, the rank order of the 
matrix entries would remain the same if 
subjected to a strictly monotonic trans- 
formation. Thus, the data can be rescaled 
to fit the subtractive model without pre- 
cluding the ordinal test for the additive 
integration models for impression forma- 
tion (Birnbaum, 1972b). 

For example, if the interaction shown 
in Figure 2 is real, then the judged dif- 
ference in likableness between someone 
who is UNDERSTANDING & LOYAL and 
someone who is UNDERSTANDING & MALI- 
crous should exceed the judged difference 
between someone who is OBNOXIOUS & 
LOYAL and someone who is OBNOXIOUS & 


MALICIOUS. But if adject nte: 
grated according to an addio on 
stant-weight averaging model, i! ine 
differences would be equal. 

Method 


The Ss were instructed to judge the 
in likableness between pairs of hypothetical 
each of whom was described by 2 adjectives 
Two adjectives were printed on the left side of 
the page and 2 on the right side of the page. Tl 
Ss were instructed to form impressions of thé 
personalities of the 2 persons before judging t 
difference in likableness. The 2 adjectives d 
scribing each person were described as being 
equal accuracy and importance; S's task was 
imagine that they were contributed by different; 
but equally reliable, sources. Judgments were if 
terms of a 9-point se: labeled as in the difference 
task of Experiment III. 

Subjects, The Ss were 195 University of Cali: 
fornia, Los Angeles, undergraduates, 65 in each 
of 3 stimulus replicates. 1 

Stimuli. The stimulus for each judgment cone 
sisted of 4 adjectives, 2 printed on the left side: 
of the page and 2 on the right. The adjecti 
pairs describing the “person on the left" were 
produced from a 3 X 3, A X B, factorial design 
the adjective pairs on the right were constructed: 
from a 2 X2, C X D, factorial design. Eai 
person on the left was combined with each person 
on the right producing a 3 X 3 X 2 X 2, (A X B) 
(C X D), factorial design. 


TRANSFORMED SCALE 


# REPLICATE 3 
= 


3 o 


on ds l 


LEVEL OF ADJECTIVE A 


FrGuRE 6. Transformed impres 


tives of Experiment I. o 
= SINCERE, for Replic: 


compare this figure with Figure 2.) 


The basic idea of the stimulus construction can 


be understood by consideration of Figure 2. For 
the third replicate, the adjectives were d, = MALI- 
CIOUS, dy = CHANGE. a = by = op- 
NOXIOUS, by = SHY, by = UNDER 3, €) ™ MA 


* LOYAL, d; = OBNO) , and dy = 
ANDING. With respect to Figure 5C, the 
last entry in the first row (9) would represent the 


rating of the difference in likableness between 
someone who is LOYAL & UNDERSTANDING and 
someone who is MALICIOUS & OBNOXIOUS. Nine 


additions 
each replicate by pairing each person on the 
with an additional person on the right who 
INFORMAL & LIGHTHEARTED, Therefore, the entire 
design represents a 9 X 5 factorial design with the 
factors representing the person on the left and the 
person on the right (see Birnbaum, 1972b). 

Procedure. ‘The 45 trials were printed in 1 of 3 
different random orders in booklets with different 
page orderings for different Ss, The Ss read 
through the entire list before beginning to record 
their judgments. 


] cells in the design were constructed for 
ft 


Results and Discussion 


The mean judgments of difference in 
likableness are presented in Table 1 for 
that part of the design illustrated in 
Figure 5. Comparison of the rank order 
of the ratings with the direction of the 
differences for the circled values shows 
that the pattern resembles that of Figure 
5C representing an interaction: Differences 


sion values for the per: 
derived from MONANOVA applied to difference ratings 
(The 3 levels of subscripts refer to low, medium, and high li 
example, a; = 
e l. To assess metric information in mean ratings, 


o» on the left 
»eriment. I 
)leness adjec- 
PHONY, dy = BLUNT, and d; 


in rating from Experiments I, II, and III 
predict the ratings of differences in the 
present experiment. 

The crucial comparisons (circled entries 
in Figure 5D), which must be equal for 
the additive or constant-weight averaging 
model, are significantly in the direction 
predicted by the interactions in Figures 
2-4 for all 3 replicates, Fs (1, 64) = 71.13, 
115.88, and 89.54, respectively. The indi- 
vidual ratings by single Ss show the same 
pattern of rank orders for these crucial 
comparisons as the mean judgments in the 
table. Of the 195 Ss, 81 were in perfect 
ordinal agreement with the interaction 
obtained in the first 3 experiments for all 
4 comparisons, Only 12 Ss showed a 
greater number of rankings in the direc- 
tion opposite from that predicted by the 
interaction, compared with 170 Ss whose 
rankings were in the direction predicted 
by the interactions obtained in Experi- 
ment I. These results are clearly incon- 
sistent with the additive model which 
would predict an equal split of these 
rankings between the 2 directions. 

The larger, 9 X 5 design was used to 
test the subtractive model for preferences 
between combinations. The subtractive 
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model also predicts parallelism, and the 
data appeared roughly parallel when plot- 
ted. However, statistically significant in- 
teractions were obtained for each stimulus 
replicate, Fs (32, 2048) — 3.72, 4.98, and 
5.10 for Replicates 1, 2, and 3, respectively. 
Application of MONANOVA to these 
ratings of preference reduces the inter- 
action to less than 1% of the total vari- 
ance, so it is apparent that the preference 
ratings are at least ordinally consistent 
with the subtractive model. 

Following the application of MONA- 
NOVA to the 9 X 5 subtractive model 
design, the derived impression values 
for the combinations were inconsistent 
with the additive and constant-weight 
averaging models. Figure 6 plots the im- 
pression values derived from the MONA- 
NOVA transformation for the 9 persons 
on the left. Since these were produced 
from a 3 X 3 factorial design of adjectives, 
the nonparallelism of these scale-free values 
refutes the additive models of impression 
formation. Similar results were obtained 
for the 2 X 2 design for persons on the 
right. 

In summary, the minimal assumption 
that the preference ratings are monotoni- 
cally related to subjective differences leads 
to the conclusion that the additive and 
constant-weight averaging models must be 
rejected. Comparison of scale-free values 
in Figure 6 with the scale-dependent 
ratings in Figures 2, 3, and 4 shows that 
ratings contain metric information that cor- 
rectly describes the divergent interaction. 

Krantz et al. (1971, pp. 445-447) have 
argued that if an interaction can be re- 
moved by monotonic transformation, then 
it should be attributed to the rating scale 
and should not be given a psychological 
interpretation. However, Experiment IV 
shows that this procedure could lead to 
erroneous conclusions. Although the in- 
teraction obtained in Experiment I could 
be removed by monotonic transformation, 
ratings of differences in impression are 
predictable from the differences in ratings. 
This indicates that the interactions are of 
psychological significance. Rescaling those 
data to parallelism would be inappropriate, 
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as demonstrated by the qualitative con- 
tradictions of Experiment IV. 

However, the ratings also appear to 
contain some nonlinearity that reduces the 
apparent magnitude of the interaction for 
nonnegative items. It would be improper 
reasoning to conclude that nonnegative 
items combine additively based on the 
lack of interaction in the ratings. The 
scale-free values in Figure 6 appear to 
show a continued divergence. Nonlinear- 
ity in the response scale may actually 
make nonadditive data appear additive. 
The procedures of Experiment IV avoid 
the rescaling problems and provide a truly 
scale-free test of the additive models. 


GENERAL DISCUSSION 


This research demonstrates that impressions 
of likableness cannot be represented as simple 
sums or constant-weight averages of the value 
of the adjectives. Instead, one bad trai 
results in an unfavorable overall impression 
with the other trait having less influence. 
This effect reflects a real psychological inter- 
action and cannot be attributed to the re- 
sponse scale. 

Some of the early research on impression 
formation concluded that the constant-weight 
averaging model could give a reasonable fit 
to ratings of adjective combinations. The 
parallelism prediction appeared to be roughly 
satisfied, and this finding was interpreted to 
validate both the model and the rating scale. 
Systematic deviations did occur in this re- 
search, but were often attributed to experi- 
mental difficulties or nonlinearity of the 
response scale. However, it is now clear that^ 
the constant-weight averaging model is not 
an appropriate general description, but may: 
give a reasonable approximation when the 
stimulus range is restricted and the experi- 
mental design lacks power.* ; 


3 An alternative view might maintain that the 
constant-weight model is descriptive of impression 
formation, but only under limited experimental 
circumstances. The E would be advised to care- 
fully select adjectives, instructions, response pro- 
cedures, and other conditions to minimize the | 
nonparallelism. When the data appear parallel, 
this view contends that E would have the right - 
to conclude that the parallelism jointly supports 
the constant-weight model and the response scale. 
However, this approach seems unsatisfactory for 
several reasons. First, the present research shows 
that different response procedures, "equal impor- 
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Theoretical Implications 


It is useful to consider a set of conditions 
that yield the parallelism prediction to con- 
sider what nonparallelism might mean. The 
following conditions underlie the parallelism 
prediction of the averaging model: (a) the 
integration function is an averaging process; 
(b) the adjectives within each factor have equal 
absolute weight; (c) the adjectives do not 
change value nonlinearly in combination, 
nor do the weights depend upon the partic- 
ular stimulus configuration; and (d) the 
J function is linear. Based on a single ex- 
periment such as Experiment I, nonparallelism 
could be interpreted as evidence to disprove 
at least one of the premises. Without further 
constraints, it would be impossible to specify 
whether nonparallelism was due to a non- 
additive 7 process (Conditions a-c) or a non- 
linear J function (Condition d). These ex- 
periments provide the leverage to indicate 
that the nonparallelism is not attributable to 
the J function. There are several remaining 
possibilities: The integration function may not 
be an averaging process, the stimulus pa- 
rameters may depend upon the configuration, 
or the weights may not be equal for the 
adjective 

The differential-weight averaging model 
(Anderson, 1971, 1972, 1974) can account for 
the interactions by allowing the weight of an 
item to depend upon its scale value. Dif- 
ferential weighting requires many more pa- 
rameters and seems unnecessarily complicated 
to fit the simple divergence interactions. The 
model can account for a wide variety of 
interactions, making it difficult to disprove. 
However, with large enough designs, there 
are degrees of freedom left over to permit 


tance and accuracy" instructions, and different- 

selections of adjectives all yield similar divergent 

interactions. If the model is to be deemed cor- 
"rect, then it must apply under very special con- 

ditions indeed. Second, since it is possible to 

manipulate the form of J, it follows that selection 
p of experimental procedures could lead to an "ex- 
perimental rescaling" of the data. "Thus, it should 
be possible to select end anchors and filler stimuli 
to reduce the interaction in the analysis of variance. 
"Therefore, it would be inappropriate to select ex- 
perimental procedures that yield parallel data and 
then conclude that the parallelism is joint support 
for model and response scale. That parallelism 
could result from a combination of nonadditive 7 
and nonlinear J is not merely an untestable philo- 
sophical possibility but represents a plausible hy- 
pothesis that can be tested by the procedures of 
Experiment IV. 


PERSONALITY IMPRESSIONS 
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tests (e.g., Anderson, 1972; Birnbaum, 1973). 
Configural weighting (Birnbaum, 1972b), in 
which the weight of an item depends in part 
on its rank within the set, requires fewer 
parameters and gives a slightly better fit to 
the data.^ 

The configural-weight model predicts steady 
divergence for Figures 2, 3, 4, and 6. The 
differential-weight model is more flexible and 
could account for reconvergence, among other 
patterns. It is thus of theoretical interest to 
ask if the curves show any evidence of re- 
convergence. The scale-dependent ratings of 
Figure 2 appear nearly parallel for non- 
negative traits. However, the scale-free values 
in Figure 6 show steady divergence for Repli- 
cates 1 and 3. The ratings may contain a 
small scale-end effect that reduces the ap- 
parent interaction. The bulge in Figure 2, 
Replicate 2 is apparently due to IMPULSIVE 
multiplying the effect of the other adjective, 
rather than having less weight on its own. 
Since there is no evidence for more than a 
simple divergence, the existing data cannot 
test between differential and configural-weight 
models. Methods for distinguishing these 
models have been suggested by Birnbaum 
(1973). 


Methodological Implications 


Previous conceptualizations of the stimulus 
concatenation problem have not separated the 

* An averaging mechanism can be analogized to 
a balance plank and fulerum. In the differential- 
weight model, each adjective corresponds to a 
weight (w) placed at a certain location (s) on the 
plank. The integrated impression (W) is the loca- 
tion where the fulcrum must be placed so that 
the plank will balance. This location is the weighted 
average, VW = Ywjs;/w;. The configural-weight 
averaging model assumes that the weight of a 
stimulus depends upon its rank within the set to 
be judged. For 2 stimuli, the simple ee model 
(Birnbaum et al, 1971) Wi; —.5(s; +s) + 
«|s; — s;|, can be rewritten as a configural- weight 
model, Wij = (.5 + )si + (.5 — w)s;, when s; > sj. 
This model can also be represented by a balance 
plank mechanism; however, the configural-weight 
model does not require each location on the plank 
to have its own weight. For 2 stimuli in the set, 
there would be only 2 weights, one for the lower 
valued item, and one for the higher valued item. 
The same weight can be placed at any location 
on the plank, if it holds the same rank. This 
model becomes a minimum model when w = —.5 
a constant-weight model when w = 0, and a m 
mum model when w = .5, and can describe a family 
of simple convergent or divergent interactions. 
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J and J functions of Figure 1. This article 
offers a conceptualization that separates these 
problems. The propriety of rescaling the 
data has been uncertain in previous work. 
Functional and conjoint measurement (Ander- 
son, 1970; Krantz et al, 1971) both allow 
for monotone transformation, but differ in 
their outlook about when transformation is 
appropriate. 

Krantz et al. (1971, pp. 445-447) reanalyzed 
the data of Sidowski and Anderson (1967) and 
concluded that since the interaction could be 
eliminated by a monotonic transformation of 
the mean ratings, the original authors were 
incorrect in attributing psychological signifi- 
cance to their findings. An investigator fol- 
lowing this rescaling procedure with the data 
of Experiment I would have erroneously con- 
cluded that the additive model was an ap- 
propriate description of impression formation. 
The present research, by demonstrating that 
data transformation can lead to erroneous 
conclusions, provides a warning against this 
practice. 

Anderson? originally attempted to rescale 
the data for ratings of the severity of dis- 
turbed behaviors and to attribute the non- 
additivity to the rating scale, but he has 
recently reinterpreted the same data as evi- 
dence for the configurality of clinical judg- 
ment by fitting these data to an averaging 
model with differential weights (Anderson, 
1972). The latter interpretation assumed the 
linearity of the J function. 

The present research supports this attitude 
toward rating scales, but it also provides 
scale-free constraints that make the inter- 
pretations of Krantz et al. (1971) and Ander- 
son (1972; see, also, Footnote 5) matters for 
experimentation rather than assumption. 

In Experiment IV it would be possible to 
distinguish multiplicative from additive mod- 
els on the basis of ordinal information (see 
Figure 5). The simple view of conjoint scaling 
would not allow this distinction, since a mono- 
tonic (logarithmic) transformation of a posi- 
tive product yields a sum. By assuming that 
ratings of differences are monotonically related 
to subjective differences, the 7 function can be 
separated from the J function. Experiment IV 
allows rescaling, but rescaling does not pre- 
clude the possibility of testing the 7 function. 

Although the rating scale contains metric 


5 N. H. Anderson. Application of an additive 
model to impression formation. Paper presented 
at the third annual meeting of the Psychonomic 
Society, St, Louis, August 1962. 
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information, it does not seem 
assume that J is linear in ev 
The J function can be prec 
It depends upon contextual ci: 
explainable (Birnbaum et al., 1 
& Perrett, 1971). A 
could result in 2 possible er 
additive data when the underlyin 
are additive, or (b) additive 1 when the 
impressions are nonadditive. Thus, the deci 
sion to rescale or not to rescale cannot be 
convincingly settled without further com 
straints, such as those applied in Experiments) 
III and IV. 


appropriate ti 

experiment; 
sly nonlinear, 

ts that are 
1; Parducei; 
J function 
s: (a) nons] 
impressions: 


nonlinear 


Conclusions 


Impressions of likableness cannot be reps 
resented as simple sums or averages of single 
values of the adjectives. They appear to be 
a predictable, but nonadditive function of the 
component values. The data show consistenti 
regular deviations from additivity that arg 
similar for different selections of adjectives 
When one adjective is dislikable, the persai 
is rated as dislikable, and variation of thé 
other trait has less effect. 

Differential or configural wei 
more dislikable traits can account for the 
interactio Representation of the stimuli 
by distributions could explain why the ad: 
pues are integrated by a nonadditive func- 

It is less likely for a person possessing’ 
ikable trait to be likable than for à 
person with a likable trait to be dislikable. 

Finally, this research illustrates that rating 
scales contain metric information that should) 
not be uncritically scaled away. The proce- 
dures employed in Experiments III and IV 
provide constraints that allow a model to be 
tested without having to assume the validity) 
of the rating scale. 


;hting of the 
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Accuracy. Gerbrands tachistoscopes all have direct-reading single and multichannel 
digital timers with accuracy of + .005% or better. Lamp drivers are completely solid state 
and insure accurate and repeatable control of illumination. Variable intensity controls are 
standard. 


Versatility. Gerbrands tachistoscopes are available with a complete line of 
interchangeable stimulus backs for presenting almost any type of stimulus. Inexpensive 
polarized filters are available for experiments using dichotic fields. All lamp drivers 
operate from +5 volt logic and are computer compatible, 


Convenience in Programming. Our new push button logic unit permits instant 
programming of the fields and externally powered devices in a wide variety of sequences. 
To change programs, simply press a few buttons. No need for cumbersome patchboards 
or custom interfaces. 


Ease of Operation. All timers utilize easy-to-use thumbwheel switches - no confusing 
array of rotary switches. In addition, all Gerbrands tachistoscopes have the new, center 
pivoted, gimbal mount mirror system that permits fast, accurate alignment of the fields 
and masks from outside the cabinet. 


Gerbrands Tachistoscopes. The result of over 40 years of continuous development in the 
design and manufacture of quality instruments for psychology. 


Q Ralph Gerbrands Company 


8 BECK ROAD, ARLINGTON, MASSACHUSETTS 02174 (617) 648-6415 


The 5800 Data Cube series is the first in a family of low 
cost, high quality instruments designed especially for 
introductory, undergraduate and general research work 


in human and animal behavioral lab- 
oratories. Use them independently or in 
any combination. Each Data Cube in the 
5800 series is guaranteed 100% com- 
patible and all units with electronic cir- 


cuitry are solid state with built-in power [| 


supplies. 

Identical size 6" x 6" x 4" cubes can be 
stacked like blocks, eliminating the need 
for expensive racks and power rod strips. 
Front panels are engineered for easy 
operation. And color coding on binding 
posts is consistent. With the 21 Data Cubes 


now available in the 5800 series, the experimenter, 
lecturer or researcher can control all basic human 
and animal behavioral experiments inexpensively and 


For information on individual Data Cubes, including 
price list and specifications, check the following list and 


mail to: 5800 Data Cube Series, Lafay 
ette Instrument Company, P. O. Box 1279. 
Lafayette, Indiana 47902. 
@ indicator panel B switch pane! 

selector panel B data recorder 
W Power supply W shocker M stop 
clock B drinkometer W counter 
Bi timer W photocell control W pre- 
determining counter W- auxiliary relay 
B stimulus display B numeric indica 
tor B power control B flicker fusion 
W GSR W probability generator 
E scrambler B pulse former an 


P. 0. Box 1279 + Lafayette, Ind. 47902 — 


LAFAYETTE INSTRUMENT COMPANY € ) 
Papel d 


LECTURE SUPPLEMENTS—INDIVIDUAL STUDY 


EXPERIMENTS IN CONTEMPORARY PSYCHOLOGY: TEXTS AND MAT 
JOHN BRADSHAW, Monash University, Australia 


Professor Bradshaw has assembled a library of didactic experiments drawn from his own recent research 
and from current developments in cognition, human performance, information processing and perception. 
Each experiment is a complete unit containing background information, references, glossaries, instruc- 
tions for the general conduct of the experiment and gathering data, worksheets, data sheets and guide- 
lines for statistical analyses. In addition, the materials needed to do the work are available: slides, 
tapes and stimulus cards, The experiments are theoretically sophisticated yet clearly described and 
refreshingly free of the need for complex apparatus or tedious preparation. Most of the nine units can 
be worked with either no apparatus or an ordinary slide projector or tape recorder, These experimental 
units are ideal for filling out departmental repertoires of laboratory research opportunities. Among the 
many applications are: undergraduate and graduate experimental courses, individual research projects, 
topical research for non-laboratory courses, and simply "getting one's feet wet" in a research area. The 
format of the written material is suitable for a loose-leaf binder. The entire series requires approximately 
18 sessions, each of 2-3 hours experimental time. The experiments and stimulus materials are available 
singly as listed below or in complete sets. Experiment modules are: 1) Introduction, 8 

and Report Writing; 2) The Nature of the Visual Tra 3) List Searching Experi 

Rehearsal and Acoustie Confusions; 5) Sensory and Motor Effects in Articulation; 6) Later 

in Verbal and Rhythmic Tasks; 7) Identification of Mirror-Reversed & Non-Reversed Material in 
Same & Opposite aal Fields; 8) Concept Learning; 9) Skilled Motor Performance and Information 
Theory; and 10) Classical Psychophysics, the Power Law, Signal Detection and Absolute Judgments. 
Lab manual, 9. Complete library of stimulus materials (slides, tapes and stimulus cards), incl, 
2 lab manuals, $175.00; with Texas Instruments SR-10 calculator, $249.95; with TI SR-11 calculator, 
$274.95. Individual modules available from our catalog, 


THEORY AND EXPERIMENTATION IN SIGNAL DE "ION 
JOAN GAY SNODGRASS, New York University 
"This is a three-part instructional unit consisting of an introduction to signal detection theory, a d 


scription of experimental methods and methods of data analysis, and a sample experiment using SDT 
Ene to analyze recognition memory for high and low imagery words, suitable for a one-week labora- 
tory in either an introductory or basic experimental course. The only apparatus required is a set of 
word cards. Part 1: Introduction, $0.90. Part 2: Experimental methods and data analysis, $0.90, 
Part 3: Applications of SDT to recognition memory, $0.90, Word cards for Part 3, $4.80. Computer 
program (approx. 150 statements, Fortran IV) to simulate ar al observer, 2nd ed., $18.00, Double 
probability graph paper, pkg. of 100, $4.00. Complete instructor's set : parts 1, 2, & 3, cards, program 


and graph paper, $28,00, 
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25 minutes, 17 slides, booklet, $26.5 
speech, ‘Tape, approx. 30 minutes, 
COMPUTER GENERATED STATISTICS TESTS FOR TEACHI TO MASTER Y 
RONALD L, KO'TESKEY, Asbury College 
Those programs generate many unique versions of calculation tests for a non-enleulus based statistics 
course, ‘The wording of the questions in the calculation tests remains the same in each version, but the 

i sh version are chosen randomly within limits so that each test is unique. Those programs 

student answer key for each test which gives the final answer and suggestions for review 

if the question is missed. Finally, they generate a teacher key for each test which gives not only the 
final answer, but answers for each step in the solution of the problem. The wording of the questions 
in the calculation tests may be easily changed. Programs are written in Fortran IV and range from 
frequency distributions to analysis of variance in six separate tests plus a final examination i corporating 
all. Complete program deck (approx. 3000 statements, Fortran IV), instructions, $90.00; with TI SR-10 
calculator, $164.95; with TI SR-11 calculator, $189.95, 


FORTRAN PROGRAMS FOR PRODUCING COMP TION EXERCISES 
ROBERT J, TRACY & PAUL G, SALMON, DePaul Unive 


There are four programs plus subroutines designed to aid instructors in developing students statistical 
skills by generating exercises on 1) z scores; 2) Pearson r correlation with regression or matched groups 
t test; 3) MES groups t test; and 4) Chi square. The programs draw random samples from 


populations and produce a unique exercise (data values and instructions) for each student, corresponding 
solutions for the instructor and intermediate summary statistics, Parameter values can be chosen to 
produce marginal statistical significance. Complete program deck (approx. 600 statements, Fortran 
V), instructions, $28.00, ] 
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The Human Subject 


The Social Psychology of the 
Psychological Experiment 
John G. Adair 

University of Manitoba 


Use of the human subject in the 
psychological laboratory raises many 
methodological and moral issues. This 
supplementary book examines those 
issues from both a practical and philo- 
Sophical standpoint, providing an inte- 
grated perspective on a highly complex 
area of experimental psychology. An 
historical survey of man's persistent 
efforts to develop an appropriate study 
method is followed by an overview of the 
contemporary critical research on the 
psychological experiment, dealing with 


Such topics as experimental bias, volun- 
teers versus non-volunteers, demand 
characteristics, and ethical considerations 
Paper 109 pages 1973 
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fmicroswitch to detect when the subject takes food. The tray, levers, 
and floor are stainless steel. There are three stimulus lamps in the 
intelligence panel plus a house lamp and speaker in the roof. With 
standard levers, either a pellet dispenser or liquid dipper, housing, 
control panel, and connecting cable the price is $525. The same set-up 
with retractable levers is $699. Other chambers start at $240. We 
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THE STUDY OF BEHAVIOR 
Learning, Motivation, Emotion, 


and Instinct 

Editor: John A. Nevin, Columbia University 
Consulting Editor: George S. Reynolds, University of California, Sarr Diego a 
This balanced, distinctive text covers the topics of learning, motivation, emotion, and instinct, 
with special emphasis on learning problems. An innovative discussion of both recent and 
traditional issues is offered with a clarity of presentation that is unusual among texts of 

this type. 

1973, 418 pages, illustrated, cloth $10.95 


CONTROL OF HUMAN BEHAVIOR 


Roger E. Ulrich, Western Michigan University 
Thomas J. Stachnik, Michigan State University 
John H. Mabry, Porterville State Hospital, Porterville, California 


Volume |: Expanding the Behavioral Laboratory 
1966, 349 pages, illustrated, paper $5.95 


Volume II: From Cure to Prevention 
1970, 378 pages, illustrated, paper $5.95 


Volume III: Behavior Modification in Education 

We've extended the coverage in this excellent series by adding a third volume — focusing on the 
application of behavior modification to varied education settings. Ranging from infancy to 
adulthood, a thorough, comprehensive look at new programs in behavioral education is 
presented. 


March 1974, approx. 448 pages, illustrated, paper, approx. $6.95 


EXPERIMENTAL SENSORY PSYCHOLOGY 

General Editor: Bertram Scharf, Northeastern University 

Consulting Editor: George S. Reynolds, University of California, San Diego 

An introduction to the problems and methods of sensory psychology, including chapters on 
psychophysical procedures, biology of the sensory systems, and the individual sensory systems. 
June 1974, approx. 384 pages, illustrated, cloth, approx. $10.95 
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Are stimulus preparation and rapid stimulus — 
changes your latest Tachistoscope problems? - 


TRY THE AUTOMATIC TACHISTOSCOPE - 
SOLUTION 


Simply photograph those complex stimuli (color or 
B&W) and mount them in 2" x 2" slide holders. The 
Scientific Prototype AUTO TACH does the rest — 
quickly, efficiently, and with positioning repeatability well 
within .005 inch. Three color-matched fields with up to bO 
foot-lambert clear field brightness that are viewed through 
a comfortable, adjustable double eyepiece make this tach 
a natural for many areas of visual research. 


Present Typewritten word lists — automatically 


OR 
Compare reactions to scenic stimuli — rapidly, automa- 
tically. 

OR 
change backgrounds — rapidly, automatically. 

AND 


Insert standard filters in standard 2" x 2" filter holders 
for precise threshold work. 

AND 
Program the slide changes from the control unit timers. 


You may also purchase just the optical unit and changer 
control units as accessories to your existing Model GB elec- 
tronics unit — they simply plug in. 

AND 


Watch for news of our new variable magnification auto E 
tach. 3 


For more details on our expanding line of tachistoscopes 
B write for Scientific Prototype Catalog No. 101. 
PROTOTYPE 615 W. 131 ST., NEW YORK, N. Y. 10027, (212) 368-6000. 
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Beginning in 1975 the Journal of Experimental Psychology will appear as four independently edited and 
distributed sections: 
The Jm. of Experimental Psychology: Human Learning and Memory (Lyle E. Bourne, Jr., Editor]. 
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ARDIAC ORIENTING DURING “GOOD” AND "POOR" 
DIFFERENTIAL EYELID CONDITIONING! 


LEONARD E. ROSS, anp FRANCES K. GRAHAM * 


LOIS E. PUTNAM, 


University of Wisconsin— Madison 


The relation between orienting responses (ORs) and acquisition of a differen- 
tially conditioned eye blink response was studied in a group of 10 Ss showing 
good differential conditioning and a group of 10 Ss who failed to condition 
differentially. With Good conditioners, the predictions of OR theory were 
confirmed. Orienting, measured by heart rate (HR) deceleration, increased 
during acquisition and habituated as conditioning stabilized. With Poor con- 
ditioners, an OR-interference effect was found which is consistent with OR 
theory but was unexpected on the basis of most previous work. Another 
unexpected result was the occurrence of HR accelerations which were larger 
to a signaled than to an unsignaled unconditioned stimulus (UCS). It was 
suggested that both findings could be explained by the effects of orienting on 
processes active during short as opposed to long CS-UCS intervals and on 
skeletal conditioned responses (CRs) as opposed to autonomic CRs that are 
components of the orienting system. 


too weak. He also noted that the OR 
tends to disappear as conditioning is estab- 
lished (Lynn, 1966; Razran, 1961). The 
more fully articulated theory of orienting 
developed by Sokolov (1963) hypothesizes 
two generalized systems of responses with 
opposed functions, an OR which enhances 


\ major reason for interest in the orient- 

i^ response (OR) is its presumed relation 
) learning processes. Pavlov initially be- 
lieved that the pattern of overt physical 
reactions which he termed the "investi- 
gatory" or “orienting” reflex had little im- 
Portance for learning, but he later noted 
that conditioning proceeds best in the pres- stimulus reception and a defensive re- 
ence of an OR if it is neither too strong nor sponse system (DR) which attenuates the 
Th d effects of stimulation. The relation of these 


"M E ME systems to changes occurring in learning 
Award was described for two paradigms—classical 
HDOO117 Rise tke U.S. Public Health Service. 


and by Training Grant esc 
1 conditioning of the vasomotor response to 
Computer services were provided through Grant 


FR00249 to the Laboratory Computer Facility and 
through a National Science Foundation grant ad- 
ministered by the University of Wisconsin Research 
. Committee, Portions of this paper were presented 
at meetings of the Society for Psychophysiological 
Research, St. Louis, Missouri, October 1971. 
$ Requests for reprints should be sent to Leonard 
E. Ross or Frances K. Graham, Department of 
Psychology, University of Wisconsin—Madison, 
Madison, Wisconsin 53706. 
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shock and the conditioning of an instructed 
motorresponse. In both cases, a vasomotor 
OR appeared on initial pairings of condi- 


tioned stimulus (CS) and unconditioned -4 


stimulus (UCS), intensified on subsequent 
pairings, persisted until the conditioned 
response (CR) stabilized, and disappeare 
on later trials. After vasomotor conditi 
ing to shock was accomplished bth CS 
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and UCS elicited a stable vasomotor DR. systems and of using a res; ot «lel 
Since the instructed motor response situa- pendent on verbal instructi re ap 
tion did not employ an aversive UCS, a pear to be no studies of hum iwhich 


DR was not elicited and, after conditioning a cl illy conditioned s mota 
was stabilized, vasomotor responses dis- response and an OR-DR «d iati 
appeared entirely (Sokolov, 1963, pp. 172- autonomic response have bee: igat 
173, 185; Vinogradova, 1965). concurrently. This paradig: sed 
Findings obtained from these two para- the present study. The eye | spon 
digms constitute the primary evidence that to air puff was the to-be-con ed 
an OR is a prerequisite for normal con- sponse, and orienting was m ed 
ditioning and that the OR changes during heart rate (HR) deceleration. rt ral 
the course of conditioning. However, both deceleration appears to meet ti rite 


paradigms have limitations. Instructions of an OR and HR acceleration t) cit ork 
are a means of manipulating attention and of a DR (Graham, 1973; Jackso 1 O 72] 
may thus elicit a different OR pattern Since acquisition of a differenti CR | 
than would occur with the traditional con- less affected than a simple CR 

ditioning situation. With an autonomic eralized arousal effects (Prokasy, 1% 7 ) 
conditioning paradigm, the measures of eye blink response was differential v con 
OR and CR may not be independent. As  ditioned. While OR theory predicts rel 
Morgenson and Martin (1968) have pointed tively unambiguous OR changes in res v» 
out, when the same response measure to the positive CS (CS+) as condi : rif 
indexes both the OR and CR, relationships progresses, it is less clear what ch agesi 


between them may be due more to general the OR should be expected wit ^ neg 
responsiveness within systems than to any tive CS (CS—). If differentia! Jic con 
role of the OR in conditioning. Although  ditioning is to occur, both li tioned 
the occurrence of changes in the direction of stimuli should presumably elii rient ing 
cephalic vasomotor responses, dependingon and changes in orienting might “pected 
whether an OR or DR is elicited, may to follow a similar course, pos bl with 
exempt the Soviet studies from this less pronounced orienting and sore rap 

criticism, the existence of OR-DR differ- extinction to the nonreinforood ë CS 


entiation by vasomotor responses has not Sokolov (1960, p. 221) gives a0 « an ple of 
been unequivocally replicated by Western more rapid extinction of orien: ng te» 
investigators (Graham, 1973). In any case, negative stimulus, but the situation is 
the conditioning studies of Western investi- directly comparable to the present one si 
gators who have been concerned with OR differential conditioning was preceleg 
changes have mainly employed the galvanic simple conditioning. 
skin response (GSR) or the peripheral In addition to investigating HR Ch a ngi 
vasomotor response, neither of which dis- during differential eyelid conditioni xy g- +h 
tinguishes between ORs and DRs. These present study also compared these ch a 
studies are clearly vulnerable to the objec- with HR responses to the condition e«c4 aj 
tion of possible OR and CR confounding unconditioned stimuli when they were 
unless latency criteria are used to separate paired. Finally, the relation of orienti 
the responses and, even then, to the objec- to successful conditioning was SUD eg 
tion that any OR-learning association is selecting two groups of Ss who cli ¢yered 
due to the reactivity of the particular re- markedly in how well they achieved <q jffer 
sponse system. The objection is especially ential eyelid conditioning. If the O R 
pertinent to studies concluding that ORs, a role in facilitating acquisition, **« 346d" 
axed by the GSR on initial trials, pre- conditioners should show the exp ected 
ilitated GSR conditioning (eg., pattern in response to CS+, i.e., initially 
increasing HR decelerations which would 
habituate as differential eyelid cond à tį oning 
reached asymptote. "Poor" conditioners, 
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on the other hand, might show either un- 
usually large and persistent ORs or smaller, 
less stable ORs. The latter finding would 
be in line with Maltzman's conclusion that 
low-OR .Ss are poorer conditioners (e.g., 
Maltzman & Mandell, 1968). 


METHOD 


Subjects. Fifty undergraduates, receiving experi- 
mental “points” for their participation, served as Ss. 
Five additional Ss were tested and discarded in order 
to complete groups of Good and Poor differential 
conditioners. Of these five Ss, three failed to meet 
a criterion of producing at least five CRs, and two 
were tested to complete the Poor conditioning group, 
but showed good differential conditioning. Two 
other Ss were also discarded, one due to equipment 
failure and one for a heart anomaly. 

Apparatus, The subject-testing room contained 
an upright ophthalmologist's chair which faced a 
sereen displaying the fixation point—a black, cir- 
cular, cardboard disc 8.5 cm. in diameter. A rec- 
tangular patch of light (85 X 50 cm.), projected on 
the screen, provided the only illumination. 

Recording and stimulus generating equipment 
were located in a separate room having intercom 
communication with the S room. Two Hewlett- 
Packard ABR audio oscillators (Model 200 CDR) 
produced pure tones of 800 and 2,100 Hz. whose 
rise and decay times (710 us.) were controlled by a 
Grason-Stadler Model 829C electronic switch. A 
Grason-Stadler 901B noise generator provided con- 
tinuous background white noise which, with tones, 
was presented binaurally to S through Sharpe HA 
10A earphones. Intensity of tones and noise w: 
determined from matches of four judges who ad- 
justed the earphone-presented noise to the loudness 
of free-field white noise, measuring 65 db. re .0002 
microbars on the C scale of a General Radio 1551C 
sound-level meter, and adjusted the earphone- 
presented tones-over-noise to the loudness of a 
1,000-Hz. free-field tone 85 db. above the judges' 
mean threshold. Stimulus durations, interstimulus 
intervals, and intertrial intervals (ITIs) were con- 
trolled by Tektronix timing equipment and an 
Ohrtronics tape reader, Tones were automatically 
triggered on the positive slope of the first electro- 
cardiographic R-wave occurring after the ITI had 
elapsed, 

Movement of the .S's right eyelid was recorded 
using a standard potentiometer-polygraph ap- 
paratus (Wilcox & Ross, 1969) and heart rate, 
detected by a lead II placement of Beckman bio- 
potential electrodes, was tape recorded and com- 
puter read. 

Design and procedures. Five treatment groups of 
five males and five females each, to which Ss were 
assigned randomly until full, included two differ- 
ential eyelid conditioning groups (Good and Poor 
conditioners) and three control groups (Unpaired, 
Two-Tone, and One-Tone). Subjects retained in 
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*/o DIFFERENTIAL RESPONSE 


Distribution of differential eyelid 
conditioning performance. 


FIGURE 1. 


the Good group showed 55% to 90% differential 
conditioning on the last half of the conditioning 
trials (percentage CRs on the last 20 CS+ trials 
less percentage CRs on the last 20 CS— trials), 
while Ss in the Poor group ranged between —15% 
to 25%. The distribution of percentage difference 
scores for all Ss tested, including the two discarded 
Good conditioners, appeared to be bimodal, as 
illustrated in Figure 1. 

Conditioning Ss received 40 CS+ and 40 CS— 
trials with an ITI of 18 to 30 sec. from CS onset to 
CS onset. A Gellerman presentation order, repeated 
every 20 trials, provided that each block of 10 trials 
contained equal numbers of CS+ and CS— trials 
and that neither stimulus occurred more than three 
times in succession. The UCS was a 100-msec. 
puff of .75 psi compressed air delivered to S’s right 
cornea and coterminous with the 900- tone 
CS+. Six of the Good conditioners received a 
2,100-Hz. CS+ and an 800-Hz, CS— and four re- 
ceived the reverse pattern. In the Poor condition- 
ing group, four Ss had the higher frequency tone as 
C84. 

Three other groups provided control comparisons, 
An Unpaired group received 40 tones and 40 un- 
paired puff presentations with the temporal schedul- 
ing identical to that of puff and CS— in the con- 
ditioning groups. Procedure for a Two-Tone group 
also retained the order and temporal intervals of 
trials in the conditioning groups but omitted the air 
puff. A sensitization c ntrol procedure with random 
interspersion of the UCS among both tones was not 
feasible since the mean intertone interval would have 
had to be lengthened to allow time for HR response 
recovery. A third control group, the One-Tone 
group, received only 40 tones on the same schedule 
as CS— in the paired group, and thus had an ITI 
ranging from 36 to 60 sec. Frequency of the tones 
was balanced within each group. 

Instructions to all Ss indicated that reactions “to 
certain distracting stimuli" were being investigated 


566 


and that heart rate, along with other responses, 
would be recorded.  Eyelid responses were not 
mentioned explicitly and, as the eye blink recording 
headset was positioned, 5 was informed only that it 
would be used to record "responses." The S was 
cautioned to sit quietly and was warned that re- 
ceiving credit for participation was contingent on 
remaining awake. 


RESULTS 


Eyelid conditioning. Percentage respond- 
ing to CS+ and CS— is shown in 5-trial 
blocks in Figure 2. Consistent with pre- 
vious work in this laboratory, a CR was 
defined as the first pen deflection of at 
least 2 mm. (1 mm. eyelid closure) occurr- 
ing in the interval from 200 msec. after CS 
onset until UCS onset. Good conditioners 
showed asymptotic responding to CS+ 
(96%) by Trial Block 5 (Trials 41-50) and 
achieved considerable differential respond- 
ing (64%) by Trial Block 3 (Trials 21-30). 
Asymptotic differential responding of ap- 
proximately 75% was reached by Trial 
Block 4. Poor Ss, on the other hand, 
showed only a small increase in responding 


loo- GOOD 


PERCENTAGE CRs 


(0) 
frials | 


CT TNT, ak ee I A: 
182 


TRIAL BLOCKS 


Fictre 2. Percentage of conditioned responses 
for Good and Poor conditioners in blocks of five 
CS4- and five CS— trials. 
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between Trial Blocks 1 virtua 
no differential responding cal te 
over the last three trial i on firme 
that Good Ss responded ily mg 
to CS+ than to CS— 96, p 
.001, that Poor Ss did no ] diff 
entially, £ (9) = 2.18, p > í that 
differential responding of 1! d grol 
was significantly better thai t of Qj 
Poor group, £ (18) = 6.88, p OL. 
Good-Poor differences could be í 
cribed to differing numbers of .5- «lassi 
as voluntary responders since « assifi 


tion, according to the Hartman and 
(1961) criteria, indicated that there Wt 
four voluntary responders in the Good aj 
three in the Poor group. 


ditioning and control groups, The diff 
ential conditioning procedure ) pi 
duced highly significant HR 
which were time locked to th: 
both CS+ and CS— in both ood al 
Poor conditioning groups. Thes 
averaged over all trials, are sho 
3 for differences between HR curing W 
first second before stimulus onset and 
on each of 6 sec. following onset. TM 
l-sec. averages are the bpm rate duri 
all full and partial beats, weighted accore 
ing to the fraction of the second occuple 
by each. A 6-sec. poststimulus period wa 
selected for illustration and analysis $i 
it was the longest period during whic 
single trend accounted for the major pa 
of the variance and because different 
from prestimulus were rarely reliable W 
yond this point (Putnam, 1972).  Diflé 
ences in prestimulus level, between group 
were not significant and did not change di 
ferentially across the experimental session 

As Figure 3 illustrates, HR respons 
during the 6-sec. poststimulus period W! 
generally triphasic with an initial decelera 
tion lasting for 2 sec., a subsequent phase 
of acceleration also lasting for 2 sec., anc 
a slower recovery phase. 1n the contro 
groups, the accelerative phase did not al 
ways increase HR beyond prestimulus, and 
the response was, on the whole, less regula 
and less pronounced. A comparison Ol 
responses to “CS—" tones showed tha 
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Ficurg 3. Mean heart rate (HR) responses to CS — and to CS+ alone, 


puff alone, or paired CS+ 
differences among the five groups.in over- 
all cubie trend were significant (Table 1, 
Col. A) and were largely due to the paired- 
control comparison. The trend in each of 
the control groups, compared individually 
with. the conditioning groups, was also 
significantly different but the controls did 
not differ from one another. When the 
three response components were tested 
separately by measuring changes over the 
first 2 seconds, over Seconds 2 to 4, and 


over Seconds 4 to 6, differences among 


and puff, for each paired and control group. 


groups and between paired and control 
groups were significant during the initial 
deceleration (Table 1, Col. B) and the 
following acceleration (Table 1, Col. C) 
but not during the recovery phase (Table 1, 
Col. D). Each of the control groups, con- 
sidered separately, also differed from the 
conditioning groups in the degree of accel- 
eration, but only the Unpaired group differed 
in initial deceleration. Again, there were 
no significant differences among control 


groups. There was also no significant dif- 
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ference among groups in the changes across CS+ and CS— and which did :abit- 
trials, pooled into blocks of eight trialseach. uate. The relative stability ol nses 

The relatively large accelerations to CS— to CS—, as well as the consist: roup 
in the conditioning groups were unexpected. difference in magnitude of decele n, is 
Since the acceleration was significantly illustrated in Figure 4. In con! re- 
larger than in the control group given both sponses to CS+ (Figure 5) shoy con- 
air puff and tone unpaired, sensitization siderably more variation across tri cks 
could not account for the full effect. although they were clearly less i lar 
Further, the conditioning and the un- than responses to puff in the l red 
paired groups also differed in their response group. When decelerative respo! to 
to the puff. When puff was unsignaled, both CS+ and CS— and to each sti: lus 
it elicited a significantly smaller accelera- alone were analyzed for changes 088 
tion than when it was preceded by CS+, blocks of eight trials, the differences be- 


F (1, 27) = 11.34, p < .0t. 

Comparison of heart rate responses of Good 
and Poor conditioners. While the HR re- 
sponse during conditioning differed from 
control group response both in initial 
deceleration and in the subsequent accelera- 
tion, significant differences between Good 
and Poor conditioners were confined to the 
initial 2-sec. period of deceleration which 
presumably reflects orienting (Table 1). 

Poor conditioners were not deficient in 
orienting but, rather, they showed large, 
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tween Good and Poor conditioners wer: 
flected in a significant Groups X Stimul; X 
Linear Trials effect when both stimuli w: -e 
included in the analysis (Table 2, Col. . ) 
and in a significant Groups X Linear Triais 
effect for response to CS4- (Table 2, Col. 
B). Only the Groups effect was significant 
in response to CS— (Table 1, Col. C). 
'The finding of greatest interest was ! 
change in the decelerations elicited 
CS+. Both Good and Poor conditio: 
showed a similar increase in deceler 


e- 


stable ORs which did not differ following over the first three trial blocks but, on | ‘er 
TABLE 1 
‘Tests or DIFFEREN| è HEART RATE RESPON F Two PAIRED (PR) 
Cow'rROoL (C) Groups 
s ae | A. Cubic C. Acceler- | 
Source of variation df | trend | celera ation | D.R ry 
F ratios 
Groups (G) 4 | 7.4** 4.1** | 9.5** 13 
Pr-C 1 26.2** 6.3* |  3à5* 1 
Among C 2 al | 3 B 6 1.5 
Good-bad 1 32 999 5 | 30 aig 9 
Nonorthogonal | i . 
Group comparisons (cfs) | | | | 
Pr-unpaired 1 | | 
Pr-1 tone 1 | ane: | : 
| 20. 7 
1 | | 21.5** 3.9 
z | 1.0 i3 
Tee] | | 8 2 
16 13 9 
4 | I ns ns 
Xe CEA EN Error mean squares 
Ss/G 45 = |: CSRS E 
pana 38.7 3 2.8 
EX tna RNS eee EE 
BE Anez 5 $ ^ 
3 near 45 12.5 | 54 5.0 
Note, Abbreviation: CS— — negative conditioned stimulus, AC NE x 
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GOOD CS- 


HR DIFFERENCE (bpm) 
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POOR CS- 


Pre- +35, +6 Pre- +3 +6 
SECONDS 
Figure 4. Heart rate (HR) responses of Good and Poor conditioners to 


CS— in five trial blocks of eight trials each. 


trials, Poor conditioners continued to give 
large decelerations while the response 
showed habituation in the Good condition- 
inggroup. These changes can be seen more 
clearly in Figure 6, which also shows the 
average decelerative résponse of those 
Poor conditioners who avoided the puff on 
at least 18 of the final 20 CS+ trials by 
giving a CR coincident with the UCS. Al- 
though there were only four subjects in this 
category, their failure to show any habitua- 
tion seems to rule out the possibility that 
ORs of Good conditioners habituated be- 
cause they were able to avoid the puff. 
The temporal correspondence between ori- 


enting and conditioning is illustrated in 
Figure 7 which shows the differential eyelid 
responding of Good conditioners superim- 
posed on their HR deceleration data. It is 
evident that the period of maximum de- 
celeration corresponds roughly to the 
time when differential eyelid conditioning 
reached an asymptote. 

For comparison with studies that have 
described the learning behavior of Ss 
categorized as high and low orienters on 
the basis of some measure of initial orient- 
ing, such as response to the first CS, differ- 
ential conditioning scores of all Ss were 
correlated with deceleration to the first 


HR DIFFERENCE (bpm) 


FiGURE 5. 
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8-trial 
blocks 


SECONDS 


Heart rate (HR) responses of Good and Poor conditioners to CS+ 


and of Unpaired Ss to air puff in five trial blocks of eight trials each. 


CS— and to the first CS+. Neither cor- 
relation was significant and they differed in 
direction (Pearson product-moment r = 
— .30 for first CS— and r = + .36 for first 
CS+): 

DISCUSSION 


The present study of the relation between 
orienting, measured by HR deceleration, and 
learning, measured by acquisition of a differ- 
entially conditioned blink response, confirms 
the descriptions of Soviet investigators (e g., 
Sokolov, 1963; Vinogradova, 1965). The OR 
intensified during acquisition and habituated 
as conditioning stabilized. Similar iesults 
have also been reported in a recent study of 
HR changes accompanying conditioning of the 
rabbit's corneoretinal potential (Powell, 1972). 
Since the Soviet studies providing the best 
empirical basis for the OR-learning relation 
either conditioned an autonomic response and 
thus confounded OR and CR measures, or 
employed an instructed motor response and 
thus might have imposed attentional require- 
ments specific to a "voluntary" task, the 
present results place on a firmer basis the 


generalization that during learning 
first increases and then declines while a 
response increases and stabilizes. 
rise and fall during learning is also of inter 
because it suggests that the common finding 
of similar inverted-U functions in autono 
conditioning studies may reflect changes in 
orienting rather than in a learning process 
Two other findings of the present study 
that Ss failing to condition had large, stabi: 
ORs and that HR accelerations to the UCS 
were larger to a signaled than to an unsignaled 
UCS, stand in contrast to most previous work. 
The existence of larger ORs in Poor than in 
Good conditioners, both on the first trial and 
throughout conditioning, is compatible with 
Soviet versions of OR theory which emphasize 
that very marked orienting, as well as very 
weak, interferes with conditioning, but the 
finding does not agree with Western versions 
that propose only a positive relation between 
orienting and conditioning (e.g., Maltzman, 
1967). Maltzman views the evidence of a 
positive relation and the absence of evidence 
of any interfering effects of high OR on learn- 
ing as important for the argument that orient- 
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TABLE 2 


Tests or DIFFERENCES IN THE DECELERATIVE 
Heart RaTE RESPONSES OF GOOD AND Poor 


CONDITIONERS 
: T A. CS4- NM E: 
Source of variation af | and cet. | cs- 
F ratios 
Groups (G) } 4} 34 | 8 | 64* 
Stimuli (S) 1 at =f er 
* Trials (T) 2 1-3:1* 6:51*- 1 1:0 
Linear T PARAS 9.5** |. 0.0 
Quadratic T | 1| 48* | 20.5** 9 
GXT | 4| 13 Qaid p se 
G X Linear T 1| 3.8 6.0* | 1 
GXS | 1| 34 — |— 
SXT 4| 46** — — 
5 X Linear T 1 6.4* 
5 X Quadratic T 1| 9.0** | 
GXSXT 4| L5 — = 
G XS X Linear T 1| 47* 
Error mean squares 
[1| 42.2 | 22.8 
x5 18| 7.8 — 
XT 72| 44 5.5 4.6 
X Linear T 18| 54 7.5 3.0 
G X Quadratic T |18| 3.4 3.2 6.3 
GXSXT A5 - — 
Ss/G XS X Linear T |18| 5.4 — 


Note. Abbreviations: CS+ = positive conditioned stimu- 
lus, CS— = negative conditioned stimulus, 

*p «.05. 

* p «.01. 


ing responses are not measuring a generalized 
arousal or drive state. The argument is 
reasonable but it is questionable that the 
previously existing empirical data permit a 
distinction between arousal and OR effects on 
learning. 

A major flaw in the evidence for a solely 
positive OR-learning relation not due to 
arousal is that a number of studies have classi- 
fied Ss into high- and low-OR groups on the 
basis of only a single trial which has sometimes 
been the first habituation and sometimes the 
first UCS trial (e.g., Maltzman, 1967). As 
Zeiner and Schell (1971) point out, the re- 
sponse to a noxious UCS qualifies as a DR. 
Base levels of skin conductance have also been 
employed (Maltzman & Mandell, 1968) as a 
measure of the tonic OR but might equally 
well be considered a measure of arousal 
(Malmo, 1959). A second major problem 
with the evidence is that studies have not 
adequately controlled for general reactivity 
effects, Many have measured CR acquisition, 
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heart rate (HR) deceleration (prestimulus HR less 
Second-2 HR) of Good conditioners, Poor condi- 


tioners, and Poor avoiders. 


which is more likely than differential condition- 
ing to reflect arousal. Even differential con- 
ditioning may not rule out activation effects if 
OR and CR are measured by the same re- 
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between differential eyelid responding of Good 
conditioners and their peak heart rate (HR) de- 
celerations to CS+. 
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sponse system and there are Ss who fail to 
respond to both CS+ and CS— in a given 
trial block. In such cases, there is no measure 
of whether discrimination is or is not present 
and their inclusion as instances of no dis- 
crimination artifactually lowers the discrimi- 
nation score, Since low-OR Ss are more likely, 
on the basis of a response reactivity factor, 
to have no-response trials, reactivity could 
thus affect the OR-differential conditioning re- 
lation as well as the relation of orienting to 
CR acquisition, Zeiner and Schell (1971) 
provide an example of this in showing that a 
positive relation between a GSR-OR and 
differential conditioning of first-interval GSR 
responses disappeared when discrimination was 
measured only on trials yielding some nonzero 
response, 

While much, if not all, of the data pur- 
portedly showing a positive OR-learning rela- 
tion may thus reflect the effects of a DR, 
arousal, or specific response reactivity rather 
than an OR, these effects could not have pro- 
duced the OR-learning relation of the present 
study since the OR was measured by HR 
deceleration, which is not a measure of arousal, 
and learning was measured by the differential 
conditioning of a response in a different system. 
However, findings of a significant negative 
relation between the OR and learning appear 
to be nonexistent in the Western literature, 
except for the present study, and it is possible 
that there are critically important differences 
between how orienting affects processes de- 
veloping during conditioning with a short as 
opposed to a long CS-UCS interval. 

Paradigm differences may also be important 
in accounting for the second discrepant finding 
of the present experiment, that HR accelera- 
tions were larger in response to a signaled 
than to an unsignaled UCS, A number of 
studies, generally measuring GSR, have found 
that unconditioned response (UCR) amplitude 
is smaller, not larger, in response to a signaled 
UCS (e.g., Kimble & Ost, 1961; Kimmel, 1967; 
Peeke & Grings, 1968). Since one interpreta- 
tion of UCR diminution is that a response to 
a signaled UCS “is due to a carryover of the 
response processes begun during the inter- 
stimulus interval [Grings, 1969. p. 607]," it 
might be thought that while a unidirectional 
response like GSR would be interfered with 
by a prior response, a bidirectional system 
like HR might be enhanced, if only by virtue 
of an initial level effect. A recent report of 
UCR diminution with HR as the response 
measure (Lykken, MacIndoe, & 


Tellegen, 


1972) makes it unlikely that the explanation 
lies solely with GSR-HR differences, but an 
alternative is suggested by considering differ- 
ences in the length of the CS-UCS interval, 
The Lykken et al. (1972) study, in common 
with GSR studies of UCR diminution and 
many OR-learning studies, employed a shock 
UCS and a relatively long CS-UCS interval, 
initiated 5 sec before shock onset by the first 
of two successive CSs, In contrast, the present 
study employed a CS-UCS interval of only 
800 msec. which would require that any 
preparatory processes be mobilized very 
quickly. Just what this implies for HR 
changes is problematic but there is evidence 
that the cardiovascular system participates 
both in orienting and in preparations for 
action and that preparations for action result 
in HR acceleration (e.g., Chase, Graham, & 
Graham, 1968), Since HR reflects only one 
aspect of cardiovascular preparations, is slow 
relative to skeletal responses, and shows lag 
as a result of feedback to the heart from blood 
pressure, volume, and Os changes, HR changes 


associated with mobilizing for rapid action 
would be expected to outlast the action and 
thus might summate with the UCR. This 
hypothesis, of an interaction between length o! 


the -UCS interval and the type of UCR 
measured, receives support from another recent 
study (Furedy & Klajner, 1972) which al« 
employed a short CS-UCS interval (500 mse 
and also obtained a larger UCR toa signale: 
UCS when the response was a cardiovascula 
change, peripheral vy somotor constriction 
There was no signaling effect when GSR wa 
measured. 

The argument, then, agrees with Grings 
(1969) that the signaled UCR reflects a carry 
over of response processes begun with the CS 
but suggests additionally that whether the 
UCR is increased or decreased depends upor 
the length of the CS-UCS interval and the 
particular response that is measured. These 
same factors may also determine whether an 
orienting response, with which brief suspen- 
sion of movement is associated (Obrist, Webb, 
& Sutterer, 1969) interferes with or facilitates 
the occurrence of a particular CR. With 
short CS-UCS intervals, a prolonged, pro- 
nounced OR may not be required in order to 
establish an association between two stimuli 
in close temporal proximity and, if it doesoccur, 
might be expected to interfere with execution 
of a skeletal response such as blink. With 
longer CS-UCS intervals, prolonged orienting 
may be more important in establishing an 
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association between stimuli more widely 
separated in time and even if it were very 
pronounced would not interfere with an auto- 
nomic CR which was itself part of the orienting 
complex. 
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PRAGMATIC NORMALIZATION : 


FURTHER RESULTS FOR SOME CONJUNCTIVE 
AND DISJUNCTIVE SENTENCES ! 


SAMUEL FILLENBAUM 2 
University of North Carolina, Chapel Hill 


On a paraphrase task involving pragmatically extraordinary sentences Ss nor- 
malized two sorts of disordered conjunctive sentences as well as disjunctive 
sentences involving "perverse" threats more than half the time. Later when 
comparing their paraphrases with the source sentences they were often unable 


to detect differences in meaning. 


These results were interpreted in terms of a 


basic assumption made by many Ss that messages are sensible, that what is 
described in messages must conform to the customary order and connection 
among events, and that apparent anomalies can be attributed to inadequacies 


in the encoding of the messages, 


The tendency to normalize connected dis- 
course or stories in memory has been well 
known since at least the time of Bartlett 
(1932). It may be the case that such 
normalization involves not so much or not 
only memorial processes, but may largely 
be a consequence of the way in which 
strange material is initially interpreted and 
stored, such that extraordinary descriptions 
are transmuted into ones more mundane 
and commonplace, given knowledge of the 
ways things usually transpire, and a strong 
presumption of the sensibleness of the ma- 
terial encountered. The present research 
constitutes a replication and extension of 
part of an earlier study (Fillenbaum, 1971) 
in which it was found that on a paraphrase 
task verb order was often permuted for 
disordered conjunctive sentences, normaliz- 
ing them and assimilating the order of 
events described to conventional order. It 
is an extension in two regards: (a) in that 
(7) a new kind of ordered conjunctive sen- 
tence is examined in addition to the kind 
studied before, and (i) a class of strange 
disjunctive sentences ("perverse" threats) 
is also studied, and, perhaps more im- 

! This work was supported in part by U.S. Public 
Health Service Research Grant MH-10006 from the 
National Institute of Mental Health. The writer 
is indebted to R. Hall for his aid in gathering and 
analyzing the data. 

* Requests for reprints should be sent to Samuel 
Fillenbaum, University of North Carolina, Psycho- 
metric Laboratory, Chapel Hill, North Carolina 
27514, 


portant, (b) in that we also s 
determine whether S could detect : 
changes in his paraphrases, by havin 
review each of his paraphrases at the en 


and having him say whether there was any 
difference in meaning between the oi iginal 
Sentence and his paraphrase, and, if so, 
to indicate what sort of difference was 
involved. 
METHOD 
There were 40 sentences to be paraphrased. There 


were 5 filler items at the start and then, in ran- 
domized order (a) 10 ordered conjunctive sentences 
in normal order (e.g., "John got off the bus and 
went into the store"), (6) 10 ordered conjunctive 
sentences with normal order reversed, these sen- 
tences being extraordinary but semantically co- 
herent (e.g., “John dressed and had a bath"), (c) 4 
ordered conjunctive sentences with normal order 
reversed, these sentences violating temporal entail- 
ment relations between their main verbs (e.g. 
“John finished and wrote the article on the week 
end”), (d) 5 disjunctive sentences involvi 
monplace threats (e. "Stop that noise or call 
the police"), (e) 5 disjunctive sentences involving 
conditional threats that were “perverse” in that the 
consequent clause indicated that the speaker would 
not bring about some undesirable event commonly 
to be expected in the circur stances (e.g., “Don't 
print that or I won't sde lou», plus a single 
"perverse" warning, the sentence “Get a move on 
or you will catch the bus.” 

Each of the 51 5s, who were college students 
meeting a class requirement for an introductory 
psychology course, was run individually being given 
a deck of 3 X 5 in. cards each containing one of the 
Sentences to be paraphrased, and a blank deck for 
writing down his paraphrases. The instructions, 
which were the same as the version two (strong) 
instructions used in the earlier study (Fillenbaum, 
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1971), stressed that the task was to paraphrase or 
rephrase “the sentences as accurately as you can, 
conserving meaning as completely as possible" and 
that S was not “to improve the sentences or make 
them more sensible, but to paraphrase them, re- 
wording each in a way that captures its meaning as 
accurately as possible." After S had finished the 
paraphrase task at his own pace he was told that 
he would have to go through his paraphrases, that he 
would be asked "about the remaining differences 
IF ANY" between each sentence and its paraphrase, 
and that if he did "see some shred of difference” to 
say what seemed to be involved. For each item S 
was then shown the original sentence and his para- 
phrase and asked ‘Is there any difference in meaning 
between the sentence and your rewording?” and if 
this question was answered “yes,” S was asked 
“What sort of difference?” After running through 
all the items in this fashion if S had actually changed 
the meaning of the sentences in his paraphrases and 
noticed this at all during the review of the items 
(which was the case for most $s), he was confronted 
with this fact, reminded of the instructions to pre- 
serve meaning, and asked “How come?" 

The analy of paraphrases was relatively 
straightforward. For each conjunctive sentence 
one could determine whether or not the order of 
verb phrases had been maintained. Change in the 
order of verb phrases for the disordered conjunctive 
sntences almost always resulted in normalizing the 
sentences. In addition there were occasional para- 
phrases which did not change the unusual order of 
verbs but which elaborated or emended the sentences 
in such fashion as to turn them into sensible, normal 
sentences. The disjunctive sentences, threats, were 
paraphrased in various ways perhaps most com- 
monly as conditionals. One could generally un- 
ambiguously determine whether or not these para- 
phrases normalized meaning. Thus, for example, 
given “Clean up the mess or | won't report you," 
both (a) “If you don't clean up the mess I'll report 
you" and (b) "If you clean up the mess 1 won't 
report you" may be regarded as changing and 
normalizing the original sentence, while both (c) 
“Tf you clean up the mess I'll report you” and (d) 
“If you don't clean up the mess I won't report you" 
may be regarded as c rving some of the strange 
properties of the original sentence. 

The judgments Ss made on reviewing their para- 
phrases, as to whether or not these differed in mean- 
ing from the original sentences, could be coded 
simply as “different” or "same," The observations 
Ss made when they did judge a paraphrase to be 
different from its source sentence, and the various 
things said by way of explanation or self-justifica- 
tion at the end, when confronted with the fact that 
their paraphrases had often involved meaning 
changes, were found to be codeable in a limited 
number of categories (see results below). 


SION 


RESULTS AND DISC 


For normally ordered conjunctive sen- 
tences meaning was preserved in 99% of the 


paraphrases (with 90% judged same in 
meaning and 9% different in meaning) and 
was perverted in 1% of the paraphrases 
(with all of these judged different in mean- 
ing). For disordered conjunctive sentences 
violating a contingent relation meaning was 
preserved in 36% of the paraphrases (with 
31% judged same in meaning and 5% dif- 
ferent in meaning) and normalized in 64% 
of the paraphrases (with 22% judged same 
in meaning and 42% different in meaning). 
For disordered conjunctive sentences violat- 
ing an entailment relation meaning was 
preserved in 30% of the paraphrases (with 
28% judged same in meaning and 2% 
different in meaning) and normalized in 
70% of the paraphrases (with 43% judged 
same in meaning and 27% different in 
meaning). For normal conditional threats 
meaning was preserved in 99% of the para- 
phrases (with 95% judged same in meaning 
and 4% different in meaning) and perverted 
in 1% of the paraphrases (with all of these 
judged different in meaning). For "per- 
verse" conditional threats meaning was 
preserved in 46% of the paraphrases (with 
40% judged same in meaning and 6% 
judged different in meaning) and nor- 
malized in 54% of the paraphrases (with 
34% judged same in meaning and 20% 
different in meaning).* 

It is clear that ordered conjunctive sen- 
tences in normal order and normal con- 
ditional threats are hardly ever perversely 
paraphrased, and that in each case para- 
phrases that preserve meaning are over- 
whelmingly regarded as doing just that. 
For these sorts of sentences when Ss claim 
that paraphrases change meaning the most 
common ground given is that change in a 
particular single word led to a change in 
sentence meaning. Results for the two 
sorts of disordered conjunctive sentences 


3 These results, insofar as comparable, are very 
similar to those obtained earlier (Fillenbaum, 1971) 
where meaning was preserved in 96% of the para- 
phrases of normally ordered conjunctive sentences, 
but for only 39% of the paraphrases of disordered 
conjunctive sentences violating a contingent rela- 
tion. In a previous (unpublished) study it was 
found that paraphrases preserved meaning for 99% 
of normal threats, but for only 43% of "perverse" 
threats. 
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and for the "perverse" conditional threats 
are quite different. For the disordered 
conjunctive sentences 64% (contingent 
relation violated) and 70% (entailment 
relation violated) of the paraphrases in- 
volve normalization, while for the "per- 
verse" conditional threats 54% of the 
paraphrases involve normalization (the 
results for the single “perverse” warning 
are roughly similar to those for "perverse" 
threats and will not be mentioned further). 
The differences in normalization for the 
three kinds of Strange sentences do not 
reach significance when proportions of nor- 
malization of each sentence kind are con- 
trasted for each S by means of sign tests. 
Turning next to j udgments of whether or 
not paraphrases changed the meaning of 
the various kinds of Strange sentences, we 
find that for disordered conjunctive sen- 
tences where a contingent relation between 
verbs has been violated normalizing para- 
phrases are detected almost twice as often 
as not (the figures are 42% and 22%), 
while for disordered conjunctive sentences 
where an entailment relation between verbs 
has been violated normalizing paraphrases 
are detected less often than they are over- 
looked (the figures are 27% and 43%). 
The difference in detection rates is signifi- 
cant (p < .01) when proportions of detec- 
tion of normalizing changes for the two 
kinds of sentences are contrasted for each 
5 by means of a sign test, indicating that 
Ss are responsive to differences in the 
semantic properties of these two kinds of 
disordered sentences even though the actual 
percentages of normalizing paraphrases are 
quite similar (64% and 7095). For 
"perverse" threats we find that paraphrases 
that normalize the source sentences are de- 
tected less often than they are overlooked 
(the figures are 20% and 34%), and that 
the detection rate for these normalizing 
changes is significantly less than that for 
normalizing changes for disordered con- 
junctive sentences violating a contingent 
relation (p < :05), and no different from 
that for normalizing changes for disordered 
conjunctive sentences violating an entail- 
ment relation. Giyen considerable con- 
ceptual differences between the disordered 
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conjunctive sentences and the strange d 
junctive sentences it is difficult to say any 
thing as to the interpretation of differenc. 
in detection rate for normalizing changes 

Consider next what Ss s y is involve 
when they do detect differences betwee: 
strange source sentences and their para 
phrases of them. For disordered conjunc- 
tive sentences of the two kinds studied we 
find that the great majority of comments 
refer to order change, change in temporal 
sequence, and change in causal sequence, 
with 85% of the comments for sentences 
where a contingent relation is violated and 
65% of the comments for sentences where 
an entailment relation is violated falling 
into these three categories. These com- 
ments are quite realistic for it is just these 
sorts of changes that have occurred in the 
meaning normalizing paraphrases. If we 
consider the comments made when para- 
phrases of "perverse" threats are judged to 
be different from the source sentences, we 
again find some use of the above three 
categories but now the most common obser- 
vation is that the paraphrase is a sentence 
opposite in meaning to its source. Again 
such comments are quite realistic, for 
normalization of perverse" threats charac- 
teristically involves just that, see, e.g., the 
difference between the Source sentence 
"Clean up the mess or | won't report you” 
and a paraphrase such as "If you clean up 
the mess I won't report you,” 

The results just presented raise two 
questions: (a) If Ss can detect differences 
between their paraphrases and the source 
Sentences with regard to such significant 
matters as changes in temporal and causal 
order and changes which result in opposite 
meaning, then why did they provide such 
misleading paraphrases, given strong in- 
structions to preserve and conserve mean- 
ing and warnings against attempting to 
improve or make the sentences more sen- 
sible? (5) If Ss cannot detect such sub- 
stantial differences between the source 
Sentences and their paraphrases then what 
might be responsible for such obtuseness or 
blindness ? What is empirically true, and 
Presupposed in both of the above ques- 
tions, is the fact that extraordinary sen- 
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tences are often pragmatically normalized 
in the paraphrases, and one needs to ask 
(c) why this should be so. 

It seems plausible that the same general 
answer holds for all three questions. Even 
in the peculiar circumstances of the psycho- 
logical laboratory Ss seem to be acting on 
the basic assumption that what is described 
in discourse will be sensible, that what is 
described will conform to the customary 
order of events and will satisfy normal 
qualitative and causal relations. between 
events or actions. If a sentence is en- 
countered which appears to violate this 
assumption one may consider it to be a 
clumsy, inadequate phrasing or descrip- 
tion, and turn it into a more appropriate 
version in one's paraphrase. Thus what 
is taken as awry or extraordinary is not the 
world, but the linguistic account of it. As 
to why Ss should have provided normaliz- 
ing paraphrases when they could detect 
significant differences between these and 
the source sentenc it may be that as 
far as Ss are concerned a difference de- 
tected is not so much one between descrip- 
tions of two different sorts of events as one 
between two different descriptions of the 
same event, with the paraphrase expressing 
properly what is intended and badly ex- 
pressed by the original sentence. Finally, 
why might Ss have failed to notice differ- 
ences between the source sentences and 
their normalizing paraphrases? This may 
just constitute further testimony to the 
strength of the basic assumption, such that 
the original “malphrased” sentences and 
their own "improved" versions are really 
regarded as amounting to the same de- 
scription of sensibly ordered and organized 
events. 

The above suggestions while speculative 
find support in the comments Ss made at 
the end, by way of explanation and justifi- 
"ation for their paraphrases. Of the 51 Ss 
10 or 11 hardly ever changed meaning in 
their paraphrases, so we need to consider 
only the responses of the other 40 Ss.‘ 

4 Tn the earlier study there were consistent in- 
dividual differences, with some Ss normalizing the 
majority of the disordered sentences and others 
hardly normalizing any (Fillenbaum, 1971). In 
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Comments which claimed (a) that the 
paraphrases made things clear and more 
sensible, (b) that the paraphrases put things 
into natural order, (c) that the original 
sentences violated expectancies, (d) that 
the original sentences were illogical, and 
(e) that they knew what the original sen- 
tences were trying to say and so they said 
it, account for the responses of 25 of these 
Ss. All of them appear to be claiming, 
in one way or another, that their para- 
phrases somehow put more adequately 
what was badly phrased in the original 
sentences. In addition there were 7 Ss 
who said that they misread or read in- 
correctly the original sentences, which 
suggests, if their comments are taken at 
face value, that they may not even have 
noticed the strange properties of the orig- 
inal sentences, and that in the very act of 
reading and coping with them they cor- 
rected or normalized these sentences. Thus 
for 32 of the 40 Ss there is some evidence 
for the operation of the assumption that 
what is described in discourse must be 
sensibly organized. One further observa- 
tion is worth reporting, almost every one 
of the 40 .Ss who was trying to explain 
or justify his (normalizing) paraphrases 
seemed in part inarticulate, and was con- 
siderably flustered. It seemed as though it 
was somehow very important for Ss to ad- 
just things to conventional order and se- 
quence, and to exhibit the normal relations 
among events, so as to say what the speaker 
of the source sentence must have intended 
or meant. 

People may make the basic assumption 


that sentences are sensible just because they 
believe that the events of which such sen- 
tences purport to be accounts occur or are 
organized in characteristic ways, and that 
it is much more likely that the world that 
is being described is the commonplace one 
and that the description is burdened with 
the errors and malapropisms of the speaker 
than that extraordinary events are being 


the present study similar results were obtained both 
with regard to whether or not strange source 
tences were normalized and with regard to detection 
of such changes. 


accurately and faithfully presented. It is 
as though people focus not on linguistic 


E messages per se, but on the information 
they embody or appear to convey, consider- 
E and assimilating this information in re- 
. lation to their preexisting knowledge of the 
J ways of the world. 
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MODALITY DIFFERENCES IN RECOGNITION MEMORY 
FOR WORDS AND THEIR ATTRIBUTES ' 
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The S» were given a continvoustrial mixed-modatity recognition test. In 


Experiment |, 20 Se gave recognition responses for whi 
tion time data were collected. In Experiment H1, 20 new Se gave 
responses and, in addition, reported modality of presentation for 
Intramedality recognition performance war superior to int 
‘This advantage tended to lavor audite qun visual 
short and long retention intervals, respectively. ^. 


formance 


ich accuracy and reac- 
thon 
items. 
per 
at 


modality report da 


ta 
indicated a high level of retention for the between-modal attribute as 
compared with similar experiments involving within-modalities attributes. 


The results were t 
of recognition facilitat 


When a person hears or sees à word, one 
possible assumption is that the word is 
immediately translated. into its semantic 
form. That is, the meaning of the stimulus 
is extracted and retained while normally 
redundant information about the physical 
characteristics of the stimulus are lost. 
From the viewpoint of a model of memory, 
it is of interest that information concerning 
the physical nature of the stimulus may be 
retained for a period of time apparently 
beyond that which is necessary for semantic 
analysis, ‘The present study was imple- 
mented with a view to (a) establishing the 
longevity of this representational storage 
and (b) gaining some insight into the coding 
processes involved in the retention of the 
representational as distinet from semantic 
attributes of verbal stimuli. 

= The present study used à modification 


Teghtsoonian (1961). In their procedure, 


wehbe the autho 


ht to be consistent with à 
and the modality report data, 


tational explanation 


Se were exposed to à series of words within 
which each item was repeated once, at one 
of several possible retention intervals. The 
technique was modified here so that an 
item could be repeated in the same modality 
(og. visual -visual—VV) or in the alterna» 
tive modality (e; auditory visual—AV) 
on the 2 presentations, Two distinct ques- 
tions are considered. The first question 
concerns the duration of persistence of 
modality-specific information in memory, 
In the present paradigm this may be in- 
ferred (rom. the duration of intramodality 
facilitation in recognition performance ; for 
exa from the of an advantage 
for VV relative to AV. The second ques 
tion follows Mann (1970) in adopting a 
direct a in which S is required to 

modality of presentation of each 
item recognized as ed. The distinction 
between recognition facilitation and knowl- 
edge about modality of presentation may 
be important in that, in one case, the infor: 
mation may facilitate recognition even 


though it is not directly accessible, whereas, — 
in the other case, the information may le. — 
directly accessible without benefiting recoge - 


nition performance per se. 
In experiments that have used words as 
stimuli, research has shown that auditory 
tation i associated with better peor- 
formance than vieual presentation over a 
variety of memory taske However, the 
duration of this auditory superiority ap: 
pears to he technique t, ranging 


s 


f 
| 
4 
4 
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from the last 2-3 serial positions in free 
recall (Murdock & Walker, 1969) to as 
many as 7-10 serial positions or seconds in 
probe recall (Murdock, 1967) and recogni- 
tion memory (Kirsner & Craik, 1971). As 
the auditory advantage was found over all 
sampled serial positions in the probe recall 
and recognition experiments, the present 
study was designed to sample recognition 
performance over a substantially longer set 
of serial positions. 

The position with regard to visual per- 
sistence for linguistic stimuli is complicated. 
On one hand, it has been widely assumed, 
following Sperling (1967), that apart from 
transient retention in an iconic form there 
is little or no visual persistence for linguistic 
stimuli. In line with this conclusion a 
number of mixed-modality studies using 
discrete-trial recognition and probe recall 
procedures have found no evidence for 


short-term visual persistence (Burrows, 


1972; Kirsner & Craik, 1971; Murdock, 
1967). Against this, however, results from 


clinical and shadowing studies using recall 


as well as same-different techniques have 
found evidence that visual information for 
1-3 letter stimuli persists for up to 25 sec. 
(Kroll, Parks, Parkinson, Bieber, & John- 
son, 1970; Parkinson, 1972; Parks, Kroll, 
Salzberg, & Parkinson, 1972; Warrington & 
Shallice, 1969). One possible explanation 
of the inconsistency may be that visual 
persistence is masked in mixed-modality 


. studies by a short-term auditory compo- 


nent. |f this is so, then a longer sampling 
period may be required in order to reveal 
visual facilitation effects. This condition 
is met in the present study, wherein perfor- 
mance was sampled over a range of 31 
intervening items in the continuous list. 

The purpose of the experiments reported 
here was to explore the duration and char- 
acteristics of representational persistence in 
audition and vision. Experiment 1 com- 
pared performance, as measured by both 
accuracy and recognition reaction time 
(RT), for intra- and intermodality recogni- 
tion decisions. Experiment II also provided 
data on recognition accuracy and, in addi- 
tion, required Ss to report the modality of 
first presentation for recognized items. 
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Subjects. Twenty undergraduate Ss from the 
University of Toronto § pool were used in each 
experiment. They were naive with respect to the 
experiment and were paid for their services on com- 
pletion of the single session. All Ss were tested 
individually. 

Design, The experiments were conducted using a 
within-Ss design for the 2 X 2 X 6 factorial com- 
bination representing test modality (auditory or 
visual), similarity (same or different modality on 
first and test presentations), and lag (1, 2, 4, 8, 16 
and 32). Lag = 1 refers to an immediate repetit: 
with no intervening items, 

The Ss were presented with a continuous seric 
of 772 items. The first 52 items were intended to 
absorb short-run practice effects and were not con- 
sidered in the analyses, The 720 critical items 
comprised 360 words presented twice each, with 15 
replications within each of the 24 experimental con- 
ditions, Each replication of the 24 test trials was 
completed within a concealed block of 48 consecu- 
tive tria nd the expe ental conditions were 
ordered differently within each replication, 

The words, drawn from a pool of common, 2- 
syllable nouns of 5, 6, 7, or 8 letters, were presented 
at a rate of 4 sec/word. Visual items were exposed 
for .5 sec. and were followed by a blank interval for 
the remaining 3.5 sec. of the intertrial interval 
This was followed immediately by an auditory item 
or another visual item in accordance with the prede 
termined sequence. Two formats were drawn up, 
each format constituting a random sample from the 
word pool, Each format was presented to 10 Ss in 
each experiment. 

Procedure, The visual items were prepared in 
lowercase on an IBM Selectric typewriter, For 
Presentation, the list was inserted in the typewriter; 
the list was advanced by a solenoid attached to the 
return button, A Shibadan television camera was 
focused on the return ca: , and Ss viewed the 
stimuli on a Shibadan monitor. The typewriter 
was advanced twice for each visual word, once to 
bring the item onto the screen and once to introduce 
the blank phase, The screen remained blank during 
Presentation of the auditory words, 

Auditory item», previously recorded in a female 
voice at intervals appropriate to the mixed-modality 
sequence were presented through a loudspeaker 
located immediately above the television screen 
from one track of a stereo tape recorder. The 
auditory and visual sequences were synchronized 
by feeding tones, located for the visual sequence, 
from the second auditory track through a speech 
relay to the solenoid of the typewriter. A Uber 
diopilot system was used to start a millisecond clock 
at the onset of each auditory and visual word. 

In Experiment I, Ss were required to decide 
whether or not each word had occurred before in 
the list. They responded by saying yes or no on 
each trial in the continuous list. The RT was mea- 
sured from the onset of each word, and the clock 


Ww 


MOD, 
was stopped by S's vocal responses amplified from 
a microphone signal, 

In Experiment IL, Ss were required to say some, 
different, or no, M à word had occurred. before in 
the list, Ss said same or different contingent on their 
decision as to whether or not the word had been 
presented in the same physical form on the 2 occa- 
sions. The Ss said no if the word had not appeared 
in the list before. The RT was not measured in 
this experiment, 


RESULTS 


Recognition accuracy. The recognition 
data for the inter- and intramodality combi- 
nations are presented in Table 1. As there 
was no significant effect attributable to the 
task variable, the accuracy data are shown 
pooled across Ss in the RT and modality 
judgment experiments. Each value is 
therefore based on 600 observations repre- 
senting 15 observations per S for the 
40 Ss in the RT and modality judgment 
experiments, 

The auditory comparison reveals an ad- 
vantage for AA relative to VA for all 6 
retention intervals, An analysis of vari- 
ance (ANOVA) (Task X Similarity X Lag) 
showed that the main effects of similarity, 
F (1, 38) = 17,3, p < .001, and lag, / 
(5, 190) = 34,7, p < .001, were significant. 
‘There were no other significant effects 
(at the .05 level), Thus, the tabular and 
statistical evidence are consistent. with the 
conclusion that the AA advantage does not 
diminish over the sampled range of reten- 
tion intervals. 

"The visual comparison shows a consistent 
advantage for VV at the 4 longest retention 
intervals and some inconsistency at the 2 
shortest intervals, An ANOVA (Task 
X Similarity Lag) showed that the main 
effects of similarity, F (1, 38) = 48, 
p €.05, and lag, F (5, 190) = 30.5, 
p < 901, were significant. The Similarity 
X Lag interaction was also significant, 
F (5, 190) = 8.7, p < .001, indicating that 
the VV advantage increases significantly at 
the longer retention intervals, There were 
no other significant effects. 

An overall ANOVA (Task X Ss X Test 
Modality X Similarity X Lag) was com- 
pleted in order to test the hypothesis that 
the advantage found for the intramodality 
conditions followed different retention func- 
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combination 


Test Modality X Similarity X Lag interac- 
tion, F (5, 190) = 6.7, p < .001, showed 
that this was the case. 

The overall performance level was similar 
in the 2 experiments. The mean detection 
and false alarm rates were 91.0% and 10.5% 
in Experiment 1 and 92.9% and 6.6% in 
Experiment IH. Thus, the data show that 
recognition performance was not impaired 
by the additional input and output de- 
mands associated with the judgment task 
in Experiment IH. 

Recognition reaction time, The results 
presented in Table 2 show recognition RT 
for correct recognition decisions for the 4 
modality combinations. The values are 
based on not less than 233 observations 
per point, 

For the auditory comparison, the recog- 
nition RT data show a pattern similar to 
that for recognition accuracy, The audi- 
tory-auditory combination enjoys a con- 
sistent advantage at most retention inter- 


tions in audition and vision. The critical — 


TABLE 2 


RECOGNITION Reaction Trae (18 Msc.) rok WORDS 
CowmskcrLvy RkcoownED AS A FUNCTION OF 
Movaurry COMBINATION. AND. LAG 


Mortality mw 
com benation - — 
is 2 4 a} we) n | M 
AA | 786 | 829| 824 | 918 | 926 (1017 | 883 
VA |788 |898| 982 | 968 | 973 | 965 | 920 
WV 17131 764 | 797 | 763 | 774 | 80S | 769 
AV (768 | 771 | 786 | 818 | B48 | 826 | 802 
[3 1 


Note. Abbreviations: AA = e p -auditory, VA = viraal- 
auditory, VV = visual-visual, and AV = auditory -vival. 
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vals, however, unlike the accuracy data, 
the AA advantage is reversed at the longest 
retention interval and substantially reduced 
in magnitude at the shortest interval. An 
ANOVA (Similarity X Lag) showed that 
the AA advantage, F (1,19) = 9.4, p< .01, 
lag effect, F (5, 95) = 31.4, p « .001, and 
the Similarity X Lag interaction, F (5, 95) 
= 5.34, p< .001, were all statistically 
significant. 

The visual comparison supported the ac- 
curacy data with a clear VV advantage at 
the longest retention intervals and a lesser 
advantage at the shorter intervals. An 
ANOVA showed that the effects of simi- 
larity, F (1, 19) = 14.0, p < .01, lag, 
F (5, 95) = 8.8, p « .001, and Similarity 
X Lag interaction, F (5,95) = 2.5, p < .05, 
were significant. 

To summarize, the evidence shows that 
recognition performance is sensitive to the 
modality manipulation. In the case of 
auditory persistence, the tabular data indi- 
cate an AA advantage relative to VA at 
the 5 shortest retention intervals for both 
accuracy and recognition RT. The recogni- 
tion RT data indicated that auditory per- 
sistence might be limited to a period of less 
than 2 min. or 31 intervening items, but 
this may be due to technique-specific or 
sampling factors. The data clearly sup- 
ported the conclusion that there is an over- 
all advantage for VV relative to AV and 
suggested that, if anything, this may be 
enhanced at long retention intervals. 

Knowledge of modality of presentation. 
The data presented in Table 3 show the 
proportion of correct judgments for each 
combination of the experimental conditions. 
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The data are n« rily conditionalized 
on correct recognition. The results of 
Experiment lI show an advantage for the 
intramodality conditions, particularly at 
long retention intervals. An ANOVA 
(Test Modality X Similarity X Lag) 
showed that the main effects of Similarity, 
F (1,19) = 11.4, p < .01, and lag, F (5, 95) 
= 27.9, p < .001, were significant. The 
Test Modality X Similarity X Lag inter- 
action, F (5, 95) = 5.5, p < .001, was also 
significant. 

There are 2 possible sources of artifact 
in the modality judgment data; first, Ss 
could improve their performance by giving 
more zo responses when uncertain about 
modality of first presentation and, secondly, 
Ss might give more same responses to items 
in one or both modalities. The finding that 
accuracy was greater in the modality report 
group appears to preclude the general 
criterion effect. 

The response bias problem is more com- 
plicated. The false alarm data presented 
in Table 3 indicate a tendency to give more 
same than different responses, particularly 
for visual items. A signal detection analy- 
sis was implemented to give an unbiased 
estimate of modality knowledge. In this 
analysis, the d’ statistic was calculated 
separately for each condition for each .S. 
The hit rate was the number of correct 
modality judgments expressed as a propor- 
ton of the number of correct detections 
within each condition. The false alarm 
rates were calculated from the allocation 
of same and different judgments to new 
items. For this purpose it was assumed 
that the population of new items was 


TABLE 3 


Proportion or Correct MODALITY JUDGMENTS AS A Fu 


NCTION OF MODALITY COMBINATION AND LAG 


Modality ZAN DE 
combination 
1 2 4 8 16 | 32 | M hee 
AA 99.7 99.7 97.9 91.7 892 | 80.0 93.0 34 
VA 98.3 90.7 96.3 89.2 860 | 77.7 89.7 31 
VV 99.0 92.3 94.7 89.9 87.3 86.0 91.5 3.9 
AV 98.3 99.7 94.6 85.3 84.9 76.7 89.9 2:1 


Note. Abbreviations: AA = auditory-auditory, VA = visual-auditory, VV 
^ For example, 3.4 and 3.1 indicate the proportion of same and different res; 


= visual-visual, AV = auditory-visual, 
‘ponses, respectively, given to new auditory items. 
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divided equally into same and diferent 
events. Thus, for the VV condition, the 
false alarm rate was the number of same 
judgments given to new visual items. The 
same procedure was adopted for different 
judgments. The d’ values were 4.04, 3.94, 
3.96, and 3.96 for the AA, VA, VV, and AV 
modality combinations, respectively. An 
ANOVA completed on the d' values showed 
that the main effect of similarity was not 
significant (F < 1), but the Test Modality 
X Similarity X Lag interaction, F (5, 95) 
= 44, p < .01, remained significant. Thus, 
the effect of this calculation was to elimi- 
nate the overall advantage associated with 
the intramodality conditions. 


DISCUSSION 


The results show that information about 
modality of presentation remains in memory 
for a period that must be considered in minutes 
rather than seconds. This is evident from both 
the direct evidence that 5s can give reliable 
judgments about modality of presentation over 
a 2-min. period, and from the indirect evidence 
that recognition performance is facilitated 
when a word is presented in the same modality 
on the presentation and test trials. 

Detailed analysis of the recognition data 
indicated some asymmetry in the retention of 
auditory and visual material. Auditory per- 
sistence was at a maximum over short reten- 
tion intervals of 16 items (64 sec.) or less, but 
the data did not consistently support the view 
that auditory persistence was of limited dura- 
tion. Visual persistence, on the other hand, 
appeared to be masked at short retention 
intervals but was present under all conditions 
at retention intervals beyond 4 intervening 
items or 16 sec. Unlike recognition, however, 
the modality judgment data showed that when 
response biases were controlled there was no 
evidence of either (a) asymmetric retention of 
auditory and visual information or (b) an 
advantage attributable to the presentation of 
a word in the same modality on the presenta- 
tion and test trials. 

‘The most important theoretical questions 
concern the way in which modality informa- 
tion is encoded and stored in memory and how 
it is retrieved and used in the recognition and 
modality report situations. There appear to 
be a number of possible explanations of the 
data, ranging, at one extreme, from the view 
that they are influenced by an extended form 
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of sensory memory, to the view that recogni- 
tion facilitation as well as modality knowledge 
is based on abstract information stored in 
long-term memory. 

As argued by Serfass and Dong (1972), the 
fact that Ss can remember input modality is 
not readily explained by reference to informa- 
tion-processing models such as those advanced 
by Atkinson and Shiffrin (1968), Morton 
(1969), or Sperling (1967). Certainly, it 
appears true that these models have neglected 
the processes involved in storing attributional 
information about an item in favor of an analy- 
sis of the operations involved in the identifica- 
tion and retention of the item per se. How- 
ever, it may be more appropriate to address 
modality report data to the question of mem- 
ory for attributes raised by Underwood (1969). 
In this context the modality report data may 
not be critical for any one model of attribute 
retention. 

Hintzman, Block, and Inskeep (1972) have 
considered 2 explanations of attribute report 
data. The first of these is that an attribute is 
encoded as an abstract proposition, associated 
with the meaning of the word, and bearing 
only a referential relationship to the stimulus 
conditions. The second is that attribute infor- 
mation is retained as a literal copy of the per- 
ceptual experience. The present data bear on 
these issues at a number of points. First, as 
Hintzman et al. have argued, if the modality. 
information is stored as an abstract attribute 
it should make no difference to recognition 
whether or not an item is re-presented in the 
same modality. In this respect the present 
results clearly support the literal or “represen- 
tational" position. Furthermore, the asym- 
metric relationship between auditory and 
visual persistence in recognition may be ad- 
duced as evidence against the view that 
modality information is encoded only in an 
abstract form. It is possible, however, that 
attribute information is encoded and used in 
different ways for recognition and attribute 
report. Thus, modality report could be based 
on abstract information even though recogni- 
tion facilitation is determined in some other 


The abstract explanation of the report data 
received some support from the finding that 
modality knowledge is generally insensitive 
to the similarity manipulation. Unlike recog- 
nition, performance on the modality report 
task is not facilitated in the same modality 
condition. On the other hand, there is evidence 
that performance on the between-modalities 
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attribute is superior to performance on within- 
modalities attributes. This trend was reported 
by Hintzman et al. (1972), and is apparent in a 
comparison between the results of the present 
experiment and the results of (a) an analogous 
visual experiment using 2 type fonts (Kirsner, 
1973), and (b) an auditory experiment using 
2 voices (Craik & Kirsner)? Thus, if attribute 
information is stored only in an abstract form 
dependent on the verbal unit in memory, then 
it seems necessary to conclude that the organi- 
zation of the structure surrounding the word 
unit is influenced by perceptual as well as 
linguistic factors. 

In summary, it has been argued that (a) 
modality information is stored in a representa- 
tion form in memory, (b) this representational 
information is used directly in recognition, and 
(c) modality judgments also use information 
which may be stored in a representational 
form, but that access to this information is 
achieved via the word unit. These conclusions 
are of course tentative, and an accurate de- 
scription of the relationship between represen- 
tational and linguistic systems in memory 
depends on further research. 


*F. I. M. Craik and K. Kirsner. The effect of 
speaker's voice on recognition. In preparation. 
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INFORMATIONAL DISCREPANCY ON ADAPTATION 
OCULAR IMAGE MAGNIFICATION * 
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The Ss adapted to uniocular image magnification which transformed the 
preexperimental relationship between objective depth slant and retinal dis- 


parity. 


The adaptation treatment consisted of continuous inspection of a 


single luminous figure representing for different Ss three different degrees of 


perspectival slant. 


Two levels of depth processing were contrasted: depth 


registration—an activity which diverted .S from active utilization of depth 


information—and depth proce 
tion of depth information. 


crease as the discrepancy between perspectival and stereoscopic 


ing—an activity that required frequent utiliza- 
he magnitude of adaptation was found to in- 


slant increased 


and to be greater following depth processing than following depth registra- 


The results were dis 


tion. 


which stresses resolution of informational discrepancy. 


issed in the context of an account of adaptation 


Other findings in the 


literature were reinterpreted in the light of the current experiments. 


In earlier. experiments (Epstein, 1971, 
1972a, 1972b; Epstein & Daviess, 1972; 
Epstein & Morgan, 1970) we showed that 
the relationship between retinal disparity 
and perceived stereoscopic depth is modi- 
fied by a period of exposure to uniocular 
horizontal image magnification. produced 
by a meridional size lens (MSL). The Ss, 
who engaged in binocular visual explora- 
tion of a typical indoor environment, while 
wearing a vertically oriented MSL before 
one eye, showed subsequent adaptive shifts 
in depth judgments based on binocular dis- 
parity. The shifts were interpreted as 
evidence that binocular disparity had been 
recalibrated as a consequence of the pairing 
of disparity with the nontransformed dis- 
crepant information provided by exposure 
to the normal environment. 

The present experiments had three ob- 
jectives. First, we wished to test a pro- 
cedure for bringing the adaptation treat- 
ment under experimental control. Although 
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wish to thank Krishan D. Broota, S. M. Ebenholtz, 
and Kenneth Paap for help with various aspects of 
this investigation. 
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the early contemporary research (reviewed 
by Epstein, 1967, Chap. 9) employed con- 
trolled laboratory tests to produce adapta- 
tion, eg., target-marking, much recent 
work, including our previous work on 
uniocular image magnification, has favored 
the less controlled expedient of giving S a 
period of exposure to the normal extra- 
laboratory environment. The advantage 
of this latter treatment is that it combines 
all of the nontransformed informational 
inputs from the environment in joint op- 
position to the optically transformed input, 
thus providing optimal opportunity for 
adaptation. The disadvantages are all 
those associated with lack of control. 
Therefore, in Experiment 11 we set about 
to demonstrate adaptation to uniocular 
image magnification with a strictly con- 
trolled treatment. 

Our second objective was to examine the 
relationship between the degree of infor- 
mational discrepancy and the magnitude of 
the adaptive shift. In the present experi- 
ments the discrepancy was between per- 
spective slant and stereoscopic slant. We 
chose perspective as the source of discrepant 
input because there is evidence that in the 
absence of other information, perspectival 
cues are effective determinants of per- 
ceived slant in depth (e.g., Clark, Smith, & 
Rabe, 1955; Freeman, 1966), and because 
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there is evidence that in a cue-conflict 
situation perspective can modify the effect 
of binocular disparity (Gillam, 1968). 
Despite the aforementioned evidence, Ex- 
periment I was designed to assess S’s 
depth judgments in a cue-conflict situation 
involving disparity and perspective. The 
choice of perspective is also recommended 
upon considerations of experimental con- 
venience. It is a simple matter to vary 
perspectival slant and thus vary the degree 
of informational discrepancy. In Experi- 
ment II, stereoscopic slant, determined by 
the MSL, was kept constant, while per- 
spectival slant, determined by the figural 
properties of the target that was inspected 
during the adaptation period, was varied. 
The adaptation treatment consisted of 
exposure to discrepancy between perspec- 
tival slant and stereoscopic slant, and the 
measure of adaptation was the shift in 
binocular, no-lens, judgment of the ap- 
parent frontal parallel plane of a single 
luminous rotatable test line. 

A third objective was to examine the in- 
fluence of level of processing on adaptation. 
It now is commonplace that adaptive 
shifts are greater following self-generated, 
self-controlled movement than following 
passive, other-controlled movement. Al- 
though a variety of interpretations of this 
result have been advanced, there is a 
relatively straightforward account which 
rarely has been made explicit. On the 
view that adaptation represents a response 
to informational discrepancy, it is plausible 
that the degree of adaptation would depend 
on the degree of processing of the inputs 
that are potentially discrepant. If process- 
ing of the input has no behavioral signifi- 
cance, because behavior is guided exter- 
nally, as in the case of passive movement, 
then processing may be aborted and the 
informational discrepancy may not be 
sufficient to motivate adaptation. Accord- 
ingly, adaptation following passive activity 
will be less than adaptation following self- 
generated activity. 

In the foregoing account, the important 
difference is not between active and passive 
movement, but between levels of process- 
ing; simple registration of input versus 
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processing of the input in : ice of a 


relevant perceptual discrimi There- 
fore, in Experiment II we ned the 
effects of two types of exp isks on 
adaptation to uniocular imag wgnifica- 
tion. In both tasks the displa litions 
of viewing, and duration of visual exposure 


were identical. In the depth 
condition S's attention was d 


egistration 
awn to a task 


that did not require utiliz of depth 
information; in the depth-p condi- 
tion S's attention was draw: isk that 
required processing of dept! rmation. 
Our expectation was that dv; cessing 
would lead to greater adaptation 
EXPERIMENT | 

Prior to the investigation o principal 
questions it was necessary to assess the re 
sponses of the perceptual sysiem to the 
discrepancy between perspectival and 
stereoscopic slant. What is the apparent 


slant in depth of a target whose retinal 
perspectival correlates and retinal-disparity 
correlates are not in agreement? ‘To answer 
this question, judgments were secured of 
the apparent slant of frontal parallel pro- 
jections of a rectangle rotated in depth 
about its vertical axis, presented under 
three conditions of viewing: monocular 
viewing (M), binocular untransformed 
viewing (B), and binocular viewing with 
MSL before one eye (MSL). On some 
trials in the MSL condition, the direction 
of the theoretical rotation of the frontal 
plane induced by the MSL, i.e., the direc- 
tion of stereoscopic slant expected in the 
absence of conflicting input, was consistent 
with the direction of perspectival slant; 
on other trials, the directions of perspec- 
tival and stereoscopic slant did not agree, 
With these variations in mind, the specific 
questions to be resolved in Experiment I 
were the following: (a) In the absence of 
discrepant input (Condition M), will the 
perspectival cue determine apparent slant- 
in-depth reliably? (b) When normal access 
to binocular disparity is available (Condi- 
tion B), will the effect of perspective be 
attenuated? (c) When the discrepancy 
between stereoscopic slant and perspectival 
slant is accentuated (Condition MSL), is 
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SUBJECT'S VIEW 
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DETAIL OF 
VIEWING APERTURE 
AND MIRRORS 


Figure 1. Arrangement for presenting stimuli. (Explanation is in text.) 


the result a perceptual compromise or does 
one or the other of the two inputs dominate 
completely ? 


Method 


Apparatus. Figure 1 shows the arrangement for 
presenting the stimuli. The S always viewed two 
targets simultaneously, one above the other: a 
monocularly viewed standard presented to the left 
eye and a comparison which was presented for either 
monocular or binocular inspection as the experiment 
required. The comparison target was 145.7 cm. 
directly in front of S. The 50% mirrors just inside 
the viewing aperture at 45° angles allowed the 
monocularly seen standard to appear at the same 
distance and directly below the comparison. The 
standard and comparison targets were equivalent 
in subjective brightness. Even though the monoc- 
ular target was viewed only with the left eye, a 
mirror was placed in front of both eyes so that both 
eyes would receive similar amounts of light. The 


Ss viewed the targets through an aperture 13.75 X 
6.25 cm. while supported by a chin rest. 

Stimuli. The targets were photographic negatives 
of a 20.4 X 10.2 X 1.9 cm. rectangle, photographed 
at slants of —60° to +60° in 10? steps. A 20.4 X 
10.2 cm. rectangle subtends a visual angle of 
8° x 4? at the viewing distance used. The targets 
were placed in slots in front of frosted glass and were 
back-illuminated. They appeared to S to be 
luminous white against a totally black background. 
In addition, there was a horizontal line target, 
20. X 6 cm. The line was cut into a black sur- 
round and when back-illuminated like the other 
targets, it too appeared to be a luminous white 
against a black background. 

Lens. A 6% meridional size lens and two plain 
glasses ground by the American Optical Company 
were used. 

Design. "Twelve presentation orders were gen- 
erated from a Latin square. Each of the 12 slants 
of the comparison (2-10? to +60°) appeared in 
each serial position only once and each slant was 
preceded by and followed by every other slant only 


588 


once. The zero-slant rectangle was always the first 
and the last comparison target in each viewing con- 
dition. It was the only target presented twice. 
The standard was always a rectangle, a zero-slant 
target. Slant estimates were secured for each com- 
parison relative to the standard. Every S judged 
all comparison slants under each of the three viewi 
conditions in succession: M, B, and MSL. 
order of viewing conditions was determined ran- 
domly for each S. 

Procedure. The S was seated with head in a chin 
rest, at the viewing aperture. The viewing aperture 
was parallel with the forehead and eye level was at 
the height of the midpoint of the aperture. Fi 
the effectiveness of the MSL was determined. This 
was accomplished by presenting the horizontal line 
for binocular inspection in the place where the com- 
parison targets would later be seen. The monocular 
standard was also in view and .5 was instructed to 
provide an estimate of the slant of the line relative 
to the standard using a reproduction procedure: 


You will see a rectangle and a horizontal line, 
one above the other. The rectangle will be of 
zero slant, that is, parallel to your forehead. 
Your task will be to reproduce the apparent 
visual slant of the line relative to the rectangle; 
that is, to draw an angle which corresponds to 
the apparent angular separation of the rectangle 
and the line. I am only interested in how 
slanted the line appears to be, and you need 
not worry about where it actually is. 


The S was given a clipboard with sheets of paper 
attached toit. Each sheet showed a circle 11.75 cm. 
in diameter with a small dot marking its center and 
à line drawn through its center bisecting the circle. 
Demonstrations of drawings of the line to reproduce 
the angular rotation of the horizontal line were 
provided. 

Throughout the entire experiment S wore an 
American Optical trial frame. For the screening 
test a. plain glass was inserted in one frame while 
the other had the 6% MSL. The MSL eye was the 
one subsequently to be used in the MSL condition 
for that S. The S observed the line and rectangle 
for 45 sec. before responding. The 45-sec, viewing 
time was chosen because the stereoscopic effect of 
the MSL is not immediate (Seagrim, 1969). The 
lights were then turned on, S removed the glasses, 
and then drew the apparent angular rotation of the 
horizontal line on the top sheet of paper. 

"Then S was exposed to the three viewing condi- 
tions and the succession of 14 targets in each condi- 
tion. In Condition M, S wore a plain glass before 
the left eye and an eye patch over the right eye, in 
Condition B the trial frame had two plain glasses 
(PG), and in Condition MSL the MSL was before 
one eye and a PG was before the other. Half of the 
5s wore the lens before the right eye and half before 
the left eye. The following instructions were read: 


You will see two rectangles, one above the other. 
The top rectangle will be changed in slant from 
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trial to trial at random but the bottom wi 

always be a rectangle of zero slant, that is 
parallel to your forehead. Your task will be 
to reproduce the apparent visual slant of th« 
top rectangle relative to the bottom rectangle 

that is, to draw an angle which corresponds t« 
the apparent angular separation of the two 
rectangles. I am only interested in how slanted 
the top rectangle appears to be, and you need 
not worry about where it actually is 


The room lights were turned on while S rey 
duced the angular rotation of the comparison and 
changed the targets. After each reproduction .* 
moved the response sheet so that preceding repi 
ductions were not open to inspection by S. 

Subjects. The 24 Ss were University of Wiscon 
undergraduates. Persons who normally wor 
framed glasses were not selected but wearers o 
contact len: were accepted, To secure these 24 
Ss, 34 were tested. Ten were rejected because they 
did not exhibit an MSL effect when viewing the 
horizontal line. 


Results and Discussion 


On the screening test, slant estimates of 
the horizontal line were 30.83? (SD — 9.099) 
for the 12 Ss who wore the MSL before 
the right eye, and also 30.83? (SD = 8.16?) 
for the 12 Ss who wore the MSL before 
the left eye. The theoretical geometric 
effect (Ogle, 1964) for the 6% MSL at an 
observation distance of 145.7 cm. is 51°5’. 
Therefore, in estimating the slant of an ob- 
jectively frontal parallel horizontal line in 
the absence of veridical slant cues S should 
estimate it as being slanted 5195^ with the 
side in front of the lens farther away. Al- 
though no S exhibited the full effect, all 
24 Ss did exhibit large effects in the pre- 
dicted direction. The principal data are 
summarized in Table 1. Analysis of vari- 
ance showed that neither MSL eye (left or 
right) nor condition of viewing (M, B, or 
MSL) were significant sources of variance. 
However, the effect of degree of perspectival 
slant was highly significant, F (6, 132) = 
769.13, p < .001. Table 1 shows that each 
interval of increased perspectival slant was 
accompanied by a corresponding increase 
in estimated slant. Also significant was 
direction of perspectival slant, F (1, 22) = 
6.05, .025 > p > .01, although this effect 
was much smaller than the effect of mag- 
nitude of slant. The targets designed to 
appear with the left side farther away were 
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erally estimated at larger slants than 
"gets of the same magnitude slanted with 
right side farther, although the differ- 
s between a left- and right-slanted 
get of the same magnitude was small. 
iso significant was the Direction X Mag- 
nitude of Slant interaction, F (6, 132) — 
169, .05 > p > .01, and the MSL Eye X 
ction X Magnitude of Slant inter- 
on, F (6, 132) = 2.45, .025 > p > .01. 
"he three questions that Experiment I 
designed to answer have been answered 
quivocally. Under the experimental 
ditions, perspectival input had signifi- 
int and systematic effects on perceived 
slant depth. The effectiveness of the per- 
spectival input was in no way modified by 
conflicting stereoscopic slant depth. Per- 
spectival slant was clearly dominant. 


EXPERIMENT II 


‘he results of Experiment I provided a 
ass for inferring the degree of informa- 
¿onal discrepancy that will be present 
during an adaptation period in which Ss 
are exposed to a constant stereoscopic slant 
coupled with varying degrees of perspec- 
ival slant. Consider the case of an S who 
spends 10 min. in binocular inspection of 
one of the comparison stimuli of Experi- 
ment I while wearing an MSL before one 
eye. Suppose that the geometric effect of 
the MSL is a 60° rotation. If the inspection 
figure is a rectangle (zero slant), the dis- 
crepancy between perspectival and stereo- 
scopic slant will be 60°, but if without 
changing the MSL, the inspection figure is 
changed to a 60? projection in the same 
direction as the MSL-induced slant, the 
discrepancy will be reduced to zero. And 
by choosing an intermediate projection, 
c.g, 30°, an intermediate degree of dis- 
crepancy is introduced. This plan was 
followed in Experiment II. Three groups 
of Ss were tested for adaptation to uniocular 
image magnification following a period of 
inspection of one of the perspectival pro- 
jections used in Experiment I. It was ex- 
pected that the magnitude of the adaptive 
shift would be larger the greater the degree 
of discrepancy, despite the fact that dis- 
parity was transformed to the same degree 
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TABLE 1 


MEAN ESTIMATES OF SLANT OF COMPARISON TAR- 
GETS UNDER THE THREE VIEWING CONDITIONS 


Binocular 

Com- 

pari- | Monocular | Grand Af 
son With Without 

MSL* SL 
OS ot weno. 0 0 0 

10° 9.50 | . 9.31 9.08 9.30. 
20* 2040 | 19.00 19.55 
30* 29.38 28.75 29.00 
40° 37.75 37.50 37.50 
50° 46.70 46.50 46.75 
60° 56.63 57.04 57.00 


» MSL = meridional size lens. 


for all Ss. Each S adapted twice, once in 
the depth-registration condition and once 
in the depth-processing condition. 


Method 


Apparatus. The apparatus used in Experiment I 
was employed in Experiment II with a few altera- 
tions. The viewing distance to the comparison 
target was increased from 145.7 cm. to 154.48 em. 
The slot where the standard target was shown in 
Experiment I was covered and not used in Experi- 
ment II. Only the comparison target area was used 
to present the adaptation figures. At a distance of 
101 cm. to the left of the chin rest used to look at the 
targets in Experiment I, a second chin-rest and head- 
clamp arrangement was set up on the same level as 
the first chin rest. At a distance of 154.48 cm, a 
rotatable electroluminescent line, 29 X .5 cm., was 
located. The height of the line was at 5's eye level. 
The line could be rotated in the horizontal plane on 
a vertical axis about a pivot point which remained 
fixed directly in front of 5. The position of the line 
could be read to an accuracy of .5? of rotation. 
A reading of 180? signified that the line was per- 
pendicular to 5's line of sight, that is, in the fronto- 
parallel position; a reading over 180° indicated that 
the left side was rotated back; a reading under 1809 
indicated that the right side was rotated back. 
The chin rest and head clamp inhibited large head 
movements by S. In order to increase the range 
of discrepancies that could be studied in our experi- 
mental arrangement, an 8% MSL was used in 
Experiment II. At the viewing distance of 154.58 
cm. the geometric effect of the MSL is an apparent 
rotation of 609, The trial frame used in Experiment 
I was used again in Experiment Il. 

Stimuli. The inspection figures presented to S 
during the adaptation period were the 0°, 30°, and 
60? figures previously used in Experiment I. 

Design. ‘Three independent groups of 20 Ss were 
tested. Each S was exposed to only a single adapta- 
tion figure, but each S participated in both adapta- 
tion activities. Within each group, half of the Ss 
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had the depth-registration activity on the first day 
and the depth-processing activity on the second day, 
while the other Ss had the two adaptation activities 
in reverse order. The two adaptation sessions were 
separated by 48 hr. Half of the Ss in each group 
wore the MSL before the right eye during the adapta- 
tion period and half wore the MSL before the left 
eye. The adaptation figures and the MSL eye were 
always paired together in such a way that per- 
spectival slant and stereoscopic slant were in the 
same direction. 

Procedure. All Ss followed the identical pre- 
adaptation procedure. First, S made eight judg- 
ments of the frontoparallel position of the electro- 
luminescent line. The S was seated at the chin- 
rest and head-clamp arrangement and viewed the 
electroluminescent line binocularly in total darkness. 
The first four judgments were made wearing the 
trial frame with PG in both frames, The second four 
judgments were made wearing the trial frame with 
a PG in one frame and the 875 MSL in the other. 
Two of the settings of each set of four started with 
the line at 155? and two with the line at 205°. 

The S instructed E to adjust the position of the 
line until it appeared parallel with her forehead. 
Head movements were explicitly discouraged. In 
order to reduce the tendency to make judgments 
based on line length rather than slant depth, Ss 
were told to disregard the length of the line and 
judge exclusively on the basis of the line's orienta- 
tion relative to her forehead. This instruction was 
necessary since, with a predicted geometric effect of 
60°, apparent line length could become a confound- 
ing variable, 

Following the preadaptation tests, the glasses 
were removed and then S moved to the other chin 
rest. The Ss in the depth-registration condition 
heard the following instructions: 


"This stage of the experiment is concerned with 
signal detection. We wish to learn whether 
the ability of an observer to detect a faint signal 
varies over a ten-minute period of observation 
of a simple unstimulating visual field. When 
you look into this box you will see a luminous 
four-sided figure in an otherwise dark back- 
ground. Atinfrequent and irregular intervals a 
faint light will be flashed inside the figure. The 
light will appear in one of the four corners, The 
corners at which the light will appear have been 
selected randomly, so it will not be possible 
for you to predict where the signal will appear. 
When you have detected a signal press one of 
the four buttons to show its location in the 
figure. 


The S was shown how to use the response box but- 
tons. The glasses were restored with the MSL still 
in place and then she observed the figure and re- 
sponded for 10 min. The Ss in the depth-processing 
condition heard the following instructions: 


This stage of the experiment is concerned with 
another form of depth judgment. We wish to 
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learn whether the depth at which objects app: 
varies over a ten-minute period of continuo 
observation. When you look into this box : 
will see a luminous four-sided figure in an ot 
wise dark background. At infrequent 
irregular intervals a faint light will be flash 
inside the figure. The light will appear in 

of the four corners. The corners at which t! 
light will appear have been selected randomly 
so it will not be possible for you to pred 
where the signal will appear. Each time a lig} 
flashes I want you to answer two ques 
about the corner in which the light appear 
(a) Does the corner, that is, the corner wl 
the light appeared, appear nearer, farther 
equal in distance from vou as the corner to 
right or left? (b) Does the corner app: 
nearer, farther, or equal in distance from yo 
as the corner below or above it? 


An example was presented and when S understox 
the procedure, the)glasses were restored with th 
MSL in place and She observed the figure and re 
sponded for 10 min. 

The light flashes were brief and appeared 
random times at intervals ranging from 30 sec. | 
90 sec. in randomly ordered corners. After th. 
10-min. adaptation activity, S kept her eyes closed 
while E changed the lenses and while she returned 
to the other chin rest, All Ss made two PG post- 
adaptation judgments of the frontoparalle! positio! 
with the electroluminescent line. Forty-eight hour 
later, the procedure was repeated with the second 
adaptation activity. 

Subjects. Sixty undergraduates from the 
population that provided Ss for Experiment I 
in Experiment Il. Twenty Ss were assigned to each 
of the three adaptation figures. Since the principa! 
objective of this investigation was to examine the 
recalibration of binocular disparity as a stimulus 
for perceived depth we could use as Ss only observers 
who gave evidence of effective use of disparity prior 
to the introduction of the experimental treatment. 
To secure 60 Ss who satisfied this requirement it was 
necessary to test a total of 97 Ss. Twenty-four 
were rejected because the mean of the 4 fronto- 
parallel PG settings deviated from the true fronto- 
parallel by more than +10°; 11 were rejected 
because of failure to exhibit an MSL effect of at 
least 20°; and 2 were dropped due to a failure to 
return for the second day of testing. With the 
exception of the latter 2 Ss, all rejected Ss were 
terminated before the initiation of the adaptation 
period on the first day. None of the retained Ss 
wore framed corrective glasses but several were 
regular users of contact lenses and these Ss con- 
tinued to wear the contact lenses throughout the 
experiment. 


Results 


Preexposure plain glass judgments. The 
results are summarized in Table 2. The 
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TABLE 2 


MEAN PREADAPTATION SETTING OF FRONTAL P. 


MEAN SHIFT, AND PERCEPT 


ADAPTATION: DEPTH REGISTRATION First; DEPTH PROCESSING FIRST 
0” adaptation figure | | 30 adaptation figure 60° adaptation figure A 
verage 
Level of processing T T E PNA Tr 7 adaptar 
Prear |Snittm | %8¢9P°| Pre | Shift %ada-) Prent | Shift M | % adar- | tion 
| | 
E 1 —— — Ex = 
Depth registration first 
TS PA dos TERRAE BE fi E E | 
Depth registration | 181.02 | 19.48 | 56.86 | 179.78 | 14.40 | 46.58 | 180.38 10.90 | 34.48 | 45.97 
Depth processing | 180.72 | 26.22 | 74.24 | 180.85 | 22.65 | 74.20 | 180.48 16.88 | 53.50 | 67.31 
Depth processing first 
Ka. AXD NEUE Y ERE | PERI 3 
Depth registration | 179.58 | 16,50 | 49.80 | 180.60 | 11.40 35.21 | 180.55 6.90 | 21.31 | 35.44 
Depth processing 180.52 | 24.52 | 69.84 | 179.92 | 15.52 | 49.05 | 180.65 9.62 | 30.43 | 49.77 


mean preexposure frontoparallel j udgments, 
based on the four settings, show that .Ss 
were highly accurate. A between-Ss (adap- 
tation figure and MSL eye), within-Ss 
(adaptation activity) analysis of variance 
of the preexposure settings showed no 
significant main effects or interactions. 

Preexposure meridional size lens judg- 
ments. The MSL produces an apparent 
rotation of the frontal plane, so that when 
the electroluminescent line is objectively in 
the frontoparallel position, the side on the 
same side as the lens appears farther back. 
"Therefore to adjust the line to the apparent 
frontoparallel plane the right side of the 
line must be moved forward when the right 
eye is exposed to the lens and the left side 
of the line must be moved forward when 
the left eye is exposed to the lens. As 
noted earlier, the predicted geometric effect 
is 60°. 

All Ss made adjustments in the predicted 
direction. When the line was judged to be 
in the frontoparallel position, the mean 
MSL settings differed from the mean PG 
settings (the MSL effect) by 34.38°, 31.47°, 
and 32.93° for the groups assigned to the 
0°, 30°, and 60° adaptation figures, re- 
spectively. A  between-Ss (adaptation 
figure and MSL eye), within-Ss (adapta- 
tion activity) analysis of variance was com- 
puted to determine whether the groups 
differed before the start of the different 
adaptation treatments. There were no 


significant main effects. There was one 
significant interaction: Adaptation Figures 
X Adaptation Activity, F (2, 54) = 3.42, 
p <05. 

Postadaptation judgments. Table 2 shows 
the mean postadaptation plain glass shift 
(difference between the pre- and post- 
adaptation judgments) and the percent 
adaptation for the three adaptation figures 
and the two adaptation activities. Adapta- 
tion is evident if at the removal of the MSL 
S judges the line to be frontoparallel when 
the line is set to produce the same disparity 
that was produced by the MSL. There- 
fore, the effect will be a deviation of judg- 
ments of frontoparallel from the preadapta- 
tion frontoparallel, with the side of the 
line on the same side occupied formerly by 
the MSL set farther back. 

All Ss showed aftereffects in the pre- 
dicted direction with mean shifts ranging 
from 6.90? to 26.22?. The degree of shift 
depended on the inspection figure and the 
type of adaptation activity. The effect of 
these variables is readily discerned in 
Figure 2 which presents percent adaptation 
averaged over the two orders in which the 
two adaptation activities were experienced. 
Figure 2 shows that the magnitude of the 
adaptive shift decreased as the perspectival 
slant of the inspection figure increased and 
that for all inspection figures the adaptive 
shift was greater following the depth-pro- 
cessing activity compared to the depth- 
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FIGURE 2. Percent adaptation for three inspec- 
tion figures (degree of discrepancy) and two types 
of adaptation activity averaged for the two orders. 


registration activity. The average percent 
adaptation was 63%, 51%, and 35% for 
the 0°, 30°, and 609 inspection figures, 
respectively. The average percent adapta- 
tion was 64% and 45% for the depth- 
processing and depth-registration activities, 
respectively. A between-Ss (adaptation 
figure, MSL eye, and order of adaptation 
activity), within-Ss (adaptation activity) 
analysis of variance was performed on the 
percent adaptation data. The main effects 
of adaptation figure, F (2, 48) — 19.88, 
p «.001, MSL eye, F (1, 48) = 4.76, 
p < .05, adaptation activity order, F (at 
48) = 15.10, p < .001, and adaptation 
activity, F (1, 48) = 90.48, b < .001, were 
significant. There were no significant in- 
teractions. The effect of MSL eye was 
due to the slightly greater adaptation for 
Ss who wore the MSL before the left eye. 
The order of activity effect was due to the 
fact that the passive-active order resulted 
in greater adaptation than the reverse 
order. 
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Discussion 


Experiment II showed that for a con 
transformation (enhancement) of 
disparity the magnitude of the adaptive 
varied as a consequence of variations of 
spectival slant of the adaptation figure. Th 
are no strictly analogous observations in 
literature. However, our findings can 
related to the results of experiments that h: 
varied the degree of optical transformati 
e.g., degree of optical tilt (Ebenholtz, 1966 
degree of sideways displacement (Efstathi 
1969). In these cases, the adaptive shift | 
been found to increase as the magnitude of t! 
optical transformation has increased. j 
present findings strongly suggest that the ess 
tial variation, common to both procedures, i 
a variation in the degree of informational dis- 
crepancy. The magnitude of discrepancy can 
be varied by manipulating the degree of optica! 
transformation or by varying the informatio: 
correlated with the nontransformed input 
The effect is comparable. Nevertheless, ou: 
proceduře has the advantage of avoiding con- 
founding due to unwanted variations in the 
ocular system that may be associated with vari 
ations of the degree of optical transformation 

The model of adaptation to uniocular imag: 
magnification which we have advanced in our 
earlier reports (Epstein, 1971, 1972a; Epstein 
& Morgan, 1970), and which has been ind: 
pendently formulated by Wallach (Wallach & 
Frey, 1972a) in accounting for adaptation in 
volving the oculomotor cues, has a ready ex- 
planation for the dependence of magnitude of 
adaptation on degree of informational dis- 
crepancy. In the present instance, the claim 
is that binocular disparity has been recalibrated 
or reevaluated as a consequence of being con- 
tinuously paired with a dominant, discrepant 
input. The degree of disparity imposed by the 
MSL acquires a new significance for perceived 
depth, one which is consistent with the domi- 
nant perspectival slant. The farther apart are 
the two slants, the greater must be the shift 
in the significance of disparity if the prevailing 
degree of disparity is to become equivalent in 
its effect on perceived depth to the effect of 
perspective. 

It is important to note that in our formula- 
tion, the directing factor in the relationship 
between the discrepant inputs is not that one 
elicits veridical percepts and the other, being 
transformed, elicits nonveridical percepts. In 
the present experiment, since the target was 
in fact frontoparallel, both inputs induced non- 
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veri percepts. The directing factor in our 

e nt was cue dominance. As Experi- 

showed, under our conditions per- 

al slant was dominant. Accordingly, 

ily was recalibrated. The direction of 

iit, therefore, is determined by dominance, 

not idieality. Of course, if the dominant 

inp! untransformed and yields veridical 

on the adaptive shift in the subordinate 

ill be toward veridicality. That shifts 

eridicality have been the rule in the 

| adaptation is a coincidence of the 

;:ental designs that have been favored. 

imple matter to demonstrate "'counter- 

»iation" (Wallach & Frey, 1972b) or 

maladaptive adaptation" (Canon, 1971), in 

which the shift is from veridical to nonveridical 

All that is required is to pair a 

subordinate nontransformed input with a 
dominant transformed input. 

The vital role of cue dominance as the direc- 
tive factor in adaptation may also explain the 
way in which selective attention effects adapta- 
o informational discrepancy. Canon 
(19 «posed S to an intermodal discrepancy 
tween visual and auditory localization. Dur- 
ing the adaptation period, S tracked a moving 
target presented in both modalities, under in- 
to attend to the visual input ignor- 

the auditory input, or vice versa. Follow- 

ing the adaptation period, measures of locali- 
ti ere secured with auditory input and 
visual inputs separately. Canon found that 
adaptation was restricted almost entirely to 
the unattended modality. If we may assume 
that selectively attending to a cue or to the 
input from a specific modality confers domi- 
nance to the attended cue or modality, 
Canon’s finding can be understood. Domi- 
nance relationships among cues or modalities 
are no doubt well-established by preexperi- 
mental perceptual experience, perhaps on the 
basis of probabilistic considerations similar to 
those proposed by Brunswik (1956). But 
dominance in a given situation may be affected 


perception. 


tion t 


tructions 


by contemporary or situation-specific factors, 
in Canon's (1970) case, distribution of atten- 
tion. This result is all the more significant 


because it seems likely that without selective 
attention visual localization dominates audi- 
tory localization (Weerts & Thurlow, 1971), so 
that if our interpretation is correct, selective 
attention is able to reverse preexperimental 
dominance relationships. 

The 60° inspection figure in Experiment II 
was selected because in this case perspectival 
slant and stereoscopic slant matched. Accord- 


ingly, informational discrepancy was assumed 
to be absent and significant adaptation was not 
expected. However, the experimental results 
showed that although the 60° inspection figure 
yielded the smallest shift the adaptation effect 
was significant. This result may suggest that 
a portion of the adaptive shift is not a response 
to informational discrepancy, just as in the case 
of tilt adaptation a part of the total adaptive 
shift may be attributed to Gibson-type nor- 
malization (Morant & Beller, 1965). A more 
likely explanation of the occurrence of signifi- 
cant adaptation to the 60° figure is that motion 
parallax, arising from unavoidable small head 
movements during the 10-min. adaptation 
period, offset perspectival slant and introduced 
a discrepancy between motion parallax slant 
and stereoscopic slant. This difficulty would 
not be present in Experiment I because S was 
not required to maintain the head in a station- 
ary position for any extended time. When we 
retested eight high adapters selected from the 
Ss of Experiment II some time after the con- 
clusion of Experiment II, using a modified 
procedure which reduced head movements to 
the practical minimum, the adaptive shift in 
response to the 60° figure was reduced. 
Nevertheless, since adaptive shifts were not 
totally eliminated the question remains open. 

The other principal finding of Experiment II 
is that an adaptation activity that requires 
depth processing yields greater adaptation than 
passive registration of depth. The effect of 
processing level may be attributable to two 
related factors: (a) depth processing may 
supply an added motive for resolution of the 
informational discrepancy, inasmuch as the 
requirement to localize the target may compel 
the visual system to opt for one outcome, and 
(b) depth processing may enhance the potency 
of the dominant cue by encouraging S to 
utilize the cue. Although the experimental 
results clearly show an enhancement attribut- 
able to depth processing it is also clear that 
adaptation occurs even when S is directed 
away from the relevant discriminations. Of 
course, the procedure did not purport to con- 
trol S's processing completely, only to intro- 
duce a difference in level of processing. It 
seems likely that in the depth-registration con- 
dition Ss did engage in a degree of depth 
processing. 

It should be understood that we are not 
claiming that depth processing is a necessary 
condition for the occurrence of adaptation to 
transformed disparity. Even were depth 
processing absent completely, the fact that dis- 
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crepant inputs are present is likely to be suffi- 
cient to produce some adaptation. The situa- 
tion is analogous to that which prevails in the 
case of the difference between passive and 
active movement. While adaptation usually is 
greater following active movement, significant 
adaptive shifts do occur following passive 
movement (e.g., Mack, 1967; Singer & Day, 
1966). According to our argument, adaptation 
is greater following active self-generated move- 
ment for reasons similar to those we offered 
in explaining the effect of level of processing 
in the present experiment. Nevertheless, 
adaptation following passive movement will 
occur as long as informational discrepancy is 
present, 
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Are unattended verbal materials stor: 
of their sensory features, or do they rec 
this question, eceived dichotic pi 
lists being selected either from the sa 
were instructed to attend to only or 
list presentation, a probe 
been presented in the attended list. 
similarity with the attended or unat 
probe item was presented that had 
channel, long latency and high error 
unattended item was the same as th 


This was interpreted as indicating that unattende 


semantic analysis. 


Several recent studies have pointed to 
the existence of short-term memory (STM) 
storage for unattended verbal items (Davis 
& Smith, 1972; Norman, 1969; Peterson 
& Kroener, 1964). However, there is some 
ambiguity concerning the extent to which 
this material is processed prior to storage. 
Glucksberg and Cowan (1970) argued that 
it is only the sensory features of the 
stimulus, rather than semantic or cate- 
gorical information about it which is held 
in STM. This conclusion was based upon 
2 findings. In a shadowing task, where 
narrative passages were dichotically pre- 
sented, digits were occasionally presented 
to the unattended ear. When Ss were 
later asked if a digit had been presented, 
homophones of digits that occurred in the 
narrative such as “too” and "for" were 
incorrectly identified as digits. This in- 
dicates that context had no effect, and 
that information about meaning was there- 
fore not being extracted from the unat- 
tended channel. Secondly, Ss were in- 
structed to report if there were any 
instances where they believed a digit had 
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was presented, wit 


ed in short-term memory solely in terms 
:eive some semantic analy: "To answer 
resentation of lists of words, with the 2 
me or from different categories. The Ss 
ne ear, and ignore the other. Following 
h Ss indicating if that probe had 

Probe type was varied as to category 
tended lists. It was found that when a 
been an actual item in the unattended 
rates were found if the category of the 
e attended list, but not if they differed. 
d materials receive some 


been presented, but were unable to recall 
exactly which digit it was. However, it 
was found that unless a specific digit was 
reported, Ss reported that no digit had 
occurred. There was thus no apparent 
memory for the category digil that was 
separate from specific item occurrence. 
'This is in contrast to memory-attended 
events, where Ss can report, correctly, 
that a category member (e.g., a digit or 
a word) has occurred without being able 
to report the identity of the specific item. 

Some data of Treisman (1964) support 
this contention. When an unattended 
message was the same as the one being 
shadowed, all Ss noticed this identity when 
the time separation was sufficiently short. 
The critical time interval depended upon 
whether the shadowed or rejected message 
y leading. When the relevant message 
was leading, identity was noticed with an 
interval up to 4.5 sec., whereas when the 
irrelevant message was leading, it was only 
1.4 sec. Treisman considered these dura- 
tons to be a measure of STM for the 
2 kinds of materials. The important find- 
ing with regard to the question of semantic 
analysis of unattended input relates to the 
differences she found in the memory dura- 
tion of attended and unattended messages 
as a function of the nature of the material 
employed. When the relevant message 
was leading, the interval for noticing 
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identity was 4.5 sec. for prose. This cor- 
responds to about 12 words, the usual 
memory span for this type of material. 
However, when S was shadowing isolated 
words, the critical lag was much smaller, 
about 6 words. Thus, memory in the 
attended channel is a function of meaning. 
On the other hand, the number of words 
by which the irrelevant message led when 
identity was noticed did not depend upon 
the kind of material used at all, suggesting 
that semantic analysis was not occurring. 

Other studies, however, provide evidence 
that some semantic analysis of unattended 
events is done. Lewis (1970) found that 
unattended material that was semantically 
similar to the attended material (syno- 
nyms) caused Ss more difficulty in a 
shadowing task than did semantically dis- 
similar items. Similarly, Davis and Smith 
(1972) found that the dichotic presenta- 
tion of a list of nonsense words which .Ss 
were instructed to ignore did not cause 
as much interference in the recall of a list 
of attended words as did the presentation 
of a to-be-ignored list of real words. 

In the study reported here, a reaction 
time (RT) paradigm introduced by Stern- 
berg (1966) was employed to study the 
question of semantic analysis of unattended 
material. In this procedure, Ss are pre- 
sented with a list of words to remember 
(memory set) followed by a probe item. 
The task is to determine whether the 
probe item had been presented as a member 
of the memory set. It generally has been 
found that RT increases linearly with the 
size of the memory set, and that the slope 
of the function is the same whether or not 
the probe was one of the presented items. 
Sternberg interpreted these results as in- 
dicating that Ss perform a serial exhaustive 
scan of the items in memory. 

Recent elaboration of this procedure has 
indicated that in some situations Ss may 
be capable of categorizing the items in the 
memory set so as to reduce the required 
search time. De Rosa and Morin (1970) 
found that when the memory set con- 
sisted of a set of consecutive digits, the 
time required to determine that the probe 
digit was not a member of this set de- 
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creased as the difference 

metic value of the presen 
the highest (or lowest) dig 
set increased. Lively and ford. (1972), 
presenting lists consisting cither of digits 
or of letters, found that the time to decide 
that a probe item was not a member of 
the memory set was faster if the probe 
was a member of the other category to 
that employed in the memory set. Further 
evidence for such categorization of items 
is presented by Smith and Abel (1973). 
They presented a list of items drawn from 
several categories followed by a probe con- 
sisting of a category name. It was found 
that the time required to decide whether 
the given category had b represented 
in the memory set varied directly with the 
number of categories, rather than with the 
actual number of items of these 
studies suggest that upon sentation of 
the memory set, some categorization of the 
list items occurs. 

The above experiments emploved only: 
memory sets to which Ss ittending. 
Does the same kind of y analysis 
occur for unattended inpui Lo study 
this, Ss were dichotically presented with 
2 lists of words, one of which they were 
instructed to ignore. All items within a 
list were drawn from the same category. 
By varying whether the unattended list 
on a given trial contained items belonging 
to the same category as the attended list, 
as well as the category from which the 
probe item was selected, it was hoped that 
something could be learned about the 
nature of the semantic analysis of unat- 
tended input. 

If unattended materials are not seman- 
tically analyzed, but have only acoustic 
representation in STM, as suggested by 
Glucksberg and Cowan (1970), then cate- 
gory similarity between attended and un- 
attended lists should have no influence on 
RT to the probe. Any effect of category 
similarity, on the other hand, w ould be 
taken as evidence that some semantic 
analysis is done of the ignored channel. 
To get more information about the memory 
representation for unattended input, several 
different probe words were employed. 


een the arith 
stimulus and 
ı the memor: 


SEMANTIC ANALYSIS OF UNATTENDED ITEMS 


Positive probes (i.e., those requiring a yes 
response) were items that had actually 
been presented in the attended channel. 
Negative probes were selected from (a) the 
category employed in the attended list, 
(b) the category employed in the unat- 
tended list, (c) a different category entirely, 
or (d) an actual item in the unattended 
list. A monaural condition in which the 
category of the negative probe could be 
the same or different from the memory set 
was included for comparison. 


METHOD 


Subjects. Five male university students with 
normal hearing volunteered to participate in this 
experiment, and were paid for their services. 

Apparatus. All lists were recorded using a mag- 
netic Sony (TC-355) 4-track tape recorder, and were 
presented individually to Ss through headphones. 
In recording the lists for dichotic presentation, the 2 
channels were matched approximately for loudness. 
By pressing a voice-operated relay key (Uhr Model 
F422) before recording each probe word, an in- 
audible mark was made on the second track of the 
tape at the onset of the probe. There was a variable 
delay of 20-40 msec. between actual probe onset and 
the mark on the tape. During playback, a sensing 
device, on detecting this mark, triggered a Hunter 
Klockounter (Model 120A, Series D) which mea: 
ured RTs to the nearest millisecond. The timer was 
stopped when S manually depressed 1 of 2 micro- 
switch response buttons to indicate his response 
choice. Because of the distance on the tape re- 
corder between the recording and playback heads, 
the probe word was not actually heard to begin until 
256 msec. after the tape mark had triggered the 
timer. Thus, this time was subtracted from all 
recorded RT measures. 

Procedure. Six hundred 2-syllable words from 45 
different categories were selected from Battig and 
Montague's (1969) category norms. Only those list 
words that had been included as category members 
by at least 5 of the students who established the 
norms were included. No word was included in 
more than one category. Prior to testing, E read 
S a list of all the words to be used to ensure famili- 
arity with all words and pronunciations. 

Each S attended 7 sessions over a 4-day period. 
Two sessions were presented on each of Days 1-3, 
and the last session was presented on Day 4. On 
Day 1, S received only monaural presentation of the 
lists, the first session representing a practice session 
and the second the test. Dichotic lists were pre- 
sented on Days 2-4, and consisted of 1 practice 
session followed by 4 test sessions. 

In each session, 144 trials were presented. A 
single trial consisted of the following sequence: a 
warning signal (metallic tap), a list of 2, 4, or 6 
words, another warning signal, and finally, the probe 
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word. A 6-sec. interval elapsed before the beginning 
of the next trial. In the monaural trials, words and 
signals occurred at the steady rate of 1/sec. For 
dichotic trials, one pair of words per second was 
heard, one word from each tape channel per second. 
Words in a pair were approximately simultaneous, 
with the onset of the attended word always preced- 
ing the onset of the ignored word by 100-150 msec. 
Half of the probes were positive; i.e., they were items 
that had occurred in the preceding list and required 
a yes response. The other half of the probes were 
negative, requiring S to respond no. Positive probes 
were selected from each serial position with equal 
frequency. 

For each S, one ear was chosen as the “relevant” 
ear. Instructions emphasized that he was to attend 
only to that ear, and to ignore materials presented 
to the irrelevant ear. The Ss were warned that 
some probes would occasionally be drawn from the 
list presented to the irrelevant ear, but that these 
required a no response, and that they would do 
better in the task if they totally ignored the irrele- 
vant materials. In an attempt to aid Ss in blocking 
out the irrelevant ear, the monaural and dichotic 
attended lists were always presented to the same 
ear. Further, in the dichotic condition the warn- 
ing signals and probe words were presented only 
in the relevant ear. The S always made his positive 
response with the same hand. Each of the 4 possible 
ear-hand combinations was gned to each of 4 
One of the combinations was then randomly 
assigned to the remaining S. 

Words within a single list represented only one 
category. For each trial, a category was selected at 
random, and then the appropriate number of words 
was randomly chosen from that category without 
replacement in a single session. A given word was 
used only once in the monaural test session, but 
twice in each dichotic session, once in an attended 
list and once in an unattended list. 

The number of trials in each condition differed 
for monaural and dichotic sessions. In monaural 
sessions, half of the negative probes were drawn 
from the same category as the list items, and half 
from a different category. Dichotic trials were first 
subdivided according to the categories of the 2 lists. 
On half of the trials the lists presented to each ear 
were from the same (S) category; on the other half, 
they were from different (D) categories, Fifty per- 
cent of the probes were positive. The remaining 
50% of the negative probes were subdivided into the 
following types: (a) a member of the same category 
as the attended list (Types Sı and Dj); (b) an 
actual item from the ignored list (Types S» and D); 
(c) an item from a category different from both lists 
(Types Ss and Ds); or (d) if the 2 lists were from 
different categories, an item from the same category 
as the unattended list (Type D4). 3 

Under Condition S, the first 3 negative probe 
types were tested with equal frequency. For 
Condition D, all 4 types were used, again with 
equal frequency. Samples of each type of probe 
appear in Tables 1 and 2, together with their fre- 
quency of presentation in each session. If the 


TABLE 1 
EXAMPLES OF Prope TYPES FOR MONAURAL CONDITION (List LENGTH 
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List item | Probe | Type Description Frequency 
GIRAFFE | GIRAFFE Positive Serial Position 1 
LION TIGER Negative Same category 
ZEBRA LEMON Negative Different category 
MONKEY | 
Total | | 


^ This figure derived from 4 Serial Position X 6 Replications, 
b The total of 144 was derived from 48 Total Frequency X 3 List Lengths. 


negative probe was an actual item from the ignored 
list, each serial position tested equally often. 
The specific order of conditions within a session 
was randomized separately for each set of 144 trials. 
Each session was recorded on a separate tape. Four 
dichotic test tapes were required in order to test 
each serial position for Sz and Ds probes at least 
twice for every list length. The: 


gory), and Ss. Incorrect 
not included in the ana! 
significant factor found 
F (2, 8) = 26.7, p< 01. ^ 
the variable of interest witi 
classification, did not result n 


'sponses were 
The only 
list length, 
obe types, 
spect to 
ficantly - 


/ were presented in " non ees P sot final 

a different random order to each S. different RTs, suggesting lassifica 
tion was not used as the : strategy 
RESULTS in responding to negative p Despite 


this finding, all Ss spontan y reported 
that probes from different categories were 
easier to reject. 

Dichotic presentation, Mean RTsat each 
list length as a function of whether the 
category of the attended and unattended 
lists are the same or different are shown 


Monaural presentation. A 3X3X5 
analysis of variance was performed on the 
mean RT data, the factors being list 
length (2, 4, or 6 words), probe type 
(positive, negative from the same category 
as list, and negative from a different cate- 


TABLE 2 
OR Dicnoric Conpition (List LENGTH 4) 


List 


| | | | AE 
E | Probe Type | Description Frequeney 
Attended | femore | | | 
| AB NC uu | 
Two lists from same (S) category 
OANNE | EILEEN | Berry | Serial Position 2 | 128 
Berry | SUSAN M ARY | Same category as attended (S1) | 4 
SHARON Lois || CaROLE | N 1 Item from ignored list, Serial Position 4 (S) | 4 
NANCY CAROLE | SQUIRREL | Nega Different category (Sa) 4 
Total | | | i 240 
A | m | E 
Two lists from different (D) category 
|| LEMON Positive Serial Position 3 12 
| MANGO | Negative | Same category as attended (Di) 3 
J Se | CELLO Negative Item from ignored list, Serial Position 2 (Ds) 3 
CHERRY | FIDDLE | R | Negative Different category (D) 3 
em | mrar | Negative Same category as ignored list (D4) 3 
ota | 
: | i 
Grand total || 
x | 1447 


* This figure was derived from 4 Serial Position X 3 Replication. 
© A total of 72 was derived from 24 Total Frequency X 3 List Lengths, 
* The total of 144 was derived from 72 Total from Same Category 4- 72 Total from Different Category. 


in Figure 1. Monaural RT data, in which 
positive and negative probes have been 
combined, are also shown for comparison. 
A 4-way analysis of variance was per- 
formed on correct mean dichotic RT mea- 
sures, the factors being list length (3); 
probe type, ie., positive or negative (2); 
category classification, i.e., attended and 
unattended lists from the same or different 
categories (2); and Ss (5). Different types 
of negative probes were combined for this 
preliminary anal Only the effects of 
list length and category type were signifi- 
cant (p < .01). As lists became longer, 
RT increased, F (2, 8) = 70.6, and re- 
sponses to probes were made more slowly 
when both lists were from the same 
category than when the categories were 
different, F (1, 4) = 26.7. None of the 
interactions reached the .05 level. A rep- 
lication of the analysis of variance using 
median scores yielded the same results, 

It can also be seen in Figure 1 that 
monaural RT was faster than dichotic RT 
at all list lengths. This effect did not 
reach signifieance at the 05 level, perhaps 
because monaural trials always preceded 
dichotic trials, and Ss were thus not as 
well. practiced. Least squares linear re- 
gression equations were calculated for the 
3 functions. They are shown in Figure 1. 
The slope for the 2 dichotic conditions, 
44.7 msec, per list item for same category 
lists and 41.4 msec, for different category 
lists, indicate that the 2 functions are 
essentially parallel. Both are slightly 
steeper than the monaural condition (33.7 
msec. per item). 

Mean correct RT for each of the 7 nega- 
tive probe types as à function of list length 
is shown in Figure 2. An independent 
analysis of variance of the various types 
of negative probes was carried out to 
clarify the interpretation of the observed 
differences, The S: probes, i.e., those in 
which an actual item from the unattended 
list was presented under the condition 
where both lists were from the same cate- 
gory, were not included in the analysis 
because of the large number of errors that 
occurred in this condition. The remaining 
6 probe types were examined in à Probe 
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Type X List Length X Ss factorial anal- 
ysis of variance design. Both probe type, 
F (5, 20) = 5.5, P < O1, and list length, 
F (2,8) = 21,5, p < 01, were found to be 
significant variables. For this analysis, 
only correct responses of less than 2 sec. 
latency were included in determining 
means. 

All sets of paired comparisons of probe 
types were made by the least significant 
difference test. When a .05 level was 
assumed, the 6 probe types fell into 2 dif- 
ferent groups. No differences were found 
for S Di, and Dz items. These 3 types 
were all responded to more slowly than 
S, Ds, and D, probes, with the latter 
3 not being differently responded to. 

To summarize the data in Figure 2, the 
slowest negative RTs were made to actual 
items from the unattended list in the con- 
dition where both lists were drawn from 
the same category. Collapsing across list 
length, mean RT for the S; condition was 
713 msec, The second longest RTs were 
found when a probe was presented from 
the unattended list, the 2 lists being drawn 
from different categories (D,), and in those 
conditions where the negative probe was 
from the same category as the attended 
list, regardless of category similarity of the 
2 lists (Sı and Dy). The mean RTs for 
these 3 conditions were 544, 534, and 547 
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msec., respectively. Finally, the fastest 
negative responses occurred in the 3 con- 
ditions in which the probe was drawn 
from a different category from the attended 
list, even if that item represented the 
category of the unattended list, providing 
it was not an actual item from the un- 
attended list. The mean RTs for these 
3 conditions were 505 (S;), 491 (D3), and 
508 (D4) msec. 

Negative probes that represented actual 
items from the unattended list (S: and D;) 


Negative reaction times in dichotic condition for each probe type. 


were examined further for serial position 
effects. Plotting the mean RT as a func- 
tion of serial position for each list length 
did not reveal any consistent trends. 
Errors. The mean percent errors at each 
list length are shown in Figure 3. Positive 
and negative responses have been combined 
in the monaural condition. Error rate in 
the monaural condition did not vary as a 
function of list length, and averaged 3.1%. 
With dichotic presentation, the errors for 
negative probes with the exception of the 
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S. condition, were the same as for mon- 
aural presentation, in that they did not 
vary across list length, and averaged 2.9%. 
For positive responses in the dichotic con- 
dition, errors were found to be greater 


than for negative probes, and to increase 
with increases in list length. 

By far the greatest number of errors 
occurred in the S. condition, where the 
probe is an actual item from the unattended 
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list, both lists being drawn from the same 
category. For List Length 2, error rate was 
a relatively small 7.5%. However, for the 
longer list lengths it was approximately 
35%, indicating that Ss have great diffi- 
culty distinguishing these items from items 
in the attended list. In contrast, only 2.8% 
of the responses to Ds probes were errors. 
To determine whether the error rate for 
the S; probe was a function of serial posi- 
tion in the unattended list, error rate was 
plotted as a function of serial position for 
each list length. However, no consistent 
pattern was found. 

Serial position effects for positive probes. 
Mean RT and percent errors for positive 
probes as a function of serial position were 
examined for both the monaural and vari- 
ous dichotic conditions, For List Length 2, 
RT and errors were greatest in Serial Posi- 
tion 2, For List Length 4, a bow-shaped 
function was found for both RT and errors, 
with slowest responses and greatest number 
of errors occurring toward the middle of the 
list. Finally, at List Length 6, while the RT 
curve was again generally bowed for the 
monaural condition, a marked recency 
effect was found in the dichotic conditions. 
Similarly , errors were greatest for the items 
at the beginning of the list, with fewest 
errors occurring at the later serial positions. 


Discussion 


The results of this experiment provide evi- 
dence that the unattended input is not stored 
solely in terms of sensory features. Rather, 
some category analysis appears to take place. 
The strongest evidence for this conclusion 
comes from the 2 conditions in which an actual 
item is drawn from the irrelevant channel and 
presented as a probe (requiring a negative 
response), [f all unattended items are stored 
às acoustic representations, as suggested by 
Glucksberg and Cowan (1970), the category 
membership of the unattended message should 
be unimportant. However, it was found that 
if the items in the unattended channel belonged 
to the same category as those in the attended 
list (S+), RT was far greater than in the con- 
dition where the unattended message belonged 
to à different category (D;). 

The 2 conditions also differed greatly with 
respect to errors, If a probe, selected from the 
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unattended channel, belonged to a differer 
category from the attended list (D+), @ 
rate was low, resembling that found for t 
monaural condition, and did not vary with 
length. On the other hand, if the probe wi 
an item that had been presented in the ignor 
ear and did belong to the same cate 
error rate was very high, increas: 
with list length. 

It is of interest to note that th: 
irrelevant list category on either RT or 
is found only when the probe is an actu 
from the unattended list. A negative probe 
from the same category as the unattended lis 
(D4) is not responded to differently than a 
probe from a category that was not present 
at all (Ds). Hence, category membership ol 
the unattended items is only an importan 
variable when the sensory or acoustic repre 
sentation of the items are actually present. 
This would lend support to the conclusion of 
Glucksberg and Cowan (1970) that there doi 
not appear to be any abstract category repre 
sentation for unattended items. 

The finding that negative probes from thi 
same category as the attended message (Si 
and D;) are rejected more slowly than word. 
from a different category (Ss, Ds, and Dı), ii 
in agreement with the results of Lively an 
Sanford (1972). It indicates that categoriza- 
tion occurs for the attended message, with this 
information being used as a basis for rejection. 
The failure to find this effect in the monaural 
condition (i.e., shorter latency for noncategory 
negative probes) may be due to the fact that 
unlike the Lively and Sanford study, where 
only 2 highly learned categories were used 
(digits and letters), many different categories 
were employed in this experiment, Since the 
monaural condition was always presented first, 
‘Ss were less practiced than in the dichotic con- 
dition, and perhaps registration of category 
information was sufficiently variable to elimi- 
nate any differences. 
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FEEDBACK PRECISION AND POSTFEEDBACK 
INTERVAL DURATION ! 


CECIL A, ROGERS, Jn.* 


University of Arizona 


Precision of feedback grain was manipulated in a simple positioning t 
An optimum was found; an increase in precision past that optimum prod 
deleterious effects upon rate of acquisition. 


In a second study, increasing 


postfeedback interval removed that optimum. The feedback precision el! 


were then replicated in a timing task. 


The combined results of the 3 stud 


were interpreted as supportive of an information-processing approach to t 
study of postfeedback interval events for simple motor skills. ‘The findi: 
additionally supported specific predictions by Bilodeau and deductions fro 


Adams' 1971 theory of motor learning. 


Trowbridge and Cason (1932) provided 
early evidence that increases in feedback 
precision normally promote rate of acquisi- 
tion in perceptual-motor tasks. Reviews 
of the literature by Adams (1971) and 
Bilodeau (1966, 1969) confirm this relation- 
ship for a variety of tasks. Bilodeau did 
suggest, however, that for any specific 
task, some optimal level of feedback grain 
might exist. Increases in precision beyond 
that level might fail to enhance and could 
be detrimental to the learning process. 
Extrinsic feedback might be refined past 
the point of human response capability or 
could exceed S's information-processing 
capacity. 

One previous study by Bilodeau and 
Rosenbach (1953) found few effects from 
varying feedback grain in a knob position- 
ing task. Since their range of manipulation 
of feedback precision was somewhat narrow, 
Experiment I was designed to investigate 
broader variations of feedback precision 
in the same task. 

The first study was, in essence, a search 
for Bilodeau's (1966, 1969) predicted opti- 
mum. The task and procedures were 
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Rosenbach 
ioning task 


adapted from Bilodeau a 
(1953). Their micrometer p 
met criteria that the amour’ of intrinsic 
feedback should be relativ small and 
that response error measurement should be 
finely grained. 


EXPERIMENT I 
Method 


Subjects. The Ss were 80 and 
volunteers from introductory psychology 
the University of Arizona. The e randomly 
assigned to 1 of 4 groups with the restriction that 
each group contain equal numbers of each sex. 

Apparatus and procedure. A micrometer (Starrett 
Model 263RL) was mounted behind a black wooden 
screen so that its turning knob projected through a 
hole in the center of the screen (Bilodeau & Rosen- 
bach, 1953). The Ss were seated and their arms 
were positioned perpendicular to the apparatus, 
which rested upon a table. The view presented to 
S was that of a black knob on a black background. 
The body of the micrometer rested behind the screen 
and was visible only to Æ. Verbal instructions to S 
indicated that he must rotate the knob and learn 
to stop it at the correct position. One complete 
rotation of the knob produced a mi eter shaft 
displacement (not visible to S) of . unit; the 
goal response was set at 4 units Æ ,5 unit. Response 
error was read directly from the micrometer by Æ 
before rounding and reporting it orally to S. Error 
was recorded to the nearest .001 unit for all groups. 
The micrometer was calibrated such that 1 unit 
equaled 1 in. 

Ten acquisition trials were given to each S, A 
single trial consisted of a verbal ready signal 
followed in 2 sec. by a go signal which was followed 
in turn by, S’s response, oral feedback 43 sec. after 
the go signal, and the next ready signal 5 sec. after 
termination of feedback. Duration of feedback was 
approximately 4 sec, and the total trial cycle was 
about 55 sec, 


female 
ses at 


male 
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On the first trial, all Ss were told to turn the knob 
as far as they cared to. If S hit the target on a 
subsequent trial, he was told that his response was 
correct and that he should try to repeat it on the 
next trial. 

Design. The Ss in the 0-order treatment were 
informed of direction of response error relative to 
the goal (too short or too far). The 1-digit group also 
received magnitude information rounded to the 
nearest whole unit. Response error for 2- and 4-digit 
groups was reported to the nearest tenth and 
thousandth of a unit, respectively. For example, an 
actual response error of +3.214 in. would be re- 
ported as "too far," +3, +3.2, or +3.214 units, 
depending upon feedback condition. 


Results and Discussion 


Absolute error data were consolidated 
into 5 2-trial blocks as displayed in Table 1. 
Variance analyses of Block 1 revealed no 
significant differences among groups at the 
onset of practice, F (3, 76) = .40. Subse- 
quent analyses revealed a significant prac- 
tice effect, F (4, 304) = 72.38, p < .01, 
and a significant feedback grain effect, 
F (3, 76) = 341, p «.025. No Feed- 
back X Practice interaction was indicated, 
F (12, 304) — 1.62. As shown in Table 1, 
mean response error showed progressive 
diminution with practice and the increase 
of feedback precision from 0 to 1 to 2 
digits of information. The 4-digit group 
showed a reversal of this trend by per- 
forming more nearly like the 0-digit con- 
dition. Examination of Block 5 data 
revealed a feedback precision effect; F (3, 
76) = 3.83, p < .025. An unequal interval 
trend analysis of precision levels at Block 5 
indicated a curvilinear component, F (1, 
76) = 11.0, p <.01.  Bilodeau's (1966, 
1969) hypothesis was then confirmed in 
that the most precise level of feedback 
administered in Experiment I not only 
failed to produce optimal acquisition rates, 
but slowed acquisition relative to the 2 
intermediate feedback levels. 

An analysis of within-trial events sug- 
gested a possible mechanism for operation 
of the feedback grain effects found in 
Experiment I. Postfeedback interval, from 
termination of feedback to the next 
response, was 7 sec. Adams (1971) re- 
viewed the effects of interpolating motor 
responses in the postfeedback interval and 


TABLE 1 
MEAN RESPONSE ERROR (IN IN.) FOR F 
CONDITIONS OVER TRIAL BLOCKS: 
EXPERIMENT | 


Trial block 


Digits of | 


feedback x 
1 2 | 3 4 5 
0 491 | 447 3.14 
1 4.05 | 2.90 1:92 
2 4.05 | 2.63 .87 
1 | 4.81 | 4.17 281 


found few effects on motor-skill acquisition. 
He hypothesized that the interval is pri- 
marily one of verbal-cognitive activity by 
S, wherein feedback is processed and a 
response modification is computed. Adams 
further suggested that acquisition might 
be slowed either by interpolating verbal 
activity in the postfeedback interval or by 
excessively foreshortening it. If some such 
minimal information-processing time is 
required between feedback and the next 
response, it seems likely that feedback 
precision would also in part determine 
optimal interval duration. As precision 
increases, more information is imparted to 
S. As Adams notes, such precision nor- 
mally provides S with information to make 
more finely graded responses. However, 
as S's informational load is increased, more 
time is required for his selection of a 
response modification. For a given task, 
acquisition might then be hindered by 
very precise feedback combined with a 
short postfeedback interval. The 7-sec. 
postfeedback interval in the present study 
may have been inadequate for the 4-digit 
condition. The present results considered 
in the context of Adams’ theory suggest 
a possible interaction between feedback 
precision and postfeedback interval 
duration. 

Weinburg, Guy, and Tupper (1964) pre- 
viously demonstrated the deleterious effects 
of shortening the postfeedback interval 
while holding grain constant. For one 
positioning task, they found that 5 sec. 
was the minimal period required between 
feedback and the next response. While 
increases beyond 5 sec. did not aid acquisi- 
tion, reduction below 5 sec. was harmful. 
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Experiment II] was then designed as a 
replication of Experiment I, but with the 
postfeedback interval lengthened. An im- 
provement in the ranking of 4-digit Ss 
might then be expected in the context of 
Adams' (1971) theory. Bilodeau's (1969, 
p. 277) position, which emphasizes the 
information-processing aspects of the post- 
feedback interval, would also be suggestive 
of such a finding. Such an outcome would 
further be consistent with data obtained 
on postfeedback intervals by Bourne and 
Bunderson (1963) for concept formation 
tasks. 


EXPERIMENT II 
Method. 


Subjects. Sixty undergraduate volunteers from 
Tulane University were divided into 3 groups of 
20 Ss. Each group was comprised of an equal 
number of males and females. 

Apparatus and procedure. No change was made 
in apparatus. Procedure differed from Experiment 
I only in lengthening of postfeedback interval from 
7 to 14 sec., with a consequent foreshortening of 
the prefeedback period from 43 sec. to 36 sec. The 
total trial cycle remained 55 sec. 

Design. The 0-digit feedback condition present 
in the first study was omitted from Experiment II. 
Feedback grains of 1, 2, and 4 digits were admin- 
istered as in Experiment I to the 3 remaining groups 
over 10 trials, 


Results and Discussion 


Mean absolute response error per 2-trial 
block is shown in Table 2 for the 3 feed- 
back grain conditions. A reordering of 
rank-order positions obtained in comparison 
to those found in Experiment I. The 
relative performance levels of the 1- and 
2-digit conditions remained the same, but 
the 4-digit group improved from worst to 


TABLE 2 


MEAN Response ERROR (IN IN.) FOR FEEDBACK 
CONDITIONS OVER TRIAL BLOCKS: 
EXPERIMENT IT 


Trial block 
Digits of 
feedback 
1 2 3 | 4 | 5 
1 5.75 | 2.79 | .1.91 | 1.33 | 122 
2 5.27 | 2.93 | 1.49 .88 .83 
4 4.68 | 2.00 .88 .68 54 
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best of the 3 conditions. Analysis © 
Block 1 data indicated ' difference 
among groups early in | ice, F ¢ 
57) < 1.0. By Block 5, precision levels di 
fered, F (2, 57) = 4.05, p < .025. Furth 
analysis of the Block 5 data revealed 


linear trend across precision levels, F (1 
57) = 7.56, p « .01. This linear tren 
contrasted with the curvilinear trend foun 
in Block 5 of Experiment |. This dis 
appearance of an optim) in Experi- 
ment II was consistent wii) the theori 
of both Adams (1971) and Bilodeau (1966, 
1969) as described above. 

The data for the 1-, 2-, 4-digit com 
ditions from Block 5 of bot) studies wert 
combined for further analysis. A feed 
back precision effect, 7^ (2, 114) — 3.21 
p < .05, and an experiment effect, F (f, 


114) = 12.10, p < .01, were found. Ai 
Experiment X Feedback Precision inter 
action was also shown, F (2, 114) = 5.34, 


< .01. That interaction supported the 
suggested dependency between precision 
of feedback and postfeedback interval 
length. On the whole, these findings were 
consistent with an information-processing 
approach to the learning of a simple posi- 
tioning task. Such an approach might 
further suggest that other variables could 
be expected to interact with feedback pre- 
cision and/or duration of postfeedback? 
interval. Information-processing load upon 
S during that interval might vary with 
response complexity, number of response al- 
ternatives, feedback transformations, dura- 
tion or rate of presentation of feed- 
back, idiosyncratic information-processing 
Strategies, and related variables which 
are extensively reviewed by both Adams 
and Bilodeau. 


EXPERIMENT III 


The phenomenon of a feedback precision 
optimum found in Experiment I might 
have been idiosyncratic to the micrometer 
positioning task. A third study was then 
conducted with a different apparatus and 
procedure. An effort was also made to 
apply a more specific information-proces- 
sing approach to the present problem. 
Bernstein, Blake, and Hughes (1968) view 
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information processing as occurring in 3 
successive stages: filtering or detection, 
processing or stimulus analysis, and re- 
sponse selection. Separate mechanisms are 
postulated to operate at each stage and 
rarely operate in parallel (Decker & Rogers, 
1973). The most likely locus for operation 
of feedback precision effects would be in 
the stimulus analysis and/or response 
selection stages during the postfeedback 
interval. By reducing the number of 
response alternatives to 1, while continuing 
to present extrinsic feedback at supra- 
threshold levels, varying precision levels 
might be studied relative to the stimulus 
analysis stage alone. A delayed single- 
choice reaction time task was then chosen 
for Experiment II. Occurrence of a 
precision optimum with such a task would 
then lend limited generality to the phe- 
nomenon and associate it in part with 
the theorized stimulus analysis stage of 
Bernstein et al. 


Method 


Subjects. Forty-five undergraduate volunteers 
from the University of Arizona were randomly 
assigned to 3 groups without regard to sex. 
was paid for his participation. 

Apparatus. A modified Lafayette reaction time 
apparatus (Model 322B) was interfaced with an 

- electronic timer (Hewlett-Packard Model 522B). 
Onset of the single blue stimulüs light activated 
the timer. Depression of a telegraph key terminated 
elapsed time measurement and provided offset of 
the light. The calibrated timer measured elapsed 
time to the nearest ,00001 sec. 

Procedure. The Ss were seated before a table 
with the index finger of their preferred hand resting 
upon a telegraph key. They were instructed that 
their task would be to learn to wait, following onset 
of the light, for a specific interval prior to depressing 
the key. Following a response, feedback was 
presented for direction (too soon or too late) and 
magnitude (in seconds) of timing error. Target 
time was set at 9 sec. + .5 sec., but Ss were not so 
informed. en trials were provided. The event 
sequence consisted of a verbal ready signal followed 
in 2 sec. by onset of the stimulus light, followed in 
turn by S's response, feedback 36 sec. after the 
ready signal, and the next ready signal 5 sec. after 
termination of feedback. Feedback duration was 
about 4 sec., providing for a total trial cycle of 
45 sec, When S responded during the target inter- 
val, he was told that he was correct and should 
repeat his performance on the next trial. 

Design. Feedback grain was varied among the 
3groups. The 1-digit group received error rounded 


TABLE 3 


TIMING ERROR (IN SEC.) BY I 
CONDITION AND TRIAL BLOCK: 
EXPERIMENT III 


MEAN 


= = 
| Trial block 

Digitsof | 

feedback 


1 LE 149 | 1.54 
dress 1.05 | .89 
8 | 7.2 1.77 | 1.45 


to the nearest whole second. The 4-digit group was 
given feedback rounded to the nearest .001 sec., 
while 8-digit Ss received feedback refined to the 
nearest .0000001 sec. In the latter group, the last 
2 feedback digits were taken from a random numbers 
table because the timer was only accurate to 6 digi 


Results and Discussion 


Table 3 presents mean absolute response 
error for each feedback condition over 
5 2-trial blocks. No differences among 
groups were found in Block 1, F (2, 
42) < 1.0. Further analysis indicated a 
practice effect, F (4, 168) = 41.75, p < .01, 
and a Practice X Feedback Precision inter- 
action, F (8, 168) = 2.48, p < .05. No 
simple precision effect was found, / (2, 
42) — 2.07. Examination of Block 5 data 
showed a curvilinear trend over precision 
levels, F (1, 42) = 10.0, p < .01, as was 
found in Experiment I. Optimal feedback 
precision was the 4-digit condition. Maxi- 
mum precision produced intermediate level 
performance with the 1-digit group rank- 
ing lowest. The feedback precision effect 
found in Experiment I with the positioning 
task was then replicated for the timing 
task. The optima for the 2 task-procedure 


combinations were different despite a 
7-sec. postfeedback interval for both 
studies. The higher 4-digit optimum 


obtained in Experiment III may have 
resulted from any number of differing 
variables between the 2 studies, including 
the shorter trial cycle and simplified 
response demand characteristics in the 
timing task. The deleterious acquisition 
effect of excessively precise feedback in the 
third study was less severe than in the 
first, perhaps for similar reasons. On the 
whole, Experiment IHI findings lend limited 
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generality to the feedback optimality 
phenomenon. The existence of such an 
optimum, in a task where extrinsic feed- 
back was at suprathreshold levels and 
response selection demands were low, sug- 
gests that the information-processing mech- 
anism affected may operate in part during 
the stimulus analysis stage described by 
Bernstein et al. (1968). 

Further extension of the Bernstein et al. 
approach to simple skills tasks would sug- 
gest that the detection stage operates 
roughly between onset and offset of feed- 
back. This feedback duration interval 
occurs between pre- and postfeedback 
intervals but has rarely been extracted or 
examined in simple positioning tasks. 
Previously established signal detection 
variables would presumably operate during 
this feedback presentation interval. Occur- 
ring next, during the early part of the post- 
feedback interval, would be the stimulus 
analysis stage. Its minimal length would 
be determined by feedback characteristics, 
long- and short-term verbal and motor 
memory demands, and possibly cognitive 
or problem-solving requirements in some 
tasks. Bernstein's response selection stage 
would occur last in the postfeedback 
interval, provided such time were avail- 
able. The Bernstein et al. formulations 
may represent a convenient vehicle for 
deriving increasingly specific hypotheses 
as to the nature, duration, and sequencing 
of postfeedback interval events in simple 
motor-skills tasks. Such a microscopic 
analysis of within-trial events follows 
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directly from the suggestions of bc dams 
(1971) and Bilodeau (1966, 1969) t; imph 
motor tasks have strong verbal- itive 
components active during the tfeed- 


back interval. 


REFERENCES 


Apams, J. A. A closed-loop theory of m 
ing. Journal of Motor Behavior, 1971, < 
BERNSTEIN, I. H., BLAKE, R. R., & Huc I. 
The effects of time and event 


uncert y 


Pei 


sequential information processing. 
Psychophysics, 1968, 3, 177-184. 
BiLopEAU, E. A., & RosENBACH, J. H. A 
of response proficiency as a function of 
error in informative feedback. 
5. 


(Res. B 

21) Lackland Air Force Base, Tex 

Air Force Human Resources Research Center, 
1953. 

BiLopEAU, l. McD. Information feedback 
E. A. Bilodeau (Ed.), Acquisition of skill. 

cademic Press, 1966. 

Bitopeau, I. McD. Information 
E. A. Bilodeau & I. McD. 
ciples of skill acquisition. 
Press, 1969. 

BounwE, L. E., Jr., & Bunperson, C. V, Effects 
of delay of informative feedback and lengt!: of 
postfeedback interval on concept identification. 
Journal of Experimental Psychology, 1963, 65, 1-5. 

Decker, L, R., & Rocers, C. A., Jr. Forced guid- 
ance and distribution of practice in sequential 
information processing. Perceptual and Molor 
Skills, 1973, 36, 415-419. 


In 
ew 


In 
ine 
ade nic 


feedbac! 
Bilodeau (Eds.) 
New York: / 


An experi- 
mental study of Thorndike's theory of learning. 
Journal of General Psychology, 1932, 7, 245-258. 
'EINBERG, D. R., Guy, D. E., & Tupper, R. W. 
Supplementary report: Variation of postfeedback 
interval in simple motor learning. Journal of 
Experimental Psychology, 1964, 67, 98-99, 


(Received June 25, 1973) 


Journal of Experimental Psychology 
1974, Vol. 102, No, 4, 609-614 


PRODUCTION, 
INTERVALS DURING I 


ESTIMATION, AND REPRODUCTION OF TIME 
HALATION OF A G 


NERAL 


ANESTHETIC IN MAN' 


NILLY ADAM, ANGELINA D. CASTRO, Axp DONALD L. CLARK 


School of Medicine, University of Pennsylvania 


The internal clock model for time judgments of short intervals describes 
adequately the relations among the methods of estimation, production, 


and reproduction observed under normal conditions. 
thetics affect time productions by slowing of this hypothesize 
ions predicted by the model would 


study investigated whether 


produced, and reproduced 3—40-sec. 


the relat 
hold during inhalation of a general anesthetic—fluroxene. 
intervals before and during fluroxene 


Sine 


general anes- 
ed clock, this 


Six Ss estimated, 


inhalation. The model predicted the results from the majority of Ss, thus 


supporting the internal clock concept. 


The anesthetic increased the slope 


of the production function and accordingly decreased the slope of estimation, 


leaving the slope of reproduction unchanged, thus mai 
An alternative model, viewing time judg- 
depending on memory content, could not account for the effects 


relations among the methods. 
ments as 


ntaining the predicted 


of the anesthetic on time reproductions seen with some Ss. 


The 3 most common measurement meth- 
ods of time perception are estimation, 
production, and reproduction. The rela- 
tionships among time judgments obtained 
through these 3 methods have been the 
subject of numerous investigations (for 
reviews, see Adam, 1971; Bindra & Waks- 
berg, 1956). The most explicit formula- 
tion of these relationships was put forward 
by Carlson and Feinberg (1968). They 
based their reasoning on the assumption 
of a “clock” underlying time perception. 
Presumably, S uses the same clock to 
judge time intervals measured by these 
methods. 

In estimation, the slope of the function 
depicting time estimates against objective 
time gives the ratio of the subjective to 
the objective clock. That is, in the func- 
tion Tg = kxto, where Tg = time esti- 
mates, to = objective time, and kg = the 
slope or the ratio of the number of ‘‘sub- 
jective’ seconds per each “objective” 
second.’ Thus, if S receives a 2-sec. stimu- 


This research was supported by U.S, Public 
Health Service Research Center Grant 5-PO1- 
GM-15430-05. The authors appreciate the tech- 
nical assistance of K. Van Horn and E. Dubroff. 

? Requests for reprints should be sent to Nilly 
Adam, P.O. Box 4563, Philadelphia, Pennsylvania 
19131. 

3A linear function was chosen since time judg- 
ments at the stimulus range of the present study 


lus which he estimates as being 1 sec. long, 
his subjective ‘objective ratio is .5 and ac- 
cordingly the obtained function T (to) 
should subtend 26°30’ (arctan .5). 

In production, S is expected to keep the 
same subjective/objective ratio as in esti- 
mation, although this is not necessarily so. 
If the same ratio is used, then when the 
function of time production—Tp = krts 
where Tp = time produced in actual 
seconds, kp = the slope, and t, = the 
stimulus stated verbally—is plotted on 
the same graph as time estimates, the 
y axis now gives the objective seconds as 
produced by S and the x axis now gives 
the subjective seconds. Hence, 


kp = 1/kn. 


Lt] 


Reproduction also depends on the same 
clock. One way of formulating the events 
during reproduction is assuming that first 
S estimates the presented interval and 
then produces the estimated interval. If 
this describes adequately the events in 
reproduction, then based on the previously 
(3-40 sec.) have been shown by numerous in- 
vestigators to follow a linear trend. This is also 
true for judgments obtained during anesthetic 
inhalation (Adam, Rosner, Hosick, & Clark, 1971). 
Intercepts are usually very small (Adam et al., 
1971; Carlson & Feinberg, 1968) and are negligible 
for the purpose of the analysis, 
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stated relations between estimation and 
production, the following should be true: 


[2] 
where Tg = the reproduced interval, t, 
— the presented stimulus, and kg — the 
slope of the function. Since first S is 


assumed to estimate the stimulus, this 
estimate is 


Tr = krt, 


Tr = ket. [3] 
This estimate, Ty, in turn, is the stimulus 
for production; thus, 


Tp = keTz. [4] 


Since the final value of reproduction is 
equal to the production of the estimated 
interval, it follows from Equations 2-4 that 


Tr = Tp = kphzt,. [5 
It follows from Equations 2 and 5 that 
kr = kp-kp. [6] 


Since kp has already been defined as kp 
= 1/kz for the case where S uses the same 
clock rate in estimation and production, 


then 
[7] 


In other words, the function of reproduc- 
tion against time should always have a 
slope of 1.0 (i.e., 45°), 

The relations delineated above have 
been shown to closely approximate the 
actual results obtained from normals and 
schizophrenics (Carlson & Feinberg, 1968). 
In a more recent paper, Carlson and Fein- 
berg (1970) have shown that production 
and estimation may involve different 
"clock rates" when not obtained from the 
same S. Also, the slopes of production 
and estimation may vary with experience 
even when obtained from the same .S. 
However, it was demonstrated again that 
if all 3 measures are obtained from the 
same individual the relations outlined 
above are maintained, 

The purpose of this study was to use 
anesthetic drugs in order to observe 
whether the relations among the 3 methods 
still hold under conditions where it is 
known that at least one of these methods 
gives an altered slope. Adam, Rosner, 


kg = 1. 
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Hosick, and Clark (1971) have shown that 
some general anesthetics increase the slope? 
of production, that is, slow down the sup- 
posed clock. The present experiment 
employed low concentrations oí fluroxene 
—a general anesthetic. Time judgments 
were obtained before and during drug 
inhalation. 


METHOD 


Subjeci The Ss were 6 young paid male volun- 
teers participating in a study of the neurophysio- 
logical effects of fluroxene in conscious man.* After 
giving informed consent to participate, each S 
underwent a complete physical examination, blood 
count, urine analysis, chest X ray, clectrocardio- 
gram, and determinations of blood ure nitrogen, 
and serum glutamic oxaloacetic ir insaminase, 
Only Ss within the normal range on cach of these 
tests were accepted for study. 

The S avoided all solids, liquids, and drugs for 
12 hr. prior to the study. During the study, 
respiratory raté and end-tidal carbon dioxide were 
monitored, Blood pressure, heart rate, and rectal 


temperature were recorded throughout the study." 
The anesthetic was delivered through a non- 
rebreathing circuit vented to the t-ol-doors. 
A mouthpiece was used rather than mask for 
easier control of drug concentration Dose was 


justed individua 
tested when equ 
concentration 
concentration), 
Material and procedure. Reproduction was tested 
by having S listen to a tone for a specified number 
of seconds. The E then started a second tone 
and S had to stop the tone by pressing a micro- 
switch when he felt that the second tone equaled 
the first one in duration. For production, S ac- 
tivated a tone by pressing a microswitch which 
he released when he felt the duration of the tone 
was equal to an interval stated verbally by E. 
In estimation, S5 heard a tone activated by E. 
He then had to state its duration in seconds. 
Reproduction was tested for 3-, 10-, 20., and 
40-sec. intervals, e: imation for 3-, 20-, and 40-sec. 


to allow testing. 
brium was reached 
within .1—295 of the 


The S was 
(end-tidal 
inspired 


‘This effect cannot be attributed to the slow- 
down which occurs with practice (Carlson & Fein- 
berg, 1970). The removal of the anesthetic brought 
about a decreased slope, whereas the effect of 
practice on production alone would be to con- 
tinuously increase the slope. 

*D. L. Clark, A. D, Castro, and N. Adam. 
Neurophysiological effects of fluroxene. Manuscript 
in preparation. 

* No correlation was found between these vari- 
ables and time judgments in another study with 
other general anesthetics (Adam et al, 1971). 
Hence this aspect was not assessed in the present 
study. 


PERCEPTION OF TEMPORAL INTERVALS UNDER ANESTHESIA 


intervals, and production for 6-, 20-, and 40-sec. 
intervals. Each S was tested with all 3 methods. 
The order of methods was counterbalanced over Ss. 
Stimuli for each method were presented randomly, 
with the constraint that not more than 2 identical 
stimuli were given consecutively. 

Control measures were obtained first, with the 
anesthesia system hooked to S and delivering zero 
concentration. Testing during drug inhalation was 
done when equilibrium was achieved. The only 
time S gave a verbal response was during estima- 
tion. Usually it was possible to understand his 
answers despite the mouthpiece. If not, a throat 
microphone was used. If this was not effective, 
S was asked to signal his answer with his fingers. 

When testing was completed, the mouthpiece 
was removed and S was taken to the hospital 
recovery room until ready for discharge. 


RESULTS 


Time judgments were plotted separately 
for each individual for the 3 methods. 
The best-fitting linear functions for indi- 
vidual Ss for control and anesthetic con- 
ditions are shown in Figure 1. Table 1 
gives the arctan of the slope of each 
function, the expected arctan slope of 
production according to the relation Slope 
Production — 1/Slope Estimation, and the 
expected arctan slope of reproduction ac- 
cording to the relation Slope Reproduction 
— Slope Estimation X Slope Production 
(see Equations 1 and 6). Intercepts were 
very small, with M — .21 and SD -.. 
Since the anesthetic did not affect the 
intercept significantly (see also Adam et al., 
1971), the intercepts are not shown in 
the table. 

As seen, 4 Ss (RS, RO, KC, and RM) 
showed the expected relations among the 
3 methods in both control and drug con- 
ditions. The deviations of the actual 
arctan slope of production from the ex- 
pected values were, respectively, 2°10’, 
2910', 4°20’, and 3?20' for control, and 
0°50’, 2°20’, 1°, and 1?10' for the drug 
condition. The deviations of the arctan 
slope of reproduction from the expected 
values for these 4 Ss were 1930', 0°30’, 
0°40’, and 0°10’ for control, and 0°20’, 
3°50’, 3°20’, and 2°20’ for the drug con- 
dition. All of these values represent de- 
viations of Jess than 10% from the ex- 
pected values. There was no tendency 
for the actual values to be either con- 
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sistently larger or smaller than the ex- 
pected values. 

Although the anesthetic did not alter 
the relations among the results from the 
3 methods in these 4 Ss, it did affect the 
slopes of estimation and production. The 
arctan slope of production was increased 
in all 4 Ss, ranging from an increase of 
1°30’ for RS to 13°50’ for KC. This 
confirms the results obtained by Adam 
et al. (1971) for the production method 
with several other inhalation anesthetics. 
The slope of estimation was decreased 
accordingly, ranging from a decrease of 
3°20’ for RM to a decrease of 17?15' 
for KC. 

The 2 remaining Ss, however, deviated 
in some respects from the expected trends. 
For both TB and AS, all 3 control re- 
sponse curves had slopes smaller than 1.0 
(arctan smaller than 45?) (see Figure 1). 
This is in contrast to the expected curves, 
where reproduction should subtend 45° 
and production and estimation should be 
symmetrical around the reproduction 
curve. This implies that both these Ss 
did not follow the relation shown in Equa- 
tion 1. Table 1 gives the expected and 
obtained values for kp. 

Note that if kp = 1/kg, then the other 
relation derived from the model is kr 
= kp'kp = 1 (Equations 6 and 7). How- 
ever, if kp Æ 1/ke but if reproduction still 
follows the suggested sequence of estima- 
tion and production, then the relation 
kr = krky still holds, only it is not equal 
to 1 (Equatior 6). The control results of 
TB and AS secm to fit this latter relation. 
The deviation of the obtained from the 
expected arctan kx was 0°55’ for TB and 
2°50’ for AS, a deviation of less than 10%. 

The anesthetic affected the results in the 
predicted direction for production and esti- 
mation. The slope of production increased 
and that of estimation decreased. The 
only S to be tested at 2 different anesthetic 
concentrations was AS. As predicted, as 
concentration increased from 1.01% end- 
tidal to 1.21% end-tidal, the arctan slope 
of production increased from 49° to 63°. 
Accordingly, that of estimation decreased 
from 36°45’ to 31°25’, indicating the dose- 
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ViGURE 1. Best-fitting linear functions 


for estimation, production, and reproduction before 
and during fluroxene inhalation. (Data presented for individual Ss, The corresponding arctan 
slopes are given as obtained values in Table 1.) 


related effect of the drug. The relation for the lower and higher drug concentra- 
kr = kp:kg was held also during the anes- tion, respectively. Note that for AS at 
thetic condition for both Ss. The dey end-tidal concentration of 1.0195 arctan 
tion of obtained from expected values was kr came very close to the expected 45? 


1?20' for TB and 3?10' and 2930' for AS (obtained value — 43°50’). However, this 
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TABLE 1 
ARCTAN SLOPE OF RESPONSE CURVES FOR INDIVIDUAL Ss BEFORE AND DURING FLUROXENE [NHALATION 
Arctan slope m 
à ON (in CS E Anesthetic que Es 
San" | Order Obtained values Expected values 
| Estimation | Production | Reproduction | Production» Reproductions | Inspired End-tidal 
RS fi 
Control o | 3e | 57°40 44°30! 55°30! 46° 
Drug Saad 59°10! 44°20! 60° 44° 135 | 1.20 
SE 38°20! 49°30! 43°30! 51°40" 43° 
_Drug PRE | yeny | 60°20" 44°10! 62°40! 40°20" 145 | 122 
I Castrol : 54°15’ | 40°10! 47°30! 35°50! 48°10! 
Drug isto io. 54° 43° 53° 46°20! 1.78 | 1.40 
EA ond d 32530! | 54^10' 41240 57°30' 41°30" 
. Drug EPOR | opes! | 59°50! 40°10" 61° 42°30" 1.37 | 1.07 
S arol _, , | 3830 | 43°40! 36*05' 51°42’ 376 
Drug ERP | ees | 6740 55°20! 60°32" 54° = 1.05 
Aral 41°10! | 33°10" 32°30! 48°50! 29°40! 
R,P,E | 36°45’ | 49° 43°50! 53°14’ | 40°40! 114 | 1.01 
Drug 31°25’ | 63° 52°40! 58°34! 50°10! 139 | 1.21 


» Abbreviations for order of presentation of methods, 
v Predicted on the basis of the relation Slope Produce’ 


on = 1/Slope E 


production, R = reproduction, 
imation. 


o Predicted on the basis of the relation Slope Reproduction = Slope Estimation X Slope Production, 


has no significance since further increase 
of concentration again removed kr from 
45° (see Table 1). 

The order in which S had to perform 
the 3 methods had no effect on the results. 
Table 1 gives the order. Note that each 
S was tested in a different order. How- 
ever, any order effect would be mainly 
due to the order of estimation and produc- 
tion in relation to one another. Note that 
the 2 exceptional Ss, TB and AS, per- 
formed estimation and production in a dif- 
ferent order. 


DISCUSSION 


Control results showed that the model first 
suggested by Carlson and Feinberg (1968) 
and delineated mathematically in the intro- 
duction, describes adequately the time be- 
havior of 4 Ss but only partially the results 
from the other 2 Ss (TB and AS). For the 
first 4 Ss all predicted relations were found. 
The obtained slope of production was reason- 


ably close tlo the expected 1/Slope Estimation 
and the sope of reproduction was reasonably, 
close to the product of the slopes of estima- 
tion and production. This is in line with the 
findings of Carlson and Feinberg (1968, 1970) 
for the case when each 5 gave time judgments 
on all 3 methods. This gives support to the 
theoretical notion of a single "clock" under- 
lying estimation and production and explain- 
ing reproduction as involving estimation of 
a presented interval followed by production 
of the estimated interval. The fact that in 
these 4 Ss the anesthetic altered the slopes 
of production and estimation but without 
changing the expected relations among the 
3 methods validated the results obtained 
under normal conditions. 

Two Ss, however, gave somewhat different 
results. In both cases, all 3 psychophysical 
methods had a slope smaller than 1.0. This 
implies that the slope of production did not 
conform to the predicted 1/Slope Estimation. 
This may mean one of two things. First, 
for some reason these 2 exceptional Ss may 
have used a different clock rate for production 
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than for estimation. This, in turn, resulted 
in a slope for production that is not the in- 
verse of that for estimation. Another pos- 
sibility is that the same clock was used for 
both methods but some unidentified response 
or processing variables affected the final result. 
Such variables could, of course, affect either 
estimation or production or both. It is rea- 
sonable to consider estimation as a process 
involving "starting the clock" and obtaining 
a reading when the stimulus is over; produc- 
tion then may be conceptualized as a process 
in which $ produces his own imaginary 
stimulus which he estimates every so often 
and compares the result with the required 
interval. He stops when the 2 match. If 50, 
then obviously different response variables 
may affect the 2 processes. The present 
results do not give any clue as to why only 
2 out of 6 Ss were affected in this way. 

Since for these 2 Ss kp did not equal 1/kp. 
it follows from Equation 6 that kp = kp:kg 
but need not equal 1.0. This relation was 
found to describe adequately the results from 
the 2 exceptional Ss, thus supporting the 
notion that reproduction depends on estima- 
tion with later production of the estimated 
interval. The anesthetic increased the slope 
of production and decreased that of estima- 
tion, as it did with the other 4 Ss. 

No correlation was found between the serial 
order of response and the magnitude of a time 
judgment. Also, the order of presentation of 
methods did not seem to have a noticeable 
effect. It is particularly important that the 
order of estimation and production was dif- 
ferent for the exceptional Ss. 

Another approach to the mechanism under- 
lying time perception Suggests that time judg- 
ment depends on memory content (Franken- 
haeuser, 1959). Thus, the obtained linear 
functions describing time judgments of periods 
longer than the immediate present reflect the 
decay of memory traces. According to such 
à concept, the anesthetic altered the slopes 
of estimation and production because the 
memory content deteriorates faster under the 
anesthetic. However, Frankenhaeuser has 
not presented any specific model for predicting 
the relationships among the 3 psychophysical 
methods, nor is it clear from her work how 
reproduction is carried out. If she adhered 
to the reproduction process as presented in 
the present article, her model would predict 
the same anesthetic effects on reproduction 
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there is qj 
reproductiom 
7n the point 
luction may 
gnition pro- 
is is found 
when it is] 
first stim- 
This latter 
4 of our Ss. 
were also 
znition task 


as the clock model. How: 
alternative way to conceptuali 
which fits her framework. | 
of view of memory theory, 
be thought of as reflectir 
cesses; that is, the second 
matching to the first stimul 
recognized as being similar to the 
ulus already stored in mem: 
possibility fits well the data 
As part of another study, 
tested for performance on a 
of 6-word lists during flur: inhalation, 
The effects were negligible í ill Ss: Thus, 
the fact that the slope of duction. did 
not change in the anesthetic dition in 4 Ss 
can be accounted for by the negligible effects of 
fluroxene on recognition at t studied con- 


centrations. However, 2 of "s did show 
altered reproduction slopes in the anesthetic 
condition. These changes predictable 
on the basis of the relation mong the 3 
psychophysical methods provided by the 
clock model. "These results cannot be ex- 


plained on the basis of impaired 
performance since the latter was 
fected. Some anesthetics do have 
effect on recognition even at very low con- 
centrations (Adam, 1973). It should be 
interesting to observe behavior on time judg- 
ments during inhalation of these anesthetics. 


recognition 
hardly af- 
a marked 
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Choice reaction time (CRT) judgments of same-different responses were 
measured for all possible pairs of nine English consonants by two groups of 
10 Ss each. A multidimensional analysis (IND SCAL) of the CRT values, 
for the first group of Ss showed that the magnitude of Ss’ CRTs was governed 
by the distinctive feature differe: een consonant pairs. A four-dimen- 
sional analysis revealed that the distin e features, voicing, sibilant, front/back 
and palatal were those used by the group. Examination of the individual Ss 
showed differing patterns of weighting these features. Analysis of the second 
group of 10 Ss served to cross-validate the results obtained from the first 
group. Results showed no significant difference in the relationship between 
CRT and distinctive feature differences for the two groups. Comparison be- 


tween these results and the 1969 res 
suggest that CRT is a good measure 
factors contributing to CRT are mini 


Judgments of consonant similarity have 
been used to find a set of phonological fea- 
tures that may be distinctive within a lis- 
tener's repertoire (Singh, Woods, & Becker, 
1972). These judgments of similarity have 
been made on the basis of subjective evalua- 
tions using methods of magnitude estima- 
tion and psychological scaling (Singh et al., 
1972) or by obtaining perceptual errors of 
direct judgments while listening to signals 
acoustically distorted by noise or filtering 
(Miller & Nicely, 1955; Singh, 1966, 1971). 

The results of several investigations have 
indicated that choice reaction time (CRT) 
varies directly with the similarity of stimu- 
lus alternatives, (Crossman, 1955; Thur- 
mond & Alluisi, 1963; Weiner, 1973). 
Reaction times are short when discrimi- 
nating between items which are obviously 
dissimilar and increase when discriminating 
between similar stimuli. 

Chananie and Tikofsky (1969) attempted 
to evaluate the similarity of consonants by 
relating CRT with the number of distinc- 
tive feature differences between consonants. 
They were unable to show that CRT was 
a measure of interphonemic similarity de- 
termined by distinctive feature differences. 


! Requests for reprints should be sent to Frederick 
E. Weiner, Department of Speech and Hearing 
Science, University of Maryland, College Park, 
Maryland 20742. 


ults of Chananie and Tikofsky seems to 
of phoneme similarity when extraneous 
mized. 


The distinctive feature system used was 
that of Miller and Nicely (1955). Recently, 
investigators have demonstrated that this 
feature system is a relatively poor predictor 
of phonemic perception (Singh & Becker, 
1972; Singh et al., 1972; Wickelgren, 1966). 
Others have varied the experimental con- 
ditions and found this system a good or 
better predictor of phoneme perception 
than other feature systems (Singh, 1970; 
Wang & Bilger, 1973). 

Rather than beginning with an a priori 
feature system, such as Miller and Nicely's 
(1955), to evaluate similarity, the present 
investigation was designed to find, by 
means of multidimensional scaling tech- 
nique, the feature system used by a group 
of Ss and then to determine whether CRT 
relates to similarity in terms of the features 
retrieved. 

METHOD 


Subjecis. Two groups of 10 undergraduate native 
American English speakers without history of hear- 
ing loss served as The responses of the first 
group of Ss served as an input to the multidimen- 
sional analysis for purposes of retrieving the group's 
perceptual feature system. The responses of the 
second group of S e used to cross-validate the 
relation between CRT and consonantal similarity 
observed in the first group of Ss, 

Stimuli. The stimuli were pairs of consonant— 
vowels (CVs) composed of one of the following 
English fricatives: /f,v,0,5,s,z,5,3,h/ followed by the 
vowel /a/. Half of the stimuli were constructed so 
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that both pairs of CVs for each item were identical, 
The remaining items were constructed so that the 
fricative of the second member of each pair was 
different from the fricative for the first member of 
that pair. Thus, 72 stimuli were recorded on 
channel 1 of a two-channel tape recorder. A pure 
tone was recorded on channel 2 at the initiation of 
the second member of each pair. All stimuli were 
presented so that the members of each pair were 
separated by 1 sec., with 5 sec. elapsing between 
stimuli, 

Apparatus, The stimuli were 
of a Sony (TC 200) stereo tape recorder with a 
frequency response of 50-10,000 Hz. + 3 db. and 
an overall frequency response of 50-14,000 Hz. 
The Ss heard the stimuli through TDH-39 head- 
phones. The output from channel 2 was connected 
to a pulse generator which in turn was connected 
to a (model DP 210) Hickock time interval meter. 
Each time a signal appeared on channel 2, a pulse 
was sent from the pulse generator to start the time 
interval meter. The Ss were seated in front of a 
reaction time box having one button labeled “S” 
and the other labeled “D.” The reaction time box 
was connected to the time interval meter so that 
depression of one of the two buttons stopped the 
meter. Choice reaction time, then, was measured 
from the initiation of the second CV to the eventual 
response of pushing one of the two buttons. Re- 
sponse times and button choice were recorded on a 
Hickock (PR 4900) printer connected to the time 
interval meter, 

Subjects were instructed to respond by pressing 
the button marked "S" if the two phonemes they 
heard were identical or to press the button marked 
“D” if the two phonemes were not the same. They 
were told to respond as quickly as possible without 
committing errors, Each S was presented with 30 


presented by way 


(1ubIIqis) Za 


fe 


DI (voicing) 


FiGURE 1. Plotting of Dimension 1 (D1) and 
D2 of the four-dimensional IND SCAL configura- 
tions showing voicing and sibilant, 
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below. 


practice items and was encouraged to res 
500 msec. Most Ss made their determi 18 be- 
fore the termination of the second syllab Thus, 
Ss with unusually long CRTs were traine make 
their decisions before completion of tl cond 
syllable for each stimulus. The Ss then pa vated 
in the experiment, which consisted of foi lom- 
ized presentations of each stimulus pair 
RESULTS AND DISCUSSION 

The median of the four CRTs { ich 
stimulus pair was taken to repres S's 
score on that pair, Utilizing thes es, 
10 symmetric matrices for each Sin C oup 
1 were analyzed by Individual Scaling D 
SCAL) developed by Carroll and ng 
(1970). This procedure is a multi: n- 
sional scaling program designed to d er- 


mine the number of dimensions Ss v. lize 
in making psychological judgments. fhe 
output provides stimulus and S weigh: gs 
on each of the dimensions of an n-di- -n- 
sional space, 

The choice of dimensionality was b. «d 
on the criteria of uniqueness and interpreta- 
bility of solution, A unique solution is one 
in which! the stimuli and S points in a 
given dimensional Space converge to ihe 
same solution irrespective of the random 
Starting configuration. Interpretability is 
determined by the solution yielding the 
most definable phonetic features. In the 
present investigation a four-dimensional 
solution was considered unique where Di- 
mension 1 (D1) was interpreted as voicing ; 
Dimension 2 (D2) as sibilant; Dimension 3 
(D3) as front back; and Dimension 4 (D4) 
as palatal. 

2 Figure 1 is a plotting of D1 versus D2. 
From this Tepresentation it can be seen 
that the voiceless consonants of the set 
/f,0,S,/ are separated from the voiced con- 
sonants /v,8,z,3,h/, Although /h/ is usu- 
ally considered voiceless in English, per- 
ceptually it was positioned at the extreme 
end of the voicing dimension. The most 
probable explanation for the placement of 
/ h/ is that /h/ is homorganic with /a/, with 
which /h/ was combined. The phoneme 
/h/ is also the only consonant which is 
considered minus-consonantal (Chomsky & 
Halle, 1968), 4 feature regularly assigned 
to vowels. Thus, S judgment may have 
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(1040/04) vq 


D3 (front/back) 


Ficure 2. Plotting of Dimension 3 (D3) and 
D4 of the four-dimensional IND SCAL configura- 
tions showing front/back and palatal. 


been grossly influenced by /a each time 

ha/ was compared with other CV se- 
quences. Dimension 2 is clearly a sibilant 
dimension wherein the sibilants /s,Z, 55 
are separated from the nonsibilant frica- 
tives /f,v,0,8,h/. 

Figure 2 is a plotting of D3 versus D4. 
Dimension 3 can be labeled as a front/back 
dimension where the consonants /1,v,0,0,5,Z 
produced anteriorly in the oral cavity are 
separated from the consonants §,3,h/ which 
are produced posteriorly. Obviously, these 
Ss were not using other distinctions of place 
of articulation on this dimension, For ex- 
ample, in Figure 2 the /6/ is positioned 
closer to back than /s/. Hence, the extent 
of the psychological reality of articulatory 
features for these Ss was limited only to a 
front/back distinction. On D4 the palatal 
consonants /$,5/ are separated from the 
nonpalatal consonants f,v,0,5,s,z,h/. Thus 
more than one place dimension was utilized 
by these Ss. 

The four dimensional solution discussed 
above was found to account for 72% of the 
variance in S responses. Of this 72%, D1 
(voicing) accounted for 24%; D2 (sibilant) 
19%; D3 (front/back) 15%; and D4 
(palatal) 14%. An examination of the 
individual S weights of these dimensions 
revealed three different trends. These 
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Ficure 3. Plotting of individual S$ weightings 


as a function of distinctive features retrieved for 
Ss 4, 7, and 8. 


three trends are reported in Figures 3, 4, 
and 5. 

Figure 3 shows that Ss 4, 7, and 8 
weighted D2 the highest. Thus, incon- 
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Ficure 4. Plotting of individual S weightings 
as a function of distinctive features retrieved for 
Ss 10, 9, 6, and 2, 
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FIGURE 5. Plotting of individual 5 weightings 
1 function of distinctive features retrieved for 
» 3, and 5, 


sistent with the overall trend, these Ss were 
weighting sibilant inordinately high. In 
contrast, Ss 2, 6, 9, and 10 in Figure 4 
weighted voicing high and sibilant low. 
Again, in contrast with the overall trend, 
these four Ss weight the sibilant dimension 


lowest. The weightings of the third group 
of Ss are shown in Figure 5, where Ss 1, 3, 
and 5 


seem to confirm the overall trend of 
the general stimulus space. 
In the past, various investigations have 
attempted to determine a feature system 
and the relative importance of the features 
for a group of Ss (Miller & Nicely, 1955; 
Singh & Black, 1966; Wickelgren, 1966). 
Results of this study show that, while it 
is possible to determine a feature system 
for a group, it is difficult to draw conclu- 
sions about the relative weightings of fea- 
tures for that group. This difficulty arises 
from the evidence presented in Figures 3, 
4, and 5 wherein it can be seen that group 
feature weights represent a mean of ex- 
treme individual weightings. Therefore, 
while it is interesting and possible to deter- 
mine the feature system of a group, it is 
much more revealing to examine the way 
the individual Ss weight these features, 
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That is to say that examination : 
ual Ss provides information regard ng thei 
perceptual strategy, which may be d 
ferent from the strategy of the other Ss. 
Even though the weightings of the indi 
vidual Ss were different on each dimension: 
the general conclusion is that all Ss utilize 
all four dimensions. On the basis of thi 
four dimensions recovered in this study, i 
is assumed that CRT judgments can bi 
used à criterion for determining ph 
nemic similarity. Ina phonological system 
involving binary features, a pair of © 
syllables, which is different by any one Ql 
the four features, is considered minimally 
distinct and thus should yield a larger CRT 
than a pair of CV syllables having a two 
feature difference. This assumption wat 
not validated by Chananie and Tikofsky 
(1969) who performed a similar study using 
the Miller and Nicely (1955) distinctivi 
feature system, Figure 6 shows CRT as a 
function of "zero", one, two, three, and 
four feature differences, Zero difference 
represents the phoneme pairs /{-6 and 
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FIGURE 6, Choice reaction time as a function of 
distinctive features for data collected from groups 
one (GI) and two (GII) in the present study as 


Compared with a previous study of Chananie and — 
Tikofsky (1969). 
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/'v-®/ since these pairs remained indistinct 
on all of the four dimensions reported in 
this study. A one-feature difference repre- 
sents a difference by any one of the four 
features. The two-, three-, and four-fea- 
ture differences are combinations of any of 
the features. 

The top curve in Figure 6, taken from 
Chananie and Tikofsky (1969), clearly 
shows a lack of relationship between CRT 
and distinctive feature differences. The 
curve labeled “Weiner & Singh GI" repre- 
sents the mean CRTs for pairs of conso- 
nants representing zero, one, two, three, 
and four feature differences obtained for 
the first group of Ss. A single-factor, 
repeated measures, analysis of variance was 
performed to test the difference between 
these mean CRT scores. The results 
showed that the mean differences were 
significant, F (4, 36) = 16.08, p < .05. A 
Tukey (a) (Winer, 1962) showed that CRT 
values for zero-feature differences were 
significantly greater than values associated 
with any other feature difference. In addi- 
tion, the CRT value associated with one 
feature difference was significantly greater 
than differences of three and four features. 

Since the above mentioned curve in 
Figure 6 was derived from the same data 
utilized to find the feature system, the 
slope was expected. It was therefore 
deemed necessary to cross-validate these 
results. The bottom curve is a plotting of 
the second group of Ss. To determine 
whether these functions showed similar 
relationships, a comparison of slopes pro- 
cedure (Dixon & Massey, 1969) was per- 
formed. The result was that the two beta 
coefficients were not different from each 
other, ¢ (6) = .494, p < .05. Thus, the 
slopes were not significantly different from 
each other. There was a difference, how- 
ever, in terms of the magnitude of the two 
curves. This is attributed to substituting 
photoelectric cells for the “push-type”’ re- 
sponse buttons used by the first group; a 
change made before realizing that a cross- 
validation would be necessary. However, 
for the purposes of this investigation, mag- 
nitude of CRT is not important. The 
purpose was to determine whether CRT was 
a function of distinctive feature differences. 
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A single-factor repeated measures analy- 
sis of variance was performed to test the 
difference between the mean values of CRT 
obtained for the second group. The results 
showed that the mean differences were 
significant, F (4, 36) = 11.17, p « .05. A 
Tukey («) showed that CRT values for 
zero-leature differences were significantly 
greater than values associated with any 
other feature difference. Additionally, one- 
feature difference was significantly greater 
than a two-, three-, and four-feature dif- 
ference; while differences of two, three, 
and four features were not different from 
each other. 

Since a number of investigations have 
demonstrated that psychological judgment 
of the proximity of a phoneme pair is a 
function of the phonological difference be- 
tween them, it is surprising that Chananie 
and Tikofsky (1969) did not find such a 
relationship. In an attempt to determine 
the reason for this discrepancy, we plotted 
the CRT values obtained from the two 
groups of Ss in the present investigation 
as a function of the Miller and Nicely 
feature system as plotted by Chananie and 
Tikofsky. The curves labeled "Miller & 
Nicely GI" and “Miller & Nicely GIL" in 
Figure 6 represent Miller and Nicely fea- 
ture system plottings of the S data in the 
present investigation of groups one and 
two, respectively. Not only is it apparent 
from these results that the differences in 
features play a role in determining mean 
reaction times, but there is no significant 
difference between the slopes of Weiner & 
Singh GI and Miller & Nicely GI, + (5) 
=.524, p < .05, or Weiner & Singh GH 
and Miller & Nicely GII, £ (5) = .621, 
p «.05. Thus it appears that either sys- 
tem of distinctive features explained the 
similarity of phonemes as evaluated by 
CRT. This finding is expected and should 
occur using any distinctive feature system 
since all feature systems draw on the same 
articulatory events. 

The difference between the results of the 
present investigation and that of Chananie 
and Tikofsky (1969) may be attributed to 
their level of reaction time which was con- 
siderably higher than the reaction time 
obtained in the present study. Figure 6 
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suggests that the number of features dis- 
tinguishing one phoneme from another 
becomes more important the lower the 
overall magnitude of reaction time. The 
magnitude was highest in the Chananie 
and Tikofsky study and number of features 
played no role. The magnitude was lower 
in Weiner & Singh GI and there was a 
slope. The magnitude was lowest in Weiner 
& Singh GII and the slope was steepest. 
In other words, it appears that when ex- 
traneous factors contributing to reaction 
time are minimized the differences in num- 
ber of features begins to play a role. 
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INTERACTIONS AND RANGE 


EFFECTS IN EXPERIMENTS 


ON PAIRS OF STRESSES: 


MILD HEAT AND LOW. 


-FREQUENCY NOISE ! 


E. C. POULTON? anp R. S. EDWARDS 


Medical Research Council, Applied Psychology Unit, Cambridge, England 


(Twelve men performed 3 tasks in 
389/33? C. 
a control condition, The three 
five-choice task, and visual vigilance, in 
had a beneficial effect upon all three 
on the tracking, and on false detectior 
related to behavioral arousal. 
on mild heat and loss of a night's sleep, 
was found to be influenced by the str: 


(1009/92? F.), and with the two stresses combined, 
were tracking with peripheral lights, the 


102-db.(C) low-frequency noise, at 
well as in 


that order. The low-frequency noise 
s. It interacted with the mild heat 
in the vigilance task, The results are 


When compared with a previous experiment 


performance in the control conditions 
s included in the within-subjects ex- 


perimental designs. This raises doubts about the validity of the interactions. 


A number of experiments examine the 
interactions of pairs of stresses (Grether, 
1970; Poulton, 1970, chap. 25). In some 
experiments reliable interactions are re- 
ported. In other experiments the effects 
of a pair of stresses appear to add. The 
combination of heat and noise does not 
produce reliable interactions, except pos- 
sibly in the 1946 experiment by Viteles and 
Smith (see Wilkinson, 1969b, p. 267). The 
present experiment investigates the com- 
bination again, using tasks which are found 
to reveal reliable interactions between 
other pairs of stresses. 

The left side of Figure 1 shows the results 
of a hypothetical experiment on the inter- 
action of two stresses A and B. The con- 
trol condition 00 is represented at the 
bottom. Stress A alone at A0, and stress 
B alone at OB, both increase the number 
of errors. Errors are rather more frequent 
still at AB, where the two stresses are 
combined. The interaction is reliable if 
the combined effect of A and B, repre- 


1 This work forms part of a program of research 
for the Royal Naval Personnel Research Committee. 
'The experimental subjects were supplied by the 
British Royal Navy. The authors are grateful to 
A. Carpenter for building the noise equipment and 
for advice on its use. Muriel M. Woodhead kindly 
tested the hearing of three people with and without 
a thermistor in the external auditory meatus. 

2 Requests for reprints should be sent to ESO 
Poulton, M.R.C. Applied Psychology Unit, 15 
Chaucer Road, Cambridge CB2 2EF, England. 
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sented by the height (x + y + 2), is re- 
liably different from the sum of the two 
separate effects, represented, respectively, 
by the heights z and (y + z). This reduces 
to x being reliably different from z. Since 
x represents the effect of A in the presence 
of B, and z represents the effect of A alone, 
an equivalent statement of the interaction 
is that the effect of A in the presence of B 
is reliably different from the effect of A 
alone. 

Where reliable interactions are reported 
in the literature, the combined effect upon 
performance is always smaller than the 
sum of the two separate effects (Grether, 
1970; Poulton, 1970). The only apparent 
exception is in experiments with knowledge 
of results as one of the two stresses. ‘The 
relationship in Wilkinson's (1961) experi- 
ment on loss of sleep paired with knowledge 
of results, is illustrated on the right of 
Figure 1. In the first illustration on the 
right, A stands for the stress of no sleep 


Sleep Incentive 
ATB No7No AyO AC 
x 
| O78 * x x 
12 c) 
c 
5| alo Yes t+ No O1O OFC 
5 y 4 y 
è t No1 Yes A B ATO 
t 
o+0 ved Yes OB O*O 
FIGURE 1. Interactions between two stresses. 
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on the previous night, and B stands for 
the stress of immediate knowledge of re- 
sults whenever the man makes an error, or 
does not respond for 1.5 sec. The loss of 
Sleep increases errors, while the knowledge 
of results reduces errors. As usual the 
combined effect, represented here by —y, 
is smaller than the sum of the two separate 
effects, represented, respectively, by x and 
(—y —2). This is equivalent to stating 
that zero is smaller than (X — z), or that 
x is greater than z. 

Wilkinson's (1961) alternative descrip- 
tion of the interaction is illustrated on the 
extreme right of the figure. An O has been 
substituted for B, and a C for O. The C 
stands for loss of incentive, just as A stands 
for loss of sleep. Here the combined effect 
is represented by (x + y +z). Itisgreater 
than the sum of the two separate effects, 
represented, respectively, by z and (y +z), 
because x is greater than z. 

Fortunately in most experiments on in- 
teractions between stresses, it is clear which 
conditions present the stresses, It is only 
with knowledge of results that there is any 
doubt. Probably this is because knowledge 
of results tends to be more effective when 
a person has been deprived of it, than it 
does when it has always been given (Brown, 
1966). A within-subjects experimental de- 
Sign improves performance with knowledge 
of results, and depresses performance with 
no knowledge. Neither of the two changed 
levels of performance can be taken as the 
true baseline level of performance. 


RANGE EFkkCTS 

The observed heights of x 
distorted by a range effect. All experi- 
ments investigating the interactions be- 
tween pairs of stresses use within-subjects 
designs. The same people perform in a 
control condition, under each stress singly, 
and under the combined Stresses. Within- 
subjects designs can produce range effects 
which bias the results (Poulton, 1973). 
Performance in one condition is influenced 
by the other conditions included in the 
experimental design. A subsidiary aim of 
the present experiment is to look for pos- 
sible range effects. Part of the experiment 


and z may be 
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is designed to be identical with a previo 
experiment on mild heat and loss of sh 
(Poulton, Edwards, & Colquhoun, in pre 
A reliable difference in the control condi 
tion which is common to the two exper 
ments, would indicate the influence of rang 
effects, 
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Low-FREQUENCY Norsk 


Low-frequency or green noise is weighted 
heavily against the higher frequencies, [ 
is used to reduce the chances of a permanei 
hearing loss, while maintaining a high overs 
all sound-pressure level. Apparatus and 
speech sounds are not masked as effectively 
as they are by white noise, which has à 
flat spectrum. Low-frequency noise arouses 
the man, without isolating him from hi 
auditory environment. "Thus it may have 
à rather different effect upon performance 
from the conventional white noise, Thi 
is an additional point investigated here. 


METHOD 


Tracking with peripheral lights. Two of the thi 
experimental tasks are described in detail in a previ- 
ous paper (Poulton et al., in press). In the tracking 
task (Hockey, 1970a, 1970b) a target marker moves! 
irregularly from side to side, In the present exs 
periment the top track frequency is about 45 cycles: 
per minute (cpm) instead of the 25 cpm used by) 
Poulton et al. The man has to keep a response 
marker in line with a target marker, by raising and 
lowering a lever with his right hand. He is told 
that this is his main task. Time on target is noted 
for each 3-min, period. 

A The six peripheral lamps are arranged in a semi- 
circle. Each lamp is surrounded by a lightproof: 
case, except for a white circle of tape, 1.5 cm. in 
diameter, which lies in front of it, W hen the lamp 
is switched on, the white circle turns yellow. The 
brightness of the circle increases from 1 to 7 cd/m? 
(.3 to 2 equivalent ftc.) In the present experiment 
the man is provided with a chin rest. When he j 
places his chin on the chin rest and looks at the 
central tracking display, the two nearest lamps lie 
20° out from his line of sight, one on each side. 
Each lamp lights up on average once every 30 sec. 

he remaining four lamps, two on each side, lie, 
respectively, 50? and 80? out from the line of sight. 
Each lights up on average once every 2 min, The 
time interval between lamp signals varies irregularly. 
from 6 to 24 sec, in steps of 3 sec. When a lamp - 
lights up, the man has to extinguish it as quickly 
às possible by pressing the corresponding micro- 
switch with a finger of his left hand. His response 
time is recorded. The combined task lasts 30 min. 
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The five-choice task, For this task (Leonard, 1959) 
five neon lamps are arranged in a pentagon of side 5 
cm, They are mounted on an almost vertical panel. 
Five brass discs, each 3 cm. in diameter, are arranged 
in a corresponding pentagon with 9 cm. between 
centers, They are mounted flush in a horizontal 
panel which lies directly in front of the neon lamps. 
The man uses a stylus 18 cm. long to tap the disc 
corresponding to the neon lamp which is lit up. 
Tapping any disc extinguishes the lamp and lights 
up the next lamp in the series, The order of lamps 
in the series is irregular. The man has to work at 
the task without stopping for 30 min. The number 
of correct taps, and the number of times the wrong 
disc is tapped in error, are recorded automatically 
and noted for each 5-min. period. Gaps of 1.5 sec. 
or longer between responses are also recorded. 

Wilkinson visual vigilance task. The task is a 
visual analogue of the Wilkinson (1969a) auditory 
vigilance task of the previous paper (Poulton et al., 
in press). A white circular window 1.5 cm. in 
diameter lies in the middle of a black panel 18 X 20 
cm, The panel slopes backward at an angle of 45°. 
Immediately behind the window is a neon lamp 
The window has a brightness of 2 cd/m? (.5 ftL.) 
when the lamp is off. The window turns yellow 
and increases in brightness to 30 cd/m? (8 ftL.) 
when the lamp is on, The black panel has a bright- 
ness of .15 cd/m? (.04 ftL.). The neon lamp lights 
up for .4 sec. every 2 sec. A signal is a shorter 
flash of only .3 sec. There are five signals, scattered 
irregularly during each 7.5 min, The task lasts 30 
min. 

In the present experiment the man has three push 
buttons to indicate his degree of confidence, instead 
of the single push button of the previous experiment. 
When he detects a signal, he has to press one of the 
three buttons as quickly as possible. The three 
push buttons have diameters of 1.2 cm. They are 
mounted in a row 8 em. below the circular window, 
with their centers separated by 4 cm. The blue 
push button on the right indicates certain signal. 
The green push button in the middle indicates 
probable signal. ‘The yellow push button on the 
left indicates possible signal. A record is kept of 
each response button pressed. The response times 
of correct responses are also noted. 

Experimental subjects and design. The 12 naval 
enlisted men have ages ranging from 18 to 28 yr., 
median age 21 yr. The experimental design is gi 
in Table 1. There is a practice control condition 
before the start of the Latin square, as in the previ- 
ous experiment (Poulton et al, in press) This 
enables the groups of naval enlisted men used in 
the two experiments to be compared directly, before 
they commence the conditions of the Latin square. 
The Latin square ensures that nobody works in 
noise on two successive days. 

The hot and cool climates are similar to those of 
the previous paper (Poulton et al., in press). The 
hot climate has dry-bulb/wet-bulb temperatures of 
38°/33° C. (100°/92° F.). This gives an Effective 
Temperature for men stripped to the waist of 34° C. 
(93° F.). The cool climate has temperatures of 
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TABLE 1 
EXPERIMENTAL I 


IGN FOR 


Two-WEEK PERIOD 
Main Test 1 Test 2 Test 3 | Test 4 
Team* | practice (Thurs- (Mon- (Tues- | (Wed- 
(Wednes-| day) day) day) nes- 
day) day) 
A |Control| Noise Heat | Noise |Control 
+ 
| Heat 
B | Control Control | Noise | Heat | Noise 
+ 
Heat ` 
C |Control | Heat Noise |Control| Noise 
AE 
E Heat 
D | Control | Noise | Control | Noise | Heat 
Heat 


^ In all cases, n 


209/179 C. (689/62? F.). This gives an Effective 
"Temperature for men fully clothed of 19° C. (66? F.). 

"The low-frequency noise is presented from 2 loud- 
speakers, each 35 cm. in diameter. They are 
mounted respectively in the wall and ceiling of the 
hot room. The loudspeaker in the wall is at head 
height. It is on the man's right when he is tracking, 
and behind him when he rotates 90? to his left to 
perform the two remaining ta: The ceiling loud- 
speaker is above his head, slightly to his left when 
he is tracking, slightly in front of him for the other 
2 ta: 

The average intensity of the low-frequency noise 
is about 102 db. on the C scale of a Dawe 1400E 
sound-level meter. The intensity varies slightly, 
depending upon exactly where the man has his 
head, The noise is attenuated about 7.5 db. per 
octave by a lowpass filter. The noise had a rumbling 
quality. It is possible to converse by shouting at 
a distance of 1 m. In tracking with peripheral 
lights it is possible to hear the noises made by the 
pursuitmeter program, and to hear the clicks made 
by pressing the response switches for the lights. 
With the five-choice task it is possible to hear the 
difference between tapping a brass disc, and tapping 
the panel in which the discs are mounted, 

The fans which circulate the air produce an aver- 
age sound-pressure level of about 80 db. (C) in the 
quiet condition. Here the frequency spectrum is 
fairly flat up to 1 kHz. Above 1 kHz. the intensity 
is attenuated about 7.5 db. per octave. 

Procedure. This follows the procedure of the 
previous experiment (Poulton et al, in press). 
Each man is always tested at the same time of day, 
One man of each team is tested in the morning, the 
other two men in the afternoon. The order of the 
three tasks is always the same: tracking with 
peripheral lights, five choice, and Wilkinson vigilance. 

Thermistors are worn in both ears, to record the 
temperature of the external auditory meatus. Wear- 
ing a thermistor does not affect the threshold of 
hearing. 
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FIGURE 2. 
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The average percent time on target 
ve S-min. periods when tracking with 
lights, 


Statistical tests. Nonparametric tests (Siegel, 
1956) are always used. All tests are two-tailed 
except where stated. 


RESULTS 


Heat noise interaction in tracking with 
peripheral lights. Figure 2 shows that in 
the control condition the average time on 
target is less during the last 5 min. of the 
tracking task than during the first 5 min. 
(b < .05). A similar result is reported in 
the previous paper (Poulton et al., in press), 
and by Hockey (19702, 1970b). 

Compared with the control condition, the 
noise alone produces an improvement which 
is reliable during the last 5 min. of the task 
(b < .05). The heat alone produces a 
decrement which is reliable during the 
first half of the task (p < .05). The direc- 
tions of the changes with noise and heat fol- 
low those of previous experiments (Hockey, 
1970a, 1970b; Poulton et al., in press). 

The novel aspect of the results is the 
reliable interaction, between the heat and 
the noise, which is particularly marked 
during the last 5 min. of the task (5 < .02). 
Here heat by itself. produces only a small 
decrement of .5 in the average percent 
time on target. But when added to the 
noise, the heat changes an improvement of 
3.0 to a decrement of 1.5. The reduced 
percent time on target produced by adding 
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the heat to the noise is therefor: 


cone 
pared with only .5 for the heat itself, 
There are no reliable differenc: n the 
lights task related to the noise, to the 
heat. 

Shift in control condition wit) fives 
choice task. The five-choice tas used 
also in the previous experiment o > loss 
of a night's sleep paired with heat 


(Poulton et al., in press). It is 
second in both experiments, s: 
between tracking with periphe: 


rmed 
iched 
lights 


and the vigilance task, The mai. Jiffer- 
ence between the two experimen that 
the loss of sleep of the previous ex nent 
is changed to low-frequency nois the 


present experiment. Table 2 sh that 


the change has a reliable effect up lean 
percent errors in the control co: tion. 
During the last 5 min. of the task, ti. con- 
trol mean is 3.2% in the present seri- 


ment with noise, compared with 1. % in 
the previous experiment with loss oí. sleep 
(p « .02). 

The bracketed entry at the top the 
table shows that the two groups o; men 
are well matched in the main control 
practice on the day before the experiments 
start. Thus the difference between the 
two control conditions must be due to 
range effects produced by the balanced 
experimental design of Table 1 and the 
comparable design of the previous experi- 
ment. The particular stresses which are 
paired together affect the control condi- 
tion. Since the control condition provides 
the baseline against which the effects of 


TABLE 2 
MEAN PERCENT ERRORS ON THE Fiv 
IN Two EXPERIMENTS on P; 
OF STRESSES 


noice TASK 
RS 


Noise and heat 


[Practice control 2, 1] 
Control Vs 
Noise 1.4 
Heat 2.9 
Noise + heat 22 


[Practice control 
Control 
I of sleep. 


n Data from Poulton, Edwards, and Colquhoun (in press): 

' Two control conditions different at .05 level with a one- 
tailed test. 

à Noise different from all other conditions at the .05 level. 


Control different from all other conditions at the .05 level 
or better, 
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the stresses are measured, it means that 
the measured effect of a stress in an experi- 
ment on pairs of stresses depends upon 
the stress with which it is paired. 

The left side of Table 2 shows that in 
the present experiment the low-frequency 
noise improves performance, while the mild 
heat if anything degrades performance. 
Whereas the right side of the table shows 
that in the previous experiment both the 
loss of a night's sleep and the heat degrade 
performance. Probably the difference be- 
tween the two range effects is due to the 
beneficial effect of the low-frequency noise 
in the present cipe The noise con- 
dition pue 1.4% errors, which corresponds 
to the 1.4% of the control condition of the 
previous pee Thus in the present 
experiment the noise condition appears to 
become the baseline condition, while the 
control condition becomes a noise-deprived 
condition. 

Other measures of performance obtained 
with the five-choice task also indicate the 
beneficial effect of the low-frequency noise. 
During the first 5 min. there are reliably 
(p < .05) more correct responses in the 
noise than in the control condition. In 
addition, combining the noise with the 
heat reliably (p < .02) reduces the number 
of gaps of 1.5 sec. between responses, com- 
pared with the heat alone. 

Heat noise interaction in visual vigilance. 
Figure 3 shows how true and false detec- 
tions are related during the first and last 
7.5-min. periods. For the full 30 min. 
there are more false detections in the heat 
than in the control condition (p < .01), as 
in the previous paper (Poulton et al., in 
press). The proportion of false detections 
is increased also in the noise, although the 
difference between the noise and the con- 
trol condition is not reliable (p > .05). 
When the noise and heat are combined, the 
proportion of false detections is not far 
above the control level. The interaction is 
reliable (p « .02). Thus adding the heat 
to the noise largely eliminates the undesir- 
able increase in false detections which both 
stresses produce on their own. 

The noise gives the greatest proportion 
of true detections during the first 7.5 min., 
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detections. (False detections are 
ntage of the number of nonsignal 
There are 2.2 percent of true signals, 


FIGURE 3. 
the visual vigila 
7.5-min. 


light flashes. 
The arrows point from the value for the first 7.5 


min. to the value for the last 7.5 min. The contours 
represent equal values of d’, an index of the ability 
to detect signals.) 


but changes reliably (p < .01) to give the 
smallest proportion during the last 7.5 min. 
For the full 30 min., there are no reliable 
differences in true detections between any 
of the four conditions. However the noise 
gives reliably more certain detections than 
the control condition (p < .05, one-tailed 
test), and reliably fewer moderately certain 
detections (p « .01). The man responds 
reliably more quickly when he is certain, 
than when he is uncertain (p < .02). 


DISCUSSION 


Interactions described in terms of arousal 
theory. Reliable interactions between the heat 
and the noise are illustrated in Figure 2 for the 
tracking, and in Figure 3 for false detections in 
the vigilance. In both cases, the combined 
effect of the two stresses is smaller than the 
sum of the two separate effects. In the past 
such interactions have usually been described 
in terms of the inverted-U relationship of 
arousal theory. The effects of the combined 
stresses will tend to cancel each other if one 
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stress increases arousal while the other stress 
reduces arousal. Cancellation can also occur 
if both stresses increase arousal, and the person 
thus becomes overaroused (Poulton, 1970, 
chap. 25). 

Noise is generally taken to be an arouser, 
It produces physiological changes which are 
associated with arousal. On the five-choice 
task it reduces the effects of a night without 
sleep, which presumably lowers the level of 
arousal (Wilkinson, 1963). 

The relationship between heat and arousal 
is less straightforward. Arousal probably in- 
creases while a person is being heated up 
(Poulton & Kerslake, 1965), and when he is 
uncomfortably hot (Colquhoun & Goldman, 
1972; Wilkinson, Fox, Goldsmith, Hampton, 
& Lewis, 1964). Whereas a constant mild 
warmth probably reduces the level of arousal 
(Wilkinson et al., 1964). In the previous ex- 
periment on heat paired with loss of a night's 
sleep, the heat reduces the detrimental effect 
of the loss of sleep at the start of the auditory 
vigilance task (Poulton et al, in press). Thus 
here the heat must act as an arouser. 

The results in Figure 3 for the visual vigi- 
lance task can also be described by taking the 
heat as an arouser. Both the heat and the 
noise increase arousal and produce more false 
detections. Adding the heat to the noise pro- 
duces overarousal, and so cancels the increase 
in false detections which both stresses produce 
on their own. 

The reliable interaction in Figure 2 for the 
time on target can also be described in terms 
of the overarousal which results from combin- 
ing the two arousers heat and noise. How- 
ever, the results in Figure 2 cannot all be de- 
scribed in terms of arousal. If the noise 
increases the time on target by increasing 
arousal, the heat which also increases arousal 
should not reduce the time on target as it 
does. Clearly some other factor in addition to 
arousal is required to account for the Opposite 
effects of the heat and noise. A similar diffi- 
culty is reported in the previous paper where 
heat is paired with loss of sleep (Poulton et al., 
in press). 

Alternative explanations of tnteractions. In 
order to discuss the interactions between 
stresses in terms of arousal theory, it has to be 
assumed that an observed interaction reflects 
a genuine interaction in the underlying process 
in the brain. However there are other possi- 
ble causes of an observed interaction, First, 
the measure of performance may not be related 
linearly to the underlying brain process. Thus 
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x and z on the left of Figure 1 can | iably 
different, without an interaction in the inder- 
lying brain process. This possibility diffi- 
cult to disprove, because the underlyiv. brain 
process cannot be observed directly. 

The time-on-target measure of Fi 2 is 
notoriously nonlinear.  Bahrick, and 
Briggs (1957) show that on this mea the 
shape of the learning curve depends u; the 
size of the target. It would be fi to 
claim that performance with the pa ilar 
size of target used here is more likely be 
related linearly to the underlying brain ess 
than is performance with some other of 
target. But the criticism of a possible on- 
linear relationship to brain function appl. > to 
any measure of performance. 

A second possible cause of a reliable | 'er- 


action between two stresses is Suggeste: by 
the results of Table 2. The relative : {x 
and z on the left of Figure 1 may be disto ed 
by a range effect introduced by a wit in- 
subjects experimental design, The present 
experiment provides no clear evidence on ihis 
point, because the range effects appear in the 
five-choice task, while the reliable interactions 
appear in the two remaining tasks. But the 
implications are worth consideri ng. 

Many range effects involve a central ten- 
dency (Poulton, 1973). Usually, the range is 
reduced. If the reduction is symmetrical, it 
need not affect the calculation of the interac- 
tion very much. This is because the interac- 
tion on the left of Figure 1 is represented by 
(x — z). If the scale contracts symmetrically 
about its center, the value of (x — z) may not 
change much. 

But if the range effect has a bias in a single 
direction, it may, for example, reduce the size 
of x without appreciably changing the size of 
z. If so, the difference (x — z) will be smaller 
than it should be. When the value of (x — z) 
is negative, as in experiments where the com- 
bined effect of two stresses is smaller than the 
sum of the two separate effects, the range 
effect will then increase the chances of obtain- 
ing a reliable interaction, Both the reliable 
interactions reported here, and all those re- 
ported in the literature, may possibly be pro- 
duced or exaggerated by an artifact of this 
kind. 

When the value of (x — z) is positive, as in 
experiments where the combined effect is 
larger than the sum of the two separate effects, 
a range effect which reduces the size of (x — z) 
will reduce the chances of obtaining a reliable 
interaction. Since no reliable interactions in- 
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living potentiation have yet been reported, 
an artifact of this kind may well be operating 
in experiments where potentiation is to be 
expected. Clearly, in order to investigate the 
interactions between stresses unbiased by 
range effects, it is necessary to use a separate- 
groups design. This has not yet been done. 

Beneficial effects of low-frequency noise. Table 
2 shows that on the five-choice task the low- 
frequency or green noise reliably reduces the 
mean percent errors, The noise also reliably 
increases the rate of responding during the 
first 5 min. These are entirely new effects of 
noise on the five-choice task. Presumably the 
low-frequency noise increases arousal, without 
having any of the detrimental effects of the 
noise used in previous experiments, 

There are 10 previous experiments where 
the five-choice task is performed in noise. In 
all but two of the experiments the noise is 
white with an approximately flat spectrum. 
In 7 of the 10 experiments, noise reliably in- 
creases errors (Broadbent, 1953, 1957; Colqu- 
houn & Edwards*; Hartley, 1973, in press-a, 
in press-b, Experiment 1; Wilkinson, 1963, 
Experiment 1). In the remaining three ex- 
periments the effect of noise upon errors is not 
reliable (Corcoran, 1962; Hartley, in press-b, 
Experiment 2; Wilkinson, 1963, Experiment 
2). In none of the 10 experiments is there a 
reliable reduction in errors in noise, as there is 
here. 

In only one of the 10 experiments does noise 
have a reliable effect upon the rate of respond- 
ing. Here the rate of responding is reduced 
(Colquhoun & Edwards, see Footnote 3). In 
none of the experiments is the rate of respond- 
ing reliably increased in noise, as it is at the 
start of the present task. 

The quite different results in the present 
experiment must be due partly to the spec- 
trum of the noise. There is a 7.5 db. per 
octave attenuation, which greatly reduces the 
intensity of the high frequencies. It largely 
prevents the masking of taps, clicks, and 
speech, which is produced by the unattenuated 
high frequencies of white noise. Stevens 
(1972) suggests that the masking may be re- 
sponsible for the deterioration in noise, when 
it occurs. 

Broadbent's (1957) experiment partially sup- 
ports this view. He takes machinery noise 
having roughly equal energy per octave. He 


a W. P, Colquhoun & R. S. Edwards. Interaction 
of noise with alcohol on a task of sustained atten- 
tion. Study in preparation. 
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filters it at 2,000 Hz. to give either predomi- 
nantly high-frequency or low-frequency noise. 
Both kinds of noise increase errors on the 
five-choice task, but the low-frequency noise 
has the reliably smaller effect. Presumably a 
noise with still greater filtering of the high 
frequencies, might turn the small increase in 
errors into the reliable decrease shown in 
Table 2. 

However, the spectrum of the noise cannot 
alone account for the results in Table 2, be- 
cause Hartley (1973) reports a reliable in- 
crease in errors using noise with a similar 
spectrum, but at a sound-pressure level of 
100 db.(A), or 118 db.(C). Thus both the 
spectrum and the intensity of the noise are 
important. To obtain a reduction in errors, 
the low-frequency noise must not be too loud. 

The other beneficial effects of the low-fre- 
quency noise are similar to previously reported 
effects of noise with a flatter spectrum, The 
improvement in the tracking illustrated in 
Figure 2 corresponds to the improvement in 
noise reported by Hockey (1970a, 1970b) 
toward the end of the tracking task. The 
effect of the low-frequency noise in increasing 
confidence in the visual vigilance t 
also in visual vigilance with the white noise 
used by Broadbent and Gregory (1963, 1965). 

Most probably, both low-frequency noise at 
102 db. (C) and white noise at 100 db. (A) 
improve performance by increasing the level 
of arousal. Where the low-frequency noise 
improves performance while the white noise 
degrades performance, it may be due to the 
masking of important auditory cues by the 
more intense high frequencies of the white 
noise, as Stevens (1972) suggests. 
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Information rehearsed in short-term store (STS) can be primarily new infor- 


mation, if only tempora 
new information, if t 
STS is of limited ca 
use of STS capaci 
recall learning ta 


storage is required; or it can be a mixture of old and 
fer to long-term store is to be optimized. 
exa a storage, coding trade-off should take place in the 


Because 


TOURS: U Sings a Single: trial free- 


words in immediate recall, but fev "wer S ord d in final free recall, than did Ss 


using a coding strategy. Also, 
words in final free recall than did 


In dual-storage memory models such as 
those of Atkinson and Shiffrin (1968) and 
Waugh and Norman (1965), the process 
of rehearsal serves to maintain new infor- 
mation in short-term store (STS), and at 
the same time transfers some of this in- 
formation with each rehearsal into long- 
term store (LTS). Recent evidence in- 
dicates that rehearsal may not be a unitary 
process that simultaneously maintains in- 
formation in STS and transfers it to LTS. 
Under some conditions, rehearsal may 
maintain information in STS without trans- 
fer to LTS. To support this view, Jacoby 
and Bartz (1972) have shown that increas- 
ing the number of rehearsals in STS does 
not necessarily lead to more information 
being transferred to LTS. Although re- 
hearsal in STS may take place without 
transfer to LTS, the opposite does not seem 
to be true. Atkinson and Shiffrin (1971) 
cite evidence which shows that transfer of 
new information to LTS cannot take place 
without rehearsal. 

The storage and transfer functions of 
rehearsal are interdependent because both 
are control processes in STS and both use 


1 This research was supported in part by Grant 
OURC-437 to the first author from the Ohio Uni- 
versity Research Committee. 

2 Requests for reprints should be sent to Francis 
S. Bellezza, Department of Psychology, Ohio Uni- 
versity, Athens, Ohio 45701. 

®Now at the University of London, London, 
England, 


ing a storage strategy recalled fewer new 
using a coding strategy. 
interpreted as supporting the concept of a storage-coding trade-off in $ 


The results were 


STS capacity (Atkinson & Shiffrin, 1968). 
Shiffrin and Atkinson (1969) discuss how 
rehearsal, under the control of S, can be 
used primarily for maintenance of new in- 
formation in STS or for transfer of infor- 
mation from STS to LTS. The S uses a 
storage strategy if he fills STS to capacity 
with new information and rehearses it. He 
is able to maintain maximum new informa- 
tion in STS, but the rate of transfer to 
LTS is reduced. The S can increase rate 
of transfer to LTS by using a coding 
strategy that consists of adding appro- 
priately chosen old information retrieved 
from LTS to new information in STS and 
rehearsing the entire ensemble. This de- 
scription of transfer from STS to LTS im- 
plies that the amount of new information 
being stored in STS must decrease if S uses 
a coding strategy rather than a temporary- 
storage strategy. Because of the limited 
capacity of STS (Miller, 1956), S must 
trade off storage space for encoding space, 
or vice versa, depending on the strategy 
he wishes to use. 

There is some evidence from recall experi- 
ments which shows that Ss may use a 
strategy involving more encoding into LTS 
if recall is always delayed rather than 
immediate. Bartz (1969) presented Ss with 
9-digit lists in a serial-learning task where 
every third list was identical. All Ss im- 
proved in performance on the identical lists. 
However, those Ss in the delayed-recall 
condition attained a higher performance 
level than those in the immediate-recall 
condition. Ina multitrial free-recall experi- 
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ment, Bellezza and Dixon! alternately pre- 
sented a list of words and a list of numbers. 
The Ss in the immediate-recall condition 
recalled the items from each list im- 
mediately after each list was presented. 
The Ss in the delayed-recall condition re- 
called the other list after each list was 
presented. After 10 trials, the performance 
of the delayed-recall group was better on 
both lists than that of the immediate-recall 
group. Although it may be that Ss in the 
delayed-recall conditions in these 2 experi- 
ments were using STS capacity to optimize 
coding into LTS, there is no way to deter- 
mine if the amount of new information in 
STS decreased correspondingly. 

The present experiment used a gaming 
situation in which all Ss received 1 point 
for each item immediately recalled. How- 
ever, Ss in the coding condition were in- 
formed before the words were presented 
that they would receive 10 additional points 
for all words that they could recall in a 
final free recall. The Ss in the storage 
condition were not informed of these addi- 
tional points until shortly before final free 
recall was requested. It was predicted 
that Ss in the coding condition would do 
more poorly in immediate recall, but would 
do better in final free recall. 


METHOD 


Subjects. The Ss were 48 volunteers, drawn from 
a pool of introductory psychology students at 
Ohio University, who received course credit for 
their participation. 
Procedure. The Ss were run individually, using 
a Lafayette IBM memory drum, and were informed 
that they were in a memory game in which they 
were to try to win as many points as they could, 
All Ss were told that they would be presented 7 
lists of numbers and would receive 1 point for each 
number that they could free recall immediately after 
each list was presented, as long as they did not pause 
for more than 2 sec, between responses. A pause of 
more than 2 sec. was used to indicate that S had 
emptied STS (Murdock & Okada, 1970). Each S 
was then presented 7 lists of 10 2-digit numbers, 
with both digits different and no zeros used. All 
10 numbers on each list were presented at once in a 
row for 15 sec. The Ss recalled orally, and Æ 
* F. S. Bellezza and R. W, Dixon. Storage versus 
coding functions of short-term Store. Paper pre- 
sented at the meeting of the Midwestern Psycho- 
logical Association, Cleveland, May 1972. 
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checked items off a score sheet as the 
As soon as 5 stopped recalling, the ne 
sented. After the lists of numbers w 
the 24 Ss in the coding condition w 


that they would be presented 7 lists o: words 4 
would again receive 1 point for each d immer 
ately recalled. In addition, these S c told tl 
there would be a final free recall in whi hey wo 
receive 10 additional points for eac! rectly 
called word from the 7 previous list Fhe ot] 
24 Ss in the storage condition were n ormed. 


the final free recall. ch S was the 
lists of 10 1-syllable words randomly 
Thorndike and Lorge (1944) and havin; 
of occurrence of 10-40 per million. The 
recall procedure for words was the same as 
procedure used for numbers. Each 
word was used only once with each 5, | 
ent randomizations of the lists of numbers an 
words were devised, and 6 Ss were run on « 
After all 7 lists of words were presented, wel 
presented with a 3-digit number and were asked 
count backward by 7s as fast as they could ford 
min. This was to eliminate any items in ST 
The Ss were then requested to try to recall as mai 
of the words in the 7 lists as possible, and .5s in 
storage condition were then informed oí the 
additional points for each word correctly recalled 
Recall was oral, and Ss were given 1 min. and S 
sec. for final free recall. 
Analysis. The results of the experiment we 
evaluated using a 2 X 8 X 7 analysis of variant 
with the specific factors being strategy conditio! 
randomization form, and list position. Strategi 
condition and randomization form were between-3 
factors. The analysis was done on 3 different de 
pendent variables: the number of 2-digit number 
per list immediately recalled, the number of wordi 
per list immediately recalled, and the number @ 
words per list recalled in final free recall, 
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RESULTS 


The analysis of variance of the number ol 
2-digit numbers recalled showed only the 
significant main effect of list position 
F (6, 192) — 4.02, p < .001, MS, = 1.15 
Figure 1 shows the number of numbers ime 
mediately recalled from each list. "There 
seems to be a slight interlist primacy effec 
and a trend analysis (Kirk, 1968) showed 
the linear component to be significant, 
F (1, 192) = 624, p < .025, MS, = 1.15) 
Because there were no significant effects 
involving Strategy condition, it can be 
concluded that Ss were performing at the 
same level in immediate recall before differ 
ent STS strategies were induced. 

The analysis of variance on the number 
of words immediately recalled per list 
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FIGURE 1. 


showed a significant main effect of strategy 
condition, F (1, 32) = 62.51, p < .001, 
MS, = 2.56. The Ss in the storage con- 
dition recalled a mean of 6.00 words/list 
and Ss in the coding condition recalled 4.64 
words/list. These results are shown in 
Figure 2. There was also a significant effect 
of list position, F (6, 192) — 3.92, p « .001, 
MS, = 1.17, as shown in Figure 1. As in 
the immediate recall of numbers, there 
appears to be an interlist primacy effect. 
A trend analysis showed the linear com- 
ponent to be marginally significant, Fd, 
192) = 3.67, p< 10, MS. = 1.17. The 
one other significant source of variation 
was the Strategy Condition X Randomi- 
zation Form interaction, F (7, 32) = 2.39, 
p «.05, MS, = 2.56. Inspection of the 
strategy condition means in each randomi- 
zation form showed that Ss in the storage 
condition performed better than did Ss in 
the coding condition on all forms except 
the fourth, where the means were the same. 
There is no obvious explanation for this 
except individual differences among Ss. 
When the data from the final free recall 
were analyzed, the main effect of strategy 
condition was again significant, F (1, 32) = 
46.01, p < .001, MS. = 1.91. As shown 
in Figure 2, Ss in the storage condition 
recalled 1.48 words/list, whereas Ss in the 
coding condition recalled 2.48 words/list. 
There was a significant main effect due to 
list position, F (6, 192) = 13.08, p < .001, 
MS, = 1.89. As shown in Figure 1, not 
only is there a slight interlist primacy effect, 


and those not recalled at all. 
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but there is also a pronounced interlist 
recency effect, even though Ss engaged in 
a difficult 60-sec, interpolated task before 
final recall began. No other sources of 
variation were significant in this analysis. 
To determine to what degree the same 
words were given in immediate and in final 
free recall, all the words on the lists were 
placed into 1 of 4 categories: those given 
in both immediate and final free recall; 
those given in immediate recall but not in 
final free recall; those not given in im- 
mediate recall but given in final free recall ; 
The mean 
numbers of words in each of these 4 cate- 
gories recalled by each S in each strategy 
condition are given in Table 1. In order to 
determine if the strategy conditions differed 
in all 4 categories of recall, a 2 X 8 X 4 
analysis of variance was performed, with 
the specific factors being strategy condition, 
randomization form, and recall category. 
Strategy condition and randomization form 
were between-Ss factors. The dependent 
variable was the total number of words re- 
called by each S in each category. The 
Strategy Condition X Recall Category in- 
teraction was significant, F (3, 32) = 28.21, 
p «.001, MS, = 29.03. Tests of simple 
main effects (Kirk, 1968) showed that the 
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CODING CONDITION 


MEAN NUMBER OF ITEMS RECALLED PER LIST 


IMMEDIATE FINAL FREE 
RECALL RECALL 
Ficure 2. Number of words recalled per list 


in the storage and coding conditions. 
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TABLE 1 


MEAN NuMBER OF Worps RECALLED IN EACH 
CATEGORY IN FINAL FREE RECALL 


Category 
Condition - - t - 
IF IF i | m 
Coding 11.75 20.71 5.63 31.91 
Storage 9.00 33.00 1.38 26.62 
Difference | 1.75* | —12.29** | 4.25** | 5.29%% 


Note, Abbreviations: IF = words given in both immediate 
and final free recall; IF = words given in immediate but not in 
final free recall; IF = words given in final but not in immedi- 
ate free recall; and TF = words not recalled at all. 


*p <.05, 
** p <,001. 


means of the strategy conditions were sig- 
nificantly different in each recall category. 


DISCUSSION 


The results support the concept of a storage- 
coding trade-off in STS. The trade-off is 
necessary because STS is of limited capacity. 
If S desires to transfer new information into 
LTS, he must load stored information into 
STS so that the new information can be 
appropriately placed in L'TS (Shiffrin & Atkin- 
son, 1969). This reduces the amount of new 
information that can be stored in STS. Be- 
cause of the gaming situation used in the 
present experiment, 5s in the storage condition 
loaded STS with as much new information as 
possible and stopped rehearsing it only after 
immediate recall. As shown in Figure 2, the 
storage group recalled significantly more words 
in immediate recall, but significantly fewer 
words in final free recall, than did the coding 
group. This indicates that STS capacity was 
being used in different ways by each group. 
It can be seen from Table 1 that the storage 
group recalled 1.38 words in final free recall 
that were not recalled in immediate recall. In 
the coding group, this number was 5.63. With 
a coding strategy, S would be likely to stop 
rehearsing an item wben he thought it was 
coded into LTS, so that other words could be 
coded. This means that these coded words 
would not be available in the immediate-recall 
condition, which required almost uninterrupted 
responding. With the greater amount of time 
given for search and retrieval in final free recall, 
these\words could be recalled. The Ss in the 
storage condition were likely not to stop re- 
hearsing any word transferred to LTS because 
this word would not be available for immediate 
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recall. The results shown in 
predicted from the notion tha 


Strategy necessarily consists of ad pp 

priately chosen old information ret: from 
LTS to new information in STS and ing 
the entire ensemble. Other interpret of 
the data are possible. For example ing 


strategy that involved nothing n C- 
tively rehearsing a fewer number of i ld 
lead to the same results. 

The better immediate recall perfor: in 
the early lists of numbers and of word 
in Figure 1 is probably not the result of | tig 
Bartz (1969) found an overall increase the 
number of digits recalled as a function 
position, although there was an initial de 
in his first 3 lists. Melton (1963) also fou: ‘1 
practice effect in a serial recall task. An a! 
nate explanation is that first-list performa 
in immediate recall has a larger LTS c 
ponent than does performance on later list 
On the first list, S may be able to retrieve i: 
formation quickly from LTS, since informatio: 
from only one list is stored there, and th: 
problems of search and decision are easier anc 


therefore faster (Shiffrin, 1970). The interlist ~™ 


primacy effect which seems to have occurred in 
final free recall could be the result of the earliest 
lists having more temporally distinct associated 
cues and hence being more accessible in LTS 
(Shiffrin, 1970). However, the effect could also 
be the result of the better immediate recall of 
the early lists, since the recall of an item has the 
same effect as an additional presentation trial 
(Tulving, 1967). The mechanism of memory 
search developed by Shiffrin can account for 
the interlist recency effect found in the final 
free recall, because information from the more 
recent lists is more accessible in LTS. 

The results of the present experiment in- 
dicate that the concept of a storage-coding 
trade-off in STS could be a useful one in short- 
term memory experiments 
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In Experiment I, Ss were exposed to a 55° line angle training stimulus and 
then tested for generalization (recognition) with lines of 40°, 55°, and 70° 
(Group 1); 40°, 47.5°, 55°, 62.5°, and 70° (Group 2); or 25°, 40°, 55°, 70°, 
and 85° (Group 3). Groups 1 and 2 yielded similar decrements to 40° and 70° 
lines, indicating no units effects; however, Group 3 responded more to these 2 
stimuli, indicating a reliable range effect. 

In Experiment II, the training stimulus was a 530-nm. light, and the test 
values were 515, 530, and 545 nm. (Group 1); 515, 522.5, 530, 537.5, and 545 
nm. (Group 2); or 500, 515, 530, 545, and 560 nm. (Group 3). Group 3 re- 
sponded more to 515 and 545 nm. than did Groups 1 and 2, which did not 


differ, again indicating a range effect and no units effect. 


Prior evidence of a 


units effect was probably due to a confounding of number of units with range 


of test stimuli. 


The amount of stimulus generalization 
decrement in human Ss has been shown to 
be influenced by a number of factors in 
addition to the physical distance between 
training and test stimuli. Mednick and 
Freedman (1960) have hypothesized that 
one such factor is the number of stimulus 
units that separate the test stimulus from 
the training stimulus. Their "units hy- 
pothesis" generates 2 primary predictions. 
(a) The amount of generalization decre- 
ment between a training stimulus and a 
given test stimulus along the same dimen- 
sion is a positive function of the number of 
intervening test stimuli. (6) The generaliza- 
tion decrement obtained for a given number 
of units away from the training stimulus 
will be the same for units of different size. 
Note that Prediction b assumes that the 
number of units is the sole determinant of 

'This research was supported by National 
Institutes of Health Research Grant HD-03486 and 
Training Grant MH-10427 under the direction of 
the third author. Experiment I was performed by 
the first and second authors, Experiment II was 
performed by the third and fourth authors while 
on leave at the University of lowa. The authors 
wish to acknowledge the assistance of James L, 
Thomas in the analysis of the data of Experiment If. 

* Requests for reprints should be sent to David R. 
Thomas, Department of Psychology, University of 
Colorado, Boulder, Colorado 80302, 
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generalization decrement, whereas Predic- 
tion a assumes that it is 2 determinant, 
but not necessarily the on! 


Thomas and Hiss (1963) reporte udy 
designed to test these predictior ing 
wavelength generalization. Three groups 


of human Ss viewed a training stimulus of 
530 nm. followed by a 5-stimulus generali- 
zation test in which the training stimulus 
occupied the physical midpoint of the test 
range. The groups varied in the sizes of 
the test unit, i.e., the physical distance 
between adjacent stimuli, which were 5, 
10, or 20 nm., respectively. 

The results reported by Thomas and Hiss 
(1963) supported the first of the Mednick 
and Freedman (1960) predictions. Re- 
sponses to common test values (i.e., those 
experienced by 2 different groups) decreased 
as the number of test stimuli between 
them and the training stimulus increased. 
"This study, however, failed to support the 
second prediction. A comparison of gen- 
eralization gradients plotted in terms of 
number of units revealed a significant effect 
of unit size. Thomas and Hiss concluded, 
therefore, that both the number of units 
and the physical distance between stimuli 
influenced generalization slope. 

The apparent finding of a units effect in 
the Thomas and Hiss (1963) study is 


z 


RANGE OF TEST STIMULI AND GENERALIZATION 


subject to an alternative interpretation, 
however. Because the number of test 
stimuli was held constant (at 5) across 
groups, unit size was confounded with the 
range covered by the test series. An in- 
crease in unit size produced a proportional 
increase in the series range. This increase 
in range could have had an effect on gen- 
eralization decrement independent of the 
change in unit size. It is also possible that 
the units effect is dependent on an inter- 
action between unit size and test range. 

A recent experiment by Tomie (1972) 
suggests that the range of test stimuli used 
by Thomas and Hiss (1963) may have 
indeed been a significant factor. Two 
groups in the Tomie study were exposed to 
a training stimulus of 505 nm. and were 
then tested for generalization to values of 
both longer and shorter wavelengths than 
the standard. For one group, only a single 
test value of shorter wavelength (500 nm.) 
was used; for the other group, the test 
stimuli included 500, 495, 490, and 485 nm. 
Thus, for both groups, 500 nm. was one 
unit removed from the training value but, 
for one group, there were additional stim- 
uli still farther removed. This group 
showed significantly more responding to 
the 500-nm. stimulus, indicating that in- 
creasing the range of generalization test 
stimuli flattens the gradient. 

Two investigations of the units hy- 
pothesis have been conducted using pigeons 
and wavelength generalization tests. Fried- 
man (1963) employed a selection of test 
values similar to that used by Thomas and 
Hiss (1963), in that he varied unit size 
while holding the number of test stimuli 


constant. This produced a unit size by 
stimulus range confounding, as noted 
above. Despite this confounding, Fried- 


man found no support for either prediction 
based on Mednick and Freedman's (1960) 
units hypothesis. While no reliable dif- 
ferences were found in responding to com- 
mon test values, the group tested over the 
widest range did respond at the highest 
level in all 4 relevant comparisons. 

An experiment by Marsh (1967) is some- 
what difficult to interpret because of varia- 
tions in level of responding to the training 
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stimulus among the different groups. 
Marsh used varying numbers of test stim- 
uli, all placed to one side of the training 
value. He reported, in agreement with 
Thomas and Hiss (1963), that generaliza- 
tion decrement varied with both the physi- 
cal size and the number of units separating 
the training and test stimuli. Although 
Marsh’s experiment was designed in a way 
which permitted the investigation of a 
range effect, he found none. 

The purpose of the present study was to 
separate the potential influences of range 
and size of stimulus unit on generalization 
decrement in an experiment with human 
Ss. Three test conditions were employed 
involving (a) small units-small range, 
(b) large units-large range, and (c) large 
units-small range. The latter condition 
was achieved by the use of a reduced num- 
ber of different test stimuli. This produced 
a group difference in the total number of 
stimulus presentations, but subsequent 
analysis indicated that this difference did 
not affect the slope of the obtained gradient. 
In order to enhance the generality of the 
present investigation, both line angle and 
wavelength test dimensions were employed 
in 2 separate experiments. 


EXPERIMENT Í 
Method 


Subjects. The Ss were 30 male and 30 female 
undergraduate students enrolled in introductory 
psychology courses at the University of Colorado. 

Apparatus. The apparatus contained a 4.5-cm.- 
diam. circular aperture in a 122 X 76 cm. black 
panel. The aperture was covered with translucent 
plastic. Line angles were produced by backlighting 
(with a 7.5-w. lamp) a .3-em.-wide black plastic 
strip mounted immediately behind and bisecting 
the aperture cover, Thus, the stimuli were black 
lines at various angles viewed against a homogeneous 
white field. The line angles used in this experiment 
varied from 25° to 85° (counterclockwise from hori- 
zontal) in 7.5° steps. Angle adjustment and stim- 
ulus backlighting were performed manually by E, 
who was seated behind the panel. The Ss were 
seated. with their faces approximately 60 cm. from 
the panel, with the circular aperture approximately 
at eye level. 

Procedure. The Ss were randomly assigned to 
3 groups with the restriction that each group contain 
10 males and 10 females. Instructions were the 
same for all Ss and included a general explanation 
of the experimental task. The Ss were instructed 
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to study the sample stimulus carefully and to 

respond on each trial by judging the test stimulus as 

- same or different in relation to the sample. The 
instructions emphasized that S should respond on 
each trial and that responses should occur during 
the stimulus presentation. 

All Ss received the sume sample stimulus (a black 
line 58° counterclockwise from horizontal) for 15 sec. 
The generalization test consisted of 9 blocks of 
trials with the appropriate test series randomized 
within each block. During the generalization test, 
each stimulus presentation was 5 sec. in duration, 

7 with an intertrial interval of approximately 5 sec. 
The E supplied no feedback to 5 during the general- 
ization test, 

Group 1 received stimulus lines of 40°, 55*, and 
70° during each test block; Group 2 received 40°, 
A7,8*, 55°, 62.5", and 70° lines; and Group 3 re- 
ceived 25°, 40°, 55°, 70°, and BS lines. The selec- 

- tion of these stimulus values allows an evaluation 
‘of the units effect with range held constant (Groups 
1 and 2) and an evaluation of the effect of range 
with the unit size held constant (Groups 1 and 3). 


Results and Discussion 


! Initially a Group X Stimuli (40°, 55°, 
— and 70°) X Sex analysis of variance was 


| performed, Because the variable of sex of 
J eomm i 
© emot d 
nn 
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Feuer i, Mean generalization gradients for the 


5 groups of Ss in Experiment I. 


G. HANSEN, A. TOMIE, D. R. THOMAS, AND D. H. THOMAS 


S neither yielded a significant main effect 
nor entered into any significant interaction 
all subsequent comparisons were pool 
across sex. 

The mean numbers of same responses f: 
the 3 groups are shown in Figure 1. The 
data of primary interest are the number of 
same responses to the common stimuli- 
40°, 55°, and 70°, All gradients peaked 
essentially the same level, Mean resp 
to 55° were 6.3, 6.6, and 6.8, for Grou; 2, 
and 3, respectively, (F < 1). 

Same responses to 40? and 70° pr- 
the primary measure for comparing 
effects of range and unit size on stimu 
generalization. The gradients of Gro: 

1 and 2 provide a direct test of the f; 
prediction generatéd by the units hypo 
esis, If generalization decrement var 
positively with the number of units int 
vening between training and test value: 
Group 2 should show fewer same responses 
to 40° and 70° than Group 1. The mean 
total same responses to 40° and 7 

these 2 groups were 2.40 and 24 

tively (F <1). Inspection of Fi: 
shows the gradients produced by thi 
2 groups to be nearly identical. 

The second prediction generated by the 
units hypothesis is that the generalization 
decrement shown for a given number of 
units away from the teaining stimulus will 
be the same for units of different size. 1f 
this prediction were correct, the level of 
responding to 40° and 70° for Group 1 
should equal that to 47.5° and 62.5? for 
Group 2. This is clearly not the case 
Mean total responses to these respective 
sets of stimuli were 2.45 and 8.40, which 
differ significantly, — F (1, 38) = 56.11, 
p < 01. Another comparison relevant to 
this prediction involves Stimuli 25° and 
85° for Group 3 vs. 40° and 70° for Group 2. 
Here the mean total responses, .25 and 
1.15, respectively, again differ significantly, 
F (1, 38) = 5.11, p < .05, 

Group 3 shows a substantial level of 
responding to 40° and 70°, with a mean of 
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= 1018, p < .01, Separate Newman- 
Keuls comparisons between Group 3 and 
Groups 1 and 2 were also significant 
(p <.01), As is shown in Figure 1, the 
effect of series range is not d t on 
any substantial responding to the added 
end values, The mean total responses to 
25° and 85° were only .25. 

It should be noted that the generaliza- 
tion test for Group 1 contained 27 stimulus 
presentations, compared to 45 for Groups 
2and 3. This procedural difference can be 
shown to be inconsequential in 2 ways. 
First the gradients of Groups 1 and 2 were 
comparable, despite the difference in 
of the test, Second, the data for Groups 
2 and 3 were reanalyzed so as to include 
only the first 5 test series (25 stimulus 
presentations). The gradients were then 
computed, based on the percentage of 
opportunities on which a response actually 
occurred. All of the group comparisons 
reported for data for the entire test were 
also observed in this reanalysis, at the same 
levels of confidence, 

This t, then, offers no support 
for the units hypothesis and suggests that 
the data presented by Thomas and Hiss 
(1963) also failed to support it, It should 
be noted that our Groups 2 and 3 varied 
in the same way as the groups employed 
in their study. The gradients show the 
same direction of differences in responses 
to the common test values. The inclusion 
of our Group 1, however, suggests that 
such differences are the result of teat series 
range rather than the unit size employed 
in testing. 


Exreament II 


Because the results of Experiment | and 
of the Tomie (1972) study suggest the 
operation of a range effect in human wave- 
Jength generalization, it seemed desirable 


able, 
procedure were therefore inevitable. Pilot 
work revealed that when the monochro- 


E 
Ll. oa 


matic lights were matched for brightness — 
(which Thomas and Hiss did not do), à : 
greater difference between adjacent stimuli 
was required than the $ nm. they used. 
Another change was the use of verbal 
same-different responses instead of the 
finger-lift response used by Thomas and 
Hiss. Pilot work in our laboratory has 
shown the verbal and motor responses to — 
be functionally equivalent, We have also 
determined that a much shorter exposure 
to the training stimulus then the 60 sec. 
used by Thomas and Hiss produces com- 
parable data, 


Method 


Subjects. The Se were J0 male and 30 female 

undergraduate students enrolled iu introduetory 
courses at the University of lowa. 

Apparatus, The apparatus conta a 11bem.- 
diam, circular aperture in a 76 20«cm.«wide bry 63 40- 
cm.hiyh black panel. The aperture was covered 
with translucent plastic. The source, a Bausch 
and Lomb Model 33-86-02 monochromator, equipped. 
with at en light source, was mounted 22: 86 cm 
behind and at the same height as the circular aper: 
ture, A pair of circular. Kodak Type M carbon 
neutral density optical wedges was mounted on the 
monec hroma toe wren the light source and the 
diffraction grating. All stimuli were equated for 
brightness loy. adjusting the settings of the variable 
wedge on the basis of the judgments of V independent 
Os prior to tbe initiation of the experiment, Adjust: 
ment of both the wr and the wedge 
was done manually by Æ during the intertrial inter- 
vale The Js were seated with their Lanes appre: 
mately @ cm. from the panel, with the circolar 
aperture approximately at eye level. 

Procedure, 4 


The Se were anigoed to 
3 with the restriction that each group cow 
tain 10 males and 10 females. Instruction wer the 


of the 
to stady the sample mulus carelully amd to 
respond tame or different on each trial in judging the 
test stimulus ja relation to the ple The in 
structions emphasised that S should respon! on cach 
trial and that responses shoal qoour during the 
stimulus presentation. 

AM Se received the same sample stimulus, a 
510-nm. light, for 15 ee, The geoeralisation test 
consisted of 9 blocks of trials, with the appropriate 
test series randomized within each block. During 
the generalization test, each stimulus presentation 
was $ ce. in duration, with an intertrial interval of 
approximately 5 sec. The E supplied no feedback 
to 5 during the generalization test. 

Stimuli received were of 518, $30, and 545 om 
for Group 1; $15, $22.5, $30, 547.5, and 545 nm 
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for Group 2; and 500, 515, 530, 545, and 560 nm. 
for Group 3. The selection of these stimulus values 
allowed an evaluation of the units effect with range 
held constant (Groups 1 and 2) and an evaluation 
of the effect of range with unit size held constant 
(Groups 1 and 3). 


Results and Discussion 


Because the rationale and design of 
Experiment 2 parallel Experiment I, the 
presentation of results will also be parallel, 
and can therefore be somewhat condensed. 
Again, an initial Group X Stimuli (515, 
530, and 545 nm.) X Sex analysis of 
variance was performed. Because the 
variable of sex of S neither yielded a 
significant main effect nor entered into any 
significant interaction, all subsequent com- 
parisons were pooled across sex. The 
generalization gradients of the 3 groups 
are presented in Figure 2. The 3 groups 
responsed quite comparably to the 530-nm. 
training stimulus value, with 8.1, 7.1, and 
7.5 responses for Groups 1, 2, and 3, 
respectively (F <1). The units hypoth- 
esis generates the prediction that Group 2 
should show fewer same responses to 515 
and 545 nm. than Group 1. The mean 


* GROUP | 


9 GROUP 2 
a OROUP 3 


MEAN NUMBER OF RESPONSES 


500 SIS 5225 530 5375 545 560 


WAVELENGTH (nm) 


Mean generalization gradients for the 
3 groups of Ss in Experiment II. 
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total same responses to these 2 values were 
2.75 and 3.45 for Groups 1 and 2, rospi 

tively. This difference is insignificant 
(F < 1) and in the opposite direction from 
that predicted. A second prediction gen- 
erated by the units hypothesis is that 
Group 1 should respond as much to 515 and 
545 nm. as Group 2 does to 522.5 and 537.5 
nm. Mean responses to these respective 
sets of stimuli were 2.75 and 7.40, which 
differ significantly, F (1, 38) — 32.79, 
.01. 


Ls 
According to the same prin iple, 


Group 3 should respond as much to 500 
and 560 nm. as Group 2 does for 515 nd 
545 nm. However, the mean total re- 


sponses, .25 and 3.45, respectively, again 
differ, F (1, 38) = 21.53, p < .01. 

Group 3 showed a substantial level of 
responding to 515 and 545 nm., with a 
mean of 5.70 total responses to these 2 
stimuli, Analysis of variance including all 
3 groups showed a significant difference in 
summed responses to 515 and 545 nm., 
F (2, 57) = 4.85, p < .025. Separate New- 


man-Keuls comparisons between Group 3 
and Groups 4 and 2 were also significant 
(p < .05). As is shown in Figure 2, the 


effect of range of test stimuli is not depen- 
dent on any substantial responding to the 
added end values. Indeed, responses to 
these values almost never occur. 

Finally, a reanalysis of the data including 
only the first 5 test series of Groups 2 and 3 
yielded all of the same group differences and 
similarities reported above for gradients 
obtained over the entire course of testing. 
Thus, in all major respects, the results 
obtained with wavelength stimuli in F 
periment I] those 


X e parallel obtained in 
Experiment I with line angles. 
GENERAL DISCUSSION 


Several interpretations may be offered for 
the effect of the range of test stimulus values 
on the slope of stimulus generalization gradi- 
ents, Capehart, Tempone, and Hebert (1969) 
have proposed that Adaptation level (AL), 
rather than the absolute value of the training 
stimulus, serves as the reference value in 
stimulus generalization studies. This view 
assumes that the training stimulus becomes 
the AL value and, in subsequent generalization 
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testing manipulations which affect the AL 
value affect generalization accordingly. Such 
manipulations include the use of a test series 
asymmetrically spaced around the training 
value, the over representation of certain test 
values during testing, etc. In the present ex- 
periment, the range of test stimuli in a symmet- 
rical distribution around the training stimulus 
would presumably not affect the mean AL, but 
it would affect AL variance and thus S's degree 
of certainty about whether a particular test 
stimulus was at the AL value at that time. If 
S had a set to “respond when in doubt,” then 
increasing the variance of AL could result in a 
flattened gradient. 

An alternative interpretation involves an 
application of Parducci's: (1965) range-fre- 
quency hypothes originally developed to 
predict performance in magnitude estimation 
tasks. If Ss placed the line angle stimuli (for 
example) in 3 categories (sharper than the 
training stimulus, same as the training stim- 
ulus, or flatter than the training stimulus), 
then broadening the range of test stimuli might 
extend the boundaries of the same category, 
with a consequent flattening of the line angle 
gradient. The present study does not permit 
a rejection of either of these interpretations, 
and there may be others which are equally 
tenable. 

With the proposed reinterpretation of the 
Thomas and Hiss (1963) study, it appears 
that support for Mednick and Freedman's 
(1960) units hypothesi is meager at best. 
None of the studies explicitly designed to test 
the hypothesis have provided confirmatory 
evidence. Furthermore, the evidence cited in 
the original Mednick and Freedman article 
was entirely based on a comparison of experi- 
ments performed in different laboratories, 
such that innummerable procedural differences 
might have accounted for the results obtained. 

Evidence for a range effect is somewhat 
more substantial, having been found reliably 
in a wavelength generalization study by Tomie 
(1972) and in the present line angle and wave- 
length experiments. In neither of the wave- 
length generalization studies with pigeons by 
Marsh (1967) and Friedman (1963) was 
reliable evidence found for a range effect, 
although in the latter case, there was some 


supportive evidence, which failed to achieve 
statistical significance. The variables that 
determine the presence or magnitude of a 
range effect in stimulus generalization have yet 
to be determined. Capehart et al. (1969) con- 
sidered and rejected the possibility that infra- 
human Ss may not be subject to contextual 
effects in generalization testing. We agree 
with their judgment that, in principle, such 
effects should not be limited to humans. 
Giurintano (1972) has demonstrated in a study 
with pigeons that when training has been very 
brief, generalization testing with a series of 
stimuli with all values to one side of the train- 
ing stimulus may result in a shift of maximal 
responding to a more centrally located stim- 
ulus. Perhaps the amount of training ad- 
ministered to Ss will similarly prove to bea 
crucial variable in the investigation of the 
range effect in stimulus generalization, and 
what now appears to be a species difference 
will turn out to be only a procedural one. 
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SOURNESS OF ACID MIXTURES! 


HOWARD R. MOSKOWITZ? 


Pioneering Research Laboratory, U.S. Army Natick Laboratories, Natick, Massachusetts 


The sourness of 5 acids (citric, hydrochloric, phytic, succinic, and gluconolac- 
tone) was assessed at 4 concentrations of each acid and in 16 mixtures with 
citric acid by the method of magnitude estimation. Power functions fit to the 
sourness of unmixed acids were either added directly to predict mixture sour- 
ness, or both acids in the mixture were first converted to equivalent levels of 
citric acid and then exponentiated according to the appropriate power func- 
tion. Summation of sourness functions accounted best for mixtures of citric 
acid with gluconolactone. Summation of acid equivalents was better for 
mixtures with phytic acid. Both models predicted mixtures with succinic and 
hydrochloric acids equally well. Mixtures with hydrochloric acid were syner- 
gistic. The results are discussed in terms of possible mechanisms underlying 


taste mixtures, 


For the past 50 years, sensory additivity 
of taste and smell has concerned investi- 
gators, who have attempted to derive rela- 
tions between the perceived intensity of 
mixture components tasted or smelled in 
in the unmixed (pure) state and the inten- 
sity of the mixture. Three major classes 
of models have been proposed, two from 
studies of taste and one from studies of 
smell. The present article concerns these 
models as they apply to the sourness of 
acid mixtures. 

Model 1, the summation of concentra- 
tions, derives from the earliest studies of 
taste mixtures and can be best seen in 
historical perspective in the extensive 
Sweetness studies of Cameron (1947). 
Cameron noted that the sweetness of mix- 
tures containing 2 or more sugars could 
be predicted if each sugar component were 
first compared to a reference sweet sub- 
stance. In the unmixed state, each com- 
ponent was compared to glucose and the 
"glucose equivalent" determined experi- 
mentally, The sum of the glucose equiv- 
alents of the 2 (sometimes 3) components 


! The author wishes to thank Harry L. Jacobs for 
numerous helpful discussions about the models of 
additivity and for suggestions about relating 
psychophysical measurement to electrophysi- 
ological studies. 

? Requests for reprints should be sent to Howard 
R. Moskowitz, Pioneering Research Laboratory, 
U.S. Army Natick Laboratories, Natick, Massa- 
chusetts 01760. 


approximated the glucose equivalent of ti 
mixture. 

Cameron's. (1947) additivity model wa: 
limited to equal-sweetness matches, since, 
at that time, no appropriate measures were 
developed for the direct assessment of sen- 
sory intensity that would, simultaneously 
bermit subjective magnitudes to be added, 
subtracted, or multiplied by a constant 
With the development of magnitude esti 
mation, Ss have been given a tool to repor! 
sensory intensities along a ratio scale. In 
this procedure, Ss assign numbers to taste 
intensity (S), and these numbers are re- 
&ressed against concentration (C) (Mosko- 
witz, 1970). The results for acids, evalu- 
ated in the unmixed state, suggest that the 
simple power function S = ŁC” provides a 
reasonable description of the growth of 
sourness with concentration (Moskowitz, 
1971). For 2 acids in a mixture, 2 separate 
power functions may be obtained : 


Acid A: S, = kC,” 
cd pra ENTE 
Acid B: S, & Cy". 
The exponents m and n need not be equal, 
and for organic acids they typically lie be- 
tween .5 and 1.0, so that sourness is a 
decelerating function of concentration. The 
intercepts $a and k, may also vary, and 
reflect the sourness estimate at 1.0 gm. 
molecular weight (M) of each acid. 
Cameron's (1947) additivity model may 
be expanded to account for power functions 
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Equation 1 can be 


of sensory intensity. 
solved for equivalents of 1 acid (e.g. Acid 
A) 


S, = Sp implies &,C," = koCo", [2] 


Ca = (hCw/b) [3] 


Equation 3 presents the concentration. of 
Acid A that is as sour as the corresponding 
concentration of Acid B. In a mixture of 
the 2 acids (Ca + Cy) the sourness can be 
redicted by Equation 4: 


Say = C. (O/k) n. E] 


Model 2, the summation of perceived 
sournesses, suggests that the gustatory sys- 
tem treats the 2 acid concentrations as 
separate inputs, processes them separately, 
and adds the outputs ("sensory judg- 
ments") together. Equation 5 presents 
Model 2: 


Sa = CST kyCu™ [5] 


Model 2 is simpler than Model 1, since it 
does not require any cross terms in the 
mixture equation. It also suggests a sub- 
stantially different process by which the 
gustatory system adds together informa- 
tion from a compound source, 

The third model, the vector addition of 
sourness judgments, is derived from an ap- 
proach to odor mixtures reported by Ber- 
glund, Berglund, Lindvall, and Svensson 
(1972). They proposed that binary mix- 
tures of odors, eg» dimethyl disulfide + 
pyridine, hydrogen sulfide + pyridine, etc., 
may add together together in intensity as 
if they behaved like vectors separated by 
a constant angle Q. Their additivity equa- 
tion is shown in Equation 6: 


S, = [S2 + Ss? + 29,8 cos Q)J*. 6] 


When the 2 odors (or here, acids) are 
qualitatively identical, then the angle 
separating them, Q, becomes 0, and its 
cosine becomes 1.0. Note that the vector 
additivity is expressed in terms of sub- 
jectively perceived intensity. A slight 
modification is needed to represent Equa- 
tion 6 as the sum of power functions. 

A previous study with sweeteners and 
their mixtures failed to distinguish between 


Models 1 and 2, viz., taste summation of 
concentrations vs. summation of subjective 
intensities (Moskowitz, 1973). In that 
study, glucose was mixed with fructose, 
saccharin, or cyclamate, and S was asked 
to estimate perceived sweetness. The 2 
models were shown to differ maximally 
when 2 conditions prevailed : (a) the ex- 
ponents m and # differed widely from 1.0, 
and (b) as little "noise" as possible was 
present in the mixture estimates. The 2 
models become increasingly similar as the 
exponents approach 1.0 (at which point 
they become identical). 

The present study concerns acid mixtures 
and the 3 models of summation. Previous 
studies with the sourness of organic acids 
(Beebe-Center, 1949; Moskowitz, 1971) 
suggested that the rated sourhesses of 
tartaric acid, citric acid, and a number of 
other compounds grow according to a simple 
power function of concentration; S = kC”. 
The exponent 7 is usually lower than 1.0, 
lying around .7 for citric acid and perhaps 
slightly higher for tartaric acid. When the 
measure of concentration is defined in terms 
of pH (—log H+ ions in solution, à meas- f 
ure of hydrogen ion concentration), the $ 
exponent » becomes —1.7, suggesting that 
apparent sourness grows as an accelerating 
function. 


METHOD 


In 4 experiments, citric acid was evaluated both in 
unmixed solutions and when mixed with another 
acid (either hydrochloric, succinic, phytic, or glu 
conolactone). Four concentrations of citric acid and 
4 of the second acid were evaluated in each experi- 
ment, as well as the 16 mixtures of the 2 acids, 
Table 1 lists the concentrations used. The stimuli 
were prepared in 250-ml, quantities with distilled 
water (Hydro Service Supply, Inc.), At the time 
of preparation, the stimuli remained for 24 hr. under 
refrigeration (4°C), following which they were frozen 
until use 2 wk. later, The solutions were thawed, 
remixed thoroughly, and served to Ss at room tem- 
perature (21°C + 1). 

Each stimulus was presented to 5 in small (1 oz.) 
paper cups containing 15-20 ml. of the solution, 
The stimuli in each set were presented in random 
order of concentration, compound, and mixture, 
and within any session, each S sampled all 24 stimuli 
(8 unmixed stimuli and 16 mixtures). The entire 
session required about 30 min., with S taking ap- 
proximately 1 min. to sample the stimulus, record 


the sourness estimate, and then rinse with 50-100 
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TABLE 1 
PARAMETERS OF SIMPLE SOURNESS 
Functions 
(S = kC" AND S' = k'C7) 
ER SORTES 
Acid n | SE. aie] 7 | gression | 10g At 
Experiment Is 
Gluconolactone | .79 40 |136 |.97| 09 | 1134 
Citric 66 | .09 | 1.66 | 97} {09 | 173 
Experiment 11» 
Phytic = 88 | .09 | 2.76 |.98 | .06 |249 
2 | 02 |159|:9 | Ls | 473 
93 | .0 | 2.17] 98] .07 | 1.83 
toi | 11 | 2.27 |197| fos | 153 
Experiment IV4 
Myarochiore .35 | .03 .85 | .98 02 14 
Citric 92 | 15 | 2:15 95) .H- |.rgs 


Note. Concentrations (molarity) were .05, ,025, .0125, and 
»00625 for hydrochloric, succinic, citric, and phytic acids; and 
+80, 40, .20, and .10 for gluconolactone, 

“n= 8, 

bn = 7, 

cn = 6, 

dn =7, 


ml. of water provided for that purpose. The rinse 
water was room-temperature tap water. 

All Ss were military enlisted men (aged 18-24 yr.) 
who had had previous experience in. judging. the 
apparent sweetness of sugars by magnitude estima- 
tion, None were aware of the purpose of the study, 
and each had been previously trained on the use of 
magnitude estimation according to the procedure 
outlined by Moskowitz (1970). A pool of 10 Ss 
participated, but smaller groups of individuals 
participated in any one session. The experiments 
consisted of one test in the morning and a replication 
of that test in the afternoon. The same Ss partici- 
pated in each session. 

Analysis, The median magnitude estimate was 
computed for the 24 judgments for each set of 
stimuli, The median was selected as the preferred 
measure of central tendency because it is not as 
drastically affected by an erratic judgment as is the 
geometric mean, For each experiment, power func- 
tions in their linearized form (log S = n Log C 4- 
log $) were computed by a least squares procedure, 
both for the median judgments of citric acid and 
for the median judgments of the second acid (in 
unmixed form), The goodness of fit was assessed 
by (a) the Pearson 7*, (b) the standard error of the 
regression, and (c) the standard error of the 
exponent. 


The 2 theoretical models of Sourness summation 


MOSKOWITZ 


ness (Si) was related to the empirical estimate of 
mixture sourness (Se) by the following equations: 


LU] 


log (Se) = » log (S) + hy 


and 5 

log (Se) = 1.0 log (S) + ks. [8] 
Equation 7 states that the relation between predicted: 
and obtained sourness is a power function. If the 
exponent » is 1.0, (Equation 8), then both pre- 
dicted and obtained sourness differ only by a single 
value, ke (ie. by a constant ratio), If the ex- 
ponent n is higher than 1.0, then the mixture sour- 
grows more rapidly than the theo; etically pre- 
dicted sourness. As the sourness leve! increases, 
the mixture sourness will eventually be substantially 
higher than that predicted from the nodels. The 
Opposite occurs if the exponent is lower than 1.0, 
Finally, if a single index number is desired to repre- 
sent, on the average, how different the predicted 
and the empirical sourness values are, then Equation 
8 is useful. The exponent is set arbitrarily to 1.0, 
and the least squares solution for ky provides that 
index number, If ks is positive, then th empirical 
sourness is stronger than that predicted from the 
additivity models; whereas if k, is negative, then the 
empirical sourness is weaker than that predicted. 
The former corresponds to "synergism," whereas 
the latter corresponds to “suppression,” 


REsvLTS 


Figure 1 shows the 4 pairs of sourness 
functions obtained from the median mag- 
nitude estimates of Sourness, and Table 1 
lists their respective parameters and the 
goodness-of-fit statistics. The acids are 
not described by the same power functions, 
but differ both in the exponent » and in the 
intercept k. The exponent of phytic acid, 
for example, is higher than that of citric 
acid judged in the same session. This 
means that equal percentage changes in 
concentration for the 2 acids produce a 
greater increment in the judged sourness 
of phytic acid than that of citric acid. 
Citric acid, in turn, is governed by a higher 
exponent than hydrochloric (a factor of 
almost 2.5:1), so that hydrochloric acid 
grows more slowly in sourness than citric 
acid for commensurate increments in molar 
concentration. 

In the evaluation of simple, unmixed 
acids, the appropriate experimental pro- 
cedure may be to determine sourness func- 
tions for both acids judged in the same 
session. [n this way, the same S judges 
the 2 acids, using the same modulus, and 
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FIGURE 1, 


Sourness functions for 4 experiments. 


(In each experiment, 


citric acid and 1 other acid were each evaluated at 4 concentrations. The co- 
ordinates of the figure are log-log, and straight lines represent power functions 
whose exponents are given by the slope of the line. The concentrations are 
in terms of molarity, and the median magnitude estimate is plotted against 


concentration.) 


brings to bear in the experiment the same 
biases toward the operation of judgment. 
Were 2 acids to be evaluated in separate 
sessions by different Ss, one S might use a 
larger range of numbers than another. 
The limited number range adopted by some 
Ss can decrease the power function ex- 
ponent considerably and has been called 
the “regression effect" in psychophysical 
judgment (Stevens & Greenbaum, 1966). 
If the regression bias is multiplicative or 
belongs to a class of transformations known 
as power transformations, then the rela- 


tive exponents, determined under these 
conditions, would yield the same ratio, 
even though each exponent might be re- 
duced or increased considerably. Here, the 
different Ss may have influenced the vari- 
ability of the exponent for citric acid from 
a low of .62 to a high of 1.01, This range 
of values lies within the range of exponents 
previously reported for various organic 
acids (Moskowitz, 1968, 1971). 

In the evaluation of unmixed acid solu- 
tions, a single representative value for 
relative sourness is often desirable, if only 


TABLE 2 
PARAMETERS OF MIXTURE FUNCTIONS 


Mixture 
Parameter I" 
Citric + glucono- Citrie Citric Citric + hydro- 
lactone | + succinic + phytic chloric 
Model 1: summation of concentrations | 
n .94(17)^ | 1.04(14) 1.09(.09) 1.18(.16) 
hi A5 — .09 - 24 — .03 
SE estimate 44 42 10 16 
" .68 .19 -90 19: 
ka 10 — .06 — 41 .08 
Model 2: summation of sournesses 
n 1.01(.14) 1.0515) 1.19(.11) 1.15(.16) 
kı .02 = i — A8 — 01 
SE estimate A4 312 B AS 
r .69 19 .89 19 
ky .02 = 07 — 30 -09 


Note. (a) log (Sournessempirien!) 


= n log (Sournessyredicted) + Æi, (b) log (Sournessenpirieal) = 1,0 log (Sournesspredicted) + k2. 


^ Numbers in parentheses refer to the standard error of estimate of 1, 
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to serve as an index number for purposes of 
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Acid mixtures. Table 2 presents the 


f comparing different chemicals. The rela- summary statistics for the analysis of acid 
tive sourness for 2 acids may be immedi- mixtures, and Figure 2 presents the relation 
ately obtained by comparing 2 magnitude between the logarithms of the sourness esti- 
estimates of different acids (usually at 1 mates (empirically obtained judgments) of 
common concentration, e.g., 1.0 M). The mixtures and the values for mixture sour- 
ratio of estimates (corresponding to the ness predicted by the summation models. 
difference in logarithmic values) corre- The models were evaluated both by the 
sponds to the desired ratio of sensory summation of concentrations (expressed in 
magnitudes. In contrast, the traditional equivalents of citric acid) and by the sum- 
(although misleading and ambiguous) de- mation of sournesses (ie, addition of 

.. finition of relative sourness has been the power functions). In many instances, the 
ratio of concentrations of 2 acids that pro- mixture sourness exceeded the predicted 
duce an impression of the same sourness sourness, and as a result, the vector model 
level. This latter ratio might better be of additivity required a cosine greater than 
called taste potency rather than relative 1.0, an impossible case. Vector additivity 
taste intensity. could not account for synergistic effects, 

When exponents differ for acids, the rela- and was eliminated as a predictor model, 
tive sournesses tend to vary with concen- based upon the relatively large number of 
tration. In order to achieve a single index synergistic mixtures. 
for the relative value, the 2 sourness func- The computations for mixture sourness 
tions must be forced to conform to a single in Table 2 and Figure 2 result from the 
power function with fixed slope (any slope composition of individual sourness func- 

^ or exponent suffices). The least squares tions (ie, by adding together psycho- 


estimate of the intercepts then serves as a 
useful value for sourness and can be used 
across the entire continuum of concentra- 
tions. In Table 1, the exponent has been 
set arbitrarily at .7 for each acid (independ- 
ent of the actual least squares exponent) 
and the value k’ corresponds to the inter- 
cept (at 1.0 M) for the .7 exponent. In 
all cases, the value for citric acid is 1.73 
(in logarithmic units). Numbers lower 
than 1.73 represent lower sourness, so that 
73 means that the second acid is 1.0 log 
unit lower in sourness (equal to 10% as 
sour as citric acid at equal concentrations). 
Gluconolactone has a sourness of 1.34, so 
that relative to citric acid, its sourness is 
—.39 (ie., 1.34 — 1.73). That value cor- 
responds to approximately 40% relative 
sourness (10-9 = 40%). Phytic acid is 
the most sour and gluconolactone the least 
sour acid. 


physical power functions) rather than from 
the empirical sourness estimates them- 
selves, As a result, the models are capable 
of accounting for mixtures of concentrations 
that are not directly tested in the experi- 
ment, since a function is used to transform 
concentration to sourness. In addition, 
however, those discrepancies between pre- 
dicted and obtained sourness of mixtures 
may, in part, be traced to the choice of 
sourness functions for the unmixed acids. 
The increase in generality by appeal to a 
simple parsimonious function will be offset 
when that function provides only a modest 
fit to the sourness estimates of unmixed 
acids. 

In Figure 2, the coordinates are loga- 
rithmic, and the best fitting functions in 
Table 2 refer to those coordinates. The 
straight line tilted 45? represents the line 
of congruity. When the predictions exactly 


deviations from the 45? line—ideal congruence between predicted and ob- 
tained values—correspond to percentage deviations, not absolute deviations. 
Mixtures that are plotted above the 45? line represent pairs of acids whose 


predicted sourness is greater than the 
mixtures show suppression or partial additivity. 


empirically judged sourness. Those 
Mixtures plotted below the 


45? line represent stimulus pairs whose predicted sourness is less than the 
empirical sourness. Those mixtures show synergism.) 
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equal the empirical sourness estimates, then 
all points fall along the line. Points below 
the line correspond to synergism, so that 
the obtained sourness estimates exceed the 
predicted values. Points above the line 
correspond to suppression, so that the ob- 
tained sourness estimates are less than the 
predicted values. Deviations from straight 
lines are logarithmic distances, correspond- 
ing to ratios in linear coordinates. The 
conversion to logarithms allows a constant 
percentage discrepancy to appear as a con- 
stant difference. In linear coordinates, 
that constancy would be difficult to ascer- 
tain by visual inspection alone. In addi- 
tion, small deviations at the low end of the 
sourness scale are as important for the 
appropriateness of the models as are the 
larger deviations that would be expected 
at the higher end of the sourness scale. 
The small deviations would be masked sub- 
stantially in linear coordinates. 

A number of phenomena emerge from 
Figure 2. First, for the mixtures of citric 
acid with either gluconolactone or succinic 
acid, the mixture sourness grows as rapidly 
as that predicted from the mixture equa- 
tions. The parallel functions do not appear 
for mixtures of citric with phytic and hy- 
drochloric acids. Instead, when the sour- 
ness estimate is the dependent variable, 
the slope relating the 2 mixture sournesses 
exceeds 1.0, so that the 2 values depart in- 
creasingly from each other with increasing 
sourness. Second, departures from con- 
gruence, even when of a constant percen- 
tage nature (ie, mixtures of citric acid 
with succinic acid and gluconolactone, re- 
spectively) can be either synergism (with 
gluconolactone) or slight suppression (with 
succinic acid). It is also interesting to 
note that phytic acid (which shows most 
suppression in mixtures) is the strongest 
of the acids in terms of the sourness it pro- 
duces, whereas gluconolactone is the weak- 
est. It is possible that relative sourness 
is implicated in synergism vs. suppression. 
Third, the 2 models produce highly cor- 
related. predictions that differ principally 
by a constant percentage. That percen- 
tage, rather than any other goodness-of-fit 
statistic, is the critical parameter by which 
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the 2 models can be differentiated and 
evaluated. The percentage by which one 
model exceeds the other (Figure 2) not 
constant, however, but varies systemati- 
cally according to the component power 
functions that are used to generate the 2 
mixture equations. For mixtures of citric 
and succinic acids, the models produce 
virtually the same predictions, whereas for 
mixtures with gluconolactone, the models 
predict moderately different sourness levels. 


Disc 


SSION 


The models of additivity suggested pre- 
viously to account for the sweetness of sweet- 
ener mixtures (Cameron, 1947; Moskowitz, 
1973) appear to be useful for predicting mix- 
ture sourness of acids as well. The present 
results bear upon 2 areas of taste psychology: 
(a) the analyzability of taste impressions and 
its correlation with taste intensity of mixtures, 
and (b) potential sensory mechanisms sub- 
serving taste impressions and their elucidation 
from studies of taste mixtures. 

Analysability. None of the Ss in the present 
experiment appeared to be aware that they 
were sampling both simple stimuli and mix- 
tures. Rather, the tastes were reported to be 
uniformly sour, Each S was questioned at the 
completion of the series of experiments, but 
none reported any other taste but sourness. 
Similar failure to perceive a taste as a mixture 
when both components possess the same 
quality was reported previously when fructose 
was added to glucose (Moskowitz, 1973). In 
that study, as in this, no stimuli contained 
bitter or other side tastes that might have 
interfered with sweetness or sourness, respec 
tively. It may well turn out that like-tasting 
compounds add together according to a simple 
arithmetic manner, with synergism or sup- 
pression. simply phemomena that reveal a 
failure to account in an adequate way for the 
law of summation. 

In contrast to simple additivity of like-tast- 
ing compounds, mixtures with different tast 
suppress each other, A study by Beebe-Center, 
Atkinson, Rogers, and O'Connell (1959) in- 
vestigated the saltiness and sweetness of com- 
pound tastes containing NaCl and sucrose. 
The saltiness and sweetness were reduced by a 
total of 40% (in gusts, their unit of taste 
intensity), to 60% of the starting value. Ina 

second study (Moskowitz, 1972), mixtures 
of sweet substances (glucose and fructose, 2 
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simple sugars) with NaCl (salty), citric acid 
(sour) or quinine sulfate (bitter) also showed 
suppresion, $0 that the total magnitude esti- 
mate given to the mixture (i.e., the sum of the 
magnitude estimates given to the 2 compon- 
ents) was less than the arithmetic sum of the 
estimates given to the unmixed components, 

It is possible that the laws governing taste 
mixtures parallel Ss’ ability to analyze the 
mixture into its components, Those mixtures 
in which analyzability is high (e.g., sweet + 
salty mixtures) may exhibit substantial sup- 
pression. Intermediate mixtures in which the 
components are qualitatively similar and merge 
slightly (e.g., some odor mixtures) show more 
additivity and le suppression, Finally, mix- 
tures of like-tasting compounds may show 
total additivity, or at least a high degree 
thereof. It is also intriguing to note that an 
entire continuum of mixture phenomena is 
possible, ranging from synergism to complete 
(100%) additivity, through partial additivity, 
and down to suppression. Taste mixtures in 
general occupy 2 separate regions. Like-tast- 
ing compounds occupy the upper region, 
whereas different-tasting compounds occupy 
the lower region of suppression. 

Underlying sensory mechanisms and taste 
mixtures. A study by Halpern (1959) com- 
pared the neural response at the medulla 
(nucleus fasisculus solitarius) to responses of 
the chorda tympani (peripheral taste nerve). 
At the chorda tympani homogeneous mixtures 
of inorganic salts produced an additive inten- 
sity response, SO that the neural response to the 
mixture could be well approximated by sum- 
ming the responses to the components. Heter- 
ogeneous mixtures of sucrose and citric or acetic 
acids also produced additive neural response 
(Halpern, 1967). In contrast, the central 
neural response at the medulla did not show 
a parallel summation. Instead, the magni- 
tude was smaller than the expected sum, but 
greater than either response to the components 
separately. It would be instructive to com- 


pare central vs. peripheral neural responses to 
both homogeneous and heterogeneous mixtures 
and to then determine parallels between hu- 
man psychophysical judgments of mixtures 
and neural responses. 
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A SEMANTIC INTERPRETATION OF ENCODING SPECIFICITY! 
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University of Michigan 


Two experiments are presented to clarify possible interpretations of th 


Encoding Specificity Principle of Tulving and "Thomson. 


This principle 


states that a cue must have been studied with a word in order for the cue to 
be effective at testing. In the experiments reported here, recall and recogni- 
tion of words were impaired by a change in the accompanying cues only if 
the to-be-remembered (TBR) words were of high frequency; low-frequency 
words did not support the Encoding Specificity Principle. The data suggest 
that both recall and recognition of a TBR word depend upon recognition 
of a specific interpretation of the word originally encoded, rather than its 


physical representation. 


The considerable recent research on con- 
text effects in memory (e.g., Bobrow, 1970; 
Light & Carter-Sobell, 1970; "Thomson, 
1972; Thomson & Tulving, 1970; Tulving 
& Thomson, 1971; Winograd & Conn, 
1971) has demonstrated that the ability 
to recall or recognize a word can be severely 
impaired by changes from study to test 
of the context in which the word is pre- 
sented. Light and Carter-Sobell, for ex- 
ample, paired homographs with one ad- 
jective at the time of study (e.g., soda 
cracker) and later tested recognition of the 
homographs when paired with the same 
adjective or with an adjective that primed 
a different sense of the homograph (e.g., 
safe cracker). Recognition was signifi- 
cantly better when the same adjective 
was paired with the target word even 
though S knew that only the noun was 
to be judged for familiarity; thus, a word's 
context can greatly affect whether Ss can 
access information stored with that word. 
This result is significant, since many models 
of memory (e.g., Anderson & Bower, 1972; 
Kintsch, 1970) assume that accessing a 
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word's trace is an automatic 
tests of recognition memory. 
Tulving and Thomson (1971) and Thom- 
son (1972) have also found th 
tion memory for to-be-rememberc«! 
words is impaired by changes in the cue 
word that is paired with a TBR word. 
Also, they found that adding or subtracting 
the cue word at the time of test impaired 
the recognition of TBR words. In con- 
trast to the Light and Carter-Sobell (1970) 
procedure, however, Tulving and Thomson 
did not use transparent homographs, and 
they did not make a deliberate attempt to 
prime different meanings of the words. 
Thomson and Tulving (1970) also have 
found that cued recall of TBR words is 
lowered when the cue employed at the 
retrieval test is different from the encoding 
cue given at input, and that result holds 
even when the retrieval cue is a strong 
normative associate of the TBR word. 
Strongly associated, extralist cues (strong 
cues) enhance recall when no cues are 
presented with the TBR words at input 
(Bahrick, 1969; Thomson & Tulving, 
1970), but are inferior probes compared 
to weak cues that were paired with TBR 
words at the time of input. It is possible 
in the Thomson and Tulving cued-recall 
situation that the S implicitly generates 
the TBR word to the strongly associated 
cue, but does not recognize the TBR word 
as being from the list. This interpretation 
is supported by the fact that Ss can 
generate the TBR item in a free-associa- 
tion task to a strong cue, yet fail on a later 


process in 
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test to recognize that word as a TBR 
word (Tulving & Thomson, 1973). 

Tulving and Thomson (1973) argue that 
only their Encoding Specificity Principle is 
consistent with these results. In defining 
this principle, they state that: "In its 
broadest form the principle asserts that 
only that can be retrieved that has been 
stored, and that how it can be retrieved 
depends on how it was stored [p. 359]." 
They assert that TBR words are stored 
as higher order episodic units and only 
elements of the episodic as opposed to 
semanlic (see Tulving, 1972) higher order 
unit can access the TBR item. However, 
Light and Carter-Sobell (1970) reject the 
notion that their results could be due 
simply to “the effect of breaking up an 
encoded adjective-noun unit at recogni- 
tion [p. 8].” They point out that words 
were recognized less frequently when a new 
adjective changed the sense of the noun 
than when the original adjective was simply 
deleted, and that recognition was better 
when the original adjective (e.g. soda 
cracker) was replaced by one that did not 
drastically alter the semantic interpreta- 
tion of the target word (graham cracker) 
than when the new adjective did alter the 
semantic interpretation of the target word 
in a substantial way (safe cracker). ‘Thus, 
Light and Carter-Sobell interpret their 
results as evidence that recognition of a 
word in a verbal learning experiment may 
involve recognition of the appropriate 
meaning of the word. 

Work by Winograd and Conn (1971) 
supports the Light and Carter-Sobell 
(1970) interpretation. Words known to 
have multiple meanings were presented to 
Ss with no explicit context, that is, no 
encoding cue. Prior to the main experi- 
ment, the relative frequencies of the vari- 
ous interpretations of each word were de- 
termined. Recognition for these words 
was then tested by presenting a word in 1 
of 3 ways: (a) In a sentence that used a 
frequent meaning or interpretation of the 
word; (b) in a sentence that used an in- 
frequent meaning of the word; or (c) by 
itself, that is, with no context or meaning 
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imposed by E. Homographs tested in a 
context that utilized their high-frequency 
meanings and those tested without an im- 
posed context were recognized about 
equally well, and both were recognized far 
more often than the homographs forced 
into an uncommon encoding or meaning. 
Winograd and Conn concluded that words 
with multiple meanings are encoded with 
respect to a specific meaning even though 
no cue is presented at input and that, in 
the absence of an experimentally provided 
context, the most frequent meaning of a 
word tends to be the one encoded. 

It might appear that the Light and 
Carter-Sobell (1970) and the Winograd 
and Conn (1971) analyses do not extend 
to the Tulving and Thomson data be- 
cause Tulving and Thomson did not use 
explicit homographs. However, many 
words in the language that are not ex- 
plicit homographs still have multiple sen- 
ses or interpretations. For instance, water 
does not have 2 obviously different mean- 
ings, but the sense of water when presented 
with lake is somewhat different than the 
sense of water when presented with drink. 
In fact, Webster's Seventh New Collegiate 
Dictionary (1965) lists 8 main senses of 
the noun water and 16 subsenses and 19 
sub-subsenses Included are senses un- 
familiar to most people and some obscure 
senses—e g, "a capital stock not repre- 
senting assets of the issuing company 
and not backed by earning power [p. 
1006 ]"—that are unrelated to any of the 
normal senses of the word. It is also 
certainly the case that there are still other 
interpretations of the word that are not 
in the dictionary but are known to some Ss. 
Thus, many words not considered homo- 
graphs allow multiple semantic interpreta- 
tions, and a plausible explanation of the 
Tulving and Thomson data posits that a 
change in cue words can tap a different 
sense of the TBR word. 

The experiments reported here were de- 
signed to test this semantic interpretation 
of encoding specificity. The basic idea is 
that words with few senses should be less 
vulnerable to changes in context in the 
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Tulving and Thomson paradigm than 
should words with many senses. 

Although the idea behind this research 
is relatively straightforward, obtaining an 
adequate, objective measure of the number 
of senses associated with words is a non- 
trivial methodological problem. One 
cannot simply ask Ss to report how many 
senses they have for particular words; it is 
not easy for Ss to retrieve all the senses 
they have for a word (MacKay & Bever, 
1967), nor is it easy to convince them that 
subtle distinctions like those associated 
with waler are to be regarded as different 
senses. If it were easy to retrieve all the 
senses of a word, Ss would not suffer the 
difficulties they do in experiments like 
those of Light and Carter-Sobell (1970). 

For other reasons, number of dictionary 
meanings is also an inadequate measure. 
Many senses listed in a dictionary are not 
instantiated in most people's lexicon, while 
other colloquial and idiosyncratic senses 
are not in the dictionary. It is also dif- 
ficult in practice to be consistent in terms 
of the cutoff point one uses to decide that 
a dictionary distinction is in fact sub- 
stantial enough to denote separate senses 
of a word. 

Based on these and other considera- 
tions, word frequency was chosen as the 
index of number of word senses. Words 
of very low frequency (less than 
9/1,000,000 in Kučera & Francis, 1967) 
were contrasted with words of moderate 

to high frequency (more than 50/ 
1,000,000). Generally, low-frequency 
words (e.g., hippopotamus, aspirin) tend 
to have only one sense. This phenomenon 
is probably attributable to there being 
little opportunity for multiple senses to 
differentiate and be maintained. It may 
also reflect the fact that words with one 
unique sense are appropriate to only a few 
situations and as a result have low fre- 
quency of occurrence. Therefore, word 
frequency is a fairly sensitive correlate of 
number of senses instantiated in the 
typical S. Schnorr and Atkinson (1970) 
found a strong positive correlation be- 
tween word frequency and number of dic- 
tionary meanings, as did we for the words 
used in the experiments reported here, 
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r —.61; ¢ (118) = 8.3, p < .001? Both 
word frequency and number of dictionary 
meanings are undoubtedly imperfect reflec- 
tors of number of senses, but frequency 
seems to provide a somewhat more satis- 
factory and objective basis on which to 
partition words into those having few or 
many senses. Later in this report fre- 
quency and number of dictionary mean- 
ings are compared in terms of their ability 
to predict memory performance. 


EXPERIMENT I 
Method 


Design and procedure. Four lists of 30 TBR 
words, 15 high frequency and 15 low frequency, 
were presented by slide projector at a 3.5-sec. rate 
to Ss for study and subsequent recall. Each TBR 
word was initially paired with a weakly associated 
cue; the TBR word was shown in uppercase letters 
directly below the cue word, which was shown in 
lowercase letters. After presentation of 30 pairs, 
10 of the TBR words were tested in each of the 
3 following conditions: (a) The .5 saw the original 
weakly associated input cue and was asked to 
recall the TBR word that : ed with it; 
(b) S saw a new cue strongly associated with a 
TBR word and was asked to use it to help him 
think of a TBR word from the last list; or (c) S 
received no cue for the word and was simply asked 
to try to recall the item from memory. Within 
each list, the 2 levels of word Írequency were 
crossed with the 3 types of output conditions. 
There were 5 words in each Frequency X Output 
combination. "Three different sets of recall sheets 
were used for each list so that, across Ss, ail words 
appeared in all output conditions, Each set con- 
sisted of 2 recall sheets, one for cued recall and 
one for the recall of words for which no probe 
or cue was given. The sheet for cued recall was 
Presented before the free-recall sheet for half of 
the Ss and after the free-recall for the other Ss. 
The Ss had as much time as they wanted for both 
uncued and cued recall. When uncued recall 
preceded cued recall, Ss were asked to recall all 
the TBR words; when uncued recall followed cued 
recall, Ss were told that they only needed to recall 
those words from the list that they had not re- 
called on the cued-recall sheet, but that they need 
not worry about duplications. 

The word presentation order within a list and 
the order of the cue words on the cued-recall 


*We had a group of Ss rate all the words in 
our experiments for number of different meanings 
they could think of for each word. The Ss were 
only able to generate a mean of 1,93 meanings 
Per word whereas Webster's dictionary has 8.83 
meanings. The number of generated meanings 
correlates .64 with frequency and .67 with number 
of dictionary meanings. 
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sheets were randomly determined but were held 
constant across Ss, Half of the 20 cues were as- 
terisked to indicate that they were new cue words. 
'The various recall sheets and the order of list 
presentation were counterbalanced over Ss. Two 
priming lists, which were not scored, preceded the 
4 lists of interest. The priming lists, composed 
of medium-frequency words, were tested at recall 
with their original input cues. The purpose of the 
priming lists was to induce Ss to encode the TBR 
word with respect to the original cue presented ; 
Ss were not informed that the priming lists were 
practice lists. Forty-eight paid Ss at the Uni- 
versity of Michigan were tested in groups of 1-7. 

Materials. All TBR words were nouns judged 
to be high in both meaningfulness and imagery. 
Items were selected from norms of frequency of 
words in print (Kuéera & Francis, 1967); the 
high-frequency words came from the upper third 
of the rank ordering in frequency, and the low- 
frequency words came from the bottom third; 
words in the priming lists were taken from the 
middle range. (The high-frequency words all had 
A or AA Thorndike & Lorge, 1944, ratings, and 
the low-frequency words averaged 6.7/1,000,000 in 
the Thorndike and Lorge count.) Those nouns 
selected that were included in Paivio, Yuille, and 
Madigan's (1968) rating of meaningfulness and 
imagery of nouns received scores that confirmed 
the subjective criterion for high imagery and con- 
creteness: They all were above 6 on the two 7-point 
scales. Thomson and Tulving (1970) constructed 
their lists of TBR words by selecting response 
words from free-association norms in which each 
TBR word occurred twice in the norms: once as 
a high-frequency response to a strong cue stimulus 
word and once as à low-frequency response to à 
different weak cue stimulus word. Association 
norms could not be used to select the cues for 
this experiment due to the difficulty of finding 
norms that contain low-frequency nouns as rte- 
sponses to other words, particularly as strong 
associates. 

Because of these inherent difficulties, strong and 
weak cues were generated according to intuitive 
criteria. That is, the strong cues seemed likely to 
elicit the TBR words frequently, the weak cues 
infrequently. The fact that the cues were con- 
structed in an informal manner does not seem a 
serious problem in methodology. The basic re- 
quirement is simply that the strong cues for low- 
frequency words are not more effective than strong 
cues for high-frequency words. This requirement 
is important because the predicted results (namely, 
that recall of low-frequency words with their new, 
strong cues will be superior to recall of high-fre- 
quency words with their new, strong cues) might 
otherwise be an artifact of prior strength. Five 
colleagues, naive with respect to the study, gave 
free associations to each of the strong cues used 
in the experiment. High-frequency TBR words 
were generated as free associates to the strong 
cues 24% of the time; low-frequency TBR words 
were generated only 12% of the time. 
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TABLE 1 


PROPORTIONS OF HIGH- AND Low-Frequency To- 
BE-REMEMBERED Wonps RECALLED ON TEsTS 
or CUED AND FREE RECALL 


Cued recall 
Word Wed Am Free 
frequency Strong Weak recall 
extralist within- 
cue lists cue 
Cued recall before free 
recall 
High 31 .69 04 
Low .64 .12 05 
Cued recall after free 
recall 
High 36 .63 42 
Low 59 69 14 


The other cue-construction requirement used by 
Thomson and Tulving (1970) is that the strong 
and weak cues for a given TBR word should not 
be associatively related. This requirement was 
impossible to maintain with low-frequency words. 
For 2 words to be both assoc atively related to 
a third word and yet not be associatively related 
themselves tends to require that the third word 
have more than one interpretation, and low-fre- 
quency words often do not. The only requirement 
in the current experiment was that no 2 cues to 
a given word be synonyms. 


Results and Discussion 


In Table 1, recall probabilities are shown 
as a function of type of test, order of 
tests, and TBR word frequency. The 
statistical analyses reported in this section 
were carried out on arc sine transforma- 
tions of the proportions in Table 1. 

Cued recall. Since performance on tests 
of cued recall was not sensitive to testing 
order, F (1, 46) = 1.2, the other statisti 
analyses of the cued-recall data were car- 
ried out on the pooled data. The result 
of greatest interest in this experiment, 
that is, the frequency by cue (strong vs. 
weak) interaction, was highly significant, 
F (1, 46) = 18.3, p < .001, with perform- 
ance in the high-frequency strong-cue con- 
dition much worse than performance in 
the other conditions. For high-frequency 
words, strong extralist associates were 
much less effective as recall cues than were 
weak intralist associates, F (1, 46) = 71.44, 
p «.001. In the case of low-frequency 
words, there is a smaller but still signifi- 
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cant advantage in the same direction, 
F (1, 46) = 7.56, p < .01. The latter 
result is not too surprising since low-fre- 
quency words are sometimes polysemous, 
and in such cases a change of cue would 
have adverse effects. 

In order to rule out the possibility that 
the foregoing results might be attributable, 
in part, to item selection (cf. Clark, 1973), 
the data were also analyzed by collapsing 
over Ss and treating words as the random 
effect. The Critical Frequency X Cue 
interaction was still highly significant, 
F (1, 118) = 13.51, p < .001. Even with 
both significant Fs, however, the results 
may still be colored by S-item selection. 
Clark prescribes the use of a min F’ sta- 
tistic, (F1F3)/(Fi1 + Fs), a conservative 
test, the significance of which guarantees 
generalizability of a result over Ss and 
items. The min F' statistic is significant 
for both the Frequency X Cue interaction, 
F' (1, 149) = 7.77, p < .01, and for the 
effect of weak vs. strong cues for high- 
frequency words, F' (1, 135) — 37.91, 
p < .001. On the basis of this more con- 
servative test, however, the effect of weak 
vs. strong cues was not significant for low- 
frequency words, /” (1, 142) = 3.45. 

Free recall. The results of the free-recall 
test in Experiment I are relatively straight- 
forward. Overall, Ss were a great deal 
less likely to free recall the 10 TBR words 
not tested on the cued-recall sheets than 
they were to recall TBR words in response 
to either type of cue. In contrast to the 
cued-recall performance, there was a dis- 
tinct effect of test order, F (1, 46) = 41.9, 
b «.001. The test of cued recall appears 
to have provided substantial interference 
with subsequent free recall of TBR words 
not tested during the cued recall, whereas 
cued recall was independent of whether 
there was or was not a prior free recall. 


EXPERIMENT Il 


Our interpretation of the context effects 
on cued recall in Experiment I involves 
a generation-recognition model for recall 
(see Anderson & Bower, 1972; Bahrick, 
1969; Kintsch, 1970). However, the model 
we have in mind, unlike the prior genera- 
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tion-recognition models, assumes that § 
does not recognize the words per se, but 
rather, semantic interpretations of these 
words. The S implicitly generates words 
and attempts to recognize their senses, 
If he recognizes the senses as being from 
the TBR list, then he recalls the cor- 
responding word. The difficulty caused 
by a change in context is ributable to 


the recognition phase of ull; S can 
generate the TBR word : ` strong cue 
but he cannot recogniz: because he 
assigns a different interpretation to it than ` 
he assigned when the wor! was paired 
with the weak cue. In ! ment II, 
Ss were instructed to geneiatc free asso- 
ciations to strong cues, and n to recog- 
nize if any of the associates were TBR 


words. If the difficulty of recalling high- 
frequency words to new, strony cues results 
from difficulties in generation, then new, 
strong cues should have a greater propen- 
sity to elicit their low-frequency TBR 
words than would the corresponding new, 
strong cues to their high-frequency TBR 
words. However, we expected this not to 
be the case. Rather, we expected that 
high-frequency TBR words, when gener- 
ated to the cues, would not be recognized 
as well as low-frequency TBR words. 


Method 


Subjects, Thirty undergraduates at the Uni- 
versity of Michigan participated as paid . The 
Ss were tested in groups of approximately 5. 

Materials and apparatus. Four of the 6 lists 
used in Experiment I were used in Experiment II— 
the same 2 priming lists and 2 of the 4 critical 
lists. Each list consisted of 30 TBR words, again 
the priming lists being all medium-frequency words, 
while the critical lists were half high-frequency 
words and half low-frequency words. All words 
in each list were paired with a weak cue. Each 
list was presented by means of a Kodak Carousel 
slide projector. 

Procedure and design. The 2 priming lists pre- 
ceded the 2 critical lists. The procedure followed 
and the reason for including the priming lists in 
the design were the same as in Experiment I. 

After presentation of the first of the 2 critical 
lists, there was the following sequence of events. 

1. The Ss were first given a sheet of paper 
with 30 strong extralist cues listed in a column 
oa the left-hand side of the sheet. They were 
asked to generate exactly 4 free associates to each 
strong cue. It was pointed out to Ss that their 
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free associates to a given cue might contain a TBR 
word from the preceding list. They were asked 
to circle any such words they generated. 

2. After the free-association and recognition 
tasks, Ss were given a 4-alternative forced-choice 
(4-AFC) recognition test consisting of 30 sets of 
4 words, each set containing 1 of the 30 TBR 
words presented in the first critical list. The 3 dis- 
tractor words in each set were drawn from words 
generated by several individuals asked to free 
associate to word pairs cor ing of a TBR word 
and its strong cue. The distractors were selected 
to have about the same frequency as the TBR 
word, 

3, Finally, in order to motivate 5s to attend 
to the weak within-list cues presented in the 
second critical list, there a test of cued recall 
involving the weak within-list cues used in the 
first critical list. 

The presentation of the second critical list was 
followed by the same sequence of events, except 
that when Ss free associated, they were asked to 
think of ings as possible for each 
word generated. were asked to gen- 
erate 4 words to the cue, but then to try to examine 
each of these words for meanings not related to 
the cue that elicited them. This instructional 
manipulation was motivated by the idea that it 
might reduce the inhibiting effect of the semantic 
context induced by the strong cue on the recogni- 
tion of TBR words generated by Ss. 


Results 


The data from Experiment II are pre- 
sented in Table 2. The effect of instruc- 
tions, i.e, to think of multiple senses of 
the generated words during the free-asso- 
ciation task, was insignificant with respect 
to all 3 dependent variables so the data 
are pooled over the 2 critical lists. 

As expected, the probability of gener- 
ating a high-frequency word to the new, 
strong cue was not lower than that for 
a low-frequency word; if anything the 
reverse was true: High-frequency TBR 
words were generated with probability .59 
to their respective cues, and low-frequency 
TBR words were generated with proba- 
bility .55 to their strong cues. The 
probability that a low-frequency TBR 
word was recognized (.84), however, given 
that it was generated during the free- 
association task, was more than twice the 
probability that a high-frequency word 
was recognized (.38) given it was gen- 
erated. The probability of recognizing 
low-frequency TBR words in the 4-AFC 
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TABLE 2 


RECOGNITION AND CuED RECALL PROBABILITIES AS A 
FUNCTION or TEST TYPE AND WORD FREQUENCY 


Type of test 


Word = L fer 
Met oes 
frequency | Generation- 4-AFC Cued 
| Secognition | recognition | recall 
Me a 
High | 38 (.59) | 72 | 61 
Low | 84 (.55) 85 | 71 
| 


Note. The values in parentheses are the probabilitie 
a to-bc-remembered word was generated in the free-assets 
At. Abbreviation: 4-AFC = 4-alternative forced-choice 
recognition test. 


recognition test (.85) was also higher than 
the recognition of high-frequency words 
(.72), although the difference was much 
smaller. 

The recognition proportions for indi- 
vidual Ss were converted via the arc sine 
transformation and the transformed scores 
were subjected to an analysis of variance 
with the independent variables being word 
frequency and method of test. The Fre- 
quency X Test Type interaction was 
highly significant, (1, 29) = 30.34, p 
< 001, as was the difference between 
recognition of S-generated high- and low- 
frequency TBR words, F (1, 29) = 117.56, 
p < 001, and the difference between the 
recognition of high- and low-frequency 
TBR words on the 4-AFC test was also 
significant, F (1, 29) = 9.39, p < .01. 
Again, in consideration of possible selec- 
tion artifacts, min F's were computed for 
the Frequency X Type of Recognition 
Test interaction, /” (1, 13) = 16.25, f$ 
< .001, the difference between recognition 
of high- and low-frequency generated TBR 
words, F' (1, 87) — 41.03, p < .001, and 
the difference between frequency levels in 
the 4-AFC, F' (1, 50) = 7.02, p < .025. 

The fact that the discrepancy between 
the recognition of high- and low-frequency 
TBR words is less on the 4-AFC test than 
it is for the recognition of S-generated 
TBR words is probably attributable to 
differential effectiveness of the context in- 
duced by the test. The meanings of 
S-generated TBR words are clearly de- 
termined by the strong cue from which 
they are generated. However, in the 
C test, Ss seemed better able to judge 
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à word without regard to the other alter- 
natives. Since the strong cue was not 
actually present on the 4-AFC test, it was, 
presumably, easier to think of the word 
with a meaning different from that im- 
posed by the strong cue and more like 
that imposed by the original cue. Tulving 
and Thomson (1973) also found improved 
recognition of TBR words when the alter- 
natives in the 4-AFC test were not gen- 
erated by the S* to the strong extralist cue. 
Recall to the original cues yielded results 
very similar to the corresponding condi- 
tions in Experiment I: The cued recall 
probability for high-frequency words was 
-61 and the corresponding probability for 
low-frequency words was .71. 


GENERAL DISCUSSION 


The present results necessitate some refine- 
ment and clarification of Tulving and Thom- 
son's (1973) Encoding Specificity Principle. 
It seems plausible, at least for retention in- 
tervals on the order of those in the present 
paradigm, that what is recognized in a rec- 
ognition test is a particular sense of a word 
rather than phonemic or orthographic infor- 
mation. A cue may generate the same word 
presented earlier, but if the semantic inter- 
pretation imposed by the cue in this genera- 
tion process is different from the meaning 
originally encoded, the word is unlikely to 
be recognized or recalled. It may be that 
if the word is tested after a brief retention 
interval, then phonemic cues will improve 
recognition. However, in experiments such 
as these in which Ss must use long-term 
memory at testing, the information retrieved 
would probably be a sense of the word, not 
its spelling pattern or phonemic character- 
istics per se. Past studies have shown that 
long-term memory confusions are semantic in 
nature rather than acoustic as they are in 
short-term memory (e.g., Craik, 1968). Fur- 
thermore, since the relation between a TBR 
word and its cue is a semantic one, it is 
even more likely in cuing experiments than 


*The difference between recognition of words 
generated by S and recognition of words that 
were generated by a yoked S was in the same 
direction as our results although the difference 
was smaller. The smaller difference may be due 
to the fact that in their experiment the strong cue 
words were presented along with the distractors, 
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in other experiments that an S wouk 
to the meaning of the TBR word. 
Tulving and Thomson (1973) ass« that 
the generation-recognition models for recall 
(Anderson & Bower, 1972; Bahrick. 1969; 
Kintsch, 1968, 1970) are incompatible with 
the findings that support their Encoding Speci- 
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ficity Principle. They argue that ge rating 
a word does not guarantee that it will be 
recognized (which is the requirement for r« call) 


even though the “event information,” a they 
call it, is available. Previous descriptions of 
the generation-recognition model (e.g, An- 
derson & Bower, 1972) stipulated that if the 
TBR item was encoded as occurring in the 
list, only implicit generation of the appro- 
priate spelling pattern (or phonemic cues) 
was required for recall, Tulving and Thomson 
showed that this was not the case. How ever, 
if the generation-recognition models are modi- 
fied by the assumption that word senses are 
the basis of recognition, then these models 
become compatible with their principle. 
Words, per se, are not generated to a cue, 
but rather senses of a word are generated. 
The generated meaning must match a meaning 
encoded during the experimental t: before 
recall can occur. 


Effect of Strong versus Weak Cues 


We proposed that many words have multi- 
ple senses and frequency of the word in print 
is a good index of how many senses words 
have for a particular S. Some senses of a 
word will unavoidably be used and en- 
countered more often than others. A strong 
cue, as manifested by its high propensity to 
elicit the TBR words, probably selects a 
salient sense of the TBR word. On the other 
hand, weak associations tend to be bizzarre 
and thus tap less likely interpretations of the 
TBR words. When S sees à strong cue and 
generates the TBR word, the sense or inter- 
pretation which comes to mind is totally 
different from the obscure one studied, and 
S frequently cannot think of all the possible 
meanings related to a word generated and 
thus does not "notice" the studied sense. 
Therefore, it would seem that the more ob- 
Scure or removed the old interpretation, the 
less likely that it will be noticed, and con- 
sequently, the less likely the word would be 
recognized, 

Tulving and Thomson noted that their 
theory could not account for the finding that 
weak input cues paired with strong output 
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cues produced higher recall than strong input 
cues and weak output cues, as they expected 
no difference (Thomson & Tulving, 1970). 
If one assumes, however, that recall requires 
both generation and recognition, then one 
should expect better recall when the output 
cues are strongly associated rather than weakly 
associated because, in the task of recall, the 
candidates must be thought of before they can 
be recognized as words from the list. There is 
simply a much higher probability of generaling 
the TBR word with the strong cue, regardless 
of the 2 cues' relative effects on recognition. 
Therefore, one would have to predict that 
weak input cues with strong recall cues give 
better performance than the converse. 

Given the generation-recognition distinction 
and the assumption that it is senses that are 
recognized, the predictions for recognition are 
the opposite of those made for recall: A TBR 
word presented with a strong cue and later 
tested with a weak cue should be better 
recognized than a word originally presented 
with a weak cue and later tested with a strong 
cue. This prediction follows because the TBR 
word in the presence of a strong associate is 
not likely to suggest the obscure alternate 
sense of the word studied during its presenta- 
tion with a weak cue. On the other hand, 
if the TBR word were originally presented 
with a strong cue, a frequent interpretation 
of the word would be studied. Because this 
sense is so salient, S would be likely to revive 
that same sense even when the word is pre- 
sented in the context of a new, weak associate. 
This prediction was also confirmed in Tulving 
and Thomson's (1971) data, but no explana- 
tion was offered for that outcome. 

Thomson (1972) also did not explain a result 
from one of his recognition experiments in 
which context was deleted. He found that 
presenting only 1 of 2 words originally pre- 
sented as a pair had a less deleterious effect 
when the original pair had a strong asso- 
ciative relation than when it had a weak one. 
Also, the most harmful effect on recognition 
performance occurred on tests of single words 
initially presented on the right side of weakly 
associated input pairs. One possible explana- 
tion is that strongly related pairs tend to be 
encoded with the common sense of each word, 
so if either of the words is presented separately, 
the original sense is very likely to be noticed. 
On the other hand, when 2 words are only 
weakly related, at least 1 of the 2 words 
must be encoded in terms of a less frequently 
used sense of the word. It seems reasonable 


that the first word encountered when reading 
the pair imposes its dominant sense on the 
second word (unless, for some reason, a more 
obvious interpretation employs a common 
sense of the second and an obscure sense of 
the first). Thus, it seems plausible that a 
word on the right in a weakly associated pair 
would be likely to have an infrequent and 
unusual meaning encoded, which, in turn, 
would produce poor subsequent recognition of 
the word. 


Effects of Word Frequency 


As stated earlier, we think that word fre- 
quency is a better index of the average num- 
ber of senses of a given word than is number 
of dictionary meanings. Although the 2 in- 
dices are strongly correlated for the words 
in this experiment (r = .61), the frequency 
measure correlated better with the obtained 
results, Using the data of Experiment 1, 
a difference score was computed for each 
TBR word by taking the number of correct 
recalls to the original weak cue and sub- 
tracting the number of correct recalls to the 
new, strong cue. This difference score cor- 
related .37 with frequency and .17 with 
number of dictionary meanings. The cor- 
relation between number of dictionary mean- 
ings and the difference score when frequency 
was partialed out was —.06 and frequency 
correlated .33 with the difference score when 
the number of dictionary meanings was par- 
tialed out. The first partial correlation is 
negative and not significant, / (117) — .65, 
p< .60. The second is significant, £ (117) 
= 3.78, p < .001. 

Rubenstein, Garfield, and Millikan (1970) 
provide incidental support for the notion that 
high-frequency words are basically quite simi- 
lar to homographs and that frequency may 
be a better indicator of number of senses in 
a person's lexicon than dictionary entries, 
They administered a task that showed that 
decisions concerning whether letter strings 
are words had shorter latencies for homo- 
graphs than nonhomographs. Their explana- 
tion was that homographs have more lexical 
entries and hence the probability of matching 
the stimulus with an entry is greater in any 
period of time, thus making reaction times 
shorter for homographs. Rubenstein et al. 
also found a highly significant effect for fre- 
quency of the word, high-frequency words 
being faster than lower frequency words; in 
fact, in a reanalysis of their data, Clark (1973) 
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found that the homography effect per se was 
insignificant, but that the strong frequency 
effect remained. Neither Rubenstein et al. 
nor Clark considered the notion that fre- 
quency might be a very sensitive measure of 
the number of meanings for a given word in 
à particular S. 


REFERENCES 


ANDERSON, J. R., & Bower, G. H. Recognition 
and retrieval processes in free recall. Psycho- 
logical Review, 1972, 79, 97-123. 

Banrick, H. P. Measurement of memory by 
prompted recall. Journal of Experimental Psy- 
chology, 1969, 79, 213-219. 

BoBRow, S. A. Memory for words in sentences. 
Journal of Verbal Learning and. Verbal Behavior, 
1970, 9, 363-372. 

CLARK, H. H. The language-as-fixed-effect fallacy: 
A critique of language sampling in psychological 
research. Journal of Verbal Learning and Verbal 
Behavior, 1973, 12, 335-359, 

Craik, F. I. M. Types of error in free recall. 
Psychonomic Science, 1968, 10, 353-354 

Kintscu, W. Recognition and free recall of 
organized lists. Journal of Experimental Psy- 
chology, 1968, 78, 481—487. 

KiTscH, W. Models for free recall and recogni- 
tion. In D. A. Norman (Ed.), Models of 
human memory. New York: Academic Press, 
1970. 

Kučera, H., & Francis, W. N, Computational 
analysis of present-da y American English. Proy- 
idence; Brown University Press, 1967. 

Licut, L. L, & CARTER-SOBELL, L. Effects of 
changed semantic context on recognition memory. 
Journal of Verbal Learning and. Verbal Behavior, 
1970, 9, 1—11. 

MacKay, D. G., & Bever, T. A. 
ambiguity. 
193-200, 


In search of 
Perception & Psychophysics, 1967, 2, 


L. M. REDER, J. R. ANDERSON, AND R. A. BJORK 


Parvio, A., YuiLLE. J. C, & Manica: 5. AN 


Concreteness, imagery, and meaningfuln: ilues 
for 925 nouns. Journal of Experimey Psy- 
chology Monograph Supplement, 1968, 76 ( t. 2), 
RUBENSTEIN, H., GARFIELD, L., & MILLIKA J. A. 
Homographic entries in the internal icon. 


Journal of Verbal Learning and Verbal B 
1970, 9, 487-494. 


iior, 


Scunorr, J. A, & ATKINSON, R. C. Stud posi- 
tion and item differences in the short- and onge 


term retention of paired associates lear by 


imagery. Journal of Verbal Learning am bal 
Behavior, 1970, 9, 614-622. 

Tuomson, D. M. Context effects in reco, tion 
memory, Journal of Verbal Learning and 
Behavior, 1972, 11, 497-511. 

Tnowsow, D. M., & TutvinG, E. Asso ive 
encoding and retrieval: Weak and strony ues. 
Journal of Experimental Psychology, 197 — 86, 
255-262. 

THORNDIKE, E. L., & Lore, I The ic her's 
wordbook of 30,000 words. New York: Ce mbia 
University, Teachers College, Bureau of lica- 
tions, 1944, 

TuLviNG, E. Episodic and semantic memory: In 


E. Tulving & W. Donaldson (Eds.), Organisation 
of memory. New York: Academic Press, 1972. 

TurviNG, E., & THomson, D. M. Retrieval pro- 
cesses in recognition memory: Effects of associa- 
tive context. Journal of Expe rn 
ogy, 1971, 87, 116-124. 


TurviG, E, & THOMSON, D. M 4 
specificity and retrieval processes in ic 
memory. Psychological Review, 1973, 80, 3. 

Webster's seventh new collegiate dictionary. (vo <d.) 


Springfield, Mass.: G. & C. Merriam, 1965. 

WiNOGRAD, E, & Conn, C. P. Evidence irom 
recognition memory for specific encoding of un- 
modified homographs. Journal of Verbal Learn- 
ing and Verbal Behavior, 1971, 10, 702-706. 


(Received July 14, 1973) 


bal 


Journal of Experimental Psychology 
1974, Vol. 102, No. 4, 657-669 


RECALL AND RECOGNITION OF PICTURES BY CHILDREN 


AS A FUNCTION OF ORGANIZATION AND 
DISTRACTOR SIMILARITY ' 
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Verbal description, recall, and recognition of complex meaningful pictures by 
children were studied, varying amount of organization in the stimuli and simi- 
larity of distractors. Across Ss (sex, ethnic group, and grade level), verbal 
measures were poor predictors of recognition accuracy. Across stimuli, amount 
recalled and recognition accuracy were both related to amount of organization. 
Recognition was also a function of the type of transformations on the target 
that were used as distractors. For all transformations there was a close match 
between ability to recognize a transformation and judgment of dissimilarity of 
the transformation to the target. The TORSCA multidimensional scaling 
technique was applied to the similarity judgments to obtain a representation 
of a memory space for the targets and their transformations. The structure 
of this space was highly consistent across Ss and indicated that transformations 
on meaningful pictures can be related to each other in stable ways. Location 
of transformations within the space was related to type of picture and amount 


of organization in the picture. 


Investigators exploring the visual storage 
systems of both adults and children (Brown 
& Scott, 1971; Shepard, 1967; Standing, 
Conezio, & Haber, 1970) have shown that 
once Ss are exposed to pictorial material, 
their ability to recognize previously seen 
pictures is extremely good. Standing et al. 
presented 2,560 pictures and found a hit 
rate for old pictures of .95 or better. 
Haber (1970) suggested that picture recog- 
nition may be essentially unlimited, and 
has contrasted it with recognition for verbal 
materials. "Unlimited" is perhaps too 
strong a term, however, since several vari- 
ables are known to affect accuracy of pic- 
ture recognition. 

1. Similarity of distractors to target: 
Recognition of pictures of single objects is 
directly related to similarity of the distrac- 
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tors, when the latter are scaled along a 
single "similarity" dimension (Bahrick, 
Clark, & Bahrick, 1967). When more com- 
plex pictures have been studied, however, 
few experiments have systematically manip- 
ulated this variable. Most studies have 
used a random selection of distractors, 
chosen from the same modality as the 
target items (e.g., photographs) but with 
content uncontrolled. One study which 
used similar distractors (Dallett, Wilcox, & 
D'Andrea, 1968) found somewhat lower 
recognition rates, ranging .69-.85, but no 
attempt was made to define or scale degree 
of similarity. Brown and Campione (1972), 
working with children, studied a single 
dimension of similarity by using distractors 
that varied from the targets only in the 
type of action a central character per- 
formed. Immediate recognition averaged 
around 95%. Thus, for adults, recognition 
accuracy is affected by similarity of dis- 
tractors, but there are not enough data to 
know whether this is as true for children. 
In addition, little is known about the di- 
mensions along which children scale simi- 
larity of remembered pictures to distractor 
items, 

2. Organization of the stimulus materials: 
Organization of a picture is defined here 


657 


658 


by the relations among its various parts 
and is equated with the concept of meaning 
(see Garner, 1962, p. 141). Adults recog- 
nize pictures of faces better than inkblots 
(Goldstein & Chance, 1971), and children 
recognize meaningful pictures better than 
abstract or nonsense pictures (Nelson, 
1971). However, little is known about the 
effects of higher levels of organization on 
recognition, namely, the relationships 
within a picture among parts that are 
themselves meaningful, such as are usually 
found in complex pictures. 

It seems likely that judgments of simi- 
larity will vary with the amount of organi- 
zation (i.e., number of meaningful relations 
among objects) in a picture. There is little 
information on this problem, yet it seems 
reasonable to assume that the more infor- 
mation that is organized into a single chunk 
(or picture) the less likely it is to be con- 
fused with another chunk containing an 
equal amount of different information, 
Thus, if complex pictures do contain a very 
large amount of both visual and semantic 
information, it is probably the case that 
the pictorial distractors used in experiments 
such as that of Standing et al. (1970) were 
grossly dissimilar from the target items. 
This brings us to the third variable known 
to affect accuracy of picture recognition. 

3. Verbal or semantic information: Sey- 
eral studies have assumed that using pic- 
torial distractors that have the same verbal 
label as the target pictures will increase 
distractor similarity and therefore decrease 
recognition. In 2 studies with children 
this was not the case; accuracy increased 
under this condition (Rosinski, 1970) A 
decrease in recognition accuracy has been 
reported with adults, however (Bahrick & 
Boucher, 1968). Further, verbal labels at- 
tached to a geometric form have been 
shown to produce Systematic changes in 
recognition accuracy along a continuum of 


vestigator who is intereste 
of meaningful pictorial materials by chil- 
dren or adults is the relation of purely 
visual dimensions, such as Size or shape, to 


* The second study is an 


unpublished report by 
L. R. Brooks written in 1971 
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the semantic dimensions of pic tures, Fi 
example, we do not know whether chan 
in orientation, deletions, or 
affect recognition of meaningf 
of objects in picture form in the 
as they do for geometric shapes 
The present experiment investigated sev- 
eral of these aspects of picture recognitio 
in children. The degree « ganization 
of complex pictures was va 
the similarity of distractor 
tors were constructed by 


ing trans 
formations on the original res. Tw 
recognition tests were used measuriny 


recognition accuracy and th 
judgments of similarity of t! 
formations to the remem 
Multidimensional scaling t 


r requiring 
ious trans- 
pictures. 


jues were 
used to construct dimensi similarity 
of the various transformations to the orig- 
inals. In addition, verbal description. 
and verbal recall of th: pictures were 
studied. 


One further problem was investigated in 
this experiment. It has frequently been 
Suggested in the literature that black chil- 
dren may have more difficulty in handling 
pictorial material than white children 
(Farnham-Diggory, 1970; Pettigrew, 1964; 
Tyler, 1956). In Spite of its widespread 
currency the hypothesis has never been 
adequately tested. Most of the studies 
used to support the hypothesis have con- 
trasted performance on vis; il and verbal 
tasks in IQ test batteries. Such test bat- 
teries are not well-suited for measuring 
perceptual functioning, since each subtest 
usually involves a number of cognitive 
factors (e.g., Cohen, 1959). 'The present 
experiment provided an opportunity to 
test the hypothesis more directly, since 2 
aspects of recognition of. pictorial materials 
were studied, Using measures relatively 
uncontaminated by verba] requirements. 
For this Teason, as well as to insure gener- 
ality of results, children from black, Chi- 


cano, and white (Anglo) ethnic groups were 
used as Ss, 


METHOD 
Subjects 


Because extensive verbal and recognition data 
were collected, a large number of Ss was used, each 
working with only 2 target pictures, rather than a 


^ 


FIGURE 1, 


smaller number of 5s, each viewing many pictures. 
The Ss were 168 children from the San Diego 
Unified School District, divided into groups by sex, 
ethnic group (black, Chicano, and white), and 
grade level (Grades 1 and 2). Chronological age 
in the first grade ranged 6.2-7.1 yr. (M = 6.75) 
and in the second grade, 7.2-8.3 yr. (M = 7.8). 
Almost the entire first and second grade English- 
speaking classes of one school, located in an upper- 
lower socioeconomic status area, were used as Ss. 
The area was composed of a fairly even mix of 
the 3 ethnic groups. United States Census Tract 
data indicated a comparable socioeconomic status 
for the 3 groups in terms of employment. Median 
family income in the area was $7,075, with Chicanos 
having slightly higher income ($7,415) than whites, 
and blacks slightly lower incomes ($6,680). Median 
amount of schooling for adults in the area was 
11.9 yr. Black adults were above the median 
(12.3) and Chicano adults slightly below (10.6 yr.). 
There were not enough black children to complete 
the sample, and slightly less than half of the second- 
grade black sample was drawn from another school 
from a somewhat higher socioeconomic status area, 
with a higher median family income of $10,150, 
but the same median schooling for adults of 12.4 yr. 


Stimuli 


Two sets of 21.6 X 27.9 cm. black-and-white 
line drawings were used; each set contained one 
“organized” and one “unorganized” picture. Within 
a set, the organized and unorganized pictures con- 
tained the same objects. In the organized version, 
a naturalistic scene was created, using familiar 
relationships among objects. In the unorganized 
version, the same objects were placed in an unrelated 
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Organized and unorganized versions of 2 pictures. 


array. Because of some overlapping of objects in 
the organized pictures, there was more empty space 
at the edge of the pictures than in the unorganized 
versions. In the latter pictures there was more 
space between the objects. It was hoped that this 
difference would emphasize differences in processing 
of relationships among objects in the 2 types of 
picture, but it should be noted that the relationship 
of this variable to pictorial organization in general 
has not yet been systematically explored. The 4 
pictures are shown in Figure 1. 

The pictures were constructed during pilot testing. 
Easily recognizable objects with a considerable 
amount of detail were drawn by 2 amateur artists. 
Final selection was made on the basis that the 
various objects could fit into a "realistic" scene and 
that in the pilot group children from the same ages 
and ethnic groups used in the study could appro- 
priately label all objects and details. A rough 
attempt was made to equate number of objects in 
the 2 sets of pictures. However, it is recognized 
that the pictures vary in a number of ways, such as 
saliency and location of items, likely focus of atten- 
tion, etc. 

The following 5 transformations were used for 
each of the pictures: 

l. Reversal: A left-right mirror image of the 
original. 

2, Deletion: Three items were deleted from the 
original, ranging in size from a small detail to a 
small object. In the bus pictures (see Figure 1), 
the girl's purse, the chimney on one house, and one 
of the books were deleted. In the lady pictures, the 
pendulum of the clock, one of the flowers on the 
table, and the pair of shoes were deleted. 

3. Size change: One item was made larger or 
smaller. In the bus pictures, the bus was made 
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50% sma 
flowers 


ler. In the lady pictures, the vase with 
made 50% larger. 

4. Rearrangement: The location of one item was 
changed. In the organized bus picture, the jacket 
on the hanger was moved from the right to the left 
of the table; in the unorganized version, the posi- 
tions of the jacket and the dog were exchanged. 
In the organized lady picture, the shoes were moved 
from under the table to the right of the table; in 
the unorganized version, the positions of the shoes 
and the car were exchanged. 

5. Substitution: One object was replaced by a 
conceptually similar object. In the bus pictures, 
the bowl of fruit was replaced by a basket of fruit. 
In the lady pictures, a different type of clock was 
used, 


Design 


The 168 Ss were divided into the 3 ethnic groups 
of 56 Ss each. The ethnic groups were further 
subdivided into equal numbers of boys and girls in 
the first and second grades. Within each of these 
12 groups, each child worked with 2 pictures, an 
organized picture from one set and an unorganized 
picture from the other. Order of prese ntation 
(organized first or unorganized first; bus first or 
lady first) was counterbalanced across groups. 
Presentation order was not a significant variable 
in any of the analyses and will not be discussed 
further. 

For each of the pictur: 
following 4 tasks were gi 


shown to a child, the 
iven in a constant order: 


1. Verbal description of the picture while viewing 
the stimulus. 

2. Verbal recall of the picture. 

3. Same-different recognition test, in which Ss 
were presented with the 5 transformations and the 
original in a random sequential order, The Ss were 
required to respond to each presentation by saying 
whether the instance was exactly the same as the 
original or whether it was different. If the child 
responded different he was asked to tell how it was 
different. 

4. The paired-comparison recognition test con- 
sisted of a 2-alternative forced-choice procedure in 
which each transformation and the original were 
paired with all other transformations to make 15 
pairs of pictures. The S was asked to pick one pic- 
ture from each pair which looked most like the 
original picture he had seen and described. 


Procedure 


Raven's Coloured Progressive Matrices were ad- 
ministered individually by testers of each S's own 
ethnic group before the experimental procedures 
began. For this test and the experimental sessions, 
Chicano Ss were given the choice of speaking in 
English or Spanish. All of them chose to use 
English, and no language difficulties were observed. 
Following administration of the Coloured Progres- 
sive Matrices, each S was tested individually in 
2 experimental sessions 1 wk. apart. In each ses- 
sion one target picture was used, At the beginning 
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of the first testing session S was told that 
would be recorded and that he would be 
listen to his voice. Then he was told he v 
to see some pictures and that E wanted t« 
children looked at pictures and remember 
The Ss were assured that the testing proce 
nothing to do with grades or school work 
E felt that some rapport had been gained id 
was reasonably relaxed, the experimental p: 
began. 

Description. The S was given a practice 
which was rich in detail and similar to the 
mental pictures, The S was asked to tell 
thing he saw in the picture, no matter how 
the detail, and to leave nothing out. He was 
to point to each object or detail as he mentio: 
If S had any difficulty with the describing 
pointing procedure, E demonstrated what w 
quired. The practice picture was kept in fr« 
5 until he described most of the details i; 
picture, 

After this preexperimental procedure, .5 
given the first experimental picture and the de 
tion and pointing instructions were repeated. Pre- 
sentation time was 3 min. If S responded income 
pletely or stopped too soon, E prompted him by 
asking if he saw anything else. At the end of 3 min., 
E flipped the picture over and talked to S for 20 sec. 

Recall. The S was next asked to recall as many 
objects and details from the picture as he could 


the 


was 
rip- 


remember. When he stopped the first time, E 
prompted him once. If he could not re pond, the 
recall task was ended ; otherwise, it continued until 


he stopped the second time. 
task. 


Same-Different 


as the 
, and 
If a picture was 
exactly the same, S was to say same. If different, 
5 was to say different and tell exactly how it w 
different, A different sequential order of pictures was 
used for each child in a Latin square design, 

Paired-comparison task, After a short break of 
approximately 30 sec., 5 was told he was now going 
to see pictures 2 at a time, Each time à pair was 
presented he was to choose the picture which looked 
most like the original picture he had described and 
recalled and which was still lying face down on the 
table. He was told further that if neither picture 
looked exactly like the original picture he was to 
choose the one which looked most like the original. 
The S was then presented all combinations of 
target and transformations, 2 at a time. Presenta- 
tion sequence was randomly chosen for each S. 

The second session was identical to the first, 
except that the practice picture was not presented. 


RESULTS 


Analysis of variance on the Coloured 
Progressive Matrices showed that Grade 2 
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"TABLE 1 
VERBAL DESCRIPTION AND VERBAL RECALL MEASURES 
Description Recall 
xS No.of | Inferential | No, ofrep-| No.of | Inferential, | wo ofrep-| fóofde- | % of recall 
Heme | 4 relational |"etitions | items | dates | “etitions | eenid | ‘scription 
Total (N = 336) | 32.4 1.8 44 19.4 1.2 2.7 53.5 12.6 

Grade | 

1 31.1 [SETS 4.8 18.1 1.3 2.9 534 11.8 

2 33:8. Re Se: | 3.9 20.7 1,0 2.5 53.6 13.4 
Sex 

Male 34.3 24 47 21.0 14 3.0 53.4 13.6 

Female 1/:30:699 7] rico 4.0 17:8 9* 2.5 53.5 115 

-Ethnic 

White 36.1 2.2 44 21.1 14 2.9 53.1 12.2 

Black Speirs 2.1 6:196*2119.4* 1.4 312" 53.0 13.0 

Chicano 29.0*** 1.1*** 2.6814. Na Ted Eri 2" 54.3 12.5 
Organization 

Organized 32.0 1.8 44 20.8 1.4 3.0 57.0 13.7 

Unorganized 32.9 1.8 44 18.0** 1.0 24 49.9*** 11,5* 
Set 

Lady 30.8 2.2 4.5 19.1 S 3.0 55.4 11.3 

Bus 34,0* ier 4.2 19.8 ‘ou 2:5 51.6* 13.8* 


* Group values differ si 
^»* Group values diffe nificantly, p < .01. 
* Group values differ significantly, p < .001. 


scored higher than Grade 1, F (1, 156) 
= 19.25, p < .001, and males scored higher 
than females, F (1, 156) = 22.60, p < .001. 
There were no main effects due to ethnic 
group, but there was a Grade Level X Eth- 
nic Group interaction, F (2, 156) = 8.27, 
p <.01, Newman-Keuls tests showed 
that black children in the first grade scored 
lower than the other groups, which did not 
differ from each other. 


Verbal Description and Recall 


Verbal description and recall measures 
are summarized in Table 1. The first 
column shows the main description score, 
consisting of the total number of items 
mentioned (either whole objects or details). 
Inferential statements (e.g., the girl ds 
happy) and relational statements (e.g., the 
bowl is on the table) as well as repetitions 
were scored separately and not counted 
in the main description score. These mea- 
sures are shown in Columns 2 and 3, respec- 
tively. Similar analyses were carried out 
for verbal recall, shown in Columns 4-6. 
Two other recall measures were computed : 
the percentage of items_in each S's descrip- 
tion that also appeared in his recall, and 
the percentage of items in recall that had 


not been mentioned in that S's description. 
These measures are shown in the last 2 
columns. Analyses of variance (3 Subject 
Variables X 2 Stimulus Variables) were 
carried out on the various verbal measures. 
Significance levels for the analyses are also 
shown in Table 1. 

Subject variables. (a) Grade—there were 
no main effects on any verbal measure due 
to grade level. (b) Sex—males had higher 
description and recall scores than females. 
Interactions with grade level on number of 
items in description (p < .05) and in recall 
(p < .01) showed that this effect was due 
primarily to the second-grade boys, who 
described and recalled significantly more 
than the other groups. (c) Ethnic Group— 
whites both described and recalled more 
than blacks, who in turn described and 
recalled more than Chicanos. 

Stimulus variables. (a) Organization— 
degree of organization of the pictures did 
not affect amount of description but did 
affect amount of recall. Organized pictures 
produced a greater number of items recalled 
than unorganized pictures. (b) Picture set : 
The bus pictures produced more items in 
description than the lady pictures, but 
there wasnosignificant difference in amount 
of recall of the 2 kinds of picture. 
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Number of inferential and relational 
statements was small in comparison with 
number of items mentioned, perhaps be- 
cause of the type of instructions given, 
and the pointing method of description 
used. The 2 types of statements were 
combined into a single measure for pur- 
poses of analysis. In general, scores on 
this measure paralleled the total descrip- 
tion and recall scores, males making more 
inferential and relational comments than 
females, and Chicanos making fewer than 
blacks and whites, who did not differ from 
each other. 

Repetition of items occurred fairly fre- 
quently in both description and recall. 
There were no instructions that Ss should 
not repeat themselves, but these data may 
be contrasted with the lack of repetitions, 
also without instructions, found in adult 
recall of verbal material (Borges, 1972). 
'There were no significant differences in 
amount of repetition in description and 
recall as a function of stimulus variable, 
and the only S variable affecting repetition 
was ethnic group. Blacks repeated items 
more than whites, and Chicanos had very 
low scores on this measure. 

'There are several ways in which recall 
might be expressed in relation to descrip- 
tion. Column 7 of Table 1 shows the 
percentage of items described that were also 
recalled. The mean for the total sample on 
this measure was 54%, a score high in com- 
parison with immediate recall of well- 
organized lists of words (Mandler, Pearl- 
stone, & Koopmans, 1969) However, 
recall can also be represented as a percent- 
age of total number of items described by 
the S pool. lt can be argued that the total 
number of items in a picture is better rep- 
resented by the "total possible" set of 
items, rather than the subset actually men- 
tioned by ,S. Such an assumption is bol- 
stered by the finding that an average of 
13% of an S's recall score consisted of items 
not mentioned in his description of the 
picture. An inventory of all objects and 
details mentioned by any S (excluding 
different labels for the same items) was 
constructed for each picture, resulting in 
a total of 129 items in the lady pictures and 
152 items in the bus pictures. Recall, 
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expressed as a percent of this inventory 
drops to a mean of 14% correct. Neithe 
of the above procedures seems an entirel 
itisfactory method of estimating amoun 
of recall; at best the 2 estimates provid 
upper and lower limits on amount of verba 
recall of complex pictures. 

Finally, it should be noted that ther 
was a high S correlation between number 
of items mentioned in description and in 
recall, r = .73, p < .01. Both verbal mea: 
sures had a low correlation with Coloure 
Progressive Matrices, r = .10, ns, and .16, 
p < .05, respectively. 


Same-Different Recognition Test 


Table 2 shows the proportion of correct; 
recognition of the original picture, each of 
the transformations, and the total propor. 
tion correct. A rough correction for possi- 
ble response bias in saying same or different 
was calculated by multiplying overall rate 
of saying same or different by actual occur: 
rence of same or different pictures. This 
method resulted in an estimate of chance 
probability of a hit on the transformations! 
of .48, and chance probability of a hit on 
the original of .07. It can be seen from 
Table 2 that recognition of the substitution: 
transformation was essentially at chance, 
with size and rearrangement resulting im 
somewhat higher scores. Recognition was 
best for deletion, reversal, and the original 
picture. Overall, probability of correct re- 
cognition was .70. 

Analyses of variance (3 Subject Vari- 
ables X 2 Stimulus Variables) were car- 
ried out on a total recognition score, con- 
sisting of the total number of correct recog- 
nitions, and separately on the number 
correct on each transformation. Signifi- 
cance levels for differences among the 
various groups are also shown in Table 2. 

Subject variables. (a) Grade—Grade 2 
had a significantly higher total recognition 
score than Grade 1. This difference was 
primarily due to Grade 2 having higher 
scores on the reversal and deletion trans- 
formations. Thus, Grade 2 showed superior 
recognition even though there were no 
differences between the grades on verbal. 
description or recall. (b) Sex—there were 
no significant differences between total 
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TABLE 2 
Proportion CORRECT ON ORIGINAL PICTURES AND FIVE TRANSFORMATIONS 


IN Same-Different RECOGNITION TEST 


Transformations Original | Substitution Size Rearrangement Deletion Reversal Total 
Total (N = 336) .89 AD 58 .59 BAKE 91 0 
Grade .92 45 54 55 | -68 87 67 
1 
2 86 53 61 .62 .86** .95* T4* 
Sex 
Male .87 .50 58 Ey 41 .92 70. 
Female .91 A8 57 61 41 90 Sl 
Ethnic | 
White .88 252 58 | 62 76 .96 12 
Black 87 .50 59 59 S .86** .69 
Chicano .92 AS .56 .54 .79 .91** 70 
Organization 
Organized .91 1 62 64 Ni .93 72 
Unorganized .87 AT sane .54** .84** .89 .69*e* 
Set 
Lady .91 -61 52 73 .80 193 SA] 
Bus .87* E Y aa 64 Seer .74* 88 166***. 
* Group values differ significantly, p < .05. 5 


** Group values differ significantly, $ < .01. 
et Group values differ significantly, p < .001. 


correct or on any transformation between 
males and females. Again, this result may 
be constrasted with the results on the 
verbal measures, in which males showed 
higher description and recall scores. (c) 
Ethnic group—there were no significant 
differences in total number correct. This 
result may also be contrasted with the dif- 
ferences found among ethnic groups on 
description and recall. The only signifi- 
cant difference in recognition occurred on 
the reversal transformation, on which 
blacks were slightly less accurate. A 
Grade X Ethnic Group interaction on this 
transformation (p < .01) indicated that 
the difference in recognition occured only 
in the first grade. Lower scores on the 
Progressive Matrices were also found for 
the first-grade black sample. Although 
this test is usually used as a nonverbal 
IQ test, it has a strong spatial component. 
For all 3 S variables, verbal description 
and recall measures were poor predictors 
of accuracy of recognition. An S correla- 
tion between number of items described 
and total correct recognition resulted in 
r = .21, p < .01, and between number of 
items recalled and total correct recognition, 
r= .25,p «.01. The correlation between 
Coloured Progressive Matrices and total 
correct recognition was .16, p < .05. 


Stimulus variables. (a) Organization— 
total correct recognition was significantly 
higher for organized pictures. The main 
transformations involved in this difference 
were rearrangement and size. These 2 
transformations are most directly relevant 
to the degree of organization of a picture 
as it has been defined here, namely, as the 
number of meaningful relationships among 
the items in the picture. Both the rear- 
rangement and size transformations involve 
a change in relationships among objects, 
and these changes were better recognized 
in the more organized pictures. 

The other major difference occurred on 
the deletion transformation. Deletion of 
detail was better recognized in the unor- 
ganized pictures. Since there were no 
meaningful relationships among the ob- 
jects in the unorganized pictures, it is 
possible that more attention was paid to 
details of the objects represented. In the 
organized pictures, on the other hand, more 
processing of relationships among objects 
took place. This does not appear to be a 
question of how Ss spent a limited process- 
ing time, since presentation of each picture 
lasted 3 min., but rather a question of what 
aspects of the organized and unorganized 
pictures tended to be coded for storage. 
When a deletion transformation was recog- 
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nized as different, on the average only 1 of 
the 3 deleted items was mentioned; in 
almost all cases this was the book in the bus 
pictures and the shoes in the lady pictures. 
In both cases these items were the largest 
of the 3 deletions. Apparently the 2 small 
deletions of detail were rarely noticed, 
although more often in the unorganized 
pictures. 

(b) Picture set— The lady pictures were 
recognized significantly more often than 
the bus pictures. It will be recalled that 
the lady pictures contained less informa- 
tion, in the sense that they produced less 
verbal description and had a smaller total 
inventory of items. Although it has been 
assumed that high information pictures 
are easy to recognize because distractors 
tend to be very different, when distractors 
consist of small changes on the target pic- 
tures it is possible that recognition is 
hindered by a greater amount of informa- 
tion. Amount of information would have 
to be systematically varied to test this 
hypothesis. 

'There were significant interactions be- 
tween the set and organization variables on 
the substitution, rearrangement, and size 
transformations. These differences ap- 
peared to reflect uncontrolled differences 
in saliency of various objects in the different 
pictures, perhaps in part as a function of 
their location in the picture as well as 
degree of organization in the picture as a 
whole. Since parametric variation in trans- 
formations of objects was not carried out 
in this experiment, no assessment of rela- 
tive importance of, for example, size changes 
vs. substitutions, in recognition of meaning- 
ful pictures in general can be made at this 
time. It is likely, in fact, that accuracy of 
recognition of any given transformation 
can be systematically varied by manipulat- 
ing the saliency of the object which is 
transformed and the extent of the trans- 
formation (e.g., amount of size change, 
number of rearrangements, etc.). 

Concerning the relationship between ver- 
bal recall and recognition as a function of 
stimulus variables, only degree of organiza- 
tion affected both measures. More was 
recalled about the organized pictures, and 
they were better recognized. These differ- 
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ted 
iese 


ences in memory were not, however, 
to number of words used to describe 
types of picture. 


Paired-Comparison Test 


The same-different test showed v ich 
transformations were most difficult to ~ec- 
ognize, i.e., which were more similar or 
confusable with the original picture he 
question then becomes whether it is po: ble 
to scale distances along a dissimi ty 
dimension(s) to aid our understand of 


the structure of what is stored in men’. ry. 
If it is assumed that memory for pict. res 
can be represented as a multidimens 
Euclidean space, and that the judged de- 
gree of similarity of any transformatie to 
the original or to other transformatic +s is 
inversely related to distances within ihat 
space, then the paired-comparison test can 
be used to estimate where in that space 
each transformation of the original picture 
is located.* 

The mean percent times that the original 
and each transformation was 
comparison with every other was computed 
separately for the various groups and cou- 
verted to d' scores (a measure of discrimi- 
nability). Thus, matrices of d’ scores were 
formed from the 15 paired comparisons of 
each original picture and its 5 transform: 
tions. These matrices give an estimate of. 
the distance each transformation is from 
each other and from the original in the 
hypothetical memory space. To the ex- 
tent that the original can be considered as 
an anchor point in the space (since each 
paired comparison was made on the basis 
of similarity to the original), the distances 
from the original to the transformations 
can also be considered a measure of dis- 
tortion of the original by the particular 
transformation made. 

Independent matrices, each based on 168 
Observations, were constructed for the 2 
sets of original pictures, for the organized, 
and unorganized versions of the pictures, 
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* An example of a similar type of analysis used to 
explore the structure of semantic, rather than 
pictorial, space can be found in Henley (1969), and 
for discriminability of letterlike forms in Harrington: 
and Brown (1972). 


rd for Grades 1 and 2. Matrices for 
she 3 ethnic groups were based on 112 
observations each. The breakdown by sex 
was not included because of the lack of 
differences found on the same-different test. 
of the matrices was analyzed sepa- 
y by the TORSCA multidimensional 
scaling program (Young & Torgerson, 
1967). 

The Kruskal stress index for each analy- 
sis (giving an estimate of goodness of fit 
of the various points in a given number of 
dimensions) for 1- and 2-dimensional solu- 
tions, as well as percentage of variance 
accounted for by the dimensions are shown 
in Table 3. In all cases except the Chicano 
group, the 2-dimensional solution gave close 
to perfect fit (stress varying .01%-.6%), 
and even in the Chicano group the fit was 
excellent (1.6%). For the 1-dimensional 
solutions, stress varied .01%-10.4%. Since 
only 6 items were being compared, the 
excellent fit in 2 dimensions might have 
occurred by chance, although the high de- 
gree of similarity among solutions obtained 
from independent groups of Ss and be- 
tween different pictures makes such a 
possibility unlikely. Based on Klahr's 
(1969) analyses, the probability of obtain- 
ing by chance stress indices as low as 
those in the 1-dimensional solutions is less 
than .05 in each case. 

The 1-dimensional solutions were highly 
similar to the first dimension of the 2- 
dimensional solutions, presumably because 
in both cases one dimension accounted for 
most of the variance. Although the 1-di- 
mensional solutions were adequate descrip- 
tions of the data, the 2-dimensional solu- 
tions are presented here because they give 
a clearer picture of the data, and because 
work in progress with adults suggests that 
2-dimensional solutions will be necessary. 
Figure 2 shows the solutions obtained from 
the various subgroups. For each break- 
down the groups being compared are shown 
in the same space, although they were de- 
rived independently. Inspection of Figure 
2 shows that in all cases original is located 
at one end of the main dimension, and 
that reversal is located at the greatest 
distance from the original, with deletion 
the next farthest from the original. The 
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TABLE 3 


PERCENT STRESS AND PERCENTAC 
ACCOUNTED FOR BY EACH DIME 
AND TWO-DIMENSIONAL SOL 


OF VARIANCE 
IN ONE- 
TIONS 


E 


Solution 


Two-dimensional One-dimen 


sional 
Variable E 
Variance 
remm Stress | Vari- 
Stress Second Stress | ance 
dimen- 
sion 
01 | 79.8 | 2 .O1 | 100.0 


01 | 100.0 


2 
4 
6 | 10.40 
9 


Organi. 
Organized .56 | 89.3 9. 87.0 
Unorganized | .05 | 89.1 | 10. 5.07| 91.4 
Grade 
1 01 | 93.8 6.2 .01 | 100.0 
2 01 | 858 | 14.2 | 9.52] 83.1 
Ethnic 
White .01 | 90.4 9.6 .01 | 100.0 
Black .03 | 93.6 6,4 6.10| 90.2 
Chicano 1.61 | 89.5 8.2 | 667| 92.2 


other 3 transformations are located in be- 
tween, show the most overlap, and the 
most movement from analysis to analysis. 
No attempt was made to conceptualize 
the second dimension since it accounted 
for relatively little of the variance, al- 
though in all cases deletion was at one end 
of the dimension and rearrangement at the 
other. 

Panel 1 shows the comparison between 
the 2 types of original picture. There 
should be a close correspondence between 
these 2 TORSCA analyses if we wish to 
conclude that the pictorial memory space 
thus constructed is general and not idio- 
syncratic to a particular picture. Looking 
at Panel 1 we can see that original, reversal, 
and deletion occupy roughly the same 
spaces. The greatest change Írom one 
picture to the other occurs on size and 
rearrangement, followed by substitution. 
Comparing the movement of these 3 trans- 
formations (toward or away from the 
original as a different picture is used) with 
the proportion of correct recognition for 
these transformations in the same-different 
test, shown in Table 2, we can see that in 
each case in which a transformation was 
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FIGURE 2. Two-dimensional TORSCA multidimensional scaling analyses 
derived from independent groups of Ss and different types of stimuli. (Panel 1 
compares the 2 types of picture, Panel 2 compares organized vs. unorganized 
versions of the pictures, Panel 3 compares 2 grade levels, and Panel 4 compares 
3 ethnic groups. St indices are given in Table 3. Abbreviations: REA - 
rearrangement, SIZ = size, SUB = substitution, REV = reversal, DEL 
deletion, and ORIG = original.) i 
recognized less well it is closer to the of the variance, more work ded tol 
original in the TORSCA analyses. As conclude that there is a de entaly 
mentioned above, these 3 transformations difference in complexity of pictorial memory} 
seem to have different effects on recogni- space. | 


tion depending upon the saliency and 
location of the particular item transformed 
in a given picture. 

Turning to the effects of organization, 
Panel 2 indicates that the rearrangement 
transformation was most affected by differ- 
encesin degree of organization. Rearrange- 
ment was less frequently recognized in the 
unorganized pictures in the same-different 
test, and in the paired-comparison test 
this transformation was judged to be closer 
to the original in the unorganized pictures. 
Thus, there is good agreement between 
the 2 tests on the greater importance of 
relationships among objects in organized 
pictures, 

Comparisons between Grades 1 and 2 
are shown in Panel 3. They are very 
similar to each other on the first dimension. 
The first graders in fact show essentially 
a 1-dimensional solution, while the second 
graders' scores have begun to move into 
the second dimension. Although current 
work with adults indicates that the second 
dimension accounts for a still greater part 


Panel 4 shows comparisons among the 
3 ethnic groups. The chief conclusion is 
that the pictorial memory space is highly 
similar for the 3 groups. This result con- 
firms the lack of difference among ethnic 
groups on the same-different test. 

Overall, there was excellent agreement 
between the 2 types of recognition test. 
The advantage of the paired-comparison 
test is that it gives a more detailed picture 
of differences between the original and the 
transformations. In addition it eliminates 
response biases such as the tendency to sa | 
same or different to most pictures, a facto. | 
which might well vary among Ss of differenr 
ages. Although a left- or right-choice bic 
could exist in the paired-comparison tesg 
analyses of the number of times Ss chos? 
the left or right picture indicated that ni 
such bias was operating. 3, 

Because the 2 recognition tests wer 
given in a constant order, it is possible tha» 
performance on the same-different test in 
fluenced choices on the paired-compau ss: 
test. For example, if an S correctly recog: 
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3 4 5 6 
el 89 96 37 


NUMBER CORRECT ON SAME-DIFFERENT TEST 


FicunE 3. Percent choice of each transformation in the paired-comparison 
f test plotted as a function of number of correct responses on the same-different 
test. (Number of Ss involved in each point is shown below the abscissa.) 


nized a transformation as different, he 
might be less likely to choose it in the 
paired-comparison test. To check on this 
Ņ possibility, percent choice of each trans- 
"formation in the paired-comparison test 
was plotted as a function of the number of 
| correct responses on the same—different test. 
} Two Ss who made 6 out of 6 errors on the 
same-different test were excluded from this 
analysis. The remaining Ss, grouped ac- 
cording to number correct, are shown in 
— Figure 3. It can be seen that performance 
| on the paired-comparison test did not differ 
tas a function of accuracy on the same- 
. elifferent test, indicating at least some degree 
| yof independence of the 2 tasks. 
| 2: 
F 
P th 
d' The present data offer little support for a 
inonception of unlimited pictorial recognition. 
rin spite of long presentation times, overall 
2ognition rate was fairly low (70%) and 
a/aried from chance responding to 91% correct 
qiepending upon the nature of the distractor. 
eo the distractors used in this experiment 
(Ponsisted of transformations on the original 


DISCUSSION 


pictures and thus were similar to the target 
pictures. Conflicting results have been re- 
ported when similar distractors have been 
used, some studies finding lower recognition 
rates, some not. The question then becomes 
how to define and scale similarity of complex 
pictures which contain many visual and seman- 
tic dimensions. 

The multidimensional scaling method used 
in this experiment appears to be a useful 
technique for scaling the similarity of distrac- 
tors to the picture stored in memory, and in 
addition to discover which aspects of pictures 
are most likely to be retained. Not only were 
transformations on the target pictures stably 
located in a memory space, but changes within 
that space were related to type of picture and 
the degree of organization within pictures. 
For example, the judged similarity of a given 
distractor differed as a function of amount of 
organization in the stimulus materials. 

There was a very close correspondence be- 
tween the 2 types of recognition tests, even 
though they asked for different judgments. 
In the same-different test, S wos asked whether 
he had seen a picture before, and in the paired- 
comparison test he was required to make a 
similarity judgment. Although it has been 
assumed here that similarity of distractor to 
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target is a major factor in recognition 
accuracy, it should be pointed out that this 
assumption needs further testing. In the 
first place, similarity judgments may differ 
when they are made in the presence of the 
target items than when they are made about 
remembered targets. Second, it could be the 
case that reversal of a picture, for example, 
is easy to recognize as different yet be judged 
most similar to the original on the grounds 
that all the relationships among the objects 
are the same. The fact that recognition and 
similarity judgments were highly correlated 
for children does not necessarily mean that 
the same relationships will hold for adults. 
This problem is currently being investigated. 

Although the present data do not allow an 
assessment of the relative importance of 
various transformations in recognition of pic- 
tures in general, the results for 2 of the trans- 
formations should be mentioned. Two trans- 
formations were consistently recognized across 
pictures and also judged as most dissimilar, 
namely, reversal and deletion of detail. Al- 
though experiments with adult 5s have typi- 
cally found only slight loss in recognition 
accuracy when reversals were used (Dallett 
et al, 1968; Standing et al., 1970), the .91 
rate of recognition of reversals in this experi- 
ment seems surprising in light of a number of 
studies showing that children have difficulty 
in discriminating among reversals of pictorial 
geometric forms (e.g., Rudel & Teuber, 1963). 
Ease in recognizing reversals in more complex 
pictures may be a function of the total num- 
ber of items whose location is changed. In 
the present experiment, reversal involved a 
greater number of locational changes than any 
other transformation. On the other hand, 
recognition of reversals may be a function of 
the meaningfulness of the material used. Cur- 
rent work is being done to evaluate these 
possibilities, 

The other transformation most consistently 
recognized as different was deletion of detail. 
"This result is consistent with experiments re 
ported by Vurpillot (1972), showing that chil- 
dren of age 7 are better at detecting differences 
in detail in meaningful pictures than size 
changes or rearrangement. On the other 
hand, her experiments indicated that children 
of this age are also good at detecting small 
substitutions, a result at variance with the 
present data. It should be pointed out, how- 
ever, that it is difficult to compare results on 
transformations from different studies because 
of the lack of appropriate scaling of the magni- 
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tude of the changes involved. In additionjl 
Vurpillot's work, like that of Gibson, Gibsor 
Pick, and Osser (1962), involved detection 
differences in a  matching-to-sample task? 
Little is known about attentional or search” 
patterns in detection vs. memory tasks, or 
whether the differences easiest to detect are 
those which are most often retained in memory. 

Concerning the relationship between verbal 
measures and recognition in the present experi- 
ment, for the .S variables, verbal measures 
were poor predictors of accuracy of recogni- 
tion. This finding is of particular interest 
in light of the heavy verbal loading found in 
the test batteries most frequently used to 
assess perceptual functioning in minority chil- 
dren. In the present study, black and Chicano 
children said less than white children, but the 
volume of words bore no relation to the visual 
recognition measures. In terms of accuracy of 
recognition, responsiveness to the organization 
of the pictorial materials, and types of mistakes 
made on transformations, the 3 ethnic groups 
were highly similar. The multidimensional 
scaling of the similarity judgments produced 
a pattern of responding that was virtually 
identical for the 3 ethnic groups 

The stimulus variables showed 2 types of 
relationship between verbal and visual mea- 
sures. The pictures that contained more in- 
formation (had a higher inventory of items)? 
called forth more words in description and 
were also more difficult to recognize. "Thus, 
there was a negative relationship between 
amount contained in a picture and amount 
remembered. On the other hand, the orga- 
nized and unorganized versions of the pictures, 
which contained equal numbers of items and 
called forth the same number of words in 
description, nevertheless showed differences in 
bóth recall and recognition as a function of 
degree of organization. Thus, 
organizational or structural informatic 
a positive relationship with amount remem 
bered. It should be noted that an hy pothesis 
that equates amount of organization with 
number of words used in description, such a 
the verbal-loop hypothesis (Glanzer & Clar 
1964), is not a useful equation in the type c 
pictorial recognition studied in this experimen 

In summary, the data of this study indicat 
that children are responsive to the organize 
tion of pictorial materials and that the natur 
of the stored representations reflects th 
amount of organization in the material viewe 
There was little verbal indication of suc 
responsiveness; that is, very few comments| 


vere made in description or recall about rela- 
ionships among the objects in the pictures, 
nor were more or different words used to 
describe organized scenes than unorganized 
scenes. However, information about relation- 
ships among the objects in the organized scenes 
was processed and retained for storage. When 
unorganized pictures were presented, which 
did not include meaningful relationships among 
the objects, other types of information, such 
as characteristics of small items, were processed 
and retained. Although we are still far from 
being able to specify the nature of the stored 
representations of pictorial material, the results 
of the multidimensional scaling technique sug- 
gest that useful models of memory representa- 
tions can be constructed that will contribute 
to the development of a theory of pictorial 
memory. 
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INSTRUCTIONAL CONTROL OF SERIAL-LEARNING STRATEGIES! 


WESLEY A. KAYSON # WILMA A. WINNICK 


Iona College 


‘Two sets of instructions were used in a serial-learning task to induce one group 
to form sequential associations and a second to use the position strate: 
Performance was contrasted with control groups, and half of the Ss in each 
of the 3 groups learned lists of high-association trigrams, with the other half 
learning low-association lists. 
responded, after learning, with the trigram that followed the presented o 
or with the trigram at the serial position presented provided evidence that t 
instructed groups had used the suggested strategies, but gave no indica! 
of the consistent use of either strategy by control Ss. 


data found a significant effect of instruction, with positional instructio: 
superior, and a significant interaction between instructions and association 


value, with position instructions highly 


Recent attempts to delineate the effective 
stimulus underlying serial verbal learning 
have focused upon 3 possible cues for the 
anticipatory response required in learning 
of this type: (4) the immediately preceding 
item, (b) a combination of 2 or more 
preceding items (an elaboration of the first), 
or (c) the identification of the ordinal posi- 
tion of the item. Young (1962) has viewed 
these as possible hypotheses that ,S might 
adopt in serial learning (SL). 

Most attempts to identify the stimultis 
in serial learning have proceeded by utiliz- 
ing transfer techniques to seek out indica- 
tions of the cues used in serial learning. In 
such studies, learning of a serial list is 
typically followed by a second learning 
task, the aim of which is to detect the 
indicators of the cues used in SL. Thus, 
the amount of transfer has been assessed 

from the learning of one serial list to the 
learning of a second list in which the items 
from the original list occupy the same or 
different positions; this technique was used 
by Young (1962) to evaluate the positional 
hypothesis. More commonly used is trans- 
fer from SL to paired-associate learning 
(PAL) where, for example, items that were 
contiguous in previous SL are used to form 


1 This article is based upon Experiment I of a 
dissertation submitted by the first author in June 
1970 to the City University of New York in partial 
fulfillment of the requirements for the PhD degree, 

? Requests for reprints should be sent to Wesley A. 
Kayson, Department of Psychology, lona College, 
New Rochelle, New York 10801. 
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Queens College, City University of New Vork 
Examination of the latencies with which 
Analysis of the erro 
effective for the low-association list 
pairs for PAL: learning of such items i 


compared with the learning o! ite: 
had not been in sequence in SL (Young, 
1962). In another variation e transfer, 
technique, SL is the second task, and the 
first learning encourages or suggests cues 
(usually positional) that might aid or 
hinder subsequent SL. In a test of the 
efficacy of the positional cue, Ebenholtz 
(1963) studied transfer from PAL, where 
spatial position cues were paired with ver- 
bal items, to SL, in which items occupied 
the same or different spatial positions. In 
another PAL-SL transfer study, Winnick 
and Dornbush (1968) followed PAL, in 
which syllables were either in the positions 
indicated by the numerals or in other posi- 
tions. These experiments and others sum- 
marized by Young (1968) lead one to agree 
with Young that the issue of effective 
stimulus for serial learning has not yet been 
resolved. 

The present study proposed to view the 
cues mentioned as providing the basis for 
a plan or strategy of learning that might 
be consciously chosen by S, suggested by: 
the experimental materials, or dictated by. 
the instructions employed in the experi- 
ment. In contrast with previous studies— 
most of which have searched, after learning 
has taken place, for evidence indicating thi 
use of one or the other of these cues—th 
present study has attempted, by the in 
structions used, to induce Ss to employ on 
or the other cue as a strategy for the learn- 
ing of the serial list. 


INSTRUCTIONAL CONTROL OF SERIAL-LEARNING STRATEGIES 


In an attempt to determine the relative 
effectiveness of such strategies, the present 
study has used instructions to suggest the 
strategy of tying successive items together 
(Young's 1962 serial hypothesis) or of 
identifying the ordinal position of the 
items (the positional hypothesis). The 
possibility that the effectiveness of a par- 
ticular approach might be functionally 
related to the particular materials learned 
was examined by the use, in separate 
groups, of lists composed of either low- 
association (LA) or high-association (HA) 
nonsense syllables. It seemed reasonable 
to expect that HA syllables would provide 
associative bases for linkages, and that 
the serial strategy would therefore prove 
more feasible for these materials. By con- 
trast, the lack of potential in LA nonsense 
syllables for associative linkages suggested 
that the positional strategy might be more 
effective for these materials. 


METHOD 


Ninety volunteer Ss from introductory 
; in the Queens College summer 
in the experiment, They were 
| to 1 of the 6 groups in the 
order of their appearance in the laboratory, such 


Subjects. 


every sixth S. 
periment, one for failing to follow instructions and 
one due to E error. 
the HA-position group and the other in the LA 
position group. 

Apparatus and materials. 


The stimulus items 
consisted of 2 sets of consonant vowel-consonant 
(CVC) trigrams selected from Archer's (1960) list. 
The LA syllables were between 9% and 41% 
association value, and the HA syllables were be- 
tween 73% and 95%. To control for any possible 
ellects due to item ordering within the lists, 3 
different orders of these syllables were formed, so 
that pairs of such HA and LA lists provided 3 
replications of the experiment, The syllables were 
typed in capital letters on a standard memory-drum 
tape and presented on a Lafayette memory drum 
at a 2-sec. rate. The same interval was used be- 
tween movements of the meinory drum to pace re- 
call. There was an intertrial interval of approxi- 
mately 30 sec. 

Experimental. design. Six independent groups of 
Ss were individually tested. One group was 
instructed to learn the list by associating each non- 
sense syllable with the following one (Group 5). 
The second was told to learn the list by associating 
each nonsense syllable with the number of its posi- 
tion in the list (Group P). The third (control) 
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group was given only standard instructions, with no 
suggestions about how to learn the list (Group C). 
Half of the Ss in each group were presented with a 
list composed of HA syllables, and half learned lists 
composed of LA syllables. Randomly selected pairs 
of HA and LA lists formed 3 replications of the basic 
experiment. Thus, a 3X3 X2 factorial design 
was employed, with 3 levels of instruction, 3 
replications, and 2 levels of associative value. The 
6 distinct groups were designated as S-HA, S-LA, 
P-HA, P-LA, C-HA, and C-LA. 

Procedure. AM Ss served individually, learning 
the list to a criterion of one perfect trial by the serial 
recall method for “the identification and assessment 
of processes in serial learning [Battig & Lawrence, 
1967, p. 1787.” In this method, learning trials 
which exposed the items alternated with blank 
exposures during which Ss were to attempt recall. 
Before the first learning trial, each S was given the 
following modified standard learning instructions: 


This is a memory test. You are to learn the non- 
sense syllables that appear in the window of the 
machine in the right order as quickly as you can. 
When I start the machine the syllables will appear 
in the window one after another until 12 have 
been shown. Try to learn them as they appear. 


After the twelfth syllable was shown and before 
the first recall trial, the following instructions were 
given: 


Now, right after an asterisk has appeared in the 
window a blank line will appear. Then you are 
to say what the first syllable Then another 
blank line will appear in the window. While that 
line is there, you are to say what the second syl- 
lable is. You are to continue in this manner until 
12 lines have appeared in the window. Guess even 
if you are not sure. 


After the first recall trial, S was told: 


Now the 12 syllables will be shown to you again 
for you to learn. This will again be followed by 
12 blank lines. ‘The procedure will continue in 
this way until you have learned all 12 syllables. 


In addition, Group S was told: 


It has been shown that people learn this list faster 
if they form associations between the syllables. 
In other words they ciate the first syllable 
with the second, the second with the third, and 
soon. For example, if the first syllable is “zop” 
and the second “kip,” try to make an ociation 
between “zop” and “kip.” If the third syllable is 
“buf,” also try to make an association between 
“kip” and "buf." Remember, try to form as- 
sociations between the syllables. 


Group P was told: 


It has been shown that people learn this list faster 
if they remember the number of the syllable on 
the list. In other words, if “zop” is the first 
syllable, “kip” the second and “buf” the third, 


say to yourself “first is ‘zop,’ second is | ip,’ third 
is ‘buf,’ ” and learn the entire list in this manner. 
Remember, learn the entire list by remembering 
the number of the position that the syllable 
occupies on the list. 


The Ss in Group C were given only the revision of 
the standard serial instructions. In addition, every 
few trials, Ss in Groups S and P were reminded 
during the intertrial interval of the method that they 
were to use to learn the list. The CVC syllables 
used in the instructions were all different from the 
syllables used during the learning task. 

After the list was learned, each experimental S 
was given 2 kinds of reaction time tests to determine 
whether he had followed the instructions while learn- 
ing the list. Each control S was also given both 
reaction time tests in an attempt to infer the strategy 
used. The equipment consisted of a standard re- 
action time key attached to a Hunter timer. The 
reaction time for each trigram was measured to the 
nearest hundredth of a second. When Æ read the 
appropriate instructions, he depressed the key and 
held it down until .S gave a response, at which time 
he removed his finger and recorded the reaction 
time. The 2 sets of instructions that were given to 
each S were as follows: (a) “Now I am going to 
test you on how well you have learned the list. 
Would you please tell me which syllable is number 
dE ?" These instructions were repeated for each 
serial position, with 3 different orders of randomly 
chosen numbers. (b) “Now I am going to test you 
on how well you have learned the list. Would you 
please tell me which syllable comes after .— — 2" 
These instructions were repeated for each trigram. 
Three different orders of randomly chosen trigr 
were used as stimuli. 

Half of the Ss in each group received Instruction 
a first until 12 numbers were presented and then 
Instruction 6 until the 12 trigrams were presented. 
The order of instruction presentation was reversed 
for the remaining Ss. All Ss were given both sets of 
reaction time instructions. 

It would be expected that Ss given P instructions 
would make a stronger association between the 
number presented and the appropriate trigram than 
between successive trigrams. This stronger associa- 
tion should be manifested by a faster reaction time 

. for Reaction Time Instruction a than for Reaction 
Time Instruction b, and Group P should therefore 
respond to Instruction a more rapidly than to 
Instruction b. Conve sely, it would be expected that 
Group S would make stronger associations between 
successive trigrams than between the number of 
the trigram on the list and the trigram. This 
stronger association should result in more rapid 
responding to Reaction Time Instruction b than to 
Reaction Time Instruction a. The same reasoning 
should help infer which strategy control Ss had used. 
The above should hold true for trigrams that are 
recalled correctly. When S answers incorrectly to 
either set of reaction time instructions, there is no 
way of knowing how strong the association is. 
Therefore, only trigrams which were responded to 


ams 
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correctly after both Instructions a and b 
in this analy 


used 


RESULTS 


Reaction times. The first step iv th 
analysis of data was a comparison ©’ th 
reaction times obtained from the 6 gi 
It was reasoned that if the instructions weri 
followed, strong associations would be 
up between successive trigrams for ‘ie 
groups and between the number den ‘in: 
position of the trigram on the list anc. th 
trigram for the P groups. Such stron 
associations should result in faster reac ioi 
times after learning when the “appropriate” 
rather than the "inappropriate" reaction 
time instructions were given. j 

Table 1 shows the mean reaction times 
for the 6 groups for the preceding stimulus! 
items and the position numerals. The data 
are broken down into types of reaction 
time and the order of recall for HA and LA 
trigrams. Of the 30 HA experimental Ss, 
23 had faster reaction times when reaction 
time instructions were appropri for the 
experimental instructions tha: they: 
were inappropriate. The probability of 
this occurring by chance according to the 
sign test is less than .004. The apparently 
slower reaction time for Ss given the 
position (appropriate) reaction time in- 
struction first (X = 4.13 sec.) as compared 
to the serial (inappropriate) instructions 
(X = 3.17 sec.) is due to the extreme delay 
in responding of one S in this group who 
took an average of 17.21 sec. to respond to 
serial reaction time instructions. In spite 
of this S, 7 out of 9 Ss responded more 
rapidly to position reaction time instruc- 
tions than to serial reaction time instruc- 
tions. When reaction time instructions 
were appropriate for the experimental in- 
structions, 22 of 30 LA experimental Ss 
had faster reaction times than when the 
reaction time instructions were inappro- 
priate. The sign test indicated that the 
probability of this occurrence was less than 
-008. It would appear that experimental 
instructions were followed by most Ss. 

It was also argued that it would be 
possible to infer the Strategy used by con- 
trol Ss by comparing their reaction times 
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for position and serial recall instructions. 
Ten of the 15 LA control Ss had faster 
reaction times when position reaction time 
instructions were used. The probability of 
this occurrence is equal to .15. Eight of 
the 15 HA control Ss had faster reaction 
times when serial reaction time instructions 
were used. The probability of this occur- 
rence is .50. Therefore, no assumptions 
regarding the preference of strategies for 
control Ss are warranted. 

Finally, it can be seen that the second 
time Ss were asked to recall the syllables, 
their reaction times were faster than the 
first time they were asked to recall the 
syllables. It seems that Ss did not expect 
to be tested on their recall of the syllables, 
and it took some time before they became 
adapted to the reaction time task. The 
finding that it takes S some time to adapt 
a set to respond rapidly is quite common in 
reaction time studies. 

Since these data indicate that Ss followed 
instructions, it is appropriate to evaluate 
the effect of differential instructions and 
association values by analyzing the trials 
to criterion, errors, and number of correct 
choices on the first-quarter trials by sepa- 
rate 3 X 3 X 2 analyses of variance and 
Newman-Keuls Studentized range tests. 
To further analyze the data, serial position 
curves were compiled. 

Trials to crilerion. Mean trials to cri- 
terion and standard deviations for the 6 
groups are shown in Table 2. A3X3X2 


TABLE 1 


Instruc- I " 
: Low i 
tions zh 


Serial RT | Position RT | Serial RT 


Position RT 


Serial | 4680) | 3.89(2) | 2.69(1) | 2.98(2) 
3310) | £771) | 2.592) | 3.91(1) 
Position | 3.74(2) | 41301) | 2422) | 2.80(1) 
3580) | 1.8502) | 3.04(1) | 1.952) 
Control | 416(0) | 2.919). | 3.13(1) | 4.25) 
3.50Q) | 3.740) | $180) 3.52(1) 


Note. Numbers in parentheses indicate order of testing. Ab- 
breviation: RT = reaction time. 


TABLE 2 
MEAN TRIALS TO CRITERION (AND STANDARD 
I ATIONS) FOR THE SIX 
EXPERIMENTAL GROUPS 


| Instructions 
Trigramas- | 


sociati | 
sar | Serial | Position Control 
Low 11.13 S 11.93 
(4.12) | (2.05) (5.80) 
High 7.53 8.33 8.87 
(3.62) | (3.56) (2.67) 


analysis of variance, with 3 types of in- 
structions, 3 replications, and 2 association 
values, found a significant main effect of 
association value, F (1, 72) = 6.64, p < .05. 
This indicates that, as expected, the HA tri- 
grams were learned in less trials (X — 8.24) 
than were the LA trigrams R i: 
The main effect of instructions approached 
significance, F (2, 72) = 2.74,.05 <p « .10. 
'This suggests that there is a tendency for 
P instructions to induce the most effective 
learning strategy and the fastest learning 
(X = 8.10 trials), while the control in- 
structions, where S must find his own 
tegy, resulted in the slowest learning 
— 10.40 trials). 

In order to determine the significance of 
the specific group differences, an internal 
comparison of the cell means was made by 
the Newman-Keuls Studentized range test. 
This found only that Group C LA took 
significantly more trials to learn the list 
than Group P-LA (X = 11.93 and 7.87, 
respectively, p < .05). Thus, the main 
reason for the borderline significance of in- 
structions is the difference between Groups 
C-LA and P-LA. 

Errors. Mean errors (intrusions and 
omissions) and standard deviations for the 
3 types of instructions and 2 association 
levels are presented in Table 3. The re- 
sults of the 3 X 3 X 2 analysis of variance 
of the error data found significance in the 
main effect of instructions, F (2, 72) = 6.90, 
p «.025; the Instructions X Association 
Value interaction F (2, 72) = 4.12, p < .05; 
and the Instructions X Lists interaction, 
F (2, 72) = 2.82, p < .05. Again, the sig- 
nificant main effect of association value 


$ 


674 


WES 


TABLE 3 
MEAN ERRORS (AND STANDARD DEVIATIONS) FOR 


THE SIX PERIMENTAL GROUPS 
E | Instructions 
Trigramas- |. TEE £ 
sociation 

Position Control 
SR 38.87 65.40 
T (30.44) 
High 50.07 
(18.43) (20.14) 


indicates that, as expected, more errors were 
made while learning LA trigrams (X — 
55.82) than HA trigrams (X = 44.00). 
That the main effect of instructions is sig- 
nificant indicates that the P groups made 
the fewest errors (X — 41.60), while the C 
group made the most (X — 57.74). Again 
S groups were intermediate (X — 50.40) 
between P and C groups. Overall, the P 
instructions induced the most efficient 
strategy (in terms of errors), while the con- 
trol instructions, leaving S to search for a 
strategy by himself, were least efficient. 
The significant Instructions X Association 
Value interaction indicates that this was 
especially true for the LA trigrams, whereas 
for the HA trigrams, the S instructions in- 
duced a more effective strategy, resulting in 
fewer errors, than either of the other in- 
structions. A Newman-Keuls Studentized 
range test was performed to determine the 
exact locus of the experimental effects. 
This analysis indicated that the significant 
Instructions X Association Value interac- 
tion was due to the fact that in the LA 
groups, the P instructions resulted in fewer 
errors than either S instructions or C in- 
structions (X = 38.87, 63.20, and 65.40, 
respectively, p < .05), while in the HA 
groups there were no differential effects 
due to instructions. Thus, it seems that 
for LA trigrams, the P instructions induced 
the most efficient Strategy, while for HA 
trigrams, the type of instructions had little 
effect. 

Number correct on Jirst-quarter trials. An 
analysis of the number of correct responses 
on trials during the first quarter of learning 
was suggested by results of an experiment 
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by Postman and Stark (1967), vhiel 
evidence supporting the serial | hesi 
was found when the first few tr wert 
analyzed, but not in the analysis of t 
number of correct responses on the enti 
list. Mean numbers of correct trigrz: s an 
standard deviations for the first qua er 
learning are found in Table 4. An a: :lysi 
of variance found no significant diffe: one 
among any of the means. The mos! oro 
able reason for the discrepancy be wee 


these results and those found by Po 
and Stark lies in the different proce.jur 
used. Postman and Stark used a P.\—S} 
transfer procedure, while this experime 
used a procedure which induced variou 
learning strategies by means of diff: 
instructions. Apparently, any strategy im 
duced by a PA task can be tapped early i 
serial learning, whereas the differential 
effectiveness of strategies suggested b 
varying instructions does not appear unti 
later in learning. 

Serial position curves. 


Serial position 


curves for HA and LA gr hown in 
Figures 1 and 2 respectively | be 
seen from Figure 1 that the sha; the 
curve for Group S-HA is somewhat differ- 
ent from that normally encountered, in that 


there is no major drop in the percentage of 
errors at the end of the curve. While this 
is a rather sharp departure from the typical 
serial position curve, it is exactly what 
would be expected if Ss learned the list by 
the serial instructions; that is, if each S 
made a connection between each pair of 
syllables, one would expect the beginning 
of the list to be learned most rapidly, with 


TABLE 4 
MEAN NUMBER Correct (AND STANDARD DEVIA- 


TIONS) ON THE FIRST-QUARTER TRIALS FOR THE 
Six EXPERIMENTAL GROUPS 


Instructions 


Trigram as- 4 
sociation 
Serial Position Control 
Low 2.02 2.89 2.83 
: (1.15) (1.37) ( .91) 
High 3.14 2.78 2.70 
(1.18) (1.06) (1.52) 
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1 TP Map NI ET IE G 8 9 


Serial Position 
Ficure 1. Serial position curves, showing percentage of errors for 
each serial position, for the 3 high-association (HA) groups. (Abbrevia- 
S = groups given serial instructions; P = groups given position 
instructions; and C = control groups.) 


more and more difficulty occurring as 
gets further and further into the list. The 
serial position curves for all low-association 


12 


10 


Percent Errors 


Mean 


105119512 


13:52 93 71:4175«505:6- 7:2 7 BU L9 


Serial Position 


FiGURE 2. Serial position curves, showing percentage of errors for 
each serial position, for the 3 low-association (LA) groups. (Abbrevia- 
tions: S = groups given serial instructions; P = groups given position 
instructions; and C = control groups.) 


S syllables, on the other hand, are more like 
the typical inverted-U-shaped serial posi- 
tion curves, with the ends showing fewer 
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errors, and most errors occurring in the 
middle of the list. 


DISCUSSION 


Two findings from this study seem worthy of 

further note. First is the effectiveness of in- 
structions in influencing S's learning strategy, 
especially for the error data. Most of the 
experimentation performed up to the present 
has been concerned with determining the 
strategy that Ss prefer to use, rather than com- 
paring the relative efficiency of various strat- 
egies (e.g, Brown & Rubin, 1967). The 
present experiment, on the other hand, is an 
attempt to compare the effectiveness of the 
specificity and position hypotheses when they 
are induced by instructions prior to learning. 
Although previous experiments have employed 
variations in instructions as an experimental 
variable, most have done so for other purposes, 
e.g., to contrast intentional (instructed) and 
incidental (noninstructed) learning (e.g., Dorn- 
bush & Winnick, 1967), to provide a basis for 
differences in motivation (e.g., Baron, 1967), 
or to provide information or misinformation 
about the learning situation (e.g., Holden & 
Rotter, 1962). The present experiment, by 
contrast, has provided instructions about spe- 
cific procedures that are to be used to accomplish 
learning. That the instructions were acted 
upon by Ss in this experiment seems to be 
indicated by the reaction time data, which 
showed that S groups built up stronger associa- 
tions between one trigram and the next one on 
the list, and that P groups built up stronger 
associations between the number of the trigram 
on the list and the trigram than between 
successive trigrams. Furthermore, the signifi- 
cance of the instructional effect in the error 
data, as well as its borderline significance on 
trials to criterion, indicates that the specific 
suggestion of a strategy provides these Ss with 
something of an advantage over control 

One possible explanation for these findings is 

that giving S instructions about how to learn 
the list allows him to start learning the list 
without having to search for a strategy. He 
would then learn the list more rapidly than if 
he had searched for a Strategy, as long as the 
instructed strategy was not one which would 
hinder learning. 

In the search for differences between the 2 
instructed groups, the Newman-Keuls analysis 
found that Group P-LA made significantly 
fewer errors than either Group S-LA or Group 
C-LA. The latter 2 groups did not differ 
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significantly from each other, indicati: peri- 
ority of position instructions for th list.) 
However, the Newman-Keuls analysi the 
trials-to-criterion data showed that oup 
C-LA took significantly more trials t arn 


the list than Group P-LA, but fou: no 


significant difference between Group P-L and 
Group S-LA. This is not meant to imply *hat 
5 used only the strategy he was instruc: | to 
use. It is possible and not contradict: to 
the present results that some Ss used ome 
complex strategy based on the instruc ons. 

The second finding of note is that the 


effectiveness of a learning strategy (as ind ced | 
by instructions) is functionally related to the 
characteristics of the materials to be lear od. 
That this was the case in the present ex; ri- 
ment is indicated by the significant interac on 
between instructions and materials for <he 
error data. This significant interaction re- 
vealed, first, that position instructions were 
more effective than specificity instructions 
with low-association trigrams, and second, that 
neither strategy had an advantage for high- 
association trigrams. This finding is quite 
understandable. Low-association  trigrams, 
by their very nature, provide little basis for the 
interitem linkages demanded 1 yt y 
strategy. The existence of numerica! i: 1 
in the list is therefore a feasible aid. l- 
grams increase in association value, it should 
become easier to form interitem link: zes, SQ 
that when high-association trigrams are used, 
the advantage of the position strategy is lost, 
and the strategies are of equal efficiency. It 
seems reasonable to predict that interitem 
linkages would be easier to form with word. 
than with trigrams, and the position strategy 
should therefore have little advantage when 
words are used. 

While the above reasoning is logical, at 
least one other possibility must be considered. 
It is feasible that, although Ss were given 
instructions, and although reaction time data 
give a clue that those instructions were used, 
Ss may have been using other cues in addition 
to and/or instead of the one suggested by the 
instructions. 

The results from the present study indicate 
that the effectiveness of a particular strategy 
of learning may be functionally related to the 
characteristics of the materials to be learned. 
Caution suggests that there may also be a 
relationship to the method of learning em- 
ployed. The present study, rather than using 
the more common method of anticipation, 
followed the Suggestion of Battig and Lawrence 
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(1967) that the serial recall procedure is 
superior because it reduces the confusion in- 
duced by the anticipation method's ‘‘require- 
ment that Ss simultaneously try to learn each 
presented item and anticipate the following 
item . ... [p. deta = Their conclusion that 
the recall procedure is more sensitive than the 
anticipation procedure for identifying the pro- 
cess in serial learning seems to be corroborated 
by the present results. 
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* SEQUENCE OF RESPONSE DEVELOPMEN' 
IN HUMAN EYELID CONDITIONING 


ARCHIE B. LEVEY? 


Clinical Psychiatry Unit, Medical Research Council, 
Graylingwell Hospital, Chichester, England. 


The acquisition of the conditioned eyelid response was examined fro 
standpoint of response patterning to determine whether response 
develops in an orderly sequence independent of response frequenc 
confined to unreinforced trials in order to avoid masking 
characteristic response shapes by the unconditioned response (UCR 
influence of response frequency was assessed by 
The 4 res 


analysi: 


and interstimulus interval. 
conditioned response alone, (b) 


return to the earlier response types. 


A lively interest in the analysis of 
specific response features and patterns of 
responding characterized the earlier ex- 
perimental literature on conditioning (e.g., 
Bernstein, 1934; Hilgard, 1931; Hum- 
phreys, 1943; Marquis & Porter, 1939; 
Wendt, 1930). Attention was also focused 
during this period on the analysis of rela- 
tionships between the conditioned response 
(CR) and the unconditioned response 
(UCR) (e.g., Grant, 1939; Hilgard, 1936; 
Miller & Cole, 1936). In subsequent 
years, the response itself tended to be 
taken for granted as interest in condi- 
tioning turned towards theory testing and 
the analysis of stimulus effects. The last 
decade, however, has seen a revival of 
interest in the description and analysis of 
response characteristics and of response 
changes during conditioning (e.g., Grevert 
& Moore, 1970; Grings & Schell, 1969; 
Kimble & Ost, 1961; Kimmel, 1966; 
Lockhart, 1972), including changes in 
overall response topography (e.g., Ebel & 
Prokasy, 1963; Gormezano, 1972; Grings, 
Lockhart, & Dameron, 1962; Prokasy, 
1965; Prokasy, Ebel, & "Thompson, 1963; 
Smith, 1968). At the same time, new 


* Requests for reprints should be sent to Archie 
B. Levey, Medical Research Council, Clinical 
Psychiatry Unit, Graylingwell Hospital, Chichester, 
Sussex PO19 4PQ, England. 
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an unaccompanied response in the lat 
range of the UCR, (c) a double response combining a and b, and (d) a resp 
in which a and 6 were indistinguishably blended. 
eyelid response exhibits an orderly sequence of development culmina: 
the blended response, though this development is ch. 


IREN 


È MARTIN 


University of London 


varying air-puff i; 
onse shapes identified were (a 


It was found th. 


aracterized by fr 


measures have been dex 
scribe topographic aspect 
with increasing accuracy 
1972; Pennypacker, 1964). 

Objective measures of response efficiency? 
have been reported for the conditioned 
eyelid response which provide a precise, 
quantitative description of the degree to 
which the developing CR anticipates the 
occurrence of the UCR (Levey & Martin, 
1968; Martin & Levey, 1969). Investiga- 
tion of these measures has shown that (a) 
they increase linearly toward a stable end 
point in the course of acquisition, (b) the 
increase is characterized by random fluc- 
tuations, (c) rate of increase is largely 
independent of response frequency, and (d) 
the end point may or may not 
optimal attenuation of the 
pending upon stimulus and .5 variables. 
The last finding is in accord with the 
Observation that use of paraorbital shock 
produces efficient placing of the CR, though 
attenuation is not involved (Coleman & 
Gormezano, 1971). In the classical cone 
ditioning paradigm, response shaping is 
determined by S rather than by E (cf) 
Prokasy, 1965), and the increase in re- 
sponse efficiency may be regarded as a 
form of adaptive responding. 

A tentative formulation that attempts 
to account for these findings has been 


! which dé 
Í the responsi 
(Edelberg, 1970) 


involve 
air puff, de- 
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offered elsewhere (Martin & Levey, 1969, 
pp. 119-136) and will not be described in 
detail. Briefly, it is suggested that the 
conditioned stimulus (CS) and the un- 
conditioned stimulus (UCS) are combined, 
from the beginning of acquisition, in an 
internally represented CS/UCS complex, 
to which the organism responds as a single 
stimulus entity, and for which a neuronal 
model is formed, analogous to Sokolov's 
(1963) model of the orienting stimulus. 
It is assumed that the earliest response to 
this stimulus complex is the formation of 
an internal response model. In early trials 
the stimulus complex evokes a UCR replica 
in the temporal range of the CS, which is 
incorporated in the response model and 
becomes the basis of the anticipatory CR. 
In the course of acquisition, the internal 
model of the response complex is progres- 
sively adjusted, through comparator mech- 
anisms, to the combined characteristics of 
the stimulus complex. The point in ac- 
quisition at which an overt response is 
elicited depends on performance factors, 
and the response form is governed by the 
current state of the response model. The 
final response is an integrated complex 
that represents efficient use of all the 
information coded in the stimulus com- 
plex, and hence produces — efficient 
responding. 

The present study was undertaken to 
test some of the observable consequences 
of the formulation outlined above. (a) 
Presentation of the CS alone in early 
trials should be capable of eliciting an 
attenuated response in either the CR or 
the UCR latency range or both. (b) On 
intermediate trials the CS should elicit a 
combined response in which both CR and 
UCR components are identifiable. (c) On 
trials late in the acquisition series the CS 
should be able to elicit a response in which 
CR and UCR components are combined 
in a single entity. (d) The Ss who re- 
spond late in the acquisition series, as- 
suming response probability to be a func- 
tion of various performance factors, should 
produce responses that reflect the develop- 
ment of the internal response model. 
A corollary of these inferences is that the 
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developing response should exhibit an 
orderly sequence, independent of the level 
of response frequency. 

The test of these propositions must rest 
with unpaired presentation of the CS, since 
occurrence of the UCS would mask the 
response shape by forcing a UCR. Ac- 
cordingly, the study used the method of 
unreinforced test trials, though this car- 
ries the unavoidable limitation that the 
effect of partial reinforcement as a variable 
cannot be independently assessed. This 
procedure has the sanction of other recent 
investigations, however (e.g., Coleman & 
Gormezano, 1971), and in the present 
context the test trials can be regarded as 
a kind of “window” on the current state 
of the developing response complex. 


METHOD 


The Ss were 72 naive adult males, screened for 
sensory defects, who were recruited as paid volun- 
teers. The conditioning procedure involved Ss being 
conditioned to a 1,000-cps 65-db. tone CS presented 
through headphones, and an air-puff UCS of 60- 
msec. duration, delivered to the cornea through 
1-mm. tubing and terminated with CS termination, 
Stimulus presentations and durations were controlled 
by a Devices Digitimer. ‘The eyelid response was 
recorded by means of an ORP-12 Cadmium Sul- 
phide photocell whose output was fed, with the 
marker signals, to a 2-channel Elema Schónander 
Mingograph with a chart speed of 50 mm/sec. 
The Ss were seated in a dimly illuminated cubicle, 
screened from the apparatus and from E. Thirty 
acquisition trials were given with 18 unreinforced 
test trials interspersed randomly throughout the 
ies, with the limitation that no 2 test trials occur 
in succession. The same randomization order was 
used for every S. Immediately following the ac- 
quisition trials, extinction trials were run to à 
criterion of 5 successive nonresponse trials or a limit 
of 20 trials. 

In order to manipulate response frequency, 2 
experimental factors known to affect this variable 
were accounted for in the design. Half of the Ss 
were conditioned at short (400 msec.) and half 
at long (800 msec.) CS durations, crossed with 2 
levels of. UCS intensity, weak (3 psi) and strong 
(6 psi) air puffs. Since personality variables, notably 
extraversion and neuroticism, are known to affect 
response frequency, Ss were assigned to the 4 cells 
of the design on the basis of these 2 personality 
factors and their interaction, using pretest scores 
on the Eysenck Personality Inventory (Eysenck & 
Eysenck, 1964). 

Responses were scored on each of the 18 test 
trials for all deflections of 1 mm. or more above the 
stable baseline. A response was scored as a CR if 
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FIGURE 1. Chart tracings of the 4 response types 
discussed in text: conditioned response (CR) only 
(c), unreinforced responses falling in unconditioned 
response range (UCR) only (u), double CR/UCR 
(d), and blended CR/UCR (b). (The figures on 
the left represent early trials, while those on the 
right are taken from later trials. The marker in- 
dicates conditioned stimulus [CS] onset, duration, 
and offset. The arrow indicates the point at which 
the unconditioned stimulus would have occurred. 
The CS duration in all figures is 800 msec.) 


it fell between an onset. latency of 120 msec, and the 
offset of CS, and as a UCR if it occurred within 
120 msec, following CS offset, (The italicized label 
will be used throughout this article to refer to un- 
reinforced responses falling in the UCR latency 
range.) It should be emphasized that the latter 
responses, though defined by the UCR latency 
Tange, were not simply duplicates of the UCR, 
either in shape or amplitude. Many of them, 
particularly in earlier trials, were low in amplitude 
and did not show the characteristic Sharp rise of the 
UCR to UCS presentation. It should also be noted 
that the findings of this study do not apply to 
voluntary-form responding, which did not occur in 
the sample studied, 
Responses were then categorized into the 4 types 
ilustrated in Figure 1. This categorization was 
done by a research assistant thoroughly familiar 
with conditioning records, but unaware of the pur- 
pose of the analysis.2 The types identified were: 


* The authors are most grateful to Mary Acock, 
Medical Research Council, for her cheerful and 
competent performance of this task, 
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CR only (C), UCR only (U), double response ( 
and blended response (B). The categorization 
objectively based on the latency criteria outlin 
and on the locus of inflection points in the wat 
forms in the case of double responses, On this 
a small number of multiple CRs was inc luded in tl 
CR-only category. Obvious spontaneous blini 
defined by their characteristic rise time, amplitu 
and duration, were separately identified and 
cluded from the analysis. Coding reliability 
assessed by independently rescoring one third 
the records, resulting in a 95% overall agreemei 
While this type of categorization is crude, by coi 
parison with the Objective measures of respon 
efficiency mentioned earlier, it served to identify 
basic response shapes with which the study 
concerned, 

The sequence of response types was ic 
each S by the following method: each 
ance of any response type was recorded 
in order of occurrence. This yielded a c 
ing 4 successive elements, For example, if 5 gave tl 
following sequence: U, C, C, U, D,t 3 
C, D, it was coded as U-C-D-B, indicati: 
progression of response types occurred in that ordei 
This method of summary yielded 24 possible se 
quences representing the permutations of 4 elements 
An appreciable number of Ss showed only 3 of th 
response types. In this situation, it follows that the 
fourth element of the code can only be the remaining 
type, and these codes were therefor ipleted with 
the missing element, A few Ss gave only 2 of the 
response types, and were randomly assigned to those 
codes in which the 2 relevant response types ap- 
peared first in the order Observed. It is important 
to note that this method of coding takes no account 
either of the absolute number of responses of a 
given type, or of the order in which any of the 
response types, having first appeared, may recur 
prior to the emergence of the next type. The main 
hypothesis to be tested was that the 4 types occur 
in ordered sequence. A Sequence such as C- U-D-B, 
should occur more frequently, for example, than the 
sequence D-U-C-B, while the sequence B-D- U.C 
should be unlikely. In other Words, it was postu- 
lated that the developmental sequence begins with 
the single responses, progresses through the double 


response, and ends with the integrated blend 
response, 


o 


REsuLts 


The total number of responses cate- 
gorized was 499, divided into 240 C, 91 U, 
79 D, and 87 B responses. The observed 
base level of responding immediately prior 
to the acquisition series was .069 for the 
shorter interval and .120 for the longer. 
These are average values based on 3 sam- 
ples from each S; allowing for the 120- 
msec. UCR criterion, and are within the ` 
ranges usually observed in our laboratories. 


SEQUENCE OF RESPONSE DEVELOPMENT IN EYELID CONDITIONING 


Only one double response occurred and 
this is consistent with chance expectation 
at the observed levels. Analysis of vari- 
ance on the 2 X 2 sampling model of the 
design, summing over 18 trials, showed 
that total response frequency was signifi- 
cantly lower under the interaction of a 
short interstimulus interval and a weak 
air puff, F (1, 68) = 4.19, p < .05. Sepa- 
rate analyses showed the U response to be 
slightly more frequent under the stronger 
air puff, F (1, 68) = 4.98, p < .05, while 
the D response was significantly more fre- 
quent with the longer interval, F (1, 68) 
= 4,02, p < .05. The C and B responses 
were not differentially affected by experi- 
mental conditions. Twelve Ss failed to 
produce responses to the unreinforced 
trials and were excluded from further 
analysis. 

The fact that the 4 response types could 
be identified in substantial numbers lends 
immediate support to the notions discussed 
in the previous section. These results 
indicate that the unpaired CS is capable 
of eliciting responses in both the CR and 
UCR latency ranges, singly and in com- 
bination, including a completely blended 
composite of both responses. The question 
of interest, however, is the order in which 
these response types occurred. If they 
merely represent 4 discrete response shapes 
or patterns, it might be expected that each 
of the sequences would be equally probable. 
In fact, of the 24 possible codes, the 2 ex- 
pected sequences, C-U-D-B and U-C-D-B, 
accounted for over 50% of all responders. 

The assumption that each of the se- 
quences is equally probable is open to 
several objections. In particular, any 
factors that govern the proportions of the 
response types would also dictate the rela- 
tive frequencies of the various sequences. 
Thus, if chance alone determined the first 
response type, the predominance of C, 
noted earlier, would lead to a higher fre- 
quency of all codes beginning with C. 
Using the same argument for each position 
of the elements in the codes, the most 
probable sequence would be C-U-B-D, 
and the next most probable would be 
C-U-D-B, representing à spurious advan- 
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tage to the sequence expected by the 
theoretical model. In the following anal- 
yses, therefore, the calculation of expected 
frequencies for each of the sequence codes 
used the joint probabilities of the response 
types, based on their overall frequencies, 
assuming independence. This afforded an 
effective control for base-level responding 
in relation to the asymmetric latency cri- 
teria for the C and U responses, for ex- 
perimental factors, and for any unknown 
factors which may have determined the 
distribution of response likelihoods. 

No hypothesis was offered to predict 
whether the CR would precede the UCR, 
or vice versa. In order to summarize the 
data, therefore, and reduce the number of 
codes to manageable proportions, the CR 
and UCR responses were considered sepa- 
rately. This yielded 6 sequence codes 
based on the permutations of 3 elements. 
It should be noted that this procedure 
offered the further advantage of account- 
ing for all Ss, since for those few Ss who 
gave only 2 response types, the code could 
be completed with the missing element. 

Table 1 summarizes the data for 3 anal- 
yses: CR omitting UCR, UCR omitting 
CR, and CR combined with’ UCR. The 
symbol CU in the code label on the left 
indicates the position of these elements in 
the sequence, while the symbol at the 
head of each pair of columns designates 
the element itself. Thus, the first entry 
in the top row of the table is the number 
of Ss who gave the sequence C-D-B, the 
second is the expected frequency for that 
sequence, the third is the number producing 


TABLE 1 


OnskRvED (O) ANp EXPECTED (E) FREQUENCIES OF 
Ss RESPONDING WITHI x SEQUEN 


Cc CU 
Code m. 
0 E 0 | E 
CU-D-B | 42 | 16.94 | 34 | 10.11 | 44 | 19.05 
CU-B-D 6 | 18.64 6 | 11.13 8 | 20.97 
D-CU-B 6 8.57 7| 941 5 7.52 
D-B-CU 3 3.07 | 10 | 9.01 2 1.96 
B-CU-D 2 9.64 1 | 10.88 1 8.48 
B-D-CU 1 3.14 2| 946] 0| 2.02 


Note. See text for code explanation, 
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the sequence U-D-B, and so on. With one 
exception, to be discussed later, the ex- 
perimental conditions had no important 
effect on the sequences and will not be 
considered separately in these analyses. 

Inspection of the table shows that the 
predicted sequences, C-D-B and U-D-B, 
were overwhelmingly highest in propor- 
tion, while B-D-C and B-D-U were scarce- 
ley represented, X? (5) = 53.95 and 74.40, 
respectively, p < .001. The joint func- 
tion of CR and UCR, shown in the third 
pair of columns, follows the same pattern, 
X? (5) = 50.16, p < .001, with greater sym- 
metry. Further clarification of the rela- 
tionship between CR and UCR was 
afforded by examining the first response of 
each S. The observed frequencies of the 
response types C, U, D, and B were 36, 
16, 7, and 1, respectively, and yielded 
X*(3) = 13.24, p < .005. Interestingly, 
the observed frequencies of the CR and 
UCR, ignoring the remaining categories, 
lie close to the observed proportions of the 
2 responses overall, suggesting that the 
CR is more likely as a first response 
simply because it is more available, 

The combination of a short interstimulus 
interval and weak UCS not only reduced 
the level of responding, but virtually ex- 
cluded the unexpected response sequences. 
For example, 90% of the Ss responding 
under these conditions showed the se- 
quence C-D-B. In order to explore this 
unexpected finding, the 60 responding Ss 
were divided into 2 equal groups of 30 
early responders and 30 late responders, 
both across experimental conditions and 


TABLE 2 


FREQUENCIES oF RESPONSE SEQUENCES FOR 25 Ss 
GiviNG THREE or Four RESPONSE TYPES IN 
EXTINCTION, COMBINING CR anD UCR 
ONLY RESPONSES 


w 
$ 
3 
Ei 
H 
g 
w 


4 Total Expected 

CU-D-B 5 0 5 
CU-B-D 0 0 0 202 
D-CU-B 2 2 4 2.65 
B-CU-D 4 0 4 6.38 
D-B-CU 3 1 4 245 
B-D-CU 3 5 8 3.14 

Total 17 8 25 


Note. See text for code explanation, 
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separately within experimental co 
and the data were reanalyzed 

analyses showed the predicted 

sequences to be undifferentiated b 
or late responding. By contra 
deviant sequences were virtually 

to late responders and in the casc 
within-conditions comparison, enti: 
This result is consistent with 2 expl: 
factors. Where the experimental 


tions are such as to suppress respo (ling, 
late responses represent the beginn: \g of 
response development. Where the : ondi- 
tions facilitate responding, late responding 
represents those Ss who withold res; onses 
until late in their own developme: se- 
quence, but whose patterns of respo ding 
show an internal sequence of development. 

No hypothesis was offered as the 


behavior of response sequences du 
tinction. The following analysis 
pirical and is offered merely as a descrip- 
tion of the typical sequences. It should 
be noted that the categorization of re- 
sponses is made difficult dui 

because the response complex lo 
shift on its time base, and this mə 

in misclassification of some resjx 
Fifty-four Ss produced a total of 
responses in extinction, of which 235 
either blended or double responses. 


ex- 


were 
In 
these trials, 45% of all Ss responding gave 


response sequences that fell in those 
categories beginning with the blended re- 
sponse. The next most frequent category 
comprised those sequences beginning with 
the double response. These results. sug- 
gest that the response complex is in some 
sense breaking down, though not neces- 
sarily regressing in an orderly fashion 
through the reverse of the response-type 
sequences. The results must be viewed 
with some caution, since the appearance 
and disappearance of responses late in the 
sequence may produce artifacts that seem 
to resemble disintegration. It is note- 
worthy, however, that those Ss giving a 
large number of responses in extinction 
Were those more likely to follow the 
graded sequences. Table 2 shows the dis- 
tribution of response sequences for the 
25 Ss giving either 3 or 4 response types 
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in extinction, combining CR and UCR in 
one category. Expected frequencies were 
calculated on the basis of overall response 
proportions in extinction. The average 
number of extinction trials for this group 
was 17.72. Itis evident that the reversed 
sequences B-D-CU and D-B-CU account 
for a disproportionately high number of 
responders, X? (5) = 17.36, p < .005. This 
suggests the intriguing possibility that if Ss 
go on responding long enough in extinc- 
tion, the response complex tends to be 
dismantled in reverse order to its 
development. 


DISCUSSION 


To summarize the results of this study 
briefly, it has been shown that the emergence 
of response types categorized as CR, UCR, 
double CR/U CR, and blended or integrated 
CR/UCR occurs for the most part during 
acquisition in an orderly sequence of response 
development. The sequence as such is inde- 
pendent of the experimental conditions. ex- 
amined, although the absolute numbers of a 
given response category are affected. Those 
experimental conditions that suppress respond- 
ing appear to delay the onset of the sequence, 
while those that facilitate responding show 
evidence that internal response development 
occurs early in acquisition. The sequence does 
not exclude a return to earlier modes of re- 
sponding and is probably best described as an 
overall emergent trend in a setting of ongoing 
random variation consistent with the fluctua- 
tion of efficiency levels mentioned earlier. 
However, in the present study, which was 
limited to partial reinforcement, it is possible 
that extinction processes also contributed to 
the return to earlier response types. 

These results were anticipated in a sur- 
prisingly neglected early study by Grant 
(1939), though differences in experimental 
conditions, procedures, and methods of record- 
ing make it impossible to compare the 2 studies 
directly. Grant identified 6 response patterns, 
of which 2, labeled the “unimodal CR” and 
the “completed slope CR,” resemble the re- 
sponses labeled double and blended responses 
in the present study. He noted of the latter 
response that "this very efficient pattern gen- 
erally occurs later in the conditioning series 
[p. 452]," and commented both on the implied 
sequence of patterns and on the tendency to- 
ward “regressive shifts to patterns which had 
been present in an earlier stage of the experi- 
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ment [p. 460].” Grant's study, of which the 
authors were unaware at the time the present 
study was designed and analyzed, was specifi- 
cally aimed at describing response patterns 
and their shifts in relation to the question of 
CR/UCR relationships. Though the mea- 
surements were made entirely on reinforced 
trials, the similarity of results is striking. 

The view that response efficiency is inde- 
pendent of response frequency is. consistent 
with the recent finding of Coleman and Gor- 
mezano (1971) that the precision and vari- 
ability with which the CR peak is placed in 
relation to UCS onset are uncorrelated with 
response frequency. In the present study, CR 
frequency accounted on the average for only 
one quarter of the variance associated with the 
4 response types, and, in general, correlations 
among response types were nonsignificant. 
Nevertheless, the finding that response sup- 
pression may delay the onset of response de- 
velopment requires some modification of this 
view. 

The development by Prokasy (1972) of a 
two-phase model that successfully predicts ac- 
quisition curves in human eyelid conditioning 
may offer a lead. One of the parameters (K) 
of this model reflects a factor of delay during 
early acquisition trials which Prokasy (p. 143) 
attributes in part to variables that affect 
orienting behavior. It seems entirely possible 
that the low saliency of the weak UCS and 
short interstimulus interval in the present 
study may have resulted in failure to attend 
during the early trials. It is reasonable to 
suppose that if the stimulus processing mechan- 
ism is “idling” during this period, the postu- 
lated stimulus and response models will be 
delayed in formation. 

The interest of the present results, apart 
from their general support of the theoretical 
model outlined earlier, centers on the eyelid re- 
sponse at UCR latency. The occurrence of 
170 of these responses, either alone or in com- 
bination with the CR, is well above the base- 
rate expectation of 28.4 responses and suggests 
that they are not trivial. A similar response, 
occurring on unreinforced trials, has been 
described by Zeiner (1968) for several auto- 
nomic measures (galvanic skin response, heart 
rate, salivation, and pupil dilatation). In 
electrodermal conditioning, it is commonly 
referred to as the second interval response 
(SIR), first described by Grings et al. (1962). 
Several explanatory concepts were examined 
by these authors, viz., response substitution, 
inhibition of delay, temporal conditioning, CS 
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offset response, and disparity responding. 
Reasons were given to show that without 
modification, none of these explanations is 
entirely satisfactory. Part of the interest of 
identifying the UCR or SIR in eyelid condition- 
ing is the fact that the orienting component, 
which is prominent in the electrodermal sys- 
tem, can be largely ignored. The behavior of 
the alpha blink, or CS reflex blink, has been 
shown to resemble that of the orienting re- 
sponse (Levey & Martin, 1967; McAllister, 
1953). This blink occurs typically to onset of 
the CS, but rarely to offset at the stimulus 
intervals employed in the present study. (We 
have informally observed CS-offset alpha 
blinks, though only to tone stimuli of very 
long duration, e.g., 120 sec.) Thus, the 
mechanisms of CS offset and disparity re- 
sponding do not apply to the eyelid response. 
Zeiner (1968) has shown empirically, in an 
elegant experiment comparing a trace paradigm 
in which the CS extended beyond the UCS, 
with a delay paradigm, that the SIR in elec- 
trodermal conditioning is not merely a response 
either to offset of the CS or to contiguity be- 
tween CS offset and UCS onset. He has also 
presented cogent reasons, which are equally 
applicable to the present results, suggesting 
that the SIR is not due either to simple re- 
sponse substitution or to inhibition of delay. 
His conclusion was that the SIR. phenomenon 
represents one part of a dual process, in which 
the occurrence of 2 Separate responses is 
attributed to separate mechanisms of prepara- 
tory responding, mediated by learned fear, 
and a modified form of Stimulus substitution. 
It is our view that in eyelid conditioning at 
least, a two-process model does not readily ac- 
count for the eventual blending and integra- 
tion of the 2 responses, CR and UCR, and the 
present results suggest that both are intrinsic 


components of the classical conditioning 
process, 


Indeed, the most interesting aspect of the 
response sequences described is not the nature 
of the 2 response components but the fact that 
they combine into an integrated whole. The 
fact that a blend of CR and UCR occurs in 
eyelid conditioning is not in doubt (Spence, 
1956). What the present study shows is that 
the blended response, as well as the composite 
double response, is elicited by the CS in the 
absence of the UCS. This of itself indicates 
that they are in some sense integrated into a 
single response, It is important to note that 
the blended response does not necessarily re- 
sult in optimum avoidance of the air puff. 


ARCHIE B. LEVEY AND IRENE MARTIN 


Doubt has recently been thrown on the abili 
of simple reinforcement mechanisms to at 
count for response placement (Gormezam 
1972) as well as response probability (Prokas 
1972). The view proposed he 
response integration to the modeling of th 
CR/UCR response complex on the CS/UG 
stimulus complex, and this is consistent wit 
our present knowledge, though it cannot b 


proven. 

If the CR and UCR are equa parts of th 
classical conditioning process re remain: 
the consideration of why th CR in thi 
present study should have bee: frequem 
than the CR. The probabilit, 'ccurrenc 


on the CR or the UCR as a firs 
been shown to be entirely consi 
overall probability. It has also 
that with a stronger UCS the 


more frequently, indicating ti e CS 

actively conveying this inform ES CC 
suggested, the CS and UCS m a single 
stimulus complex, sharing the information 


content of both, the ability of the CS to elicit 
the UCR, or the UCS to elicit the CR, might 
be expected to be weaker than when the 2 
stimuli are presented together. The second 
half of this proposition cannot b: tested, since 
the UCS alone cannot give rise to an observable 
CR. In the absence of this unobtainable evi- 
dence, the difference in frequency of the CR 
and UCR offers no basis for rejection of the 
conclusion that the CR and the UCR are 
equally parts of the classical conditioning se- 
quence, and that their integration into the | 
double or composite response and finally into a 
blended response is an intrinsic part of the 
conditioning process. 

Finally, though the results lend support to 
the theoretical model outlined, they also raise 
a very puzzling question. The data clearly 
show that on very early trials the CS is capable 
of eliciting a CR or a UCR, but not both. It 
might be noted that this situation is equally 
problematic for a two-process model. Nothing 
in the proposed model or in any current theory 
of classical conditioning predicts this pheno- 
menon. It is not the purpose of this article 
to enter into discursive theorizing. The data 
seem to suggest 3 possibilities, First, the early 
trials may recruit insufficient response energy, 
in some sense, to Support 2 responses, In this 
event, the UCR would be given only on a trial 
on which the CR had for some reason not | 
occurred, Second, if 2 Separate processes are 
involved, they may for unknown reasons be 
mutually exclusive or may interfere in the 
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early trials. Third, and most attractive in our 
view, the phenomenon can be handled within 
the confines of the proposed model. If it is 
assumed, following Estes (1950), that the 
comparator process mentioned earlier, by 
which it was suggested the integrated response 
complex is gradually built up, operates on each 
trial by scanning only a random subset of the 
information in the stimulus complex, the result 
is at least partially explained. If this were the 
case, the occurrence of a CR or a UCR or of 
no response in early trials would be a function 
of the particular subset sampled. On succes- 
sive trials, further information would lead to 
progressive shaping and integration of the 
complete response model. This view is con- 
sistent with current views of information 
processing and is able to account for the 
gradual integration of the response. 

No final answer can be given to the problems 
raised by the occurrence of the UCR and the 
CR/UCR complex in response to the CS. 
While, at this stage, the results lend support 
to the view of conditioning outlined earlier, 
they do not, of course, confirm it. The purpose 
of the present study was to establish firmly the 
empirical basis of response development se- 
quences in eyelid conditioning. Further work 
will be required to elucidate the underlying 
mechanisms and to arrive at a satisfactory 
explanatory formula. 
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HAPTIC EQUIVALENCE MATCHING OF CURVATURE 
BY BLIND AND SIGHTED HUMANS * 


PHILIP W. 


The relations between active handling, accurate haptic curvature matching, 
and task difficulty were studied using increases in the number of stimuli con- 
tained in comparison arrays to vary 
s task difficu 


accurate than sighted 
exploratory search wei 


Compa 
more similar was the scanni 
blind. 


like curvature, 


Skill in haptically judging form proper- 
ties such as length, proportion, or curva- 
ture seems to depend upon the particular 
pattern of active finger, hand, or arm 
movements used to explore the stimulus. 
However, little is known of the role of such 
movements in perception. 

A recent study by Davidson (1972a) 
attacked one aspect of this question, the 
relation. between the stimulus property 
being judged and the style of search. The 
experiment compared blind and sighted 
Ss in discriminating curved stimuli by 
hand. The results showed that blind Ss 
(a) were more accurate than the sighted 
with the most difficult curves, and (5) 
employed a scanning strategy infrequently 
used by the sighted, one leading to im- 
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task difficulty. 
Ity increased and marked differences in 
> noted between blind and sighted Ss. 
blind's characteristic search strategy, infrequently used by the sighted, led to 
improved matching by sighted Ss restricted to its use. 
ison-Array conditions, the more stimuli in the comparison arrays the 
ng of sighted Ss in each condition to that of the 
It was concluded that an interaction between task requirements and 
search style may in part determine accuracy in judgments of form properties 


Blind Ss were more 
Moreover, the 


Also, in the larger 


proved judgments by sighted Ss rest ricted 
to its use. 

The blind’s strategy, called gripping, en- 
tailed using three or four fingers to simul- 
taneously explore numerous points along 
the curve. Sighted strategies, although 
similar to gripping in hand and arm motion, 
involved successive search through pinch- 
ing, or sweeping, the curve with one or 
two fingers. Davidson (1972a) concluded 
that gripping probably was suitable for 
prehending curves of low discriminability 
since it provided more stimulus information 
than other methods. 

The present experiment examined 
another related aspect of this question, 
asking if the effectiveness of search strate- 
gies is influenced by the difficulty of the 
task itself, apart from the nature of the 
stimulus attribute. We used curvature 
equivalence matching, varying task diffi- 
culty by systematically changing the num- 
ber of alternatives to be compared to the 
standard. The larger the number of stimuli 
in the comparison array, the greater the 
burden placed on immediate recognition 
memory and the more difficult the task 
(Goodnow, 1971). Part I compared the 
unrestricted exploratory search of blind and 
sighted Ss while Part II tested the effective- 
ness of these search strategies. 


METHOD 


Subjects, Twenty-four Ss with normal vision 
(Mago = 15 yr, 5 mo.) and 23 congenitally totally 
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blind Ss (Mag = 19 yr., 0 mo.) participated in 
Part I. The blind and sighted Ss were matched 
approximately for IQ based on Verbal Scale scores 
on the Wechsler Intelligence Scalé for Children 
(Mro = 102, SD = 9). Ninety paid undergraduate 
volunteers participated in Part Il. 

Stimuli. Seven different concave curves (as 
illustrated in Davidson, 1972a) were used, with arc 
heights of 2, 4, 6, 8, 10, 12, and 14 mm. 

The task. The blindfolded S felt first the standard 
stimulus for 8 sec., then successively presented com- 
parison stimuli for 8 sec. each to match the standard. 
The 2-, 4«, and 6-mm. curves served as standards 
(the greatest differential scanning effects reported 
by Davidson, 1972a, occurred with these three 
stimuli), each appearing 8 times in a random order 
for a total of 24 trials. 

Task difficulty was varied by changing the number 
of curves appearing in comparison arrays and the 
same three difficulty conditions were used in both 
Parts I and II. In the first condition, each compari- 
son array contained one curve (i.e., a same-different 
task), in the second, comparison arrays contained 
three curves (the standard plus two randomly 
selected others), and in the third, comparison arrays 
contained five curves (the standard plus four ran- 
domly selected others). "The standard and compari- 
son stimulus were the same 50% of the time in the 
One-Comparison condition and stimulus combina- 
tions and orders were randomized for all Compari- 
son-Array conditions. No stimulus appeared twice 

in the same comparison array. Stimuli were pre- 
sented following Davidson's (1972a) procedure. 

Part I procedure. Seven blind and eight sighted 
‘Ss were tested in the One-Comparison condition, 
and groups of eight blind and eight sighted Ss were 
tested in the Three- and Five-Comparison condi- 
tions. Each S's exploratory activity was videotape 
recorded and the recordings analyzed by two inde- 

pendent scorers following Davidson's earlier pro- 

cedures (19724). The scorers categorized an S's 
scanning each time he felt a curve, noting only 
whether particular predefined methods occurred, 

The tapes were scored for gripping, pinching, and 

top sweeping, the three scanning strategies differ- 

entiating blind and sighted Ss in Davidson's 

(1972a) study and defined and illustrated there, 

Part II procedure. In each of the three task 
conditions, 10 5s were restricted to using only grip- 
ping, 10 5s, to only pinching, and 10 Ss, only top 

Sweeping. Each S was trained in using the appro- 

priate scanning method before testing began. 


RESULTS 


Part I matching errors. The upper part 
of Table 1 gives the mean error scores 
from Part I. A two-way analysis of vari- 
ance revealed effects of Vision, F (2, 41) = 
11.71, p «.01, and Comparison Array 
Size, F (2, 41) = 48.78, b < .01. Sub- 
sequent ¢ tests indicated that blind and 
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sighted Ss’ errors did not differ in One- 
Comparison condition, / (41) = 1 b» 
-05, but that the blind made fewer rrors 
in the Three-Comparison, ¢ (41) 2.46, 
b < .01, and the Five-Comparison ondi- 
tion, ¢ (41) = 1.99, 5 < .05. ENev nan 
Keuls tests showed that errors ino ased 
for both blind and sighted Ss wit! cach 
increase in comparison array size (; .0t 
for all comparisons). No change in .rors 
occurred over trials, and IQ had : ) in- 
fluence on errors, regardless of Visi a or 
Comparison-Array condition. 

Part II matching errors. The mean rror 
scores from Part LI are shown in the ^ wer 
portion of Table 1. A two-way an lysis 
of variance revealed effects of both S ann- 
ing, F (2, 81) = 403, p< .01, and :'om- 
parison Array Size, F (2, 81) = 53.31, 
b <.01. Newman-Keuls tests revealed 


that (a) gripping led to fewer errors than 
both pinching, p < -01, and top sweeping, 
p < .01, but pinching and top sweeping 
did not differ from one another, > > .05, 
and (6) errors increased with si 
increases in comparison array size (; 
for all comparisons). 

Comparing the Grip groups with the 
combined errors of the Pinch and Top 
Sweep groups showed that the three 
strategies led to similar errors in both the 
One-Comparison, ¢ (28) = 1.31, p.05, 
and the Three-Comparison condition, / (28) 
=1.07, p > .05. However, gripping led to 
fewer errors than other methods in the 
Five-Comparison condition, ¢ (28) = 2.36, 
b < .02. This outcome departs from our 
findings in Part | where, with scanning free 
to vary, the between-groups difference in 
the Five-Comparison condition was some- 
what less than the Three-Comparison 
difference, 

Analysis of scanning in Part I. After a 
check for interscorer agreement (the scorers 
agreed 83.9% of the time), following 
Davidson's (1972a) procedure we calcu- 
lated the percentage of an .S's judgments in 
which the scorers reported seeing each 
Scanning strategy and then obtained group 
averages. The resulting distributions for 
blind and sighted Ss, shown in Figure 1, 
were compared with separate Mann-Whit- 
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TABLE 1 
MEAN CURVATURE MATCHING ERRORS 


| Number of Stimuli in Comparison Arrays 
| + 
| Three | 


Five 


Group | 


| x | sp 


Part I: exploratory 


slind | 6.57 
Sighted | 8.62 


1.76 | 
2.68 | 


Part I1: exploratory 


| 
Grip 8.20 | 2.4 
Pinch 9,40 | 1.74 
Top sweep | 940 2.57 


ney U tests for each scanning strategy 
within each Comparison-Array condition. 
In the One-Comparison condition, 
ighted Ss pinched more often than the 
blind, U (7, 8 = 7.0, p < .01, while the 
blind Ss gripped more often than the 
sighted, U (7, 8) — 13.0, p < .05, results 
paralleling Davidson's (1972a) previous 
study. The outcome with larger compari- 
son arrays, however, differs from earlier 
findings for sighted Ss. In the Three- 
Comparison condition, sighted Ss’ gripping 
showed a large increase compared to One- 
Comparison condition sighted Ss, although 
still used less often than by Three-Compari- 
son condition blind Ss, U (7, 8) = 12.0, 
p <.05. At the same time, pinch usage 
by these sighted Ss was similar to that of 
Three-Comparison blind Ss, U (7, 8) = 23.5, 
p> 10. In the Five-Comparison con- 
dition, no significant scanning differences 
vere indicated, a result perhaps accounting 
for the diminished error difference between 
blind and sighted Ss in this condition com- 
pared to the Three-Comparison condition. 
(‘Top sweeping by all groups was low with 
no consistent changes as à function of 
Comparison-Array condition). 

We summarized the percentage of judg- 
ments within successive four-trial blocks 
in which each strategy occurred to get a 
better view of the changes that took place 
in sighted Ss’ scanning as comparison 
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array size increased. The variability of 
these scores made any meaningful statis- 
tical analysis difficult. However, the data 
for sighted Ss did suggest a decrease in 
gripping over trials and an increase in 
pinching over trials in the One-Comparison 
condition, trends which did not occur in 
either the Three- or Five-Comparison con- 
ditions. There were no consistent effects of 
trials on top sweeping by sighted Ss, or 
upon usage of any of the three strategies by 
blind Ss. 


DISCUSSION 


The present study extends Davidson's 
(19722) paper in several ways. First, it appears 
that the contribution to perceptual accuracy 
made by exploratory search may be based upon 
an interaction between the task requirements 
and the style of search. This conclusion was 
suggested by Davidson (1972a) with respect 
to the nature of the stimulus attribute being 
judged, and seems extended by our data to 
also include the nature of the task paradigm. 

Second, a scanning strategy that permits 
simultaneous prehension of a stimulus attri- 
bute seems to withstand increasing task 
difficulty better than methods that sample 
more successively (at least with regard to rela- 
tional attributes such as curvature). I vidson 
(1972a, 1972b) has speculated that simulta- 


e——e nuno 9---0 SIGHTED 


"TOP SWEEP" 


"oni" 


PER CENT JUDGMENTS 
s 


NUMBER OF STIMULI IN COMPARISON ARRAYS 


Ficure 1. The percentage of judgments in 
which blind and sighted Ss in Part I used the scann- 
ing strategies. (More than one strategy could occur 
in each of an S's 24 trials. Mann-Whitney U tests 
revealed that blind Ss gripped more than the sighted 
in the One-Comparison condition, P < 05, and 
Three-Comparison condition, P < .05, and the 
sighted pinched more than the blind in the One- 
Comparison condition, p < .01.) 


neous prehension may help to pattern the rela- 
tionship, reducing the requirement for integrat- 
ing successive inputs in memory. If the 
increases in task difficulty in the present study 
are viewed as increases in task immediate 
recognition memory demand (as has been 
argued by Goodnow, 1971), then our results 
suggest that such scanning may also contribute 
to better recognition memory for form. 

The greater accuracy of blind Ss compared 
to the sighted is consistent with other studies 
of curvature categorization by the blind 
(Davidson, 1972a; Hunter, 1954), extending 
these previous results to the equivalence 
matching situation. This finding suggests that 
curvature is one form property that requires 
some haptic perceptual experience before it 
can be accurately perceived. 

The link between perceptual accuracy and 
scanning style, found initially by Davidson 
(1972a) and replicated here, is a straight- 
forward extension of previous findings for 
judgments of other form properties involving 
comparisons either of blind and sighted hu- 
mans (Davidson, Barnes, & Mullen, in press; 
Hatwell, 1959) or young children (Abravanel, 
1968). Moreover, the Scanning differences 
between sighted Ss depending on comparison 
array size suggests that modification of ex- 
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ploratory activity may be one result of pe 
ceptual experience, 
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JOEL R. LEVIN? 


University of Wisconsin—Madison 


ELIZABETH S. GHATALA 
Weber State College 


LARRY WILDER 


University of Wisconsin—Madison 


A discrimination-learning task was administered to a sample of sixth graders. 
Stimulus materials consisted of either pictures or words for which frequency 
judgments had been obtained in a previous experiment. Picture and word 
items were selected which differed either substantially or minimally in ap- 


parent frequency measures. 


While the usual picture-over-word effect was 


found when picture-word apparent frequency differences were large, it vanished 
when pictures and words were equated in terms of apparent frequency. These 
results, coupled with some recent data, provide for a fairly complete account 
of picture-word differences in discrimination learning. 


It has recently been demonstrated that 
pictures are more easily learned than 
words in a discrimination task, for both 
children (Rowe, 1972; Wilder & Levin, 
1973) and adults (Rowe, 1972; Rowe & 
Paivio, 1971; Wilder & Levin, 1973). To 
date, one of the more popular explana- 
tions of this effect has been that pictures 
are more likely than words to produce 
a simultaneous verbal-imaginal encoding, 
and that dually coded materials are better 
recalled (Paivio, 1971). However, guided 
by the frequency theory of verbal dis- 
crimination learning originally proposed by 
Ekstrand, Wallace, and Underwood (1966), 
the present authors conducted an experi- 
ment which suggested that picture-word 
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effects in discrimination learning may be 
attributable to differences in the subjec- 
tive (apparent) frequency associated with 
the two types of material (Ghatala, Levin, 
& Wilder, 1973). 

In that experiment, sixth graders were 
administered either a picture or word 
frequency judgment task in which line 
drawings of familiar objects or their 
printed verbal labels were presented for 
varying numbers of exposures, with Ss 
later asked to estimate the number of 
times a particular picture or word had 
been previously shown. 1t was found that 
the mean judgments for pictures were 
higher than the mean judgments for words, 
even though the actual presentation fre- 
quencies of the two were the same. In 
addition, the average variability associated 
with items of the same presentation fre- 
quency was lower for pictures than words. 

According to frequency theory, dis- 
crimination between the "correct" and 
"incorrect" members of a pair in a verbal 
discrimination task is based on the fre- 
quency differential between them, with 
more subjective frequency "units" accruing 
to the correct member of the pair than 
to the incorrect member through such fac- 
tors as differential rehearsal favoring the 
former. The Ghatala et al. (1973) results 
are consistent with a frequency theory 
interpretation of picture-word differences 


691 


692 


J. R. LEVIN, E. S. GHATALA, AND L. WILDER 


in discrimination learning since: (a) the ¥ apparent frequency (small-difference items). The 


finding that pictures were higher in mean 
apparent frequency than words suggests 
that a subjective frequency unit produced 
by a picture is larger than a frequency 
unit produced by a word; and (b) the 
finding that Ss were less variable in 
estimating the presentation frequencies of 
pictures as compared to words suggests 
that differences in presentation frequency 
are more likely to be discriminated when 
the items are pictures than when they are 
words. Both findings lead to the predic- 
tion of superior discrimination learning 
when the materials are pictures—the first, 
because the absolute frequency difference 
between correct and incorrect items is 
larger for picture pairs than for word 
pairs; the second, because lower variability 
associated with correct and incorrect items 
in picture pairs renders them more dis- 
criminable than word pairs. 

The present experiment was conducted 
to test the hypothesis that picture-word 
differences in discrimination learning are 
a function of apparent frequency differences 
associated with the two types of material. 
How apparent frequency judgments of 
pictures and words were obtained and 
manipulated is discussed in the following 
section. 


METHOD 


In the Ghatala et al. (1973) study, 19 sixth- 
grade Ss judged the frequencies of 39 pictures, 
20 of which had been presented once, 10 twice, 
6 three times, and 3 four times. Another 19 Ss gave 
frequency judgments for the printed labels which 
occurred with the same presentation frequencies 
as their corresponding pictures. 

Manipulation of apparent frequency means. 
A comparison of the mean apparent frequencies 
of individual picture and word items revealed that 
the mean of pictures was reliably (p < .001) higher. 
However, while the mean judgments of pictures 
(at each presentation frequency) were, in general, 
higher than those of words, there was some overlap 
in the distributions. In particular, from among 
the items presented once, 8 of the pictures could 
be matched fairly closely with 8 of the words. 
Similarly, 4 pictures and 4 words were selected 
from among the items presented twice, and 2 pic- 
tures and 2 words were selected from among the 
items presented 3 times. Pairing items within 
presentation frequencies resulted in 7 picture pairs 
and 7 word pairs which differed minimally in mean 


mean apparent frequency of the pictures used in 
the small-difference items (averaged over the 14 
pictures) was 1.42. The mean apparent frequency 
of the 14 words also averaged 1.42. 

Next, 14 pictures and 14 words were selected 
to form pairs which differed considerably in mean 
apparent frequency (large-difference items). Four 
picture pairs and 4 word pairs were formed from 
items presented once, 2 pairs of each stimulus 
type were formed from items presented twice, and 
1 pair apiece were formed from the items presented 
3 times. The mean apparent frequency of the 
pictures was 1.74, and that of the words was 1.16. 

The 14 picture pairs (7 small-difference items 
plus 7 large-ditference items) comprised 1 discrimi- 
nation list, and the 14 word pairs comprised another. 
In forming each list, an attempt was made to 
equate a pair's correct and incorrect members in 
apparent frequency. This was done in order to 
ensure that any between-materials (pictures and 
words) apparent frequency differences were manipu- 
lated independently of within-materials (correct 
and incorrect members) apparent frequency 
differences. 

"Three presentation orders of each list were con- 
structed such that: (a) within each order, the 
occurrence of small- and large-difference items was 
random; (b) within an order, the correct member 
was located on the right for half of the pairs and 
on the left for the other half; and (c) across orders, 
the spatial location of the correct member of each 
pair was varied randomly with the restriction that 
not occur in the same position across all 3 orders. 
The randomization scheme for constructing the 3 
presentation orders was the same for picture and 
word lists, i.e., each order's sequence of small- and 
large-dillerence items, as well as its spatial location 
of correct members, was the same for the 2 lists. 

The pictures were line drawings (3 X 3 in.) 
which were pasted side by side on sheets of 8} 
X1 in. paper. The sheets were then placed in 
a ring notebook binder. The words were typed 
in primary type, side by side, on sheets of paper 
which were placed in a separate 3-ring binder. 
Asterisks were used to indicate the correct member 
in each pair. 

Manipulation of apparent Srequency variances, 
From the same pool of items used to construct 
lists varying in mean apparent frequency, picture 
and word items were selected on the basis of the 
variance in apparent frequency associated with 
each item, The variance of these judgments dif- 
fered for pictures and words, being statistically 
smaller (p < .001) for pictures. Once again, there 
was some overlap between the distributions. 
Small-difference items were constructed from 14 
pictures (8 selected from among the "one" items; 
4 from among the “two” items, and 2 from among 
the "three" items) and 14 words (selected in the 
same numbers as the pictures from the 3 presenta- 
tion frequencies) which differed little in variability. 
Within presentation frequencies, picture pairs and 
word pairs were formed which differed minimally 
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in variance. The average variance of the pictures 
was .54, and that of the words was .52. 

Fourteen pictures and 14 words which differed 
widely in variance (lower for pictures) were used 
to construct the large-difference items. The pic- 
tures and words were selected from presentation 
frequency levels in the same numbers given above. 
As in ail previous cases, pairs were constructed 
from items within a presentation frequency. The 
average variance for the 7 picture pairs so con- 
structed was .39. The mean variance for the word 
pairs was .80. 

As before, the 7 small-difference and 7 large- 
dillerence items within each stimulus type were 
combined to form a 14-pair picture list and a 
14-pair word list. The procedures used in creating 
the li were the same as those previously 
described. 

Finally, it should be noted that in selecting 
items to investigate one of the measures of ap- 
parent frequency (either means or variances) an 
attempt was made to control the other measure, 
even though this could not be done perfectly. 
For example, the average variances of picture and 
word items selected for a small difference in mean 
judgments were .45 and .65, respectively. For 
large-difference items, these figures were .55 and .61. 
With regard to items selected for small differences 
in variance, the mean apparent frequencies were 

.1.64 for picture pairs and 1.40 for word pairs. 
For the items with large variance differences, the 
corresponding means were 1.49 and 1.41. 

Subjects. The Ss were 80 sixth-grade children 
(mean age = 12.4 yr.) randomly selected from two 
elementary schools located in middle-class neighbor- 
hoods in Ogden, Utah. Twenty Ss were assigned 
(according to a block-randomized procedure) to 
each of the 4 conditions formed by the combina- 
tion of 2 materials (pictures or words) and 2 mea- 
sures (means or variances). Following Walster 
and Cleary's (1970) suggestions for equating 
"statistical" significance with "practical" signifi- 
cance, the number of Ss required was based directly 
on the simultaneous specification of effects con- 
sidered important (differences in excess of 1 within- 
cells standard deviation for the large-difference 
items) and those considered trivial (differences less 
than } of a within-cells standard deviation for the 
small-difference items). 


RESULTS 


Performance on the discrimination task 
is summarized in Table 1. For both mean 
and variance manipulations, the results 
are clear-cut. With large-difference items, 
pictures and words differ substantially: 
means, F (1, 38) = 13.58, p < .001; vari- 
ances, F (1, 38) = 6.48, p < .05; whereas 
with small-difference items, they do not 
(F <1 for both means and variances). 
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TABLE 1 


Mean NUMBER OF CORRECT DISCRIMINATIONS IN 
Two TRIALS ON ITEMS WITH LARGE OR SMALL 
MEAN AND VARIANCE DIFFERENCES 
(Maximum Score = 14) 


| Large difference | Small difference 
Measure 

| "pictures | Words | Pictures | Words 

| m 
Means 10.20 11.70 11.45 
Variances | 10.45 11.80 11.60 


While there was marked improvement over 
trials, in no case did this interact with the 
above picture-word effects (all ps > .05). 


DISCUSSION 


Based on the assumption that discrimina- 
tion learning consists of discriminations of 
subjective frequency differences between items 
¿strand et al, 1966), and given the em- 
cal finding that apparent frequencies as- 
sociated with pictures and words differ 
(Ghatala et al., 1973), we were able to anti- 
cipate the usual picture-over-word superiority 
in discrimination learning (Rowe, 1972; Rowe 
& Paivio, 1971; Wilder & Levin, 1973) when 
apparent frequency differences between pic- 
tures and words were maintained. On the 
other hand, when pictures and words were 
equated in terms of either apparent frequency 
means or variances (as determined from an 
earlier study), no discrimination-learning dif- 
ferences between the two types of material 
were detected. 

When one considers the (almost incredibly) 
straightforward pattern of results, it is tempt- 
ing to pit plausible rival constructs against 
the authors’ preferred account of the effects 
in terms of apparent frequency differences. 
Thus, it might be argued that when apparent 
frequencies were manipulated to create large- 
and small-difference items, other attributes 
were inadvertently manipulated concomi- 
tantly. Normative word frequency (f) as 
indexed by the  Thorndike-Lorge (1944) 
norms, and measures of meaningfulness (m) 
and concreteness (C)—all of which have been 
found to influence the ease of verbal discrimi- 
nation learning—may have been differentially 
represented in the large- and small-difference 
word pairs. However, based on an inspection 
of the present words for which norms were 
available (Paivio, Yuille, & Madigan, 1968), 
it was found that all the words were uniformly 
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high in f, m, and C, with no discrepancies 
between large- and small-difference items 
evident. Of course this finding was to be 
expected given that the present stimuli were 
selected to be picturable (i.e, concrete) and 
familiar to elementary school children (i.e., 
frequent and meaningful). 

While there are potentially a multitude of 
uncontrolled factors which may have con- 
tributed to the observed effects, a reviewer 
of this journal has pointed out one in par- 
ticular which will be considered in detail. 
lf it is assumed that apparent frequencies of 
pictures are correlated with those of their 
verbal labels (the corresponding words), aud 
if it is further assumed that the words cor- 
responding to, say, large-difference pictures 
are not equally divided between large- and 
small-difference items (due to nonrandom a: 
signment of items to conditions), then it is 
possible in the present study that apparent 
frequency has been confounded with the 
specific items appearing in the various condi- 
tions. For example, if the large-difference 
pictures (which tend to produce higher and 
less variable judgments than the small-dif- 
ference pictures) correspond to words with 
high means and low variances, and if more 
of these words happened to be assigned to 
the small-difference than to the large-dif- 
ference condition, then the pattern of results 
in Table 1 may be easily explained in terms 
of item-specific peculiarities; that is, large- 
difference pictures would be expected to be 
best, large-difference words worst, with the 
two other conditions in between. The same 
argument can be made if words corresponding 
to small-difference pictures ended up more 
frequently in the large-difference than in the 
small-difference condition. 

Fortunately, a thorough examination of the 
present materials suggests that this account 
of the results is not satisfactory. In the first 
place, for the items equated on the variance 
measure, the words corresponding to large- 
difference pictures were almost equally divided 
between the large- and small-difference condi- 
tions, as were the words corresponding to 
small-difference pictures (a total of 10 vs. 12 
cases, in the direction opposite to the hy- 
pothesis). Second, although an inspection of 
the items equated on means indicated a tend- 
ency toward unequal division in the direction 
specified by the hypothesis (a total of 15 vs. 
6 cases, p < .10), further scrutiny of the 
Ghatala et al. (1973) data revealed that the 
correlation between picture and word items 
presented once was low (r — .14) and statis- 
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tically nonsignificant, thereby mitigating 
against the reviewer's first assumption, Thus, 
the specific-item hypothesis, with its assump- 
tions of picture-word correlations and selective 
item assignment, cannot account for the pat- 
tern of results observed here: In the case of 
the variances, there was no differential item 
selection; and in the case of the means, the 
picture-word correlation was low. 

While it is not possible to state with cer- 
tainty which of the two measures, means or 
variances, has the more predominant influence 
on picture-word differences in discrimination 
learning (since the two measures covary, and 
this problem was not completely controlled 
for here), speculations may be made on the 
basis of some previous research. In one 
published study (Ghatala & Levin, in press-a) 
and in other unpublished work, it has been 
found that pictures and words differ more 
reliably with respect to variability in apparent 
frequency (pictures being lower) than with 
respect to mean apparent frequency. If this 
result continues to be replicated in further 
controlled experiments, it would indicate that 
pictures are easier to discriminate than words 
because the subjective frequency units 
ciated with them are more stable. 

Even though a frequency theory (Ekstrand 
et al., 1966) explanation of the present effects 
is appealing—in particular, with regard to 
the operationalization of the manner and 
contexts in which frequency units accrue—it 
must be realized that the validity of an 
apparent-frequency construct is not enhanced 
unless one can explain why pictures and words 
differ in apparent frequency. However, such 
an attempt has been made by. Ghatala et al. 
(1973). Essentially, the argument is that 
pictures and words differ substantially in 
terms of S's preexperimental encounters with 
them. Thus, while Ss have likely read the 
experimental words on numerous preexperi- 
mental occasions, they have never encoun- 
tered (preexperimentally) the particular ex- 
perimental line drawings. By adapting 
Weber's law to this type of situation, it is 
readily understandable why the presentation 
of a picture (initially low in preexperimental 
exposure) should constitute a larger—and, 
evidently, more stable—subjective frequency 
unit than the presentation of a word (initially. 
high in preexperimental exposure). 

Based on this rationale, some interesting 
extensions have been made (cf. Ghatala & 
Levin, in pressa, in press-b). However, there 
are additional verbal discrimination and recog- 
nition memory phenomena with which the 
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“background-frequency” explanation must deal 
before it can be considered viable. Whether 
or not it can negotiate them successfully re- 
mains to be seen. 
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A PREVIEW-CONSTRAINT MODEL OF ROTARY ARM CONTROL 


AS AN EXTENSION OF FITTS'S LAW! 


TARALD O. KVALSETH ? 


School of Industrial and Systems Engineering, Georgia Institute of Technology 


A first-order linear model relating movement time to Fitts's movement variable 
and a preview variable was formulated and validated experimentally using 15 
undergraduate students s. Without preview constraint, the rotary arm 
movement task produced a maximum movement information or index of task 
difficulty (ID) rate of 4.7 bits/sec, which, when preview constraints were 
imposed, was reduced to 3.9 bits/sec as compared to a marginal preview ID 
rate of 12.5. The movement variable was found to account for about 70% 
of the total contribution to movement time, The error rates were determined 
to be highest in the no-preview-constraint case and were significantly (p < .05) 


affected by both movement and preview ID. 


The interrelation between movement 
time (Y), amplitude (4), and accuracy or 
target width (W) for terminating the 
movement for serial motor tasks was first 
formulated within an information-theoretic 
context by Fitts (1954).  Arguing by 
analogy from Shannon's channel capacity 
Theorem 17 (Shannon & Weaver, 1949), 
Fitts proposed the simple relationship 


V=at+oXit+e; 


Il 


[1] 
X; = log:(24/W) bits, 


where X; is considered to be a measure 
of the amount of entropy or information 
in each movement, e is the error term, 
and «a and b are constant parameters. 
The validity of this linear model has been 
clearly confirmed by Fitts and other in- 
vestigators (cf. Welford, 1968, Chap. 5) 
through a number of experiments such as 
those involving tasks for which Ss were 
required to move a stylus or pen quickly 
between 2 targets of width W and distance 
apart A. 

Fitts's model was concerned with linear 
arm movements without any constraint on 
the movement path or the preview. 
A linear second-order model for hand 
movements with path constraints was 


Il 


1 The author wishes to thank W. B. Cross for 
assisting with data collection. 

2 Requests for reprints should be sent to Tarald 
O. Kválseth, School of Industrial and Systems 
Engineering, Georgia Institute of Technology, 
Atlanta, Georgia 30332. 


recently formulated by Kválseth (1973); 
however, as far as this author is aware, 
no quantitative model has so far been 
formulated and validated that incorporate 
preview constraints for serial motor mov 
ments involving, specifically, arm rot 
tions (Crossman, 1960, studied the effe t 
of preview on continuous tasks involviny 
pursuit tracking). The present investiga 
tion was designed to study the effect oí 
such variable preview in laboratory 
experiments. 

The basic hypothesis to be tested was 
that the movement time Y is linearly 
related to a new index of task difficulty 
(ID) variable 


M CIN 


where P is the preview (distance) and a, 8, 
and y are unknown constant parameters. 
This model may be placed within an in- 
formation-theoretic context that represents 
essentially a generalized extension of Fitts's 
interpretation of the variable X, (cf. Fitts 
& Posner, 1967, pp. 112-117) without 
proposing that it be considered as a nec- 
essarily true paradigm of human behavior 
relevant to the motor task considered in 
this study. Thus, the quantity X may be 
interpreted as the information generated 
by or the entropy associated with the 
selection of any one movement from an 
ensemble of equiprobable movements; the 
term in the square bracket of Equation 2 
being the number of elements in the en- 
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semble. The unit of X may then be 
considered as being "bits" with the con- 
straint imposed that all values of the dif- 
ferent variables and parameters considered 
are such that X > 0. 

Effectively, the variable P is such that 
0 < P< (A + W/2). For the case of no 
preview constraint, P = (A + W/2) or 
alternatively, it may be considered that 
B = 0, so that the resulting expression for 
X is simply a generalization of X; in 
Equation 1; the identical linear model of 
Equation 1 is still seen to apply. For the 
preview-constraint case, the linear rela- 
tionship between Y and X can, after a 
few simple algebraic manipulations, be 
expressed alternatively in terms of the 
following first-order linear model, 


Y-2adbXi-c cXa + €i 
[3] 


X; = logs(1/P) 

where the ID variable X», which may as- 
sume negative values and did so exclu- 
sively in the present study, is not to be 
interpreted as a direct information or en- 
tropy measure in the information-theory 
sense. The coefficients a, b, and c are 
unknown constant parameters to be esti- 
mated, and e is the experimental error. 


METHOD 


Apparatus and task. The experimental apparatus 
designed for this experiment is Illustrated in Figure 1. 
It consisted of a black box containing a 11-v. battery 
that provided the input voltage to a linear poten- 
tiometer whose output terminals were connected to 
a single-channel strip chart recorder. The po- 
tentiometer was fixed to the inside of the box with 
its rotary wiper or shaft emerging through a hole 
in the center of one side of the box. This shaft 
was attached to a round control knob 21 in. in 
diameter so that any angular displacement of the 
control knob produced a proportional variation in 
the output voltage of the potentiometer, which was 
thus used as a displacement transducer. A 2}-in.- 
long pointer (measured from the center of the con- 
trol knob and the shaft) was mounted onto the knob 
such that it barely cleared a scale attached to the 
box, Thus, a control-display ratio of 1 was used. 
Such an easily replaceable scale, which was placed 
such that the end of the black pointer reached ap- 
proximately halfway across it throughout its length, 
had a marked target area of width W degrees; the 
centers of the target areas were separated by a 
distance A degrees. 
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Ficure 1. A plan view of the apparatus. 


Preview constraint was incorporated into the 
experiment by attaching a black circular cardboard 
plate to the bottom of the control knob. The 
diameter of the plate was slightly larger than that 
of the outer side of the scale measured from the 
center of the control knob (see Figure 1). i 
plate, which was easily removable, had a radial slit 
in it of 2P degrees and was placed such that the 
pointer was in the center of the slit. Several such 
plates and scales were designed to provide different 
experimental conditions involving different values 
of A, W, and P. 

The apparatus was placed on the top of a table, 
and Ss stood in an upright position. 'The experi- 
mental task required S to move the pointer back 
and forth between the 2 target areas of a scale by 
turning the control knob; this movement had 
negligible friction. Because of the preview con- 
straint, S could only see the scale a distance of P 
degrees ahead of the pointer throughout a complete 
movement between targets. For such serial move- 
ments, which were simultaneously recorded on the 
strip chart, Ss were told to minimize their movement 
times, while at the same time trying to maintain low 
error rates. An error was considered as being com- 
mitted whenever a pointer movement terminated 
outside 1 of the 2 targets. 

Subjects. A total of 15 unpaid Ss were used in the 
experiment, all of whom were right-handed male 
undergraduate students. Five of these performed 
the experiment without any preview constraint, 
while the remaining 10 Ss were subjected to preview 
constraints. 

Experimental procedure. The 12 experimental 
conditions used consisted of all combinations of 
angular movement à nplitudes A = 30°, 60°, 90° 
and target widths W = 5°, 10°, 15°, 20°, with pre- 
view P = W. The order of these conditions was 
randomized and different for each S. Prior to each 
run, the pointer was lined up with each of the 4 
boundary lines of the target pair, and the strip chart 
was run a short distance for each of these 4 pointer 
settings so as to identify the equivalent target 
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boundaries on the strip chart. Then, with the 
strip-chart recorder turned off, S was given a 10-sec. 
practice session. Following a practice session and 
an approximate 5-sec. rest period, S performed the 
experiment for 15 sec. while the strip-chart recorder 
was turned on. From this chart, E was able to 
obtain the total number of moves made by S and 
hence the mean movement time during the sec. 
interval. The chart also provided information about 
the error rate by simply observing the fraction of 
moves by the recorder pen that fell short or far of 
the target boundaries. Movements that resulted in 
errors were included in the determination of the 
mean movement time. 


RESULTS AND DISCUSSION 


No-preview-constraint case. On the basis 
of least squares regression, the relationship 
between rotary arm movement time and 
movement ID was estimated as 


P = —21.02 + 21.31 X1, [4] 


where the unit of Ê is 10-? sec. The 
parameter estimates are based on average 
movement time over each 15-sec. period 
and then over Ss. The coefficient of 
multiple determination for this relation- 
ship was determined to be A? — .96, in- 
dicating an excellent fit for this regression 
or a highly significant (p < .0005) cor- 
relation between Y and Xi of r = .98 
(summary data are given in Table 1). 
On the basis of these data, the maximum 
information rate for this type of movement 
task is seen to be constant for different 
X; values at approximately 4.7 bits/sec; 
1.58 € X, < 5.17 in this experiment. This 
information rate is slightly lower than that 
for linear serial hand movements (deter- 
mined by Kválseth, 1973, to be 5.5 bits/ 
sec) and .4 times that for arm movements 
in reciprocal tapping tasks (cf. Fitts, 
1954). 

Preview-constraint case. Least squares 
estimation for Equation 3 based on aver- 
age movement time in 10-2 sec. (averaged 
for each S during 15-sec. period and then 
averaged across the 10 Ss) resulted in the 
following relationship, 


Y = 8.97 + 26.06 X; + 8.03 X». [5] 
This first-order model gives an excellent 
fit to the experimental data with R? = ,98, 
The regressor X; is by far the most "sig- 
nificant” of the 2 regressors in terms of its 
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contribution to (the calculated value 
Specifically, on the basis of the | 


determination coefficients, which 
found to be .93 and .41 for X; and 
respectively, X; accounts for appr 


mately 70% of the total contribution to 
by the 2 regressors. However, as is se 
from the data in Table 1, the addition 
effect of X» is significant at the .05 | 

From Equation 5, it seems plausib! :o 
test the statistical hypothesis that c = 7/3. 
The appropriate ¢ statistic for this test 
was found to take on the value .20 with 
9 degrees of freedom so that this hy- 
pothesis is clearly accepted at the .05 level. 
Thus, Equation 3 may be replaced by 
the model 


Y = Bo + Xs + 6; 


[6] 
X; = log.(24/WP), 


where 8) and 8; are unknown constan: 
parameters and e denotes the error term. 
The ID variable X; is then the equivaler: í 
of Xı when preview constraint is imposed 
and, although it is seen that X; may pos- 
sibly assume negative values, only positive 
X; values were used in the present study. 

On the basis of a least squares regression 
of Y on X;, employing the same values of 
the variables Y, 4, W, and P as those 
used in arriving at Equation 5, the esti- 
mates for 8) and £, were determined to be 


Bo = 12.00, ĝi = 25.72, 


providing an R? of .97; the analysis of 
variance data corresponding to the test of 
the hypothesis that 6; = 0 are given in 
Table 1. The resulting rate dX,/dP = 3.9 
=~ 0X,/dY is seen to be 17% lower than 
the maximum information rate for the no- 
preview-constraint case. Furthermore, the 
marginal ID rate 0X,/90Y = 12.5 is ob- 
served to be comparable to the maximum 
ID rate obtained by Fitts (1954) for linear 
arm movements. 

Error rates. The percentage of move- 
ments that missed the indicated targets 
tended to be consistently higher in the 
no-preview-constraint case than when pre- 
view constraints were imposed, although 
the instructions to Ss were the same in 
the 2 cases. In the former case, the 
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average error rate (average across Ss for 
each of the 12 experimental conditions) 
ranged approximately 505-1595 and was 
highest for large X; values without ap- 
pearing to be entirely systematically related 
to Xj. The average of this error rate 
across the X; values used was found to 
be about 1095, which was twice that of 
the preview-constraint case. In general, 
the errors appeared to be about equally 
distributed between overshoot and under- 
shoot errors. For the preview-constraint 
case, it was established that task difficulty, 
as reflected by values of Xi and X», had 
a significant effect on the error rate. 
The average error rates for each S were 
paired according to low index of difficulty 
level (1.58 € X; < 3.58 and —4.32 < Xs 
< —3.32) or, equivalently (14X Xs < 
2.48) and high difficulty level (3.58 € Xi 
«X 517 and —3.32<5 XS —2.32) or, 
equivalently (2.48 < Xs< 4.40) and, on 
the basis of the Wilcoxon signed-ranks test, 
the hypothesis of no difference between 
these levels was rejected at the .05 level. 


CONCLUSION 


This study has generalized Fitts's law for 
simple linear and serial arm movements to 
incorporate temporal motor responses involv- 
ing rotary arm movements with preview con- 
straints, It was demonstrated that the im- 
position of preview constraints on such 
peripheral motor tasks had a significant effect 
on movement time, error rates, and maximum 
information rate. The simple first-order linear 
models relating movement time to movement 
variable X;, the preview variable X», and the 
composite variable X; were clearly appropriate 
formulations providing excellent fits to the 
experimental data generated. Of the 2 vari- 
ables, X1 and X», the latter was found to be 
the least "important" one in explaining the 
variation in movement time; its contribution 
to movement time was determined to be less 
than half that of Xi. Further observations 
resulting from imposing preview constraints 
were that the average error rate was reduced 
by some 50% while the maximum information 
rate was 17% below that of the no-preview- 
constraint case. 

No attempt was made in the present study 
to determine if practice had any influence on 
the degree to which the preview constraint 
effected movement time. It is conceivable 
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TABLE 1 
ANALYSIS OF VARIANCE 
Source df T: 
No-preview-constraint case 
(Equation 4) 
Xi 1| 237.43** 
Residual (M.S) 10 | (22.78) 
Total 11 
Preview-constraint case 
(Equation 5) 
Xi 1| 243.06** 
Addition of Xs 1 6.21* 
Xi X: 2 
Residual (MS) 9 | (30.24) 
"Total 11 
Preview-constraint case 
(Equation 6) 
Xs 416.82** 
Residual (M.S) 10 | (27.27) 
Total 1l 
*p «.05. 
p c 001. 


that there is an interaction between practice 
and preview so that the preview constraint has 
a greater effect on movement time early during 
the task performance than it has later when 5 
may not have to rely as much on preview in- 
formation. This point ought to be worth 
pursuing in another study. 

The effect of preview constraints on the task 
parameters as determined in this study applies 
to simple, serial, and 1-dimensional rotary arm 
movements and may be quite different for 
other types of motor tasks. A further investi- 
gation is planned with objectiv similar to the 
present study but involving linear movement 
tasks with and without different types of move- 
ment path constraints. 
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Changes in finger volume (FV) and finger pulse volume (PV) were measured by 
two pneumoplethysmographic methods, Eleven identical randomly spaced 
auditory stimuli were given. Five different habituation estimates were calcu- 
lated; H-score, parametric beta, parametric correlation, Kendall's tau, and 
nonparametric beta. Both FV and PV responses habituated, PV responses 
faster than FV. The correlations between FV and PV responses were signifi- 
cant and moderately high. Some surprisingly low correlations were obtained 
among the different habituation estimates, indicating that the choice of 
habituation estimate is of great importance, and might influence the result 


profoundly, —Differen 


s in response latency found between FV and PV 


responses were discussed in terms of measurement technique. 


Much psychophysiological research in 
recent years has been directed toward ex- 
ploring empirical and theoretical aspects 
of habituation of orienting responses in 
different autonomic systems (Franken- 
haeuser, Fróberg, Hagdahl, Rissler, Bjórk- 
vall, & Wolff, 1967; Groves & Thompson, 
1970; Lader & Wing, 1966; McDonald, 
Johnson, & Hord, 1964). Speed of habitua- 
tion of electrodermal responses has been 
found to be a highly stable individual 
characteristic (Crider & Lunn, 1971) and 
there is evidence for at least moderate 
genetic determination (Lader & Wing, 
1966). 

Habituation is in general rapid for skin 
conductance (SC) responses. In regard to 
habituation of the digital vasoconstrictive 
response the evidence is equivocal. Davis, 
Buchwald, and Frankmann (1955) found 
significant habituation for pulse volume 
(PV) but not for finger volume (FV) re- 
sponses. This dissociation was inter- 
preted to indicate that PV and FV changes 
are dependent on different physiological 
systems. The difference in response latency 
found between the two measures was re- 
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garded by these authors as further evi- 
dence for this hypothesis. Their conclu- 
sions have been cited in subsequent review, 
e.g, Martin (1961). Burch (1961) also 
found habituation for PV responses. Unger 
(1964) found that 12 of 20 subjects reached 
à habituation criterion of three successive 
PV nonresponses to successive number of 
stimuli up to 36 stimulus presentations. 
Using the same habituation criterion Koe- 
pke and Pribram (1967) found that about 
half of the subjects habituated in PV, also 
with a variable number of tone stimuli up 
to 40. However, in a study by Lader 
(1965) there was no significant difference 
between the first four and the last four 
PV responses to repeated tone stimuli. 
Hare (1968) found indications of great 
intraindividual variability and resistance 
to habituation for PV responses in criminal 
subjects. Further, he noted that it was 
not unusual for several nonresponses to be 
followed by a;number of responses of ap- 
preciable size. 

The purpose of the present study was 
to investigate simultaneously the habitua- 
tion of FV and PV responses to repeated 
auditory stimuli by a direct pneumo- 
plethysmographic technique. Since there 
is a tendency to dishabituation in the 
vasoconstrictive orienting response, the use 
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of the simple habituation criterion of three 
successive nonresponses is questionable. 
Therefore different methods of estimating 
habituation were applied. 


METHOD 
Procedure 


Subjects. Nineteen conscripts, 19-21 yr. old, 
served as Ss. Participation was voluntary and Ss 
were paid, The Ss were in good physical and mental 
health. 

Auditory stimuli. Eleven sine-wave tones of 
1,000 Hz., approximately 100-db. intensity, and 
t-sec. duration were administered through a loud- 
speaker. This was placed 1 m. to the right of the 
subject. The intervals between the tones were 
randomized and varied between 35 and 60 sec. 

Measurement of finger volume, A pressure re- 
cording system (Elema EMT 32) with a pressure 
transducer of capacitor microphone type and ampli- 
fier with a stepwise variable time constant was used. 
The transducer was connected toa plastic oncometer 
cuff via a short rigid plastic tube. The cuff was 
mounted airtight on the left long finger of the sub- 
ject. To eliminate the fast pulse volume deflections, 
the recordings were made with a high time constant 
(3 db, down at .3 Hz.). 

Measurement of pulse volume. A pressure trans- 
ducer of piezoelectric type (Elema EMT 510, see 
Lund, 1964) was connected to a plastic oncometer 
cuff via a short rigid plastic tube. The cuff was 
mounted airtight on the left index finger of the sub- 
ject. Time constant for the outfit was 2 sec. 
(transducer connected directly to the galvanometer 
amplifier of the Mingograph). 

Recording instrument. An inkwriter of the Elm- 
qvist fast-response type (Mingograph 81) was used. 
As a control variable the left-hand long-finger tem- 
perature was continuously monitored by a ther- 
mistor device. 

Procedural detail. The experiment was performed 
in September, a. period with stable weather condi- 
tions. The barometric pressure varied between 
747- and 763-mm. Hg. The Ss came to the labora- 
tory in the afternoon, after their normal duties. 
They were placed in an examination chair in a quiet 
room, in which the temperature varied between 
21.5? and 23.5? C. The finger temperature of the 
subjects varied between 21.5? and 34.8? C. at the 
beginning of the stimulation period. A constant, 
faint, indirect light was used. The recording ap- 
paratus was kept in an adjoining room, from which 
the experimenter could observe the subject through 
a small window. 

The left arm and hand of the subject was placed 
on a soft support so that the limb was in a position 
of rest, and so that the fingers were at the level of the 
sternal angle. 

The procedure was explained to the subject, and 
several deep inspirations were practiced. Then the 
recordings were started. Thereafter all instructions 
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to the subject were given through a tape recorder 
via a loudspeaker. 

The experiment started with 10 min. of rest. 
Then the 11 tone stimuli were administered. 


Treatment of Data 


Of the measures obtained, only data pertaining 
to habituation of responses are reported in this 
paper (cf. Lidberg, Schalling, & Levander, 1972). 

Finger volume, Any observable decrease of the 
finger volume occurring between 1.6 and 10 sec. 
from the onset of the stimulus was considered as a 
response. Responses within 1.6 sec. were dis- 
regarded. (There were eight such responses, 506 
of the total number of responses.) 

A response was quantified as the difference, in 
mm. recording paper, between the maximal deflec- 
tion during 10 sec. after the stimulus and the mean 
level during 10 sec. before the stimulus. This value 
was then corrected for transducer sensitivity, 
electrical amplification, and the volume of the 
finger tip inserted into the cuff, 

Pulse volume. Any observable decrease of the 
pulse volume occurring between 1.6 and 10 sec. 
from the onset of the stimulus was considered as à 
response, Responses within 1.6 sec. were disregarded. 
(There were six such responses, 4% of the total 
number of respon: 

A response was quantified as the difference, in 
mm. recording paper, between the mean of the three 
smallest consecutive deflections during 10 sec. be- 
fore the stimulus, and the mean of the three smallest 
deflections during 10 sec. after the stimulus. The 
difference was expressed in percent of the mean of 
the three smallest deflections during 10 sec. before 
the stimulus. 

Response latency. As the time constant of the 
two recordings differed, no meaningful comparisons 
between response latencies of FV and PV were 
possible. 

Habituation. Five estimates of habituation were 
calculated. Parametric correlations and regressions 
were calculated for response size on the logarithm 
of stimulus numbers. The third of a series of 3 
successive nonresponses were regarded as the end 
of a response sequence. Most subjects, however, 
did not reach that criterion, and thus their response 
sequence consists of 11 responses. As there was a 
significant positive correlation between slope and 
intercept (estimated first response) for the para- 
metric regression equations, a habituation estimate 
(H score) according to the procedure described by 
Montagu (1963) was calculated, in which correc- 
tion for differences in estimated first response is 
applied. 

As the use of parametric statistics requires normal 
distributions, homoscedasticity, and linearity of 
regression equations, and as these requirements are 
not strictly satisfied in the present data, as in most 
other studies, two nonparametric methods were 
also applied. Kendall’s tau is a nonparametric 
correlation. coefficient, which is relatively insensi- 
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tive to as compared with the Spearman rank 
correlation coefficient. However, a correlation 
coefficient estimates not only the slope of an as- 
sumed regression curve, but also deviations of 
responses from the curve. Sen (1968) has described 
à nonparametric regression statistic, nonparametric 
beta, based on the theoretical work of Kendall. 
Calculation of individual confidence limits is possible, 

Thus five main habituation estimates were calcu- 
lated on the present data, three parametric esti- 
mates based on the plot of response 
stimulus number— product-moment correlation (r), 
linear regression (beta and H score)— and two non. 
parametric estimates—Kendall's tau and nonpara 
metric beta. Further, some other simple estimates 
of habituation were calculated to allow for a direct 
comparison with the findings in some earlier studies 
(t test between means of first four and last four 
responses, the Wilcoxon test, and number of responses 
to the criterion of three successive nonresponses). 


RESULTS 


Mean responses to tones in FV and PV, 
expressed as percent of the first response 
of each subject are shown in Figure 1. 
The trend appears to be linear and de- 
creasing for both measures, though it ap- 


3 


Log stimulus number 
FIGURE 1. 


Mean responses to tones in finger volume 
expressed as percent of the first response of. ea: 
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pears more pronounced for PV 1 for 
FV. The slopes of the regression li: ere 
—.40 for FV and —.55 for PV. 

The correlations between mean res) ses 
in FV and PV were .62 over subjects and 
44 over stimuli. For individual sub cts 


the correlations varied between —.22 .nd 
98, 25% of the subjects h fing corr la- 
tions above .90. Two subjects who | ad 
many spontaneous fluctuations obtai: od 
very low correlations, —.22 and .11. 

The distributions of H Score, parametric 
beta, tau, and nonparametric beta justified 
the use of the ¢ test to estimate if there was 
à significant habituation trend for the 
group in FV and PV. The / values for 
mean habituation are seen in Table 1. 
For PV, all £ values were highly significant, 
whereas for FV two estimates (tau and H 
Score) were significant, one was almost 
significant (parametric beta), and one was 
nonsignificant (nonparametric beta). Fox 
FV, no subject reached the habituation 
criterion of three successive nonresponses. 


OO Finger volume 


aao Puhe voiyme 


and pulse volume, 
ch subject. 
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TABLE 1 


t VALUES FOR M 


s or ESTIMATES OF HABITUATION FOR FINC 


R VOLUME AND 


Purse VoLume (N = 19) 
Finger volume | Pulse volume 
Measure = 
M | | sD t 
H score —1.41 24.38 
Parametric beta —1.41 35.26 
tau 1.192 .309 
Nonparametric beta — 420 | 5.47 
S ~ — 4 — = =, — — = 
*p «.10. 
*»* p <05. 
sek p < 01. 


HED < .001. 


There was a significant difference between 
the mean of Responses 2-5 and 8-11 
(t = 3.43), and also significant habituation 
estimated by the Wilcoxon test (T = 28) 
for FV. 

Of the five habituation estimates used 
only tau can be used for direct compari- 
sons between FV and PV in regard to 
difference in habituation rate. A / test 
for correlated means showed that PV re- 
sponses habituated significantly faster than 
FV responses, / = 18.1, p € .001. 

Correlations among the five habituation 
estimates, first response obtained, and 
estimated first response (from the H-score 
procedure) are shown in Table 2 (FV) and 
Table 3 (PV). The correlations ranged 
from .66-.95 for FV and from .49-.92 for 
PV. 

In Table 4 intercorrelations between FV 
and PV measures including estimates of 
habituation and obtained first response are 
given. The correlations ranged from .38- 
.73, with the correlations obtained between 


'TABLE 2 
-MOMENT CORRELATIONS AMONG ESTI- 


identical habituation estimates being gen- 
erally slightly higher. 


DISCUSSION 


Mean habituation was highly significant for 
PV for all habituation estimates. For FV the 
result of the statistical analysis was inconclu- 
sive. As judged from the statistically most 
appropriate habituation estimate, nonpara- 
metric beta, FV responses did not habituate in 
tudy. However, s gnificant habituation 
for FV was obtained for all the other habitua- 
tion estimates used except for the simple criter- 
ion of three successive nonresponses. In line 
with the findings of Davis et al. (1955) there 
was in the present data a significantly faster 
habituation for PV than for FV. However, 
in contrast to Davis et al., FV responses 
habituate as judged from most habituation 
estimates including that used by Davis et al. 
(Wilcoxon test). This discrepancy might be 
due to differences in the recording system. 
The transducer system in the present study 
ensures optimal reproduction of the human 
pulse wave (Cronvich & Burch, 1969). The 


TABLE 3 
Propuct-MoMENT CORRELATIONS AMONG ESTI- 


MATES OF HABITUATION AND MEASURES OF 


MATES OF HABITUATION AND MEASURES OF 
First RESPONSE FOR FINGER VOLUME First RESPO; FOR PULSE VOLUME 
(N = 19) (N = 19) 
Measure ZHBHESE Measure ZEBBESS 
A yE 3 | 
1. H score 47 82 . H score 69 55 49 J4 —.05 0 
2. Parametric beta -80 Parametric beta 86 90 .75 —.03 —.72 
3. Product-moment 7 Product-moment r 92.70) —.59 —.07 
4. Kendall's tau Kendall's tau. i88 —60 —.77 
5. Nonparametric beta onparametric beta —47 —.58 
6. First response response ES 
7. Estimated first re- mated first re- = 
onse (from H- 


(from H- 


sponse 
procedure) 


score 


score procedure) 


TABLE 4 


INTERCORRELATIONS AMONG ESTIMATES OF HABI- 
TUATION FOR FINGER VOLUME AND PULSE 
VoLUME 


Finger volume 
Pulse volume TS hi ae A 


1 2 | 3 4 
1, H-score .66| .38| AL 
2. Kendall's tau | 4439]. .73| .60 
3. Nonparametric beta. .65 .59 .60 
4. First response —.16 | —.36 | —.30 


disadvantages of the pneumoplethysmographic 
technique, e.g., sensitivity for movements of 
the hand and for leakage in the pneumatic 
system, can easily be overcome as such arti- 
facts are readily identified in the records 
(Lader, 1967). Impedance plethysmography 
as used by Davis et al. is an indirect method. 

Lader (1965) did not find significant habitua- 
tion for PV using / test between mean size of 
first four and last four responses as habitua- 
tion estimate. This finding might in part be 
explained as an effect of the type of response 
measure used by this author, A PV response 
was defined as the difference between the 
smallest pulse volume deflections occurring 
within 10 sec. of the onset of the stimulus and 
the average of six consecutive beats immedi- 
ately before the stimulus. In that way, 
statisically, some of the responses obtained 
may be artifacts, 

Increased drowsiness as a monotonous ex- 
perimental situation proceeds may result in 
dishabituation and an uneven course of the 
curves. In the study by Lader (1965), which 
included a rest and a tone stimulation period, 
the pulse volume responses diminished at 
first, but they tended to increase again; this 
increase was accelerated by administration of 
barbiturates, Habituated and small cardio- 
vascular responses tend to reappear and be- 


come large during drowsiness and sleep 
(McDonald et al., 1964; Pampiglione & 
Ackner, 1958). In the Present study, the 


subjects were tested in the late afternoon, 


after a day's work, which might have made 
them prone to drowsiness, It is noteworthy 
that despite this the estimates of mean habi- 
tuation in PV used in the present study were 
highly significant. 

Davis et al. (1955) found that response 
latency was greater for FV than for PV. 
Operationally the two variables are defined in 
terms of different frequency Spectrum, i.e., 
bandwidth, regardless of the fact that the time 
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derivate (v) of very large FV response: ht 
approach the corresponding value for ord vy 
PV deflections. Electronically the two a- 
sures are separated by means of low pass | V) 


and high pass (PV) filters. Such filters i: v- 
duce delays of transient signals, different (or 
different frequency bands. Thus a recor «d 
difference in response latency between 
and PV, is, at least in part, an artifact, | 
duced in the electronic system. For 
large responses and the high frequency |i 
used in the present study this delay was 2 s 
Thus, although a highly significant differen: 
in response latency was found between F\ 
and PV, FV response latency being greater 
this finding cannot be interpreted in phys- 
iological terms. Davis et al. (1955), who 
found a similar difference between FV and 
PV, have interpreted their findings as a phy- 
siological difference. 

Physiologically it is reasonable to regard 
FV as reflecting the tone in the venules and 
small veins (capacitance vessels) and PV as 
reflecting the tone in the arterioles and pre- 
capillaries (resistance vessels), One might 
expect a dependency between these two mea 
sures both on a blood flow dynamics and a 
neural basis. In the finger skin vessels, both 
for capacitance and resistance vi s, there 
are exclusively vasoconstrictive sympathetic 
fibers with noradrenaline as the transmitter 
(Shepherd, 1963). 

The correlation between FV and PV re- 
Sponses was significant but moderately high. 
There were great interindividual differences 
with regard to intraindividual correlations, 
ranging from — .22-.98. The interpretation 
of these differences is uncertain, The strictly 
defined mechanical method of evaluation of 
responses used, which did not allow for adjust- 
ments due to, e.g., spontaneous fluctuations 
may be a source of error variance. However, 
à more flexible evaluation of the responses 
would probably have diminished the relia- 
bility of the measures, 

Within each system the correlations among 
the five different habituation estimates are 
sometimes surprisingly low, indicating that 
the choice of habituation estimate profoundly 
influences the conclusion as to whether habitua- 
tion has or has not occurred in the system. 

Three of the habituation estimates used in 
the present study are of special interest: H 
score, because this measure has been widely 
accepted and used as a habituation estimate, 
and tau and nonparametric beta, because 
these two measures have certain statistical 
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advantages. Intercorrelations between FV 
and PV for these habituation estimates are 
somewhat lower than the corresponding inter- 
correlations within PV and FV. There is thus 
a certain independence, intraindividually, be- 
tween the FV and PV response sequences 
with regard to habituation, This might be 
interpreted partly as an effect of the measure- 
ment procedure and partly as a physiological 
difference, the relative importance of which is 
uncertain. 

It might be concluded that both FV and 
PV responses habituate, that PV responses 
habituate faster than FV responses, and that 
there is a certain independence between the 
two physiological variables, in regard to 
both the individual response sequence and 
habituation. 
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MEMORIAL STRATEGY AND IMAGERY: 


AN INTERACTION BETWEEN INSTRUCTIONS AND RATED IMAGERY? 


DONALD ROBBINS, JAMES F. BRAY, JAMES R. IRVIN, Asp PHILLIP S, WISE 


Emory University 


Two studies are reported in which memory instructions (interactive imagery 
separation imagery, and rote repetition) and rated imagery values of the 
stimulus and response terms (high and low) were manipulated in a paired 


associate task. The results indicated that interactiv 


imagery instruction 


led to superior performance in contrast to separation-im gery or rote-repetition 
instruction only with high-imagery cues. This interaction between memoria! 
strategy and rated imagery is a confirmation of the “elusive” interaction 
expected by Paivio's conceptual-peg hypothesis. 


Mental imagery has been shown to lead 
to very high levels of recall performance in 
paired-associate memory tasks (e.g., Paivio, 
1969, 1971). In these studies, when rated 
imagery (Paivio, Yuille, & Madigan, 1968) 
was varied, Paivio (1969) found imagery 
effects for both the stimulus and response 
members of a noun pair, although the 
stimulus effect is apparently larger. In 
particular, Paivio (1969) and Humphreys 
and Yuille (1973), among others, have 
constructed noun-noun pairs representing 
a factorial combination of high and low 
imagery values for each member. They 
have found the following ordering of paired- 
associate performance for the 4 pair types: 
high-high (HH) > high-low (HL) > low- 
high (LH) > low-low (LL). 

Studies have also been conducted where 
Ss have been given specific instructions on 
how to memorize, using high-imagery noun— 
noun pairs. Substantially better recall 
performance has been found for Ss given 
interactive-imagery instructions in con- 
trast to those given rote-repetition (Bower 
& Winzenz, 1970; Schnorr & Atkinson, 
1969; see also Paivio, 1971, for an excellent 
review) or separation-imagery instructions 
(Bower, 1970). 

"Thus, we have 2 classes of studies: those 
varying rated imagery of the stimulus and 
response members of noun-noun pairs and 
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those manipulating the instructional 
(or memorial strategy) using pairs of nouns 
high in rated imagery. In order to deter- 
mine if the same processes are operating 
in both types of studies, there is a need to 
study the possible nteraction between the 
two. Asa result, imagery effects may be 
better understood if we manipulate memo- 
rial strategy instructions as well as rated 
imagery values. Paivio (1971) studied the 
interaction. between rated imagery and 
imaginal and verbal mediators. He 
ported mixed success in finding a statisti- 
cally significant interaction between in- 
structional set (imaginal, verbal mediation, 
and rote repetition) and imagery values 
(noun pairs of high or low rated imagery) 


set 


ir 


EXPERIMENT I 


In Experiment I, 3 groups of Ss were 
given either interactive-imagery, separa- 
tion-imagery, or rote-repetition instruc- 
tions in memorizing a list of paired as- 
sociates. The list contained pairs in which 
the stimulus and response members were 
a factorial combination of high and low 
imagery values. 


Method 


Subjects. Sixty Ss, obtained from the introductory 
psychology S pool, partially fulfilled a course re- 
quirement by participating in a 45-60-min. session. 

Material and design. Each S was given a list of 
20 noun-noun pairs using a paired-associate study— 
test procedure, Four types of pairs were used, in 
which the stimulus and response members of a pair 
were of HH, HL, LH, or LL imagery values, re- 
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spectively, based on the Paivio et al. (1968) norms. 
Five pairs of each type were used, yielding a 20- 
pair list. The mean values on the imagery scale 
were 6.72 and 2.16 for high- and low-imagery nouns, 
respectively. It should be noted that for the ma- 
terial used in this study the correlation between 
rated imagery and rated concreteness was .97, and 
that between rated imagery and rated meaningful- 
ness was .90. 

The Ss were randomly assigned to 1 of 3 groups 
and were instructed to memorize the noun pairs by 
either overt rote repetition of the noun pairs (RR), 
constructing an interactive visual image (ID, or 
constructing an image of the objects denoted by 
the nouns in a noninteracting and separated “visual 
ce" (SI), after Bower (1970). There were 20 
s in each group. 

Procedure. Each S was tested individually. A 
paired. "iate study-test procedure was used. On 
each study trial, 5 was shown a sequence of 20 noun 
pairs at a 6-sec. rate with a Kodak Carousel projec- 
tor. On these study trials, S was to memorize the 
noun pairs in accordance with the instructions given 
at the beginning of the study. On study trials, the 
nouns were adjacent to each other, separated by a 
dash. A study trial was then followed by a test 
trial, on which the stimulus member of each pair 
was presented at a 6-s rate on the left-hand side 
of the slide, followed by a dash. 'The responses 
were made vocally by S. No formation given 
to S on the test trials. Study and test trials 
alternated until S met a criterion of 3 consecutive 
successful recalls of the entire list. For each S on 
each trial, the list was presented in a different 
random order on both study and test trials with the 
restriction that 3-5 items intervene between a 
study trial of a particular pair and a test of that 
pair (and vice versa). After reaching criterion, S 

vas engaged for the next minute; generally, Ss were 
asked how they memorized the noun pairs. They 
were then told to recall all of the words in the study 
in any order. 


Results 


Since much of the research investigating 
instruction or strategy effects are single- 
trial studies, we will report first-test-trial 
data primarily. The mean proportions of 
correct responses on the first test trial for 
all 3 groups for the 4 pair types are shown 
in Figure 1. For the HH and HL pairs, 
Group IL showed higher levels of recall 
than either Group RR or SI, with little 
differences between the 3 groups for LH 
and LL pairs. In addition, recall perform- 
ance for Group II was ordered HH » HL 
> LH > LL, while that for Groups SI and 
RR was ordered HH > HL = LH > I 
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These observations were confirmed by 
an analysis of variance (as shown in Table 
1) that revealed significant effects due to 
stimulus and response imagery and the 
Group X Stimulus Imagery interaction. 
This overall analysis was partitioned into 
2 orthogonal components—one comparing 
Groups SI and RR, and the other compar- 
ing Group II with Groups SI and RR 
combined. The analysis comparing Groups 
SI and RR revealed significant effects of 
stimulus imagery, F (1, 38) = 27.30, p < 
.05, and response imagery, F (1, 38) = 
13.82, p < .05, but not of group, F (1, 38) 
= 1.65, p > .05, or any other comparison. 
The analysis comparing Group II to 
Groups SI and RR combined revealed that 
the group effect was not significant, F 
(1, 58) = 2.93, .05 < p < .10, although the 
3 effects found for the overall analysis com- 
paring the 3 groups including the interac- 
tion were. The analysis of the mean total 
number of errors per subject item to cri- 
terion performance was similar to that of 
the first test trial. Since the statistical 
analyses of the mean total errors resulted 
in the same statistical conclusions as those 
for the first test trial, as shown in Table 1, 
these data are not presented, in order to 
avoid redundancy. In addition, high levels 
of recall of the pairs were found for all con- 
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Ficure 1. Mean proportion of correct recalls 
as a function of pair type (H = high imagery, L= 
low imagery) for the groups given interactive- 
imagery (II), separation-imagery (SI), or rote- 
repetition (RR) instructions in Experiment I. 
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TABLE 1 
ANALYSIS OF VARIANCE SUMMARY 
: j M no. recalled on first E 
"^ test trial M total 
Source aer REESE | es x 
af MS F | MS | 
pd e 2| 763| 2.34 56) «1 
Ss within groups 57 3.26 KiS | 
thins, E 
A DTE imagery 1 | 60.00 | 86.96** | 21 96 | 85.1 a 
Groups X Stimulus Imagery í 2 4.59 6.65** 385 3.27 
Stimulus Imagery X Ss Within Groups. 57 .69 zs 26 ie. 
Response imagery 1 | 32.27 | 31.95** | 10,00 | 75.93* 
Group X Response Imagery 2 .70 «1 .09 «1 
Response Imagery X Ss Within Groups 57 1.01 ETÀ. 
Stimulus Imagery X Response Imagery 1 2.82 2.78 10 «1 
Groups X Stimulus Imagery X Response Imagry PR 2 93 <1 06 «1 
Stimulus Imagery X Response Imagery X Ss Within Groups ay 1.01 A3 
Total 2: 


*p «.05. 
**$ €.01. 


ditions on the free-recall task, with little 
difference between groups. 


Discussion 


The major results of this study revealed 
that Ss given interactive-imagery instructions 
showed superior recall performance on the first 
test trial only for HH and HL pair types in 
contrast to Ss given either separation-imagery 
or rote-repetition instructions. These results 
confirm previous findings where only HH pairs 
have been used (Bower, 1970; Bower & 
Winzenz, 1970; Schnorr & Atkinson, 1969). 
The results of the present study also indicate 
that the superiority resulting from interactive- 
imagery instructions depends on the rated 
imagery value of the stimulus term. Specifi- 
cally, Group II was superior in recall perform- 
ance only for pairs with high-imagery stimuli. 
No difference due to instructions was found 
for pairs with low-imagery stimuli. Further- 
more, the recall performance on the first test 
trial for Ss given interactive-imagery instruc- 
tions was ordered HH > HL > BEAD: 
The results also revealed no difference between 
Groups SI and RR. In addition, in contrast 
to Group II, the recall performance on the 
first test trial for Groups SI and RR was 
ordered HH > -HL—— LHeS- Tp. Since a 
number of investigators have reported HL > 
LH when no specific instructions are given 
(e.g., see Paivio, 1969, 1971) we attempted to 
replicate the interaction found between in- 
structions and rated imagery. 


EXPERIMENT IT 


In an attempt to replicate the major 
results of Experiment I, 2 groups of Ss 
were given either interactive- or separation- 
imagery instructions in a paired-associate 
study-test procedure. Half of the noun- 
noun pairs were HL and the other half LH 
in rated imagery values. 


Method. 


Subjects. Forty Ss, obtained from the introduc- 
tory psychology S pool, partially fulfilled a course 
requirement by participating in a 15-20-min. session. 
None of these Ss had participated in Experiment I. 

Procedure. The Ss were tested in small groups 
of up to 5 people. These small groups were ran- 
domly assigned to 1 of 2 groups (n = 20) and were 
instructed to memorize the noun pairs by either 
constructing an interactive visual image (II) or by 
using separation imagery (SI), as in Experiment I. 
A 46-pair list was used. On each slide, one word 
was at the top and the other at the bottom. Half of 
the noun-noun pairs were HL, while the other half 
were LH. On the study trial, the pairs were pre- 
sented at a 6-sec. rate with a Kodak Carousel 
projector. The first 3 and last 3 pairs represented 
a primacy and a recency buffer, respectively. These 
Pairs were not tested. The test trials were pre- 
sented at a 6-sec. rate. For each pair type, half of 
the cue words in the test trial were high in rated 
imagery, and the remaining half were low in rated 
imagery. 

As a result, the HL and LH conditions may be 
defined in terms of the cue word used in a test trial. 
Since no effect was found due to location of the 
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cue word (top or bottom), no further mention of 
this variable will be made. Each S recorded his 
responses to the cue words on a separate sheet. In 
addition, each small group was given a different 
random order of the list in both the study and test 
phases, and the particular cue used during testing 
was balanced across Ss. 


Results and Discussion 


For Ss given interactive-imagery instruc- 
tions, high-imagery cues led to .64 propor- 
tion correct recalls in contrast to .45 pro- 
portion for low-imagery cues. For Ss given 
separation-imagery instructions, these pro- 
portions were .33 and .29 for high- and 
low-imagery cues, respectively. These re- 
sults were subjected to an analysis of 
variance which revealed significant effects 
of instruction, F (1, 38) = 16.30, p < .05, 
rated imagery, F (1, 38) 18.44, p < .05, 
and a significant interaction between these 
effects, F (1, 38) = 8.46, p < .05. Thus, 
interactive-imagery instructions led to a 
large difference in recall performance for 
high- in contrast to low-imagery cues, while 
separation-imagery instructions did not. 
Thus, we replicated the interaction be- 
tween instructions and rated imagery found 
in Experiment I. However, in Experiment 
II, Group II was superior to Group SI for 
both HL and LH cues, although for Group 
Il, performance was ordered HL > LH, 
while for Group SI, HL = LH. 


CONCLUSION 


The finding of an interaction between in- 
structions (memorial strategy) and rated im- 
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agery values is precisely the interaction that 
would be expected by the conceptual-peg hy- 
pothesis (Paivio, 1971). In particular, it 
appears to be a confirmation of the interaction 
that Paivio (1971, p. 366) has predicted but 
has found to be elusive. It also appears to 
support the "integrated structure" notion of 
Begg (1972), as well as the “relational associa- 
tion" of Bower (1970, 1973), when taken 
together with a retrieval axiom, namely the 
conceptual-peg hypothesis. 
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EYELID CONDITIONING PERFORMANCE WHEN THE MODI 
REINFORCEMENT IS CHANGED FROM CLASSICAL TO 
INSTRUMENTAL AVOIDANCE AND VICE VERSA ! 
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Eyelid conditioning performance during Stage II instrumental 
reinforcement was examined after 0, 5, 10, or 20 Stage I 


avoidance 
reinforcements 


and eyelid conditioning performance during Stage II classical reinforcement 
was examined after 0, 5, 10, or 20 Stage I instrumental avoidance reinforce- 


ments. 


Response rate on the initial Stage 


I trial block of instrumental 


avoidance conditioning was a progressively increasing function of the number 


of Stage I cl "al reinforcements. 


Response rate on the initial Stage II trial 


block of classical conditioning was greater if 5, 10, or 20 Stage I instrumental 
avoidance reinforcements had been given than if no Stage I instrumental 


avoidance reinforcement had been given. 


Response efficiency on the initial 


trial block of Stage II also indicated positive response transfer from instru- 


mental avoidance reinforcement to cl 


sical reinforcement, 


The results were 


discussed in terms of the complexities of the reinforcement and extinction 
processes involved in classical and instrumental avoidance conditioning of 


eyelid responses. 


Some of the interrelationships among the 
responses learned under classical and in- 
strumental avoidance reinforcement were 
investigated by shifting the mode of rein- 
forcement from classical to instrumental 
avoidance and vice versa. As Kimble 
(1961, chap. 4) has pointed out, the experi- 
mental investigation of the interrelation- 
ships among the responses learned under 
different modes of reinforcement has proved 
to be extremely difficult. Distinguishing 
between paradigms in strictly operational 
terms is only an initial step in a scientific 
investigation of the various modes of rein- 
forcement. Increasing attention must be 
given to the psychological processes in- 
volved. When note is taken of the positive 
and negative response transfer effects that 
may be observed in two-stage experiments 
in which the modes of reinforcement. are 
shifted, more light may be shed on the 
similarities and differences of the psycho- 
logical processes involved in each mode of 
reinforcement. 


1 This research was supported in part by funds 
provided to the second author by the U.S. Public 
Health Service in Research Grant MH-06792. 

* Requests for reprints should be sent to David 
A. Grant, Psychology Building, Charter at Johnson, 
University of Wisconsin— Madison, Madison, Wis- 
consin 53706. 


If a response learned under one mode of 
reinforcement can be transferred to another 
mode, then initial response frequency. and 
efficiency after a shift in the mode of rein- 
forcement should be a positive function of 
the number of preshift reinforcements 
Grant, Kroll, Kantowitz, Zajano, and Sol- 
berg (1969) reviewed several studies which 
have demonstrated positive transfer from 
classical to instrumental conditioning and 
positive transfer of differential classical to 
differential instrumental conditioning. but 
found no reports of positive transfer from 
instrumental to classical conditioning. 

Grant et al. (1969) investigated transfer 
of responses from Stage I classical rein- 
forcement to Stage II instrumental re- 
ward reinforcement and vice versa. They 
found that response rate during subsequent 
instrumental reward conditioning was a 
Progressively increasing function of the 
number of Stage I classical reinforcements. 
In addition, the V or C classification of an 
S's eyelid responses using the Hartman- 
Ross (1961) criterion was usually the same 
in Stage I classical conditioning and Stage 
I instrumental reward conditioning. How- 
ever, following Stage I instrumental reward 
training, the response rate during Stage II 
classical conditioning was not an increasing 
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monotonic function of the number of Stage 
I instrumental reward reinforcements. In 
fact, response rate during Stage II classical 
conditioning was slightly higher if no pre- 
vious instrumental reward reinforcements 
had been given than if 20 had been given. 
Further, there was no significant tendency 
for the V or C classification of conditioned 
responses (CRs) in Stage II classical con- 
ditioning to correspond to the V or C 
classification of the responses in Stage I 
instrumental reward conditioning. 

The classical conditioning reinforcement 
used by Grant et al. (1969) was a corneal 
airpuff—a mildly aversive unconditioned 
stimulus (UCS)—while the instrumental 
reinforcement was a symbolic reward. Be- 
fore Stage I classical conditioning, their Ss 
were given neutral instructions asking them 
to adopt a passive attitude, but before in- 
strumental reward conditioning the Ss were 
asked to adopt what amounted to an active, 
problem-solving attitude. The Ss shifted 
from classical to instrumental reward con- 
ditioning were initially given neutral in- 
structions and were later given the new 
set of instrumental reward instructions be- 
fore the shift. However, the Ss shifted 
from instrumental reward to classical con- 
ditioning received only the Stage I instru- 
mental reward instructions and were not 
reinstructed before the shift. The asym- 
metry of response transfer found by Grant 
et al. may have been caused, at least in 
part, by (a) the asymmetry of response 
transfer from paradigms using aversive 
reinforcement to paradigms using reward 
reinforcement and (b) the asymmetry of 
preshift reinstruction. 

Grant et al. (1969) suggested that 
another possible reason for the asymmetry 
of response transfer found by them was 
that the CR produced by classical condi- 
tioning was adequate to trigger the instru- 
mental reward devices, but the response 
typically learned during their instrumental 
reward phase was of low amplitude and of 
a topography that would be ineffective in 
protecting the cornea in the subsequent 
classical conditioning. In one case, then, 
the initial classical conditioning made 
available a response which would be effec- 
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tive in the subsequent instrumental reward 
phase and positive, systemat ic transfer was 
obtained. In the opposite case, the initial 
instrumental reward training made avail- 
able a response which would not be effec- 
tive in the subsequent classical conditioning 
phase and nonsystematic, and perhaps neg- 
ative, transfer performance was obtained. 

Using the same transfer paradigm and 
avoiding some procedural problems en- 
countered by Grant et al. (1969), the 
present experiment was designed to com- 
pare the transfer of responses conditioned 
by classical and instrumental avoidance re- 
inforcement. The corneal airpuff can 
be used in both classical conditioning 
and instrumental avoidance training (e.g., 
Hansche & Grant, 1965; Kimble, Mann, 
& Dufort, 1955; Logan, 1951; Moore & 
Gormezano, 1961), so that the need for pre- 
shift reinstruction and problems with re- 
warding versus aversive stimuli can be 
eliminated. In addition, Massaro and 
Moore (1967) have examined the latency 
and recruitment of classically conditioned 
CRs and avoidance responses. Their 
analyses showed that the instrumental 
avoidance procedure produced a response 
topography similar to that of the classically 
conditioned CR. The implication is that 
the response learned during Stage I in- 
strumental avoidance reinforcement should 
be effective in protecting the cornea during 
Stage II classical conditioning. 


METHOD 


Apparatus. The apparatus and laboratory were 
similar to those described by Grant et al. (1969). 
The S was placed in a soundproofed chamber. He 
sat in an ophthalmological examination chair facing 
a rectangular wooden enclosure painted flat white 
and illuminated to approximately 1 mL. At the 
back of the wooden enclosure were two ground-glass 
disks, approximately 10 cm. in diameter and sepa- 
rated from center to center by 66 cm. The disks 
were at eye level and approximately 120 cm. in 
front of the S. Between the ground-glass disks was 
a 45 X 30 cm. ground-glass screen which was not 
used in the experiment. The intertrial illumination 
of the two disks was approximately 1.5 mL. The 
S wore a plastic headset that was designed originally 
to support a welder's face mask. To it was attached 
a potentiometer to pick up the eyelid movement 
and a tubular jet from which the corneal airpuff 
could be delivered. The potentiometer was coupled 
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to the right eyelid of the .S by means of a piano wire 
that was taped to the eyelid with plastic electrician's 
tape and inserted into a length of hypodermic 
tubing attached to the shaft of the potentiometer by 
a hinged coupling. 

Procedure. The conditioned stimulus (CS) was 
the simultaneous illumination of the two ground- 
glass disks to a luminance of 1.9 mL. for a duration 
of 700 msec. The UCS, when given, was a 2 psi 
corneal airpuff lasting 200 msec. and delivered 500 
msec. after CS onset. The intertrial interval varied 
from 15 to 35 sec., with a mean of 25 sec, During 
the instrumental avoidance training an eyelid re- 
sponse that produced a 1 mm. or greater deflection 
of the oscillograph first time-derivative pen be- 
tween 200 and 500 msec. after CS onset was defined 
as an avoidance response (AR). An AR caused the 
airpuff to be omitted. A 1 mm. or greater deflection 
of the derivative pen corresponded to an eyelid 
closure of approximately 1 mm. Detection of an 
AR was accomplished by means of a Schmitt trigger 
and associated timing and logic circuitry. The puff 
was delivered on every trial during the classical 
conditioning sections of the experiment. The defini- 
tion of a classically conditioned response (CR) was 
similar to the definition of an AR except that the 
scoring interval extended to 540 msec. after CS onset. 

The Ss were divided into eight experimental 
groups and were given neutral instructions in which 
they were asked not to aid or inhibit their natural 
eyelid responses, Four of the experimental groups 

(CL-AV groups) received, respectively, 0, 5, 10, or 
20 classically reinforced pairings of the CS and UCS 
during Stage I of the experiment. They then re- 
ceived 60 instrumental avoidance conditioning trials 
during Stage II followed by 30 extinction trials 
with the CS alone. These four groups will be 
referred to as CLO-AV60, CL5-AV60, CL10-A V60, 
and CL20-AV60, respectively. The Ss in the re- 
maining four groups (AV-CL groups) received 0, 
5, 10, or 20 instrumental avoidance conditioning 
reinforcements during Stage I of the experiment and 
then received 60 classical conditioning trials during 
Stage II followed by 30 extinction trials. These 
four groups will be referred to as AVO0-CL60, AV5- 
CL 60, AV10-CL60, and AV20-CL60, respectively, 

A maximum of 60 trials was permitted during 
Stage I for Ss in the AV-CL groups. To match 
groups, data from Ss in the AVS-, AV10-, and AV20- 
CL60 groups were retained only if the § had made 
at least five ARs in 25 trials, because pilot experi- 
ments indicated that only Ss who do so will typically 
go on to make 20 or more ARs in 60 trials, This 
criterion resulted in 38 rejections, approximately 
equally distributed among these three groups. 
Twelve additional Ss were run in Group AV0-CL60 
so that the data from the 12 Ss with the lowest 
response rates could be rejected, resulting in an 
approximately equal number of rejections in each 
of the four AV-CL groups. 

At the conclusion of the experimental session Ss 
were given a questionnaire asking for their impres- 
sions of the experiment, how noxious they felt the 
airpuff to be, whether or not their responses were 
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"voluntary," and whether they thought a 
that their behavior would cause the airr 
omitted. 


time 
to be 


Subjects, The Ss were 80 men and & men 
student volunteers from introductory p logy 
courses at the University of Wisconsin—? son, 
Assignment of 10 men and 10 women to cac the 


eight experimental groups was random unt 
came necessary to balance for sex, 


be- 


RESULTS 


Summary of Stage I data. Table 1 gi 2s 
(A) the average number of CRs or A Sy 
(B) the average number of trials, and ( ) 
the average number of airpuff reinforc- - 
ments during Stage I for each of the fou + 
CL-AV and the four AV-CL groups. Dur- 
ing Stage I, the AV-CL groups received, 
on the average, more trials and gave mori 
responses than the corresponding CL-A\ 
groups. The differences between the four 
CL-AV groups in the mean number of Stage 
I airpuffs are larger than the corresponding 
differences between the four AV-CL groups 

Response form and transfer. All Ss who 
gave one or more responses during $ tage | 
were classified as V-form or C-form m 
sponders during Stage I, using the Hart 
man-Ross (1961) response-slope criterion, 
and were independently reclassified during 
Stage II. Of the 46 Ss in the CL-AV groups 
who could be so classified 35 Ss had the 
same V or C ‘classification during both 
stages and 11 Ss did not. Of the 60 Ss in 
the AV-CL groups for whom the Stage I 
classification was available 50 Ss had the 
same V or C classification during both 
stages and 10 Ss did not. Two significant 
chi squares (p < -001), one for the CL-AV. 
groups and one for the AV-CL groups, 
Showed that in both cases there was a 
significant tendency for Ss to have the 
same V or C classification in both Stage | 
and Stage II. 

Stage IT avoidance response frequency. 
Figure 1 shows the percent ARs given dur- 
ing each of the six 10-trial blocks of instru- 
mental avoidance conditioning trials with 
the number of previous classical reinforce- 
ments as the parameter. The data of Vs 
and Cs are shown in the upper and lower 

panels, respectively. The final training 
Point given for each group in Figure 1 is 
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the percent CRs for the final five trials of 
Stage I classical training. For Group 
CLO-AV60 the final training point is the 
percent ARs given on the first trial of in- 
strumental avoidance conditioning. Since 
approximately 40% of the Ss run in the 
AV-CL conditions were rejected on the 
basis of low response frequency, the 12 
highest responders in the CL-AV groups 
are most likely to be similar to the AV-CL 
Ss in terms of base response rate and con- 
ditionability. Although the results based 
on all CL-AV Ss are presented, it is im- 
portant to note that the pattern is identical 
when the data of only the high-frequency 
responders from the CL-AV groups are 
considered. 

During the first 10-trial block of Stage Il 
instrumental avoidance transfer response 
frequency was a progressively increasing 
function of the number of previous classical 
reinforcements, F (3, 76) = 4.95, p < .005. 
In general, during the first trial block, hav- 
ing had 5 or 10 previous classical reinforce- 
ments produced more responding than hav- 
ing had no previous classical reinforce- 
ment, and having had 20 previous classical 
reinforcements produced more responding 
than having had 5 or 10 previous classical 
reinforcements. Possible exceptions to this 
pattern are the Cs in Group CL5-AV60, but 
there were only four Cs in that group, so 
that the reliability of those data is ques- 
tionable. The effect of the number of 
previous classical trials diminished quickly, 
and during the last three 10-trial blocks 
there was no significant effect of the num- 
ber of previous classical reinforcements, 
Vo SEE yes s 

Averaged over all AV trial blocks, the 
response rate of Vs was significantly higher 
than the response rate of Cs, F (1, 72) = 
9.74, p < .005. In addition, although the 
initial effect of the number of previous 
classical reinforcements was greater for Vs 
than for Cs, the effect persisted longer for 
Cs, as indicated in a significant Number of 
Stage I Reinforcements X Response Form 
x Trial Blocks interaction, F (15, 360) — 
1.75, 5 < .05. 

Stage II classical conditioned response 
frequency. Figure 2 shows the percent 
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TABLE 1 


STAGE I Summary DATA 


C. Mean 
A.Mean | B.Mean | numberof 
Group numberof | numberof | airpuff re- 
CRsorARs| trials inforce- 
ments 
CLO -AV60 0 0 0 
CL5 -AV60 .90 5 5 
CL10-AV60 2.70 10 10 
CL20-AV60 9.50 20 20 
AVO -CL60 0 0 0 
AV5 -CL60 $ 14.30 9.30 
AV10-CL60 10 23.35 13.35 
AV20-CL60 20 36.80 16.80 


Note. Abbreviations: CR = conditioned response; AR 
= avoidance response, See text for explanation of group 
names. 


CRs given during each of the six 10-trial 
blocks of classical conditioning with the 
number of previous instrumental avoid- 
ance reinforcements as the parameter. 
The data of Vs and Cs are shown in the 
upper and lower panels, respectively. The 
final training point is the percent ARs for 
the final. five trials of Stage I, except for 
Group AV0-CL60 where the final training 
point is the percent CRs given on the first 
trial of Stage IT. 

The frequency of CRs during the first 
10-trial block of Stage II was greater heap 
10, or 20 ARs had been given during 
Stage I than if no ARs had been given, 
but there were no systematic differences in 
CR frequency between Ss in the AVS-, 
AV10-, and AV20-CL60 groups during any 
of the 10-trial blocks. These observations 
were supported by a significant difference 
in the percent CRs on the first. 10-trial 
block during Stage II as a function of the 
number of previous instrumental avoidance 
reinforcements, F (3, 76) = 29.26, p < 
001, and the fact that the corresponding 
differences among the AV5-, AV10-, and 
AV20-CL60 groups were not statistically 
significant, F (2, 57) < 1. The response 
rate of the Ss in Group AVO-CL60 reached 
an asymptotic level quickly, so that the 
number of previous instrumental avoidance 
reinforcements produced no significant dif- 
ferences in classical CR frequency during 
the final three 10-trial blocks of Stage II. 
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As Figure 2 shows, averaged over all 
Stage II trial blocks, Vs gave more CRs 
than Cs, F (1, 72) = 12.38, p « .001. In 
addition, the trend over Stage II trial 
blocks as a function of the number of 
Stage I avoidance reinforcements was dif- 
ferent for Vs and Cs, as indicated by a 
significant Trial Blocks X Number of Stage 
I Reinforcements X Response Form inter- 
action, F (15, 360) = 1.75, p < .05. Figure 
2 also shows that only for the Ss in Group 
AV0-CL60 did CR frequency increase over 
the successive trial blocks of classical 
conditioning. This is reflected in a signifi- 
cant Trial Blocks X Number of Stage I 
Reinforcements interaction, F (15, 360) = 
8.07, p < .001. 

Stage II response efficiency. Response 
Efficiency (PUFAMP/MAXAM P) was de- 
fined as the amplitude of eyelid closure at 
500 msec. after CS onset (PUFAMP) 
divided by the maximum amplitude of 
eyelid closure between 200 and 700 msec. 
after CS onset (MAXAMP). During 
classical conditioning, MAXAMP typi- 
cally occurred between 500 and 700 msec. 
after CS onset, making PUFAMP/ 
MAXAMP similar to some of the effici- 
eney measures used by Martin and Levey 

(1965; 1969, pp. 63-67). The mean 
PUFAMP/MAXAMP score was computed 
for each S on each 15-trial block of Stage IH. 

The mean PUFAMP/MAXAMP scores 
increased monotonically over the four 
15-trial blocks of Stage 1I classical condi- 
tioning in the AV-CL groups, but did not 
increase over the 15-trial blocks of Stage 


TABLE 2 
XAMP Score: 


MEAN PUFAMP/M 
15-TRIAL 


ON THE First 
Il 


Number of Stage I reinforcements 


Groups | 
| 0 5 10 20 
r | 
CL-AV | | 
Vs(n 16) 204 813 .685 | .796 
Csín 34) .683 B38 619 | 749 
AV-CL | 
Vs(n = 64) 571 -671 747 809 
Csín = 16) Mal A70 ATS 658 


Note. Abbreviations: PUFAMP = amplitude of eyelid closure 
at 500 msec. after onset of conditioned stimulus (CS); MAX. 
AMP = maximum amplitude of eyelid closure between 200 and 
700 msec, after CS onset, 
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II instrumental avoidance conditioni + in 
the CL-AV groups. This difference be- 
tween the linear trends of the A% CL 
groups and the CL-AV groups was s 
tically significant, F (1, 148) = i! 92, 


p «.001. Analysis of variance furi er 
indicated that the within-groups e or 
variance of the PUFAMP/MAX^' P 


scores was significantly smaller du: m 
classical conditioning (AV-CL groups, er or 
variance = .012) than during instrumen: |l 
avoidance conditioning (CL-AV groups, 
error variance = .021), p < .01. 

Table 2 presents the mean PUFAM 
MAXAMP scores on the first 15-tri:! 
block of Stage II for Vs and Cs in the 
CL-AV groups and the AV-CL groups. 
For the CL-AV groups the mean PUFAMP, 
MAXAMP scores on the first trial block 
of Stage II were not a monotonically in- 
creasing function of the number of Stage | 
reinforcements, but for the AV-CL groups 
the progressive increase in PUFAM P/ 
MAXAMP scores was statistically signifi- 
cant, F (3, 76) = 3.91, p < .025, 

An analysis of variance of the mean 
PUFAMP/MAXAMP scores for the AV- 
CL groups, with number of Stage I rein- 
forcements (R), V or C response form (F) 
and trial blocks (B) as variables, yielded 
several interesting results. The Vs gave 
significantly higher PUFAMP/MAXAMP 
scores than the Cs, F (1, 72) = 27.96, 
b < 001, and their CRs showed an ini- 
tially more rapid rate of PUFAMP/ 
MAXAMP increase than the Cs FX B 
interaction, F (3, 216) = 3.59, b «.025. 
The overall increase in PUFAMP/ 
MAXAMP over. trial blocks was also 
significant, F (3, 216) — 7.57, p < .001. 

Extinction. 1n neither the CL-AV groups 
nor the AV-CL Eroups was response fre- 
quency during extinction a function of the 
number of Stage I reinforcements. 

Questionnaire. The answers to the post- 
experimental questionnaire were astonish- 
ing in one respect. Of the 140 Ss receiving 
some instrumental avoidance conditioning, 
not one reported that any of his responses 
would cause omission of the airpuff. The 
postexperimental inquiry thus confirms the 
reports of Massaro and Moore (1967). 


DISCUSSION 


mary results of the present study and 

onship to the results of Grant et al. 

ill be discussed in terms of their 

regarding the nature of the pro- 
cesse ived in classical and instrumental 

e | tioning. 
ige LI performance. In the CL-AV 

frequency during the first 10-trial 
istrumental avoidance conditioning 

tage IT was a progressively increasing 

ion of the number of previous classical 

aloreements, In addition, the Ss tended 
io retain the same V-C classification in Stage 
L and Stage II. These results essentially 
parallel the results on transfer from classical 
to instrumental reward reinforcement reported 
by Grant et al. (1969) and are consistent with 
the position that the exact CR learned under 
classical reinforcement is retained when the 
mode of reinforcement is changed, 

Avoidance response efficiency, as defined by 
\1P/MAXAMP measure, was not à 
dy increasing function of the num- 

ber of Stage I classical reinforcements, The ab- 

sence of an increasing PUFAMP/MAXAMP 
the beginning of Stage II is not 
because, as Table 1 shows, the 

iber of CRs given during Stage I 

LO-, CLS-, and CL10-AV60 was 0, 
igi , respectively, When groups do 
not diler appreciably in the small number of 

CRs that have been given, systematic differ- 

ences in topography measures should not be 

expected. Perhaps maximization of the 

PUFAMP/MAXAMP ratio depends upon the 

occurrence of enough CRs so that processes 

such as exteroceptive and proprioceptive feed- 
back can influence response timing mechanisms 

(Grant, 1972), although Martin and Levey 

(1969, pp. 93-98) argue that the occurrence 

of overt responses may not be necessary for the 

development of response topography. 

In the AV-CL groups, CR frequency during 

the first 10-trial block of Stage II was greater 

if 5, 10, or 20 instrumental avoidance re- 
sponses had been given during Stage I than if 
no avoidance trials had been given, but CR 
frequency was approximately equal in the 

AVS-, AV10-, and AV20-CL60 groups during 

all six trial blocks of Stage II classical con- 

ditioning. Table 1 reveals a possible reason 
for these surprisingly equal response levels 
during Stage If. The CL-AV groups received 

0, 5, 10, or 20 airpuff classical reinforcements 

during Stage I classical training. During 


ta 
69) w 


implicati 


function at 
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Stage I, the AV-CL groups also received air- F 
puffs which have long been regarded as inter- — 
mittent classical reinforcement (Logan, 1951), — 
In fact, several investigators have suggested - 


that the occurrence of the airpuff is the pri- | 


mary reinforcing event in instrumental avoid- 
ance eyelid conditioning (e.g, Gormezano, 
1965; Kimble, 1961, 1964; Logan, 1951). The 
AVO-, AVS-, AVIO-, and AV20-CL60 groups 
received on the average 0, 9.3, 13.35, and 16.8 


airpuff reinforcements, respectively, during — 


their instrumental avoidance training. The 
differences in mean number of Stage I airpuffs 
among the AV5-, AV10-, and AV20-CL60 
groups were, therefore, far smaller than the 
corresponding differences in the CL-AV groups. 
The smaller differences between the AV-CL 
groups in the number of such Stage I classical 
reinforcements could readily account for the 
smaller differences in the Stage IT response 
frequencies of these groups, 

The finding that Ss in the AV-CL groups 
tended to retain the same V-C classification 
when the mode of reinforcement was changed 
indicates that the same response learned dur- 
ing Stage I instrumental avoidance condition 
ing continued to be performed during Stage 
II classical conditioning. Positive response 
transfer was also indicated by the finding 
that the efficiency of CRs, as defined by the 
PUFAMP/MAXAMP measure, on the first 
trial block of Stage II class conditioning 
was a monotonically increasing function of 
the number of Stage I ARs, This result also 
suggests that the respon opography learned 
during instrumental avoidance training was 
very effective in attenuating the noxiousness 
of the airpuff on subsequent classical condition- 
ing trials which may explain why the same 
response was retained, 

In many important respects, the results in 
the AV-CL groups of the present experiment 
do not parallel the results on transfer from 
instrumental reward to classical reinforcement 
reported by Grant et al. (1969). In the Grant 
et al. experiment there was no indication that 
the yonse learned during instrumental re- 
ward reinforcement was the same as the CR 
that occurred during subsequent classical con- 
ditioning. Some or all of the differences noted 
in the introduction between the two experi- 
ments may account for this difference in results. 
The most important difference may be that 
the response topography learned during in- 
strumental avoidance conditioning is effective 
in attenuating the noxiousness of the corneal 
airpuff in classical conditioning, whereas the 
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response topography learned during instru- 
mental reward conditioning is less effective in 
this respect (Grant et al., 1969). The com- 
bined results of the present experiment and 
the experiments of Grant et al. suggest that 
when the response learned under one mode of 
reinforcement is "effective" under a second 
mode of reinforcement the response will con- 
tinue to be given when the mode of reinforce- 
ment is changed from the first to the second. 

Instrumental reward and. avoidance reinforce- 
ment compared. In the present experiment, the 
AR frequency curves for the CL-AV groups 
converged to a common asymptote during the 
last three 10-trial blocks of instrumental avoid- 
ance conditioning. Grant et al. (1969) found 
that the instrumental response frequency 
curves for their four classical-to-instrumental 
reward groups remained parallel throughout 
the 60 trials of Stage II instrumental reward 
reinforcement and, therefore, reached different 
asymptotes. Differences between the relation- 
ship of Stage II reinforcement to the occurrence 
of a conditioned eyeblink in the two studies 
may account for the different patterns of re- 
sults on the later Stage II trial blocks. 

Grant (1973) has noted that opposing rein- 
forcement processes occur during instrumental 
avoidance conditioning. The occurrence of the 
airpuff on no-response trials constitutes a 
classical reinforcement, which increases the 
probability of a CR on subsequent trials. 
The omission of the airpuff on the occurrence 
of an AR constitutes both an instrumental 
avoidance reinforcement, which increases re- 
sponse probability, and a classical extinction 
trial, which decreases response probability. 
Therefore, as the response rate increases, the 
number of classical extinction trials increases 
which should be expected to reduce the re- 
sponse rate. As a conditioning session pro- 
gresses, these conditioning-extinction pro- 
cesses tend to produce a stable equilibrium in 
response rate. Therefore, even though the 
four CL-AV groups of the present experiment 
differed in initial Stage II response rate, the 
interaction of the processes just noted acted 
to produce convergence toward a common 
asymptote. 

In instrumental reward conditioning, the 
number of reinforcements is a positive function 
of response rate, The higher the initial Stage 
II response rate, the higher the number of in- 
strumental reward reinforcements. In general, 
higher proportions of reinforced trials should 
tend to produce higher asymptotic response 
rates (e.g, Moore & Gormezano, 1961). 
Therefore, the Stage II instrumental response 
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frequency curves of Grant et al. (1969) in- 
crease in a more parallel fashion as a function. 
of the reinforcements of the Stage II trial 
blocks, Thus, the differences between the 
results of the present experiment and that of 
Grant et al. can readily be understood in terms 
of differences between the reinforcement 
processes involved in instrumental reward and 
instrumental avoidance training of the two 
experiments. 

Response efficiency changes over trial blocks. 
During Stage II the mean PUFAMP/ 
MAXAMP ratio increased monotonically over 
the trial blocks of classical conditioning in the 
AV-CL groups. This increase is consistent 
with the reports of Martin and Levey (1969) 
and Hickok (1968), who found that CR effici- 
ency, defined by various measures of puff 
avoidance, increased monotonically over trials 
in classical conditioning. However, during the 
Stage II instrumental avoidance conditioning 
trial blocks of the CL-AV groups, progressive 
increases in AR efficiency were not obtained. 
Maximizing the PUFAMP/MAXAMP ratio 
is biologically adaptive during c cal con- 
ditioning. Nothing can be gained, however, 
during instrumental avoidance conditioning by 
synchronizing the point of maximum AR 
amplitude with the time that the airpuff 
would occur on a no-response trial. Therefore, 
one might expect that more individual varia 
tion in the CR topography would be found 
under instrumental avoidance training than 
under classical reinforcement. That the topog- 
raphy was actually more variable 
instrumental avoidance reinforcement than 
under classical reinforcement was revealed 
by the fact that the error variance of the 
PUFAMP/MAXAMP measure was signifi- 
cantly greater with instrumental avoidance 
reinforcement than with classical reinforce- 
ment. The implications of the results on re- 
sponse efficiency can be summarized by stating 
that they are consistent with the suggestion of 
Martin and Levey (1965, 1969) that only those 
components of response topography that are 
biologically adaptive are likely to be maxi- 
mized during the conditioning process. 

Conclusion. Experimenters have tradition- 
ally been forced to distinguish among various 
forms of conditioning in terms of the experi- 
menter's operations, rather than in terms of 
the S’s psychological processes, a very un- 
satisfactory situation (Grant, 1964). The 
findings of the present experiment and those of 
Grant et al. (1969), however, provide some evi- 
dence for functional differences as opposed to 
Operationally defined differences among class- 


under 
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ical conditioning and two forms of instrumental 
conditioning. Differences in response topog- 
raphy obtained under the different training 
procedures make it increasingly clear that it is 
inaccurate and misleading to speak of the eye- 
lid CR becoming conditioned under these three 
modes of reinforcement. In addition, differ- 
ences in the consequences of shifting from 
classical to instrumental avoidance or classical 
to instrumental reward reinforcement, etc., 
reveal some of the variety and complexity of 
the reinforcement processes that characterize 
these three modes of reinforcement. It is 
likely that further experiments using the trans- 
fer paradigm and further analyses of changes in 
response topography, as urged by Martin and 
Levey (1969), will provide a more incisive 
analysis of conditioning and will yield more 
precise specification of functional as opposed to 
operational differences among the various 
modes of reinforcement by which the eyelid 
response can be conditioned. 
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NONLEARNING: THE COMPLETENESS OF THE BLINDNESS! 


PAUL FINGERMAN asp. MARVIN LEVINE * 


State Unitersity of New York at Stony Brook 


College students fail to learn a simple discrimination (e.g., choosing A from 


the pair A~B), if they are set for a different type of solution. 


It is possible, 


however, that these Ss learn enough to recognize the solution when stated. 
In the present study, therefore, Ss who failed to learn this discrimination were 
given a multiple-choice test containing the statement of the solution as one of 
the alternatives. None of the Ss selected this alternative, 


Exploring one facet of his theory of learning, 
Levine (1971) derived conditions for the occurrence 
of nonlearning, the absence of learning, in the 
simplest of tasks. The conditions of failure were 
specified for even a well-motivated, intelligent adult 
who perceived all the relevant stimuli. Before 
reviewing the derivation, it is worth stressing that 
the task, simultaneous discrimination, is almost 
irreducible in its simplicity. Two letters, A and B, 
are both printed on each of a long series of cards, 
varying only in which of the letters is on the right 
(R) side of the card and which on the left (L). The 
5 turns the cards 1 at a time and is instructed to 
state 1 of the 2 letters at each card. The E says 
correct whenever S says A and incorrect whenever 
Ssays B. The Ss are instructed to try to be correct 
as often as possible. College students normally 
learn this task quickly, making their final error at 
about the second card. 

The theory, which specifies the conditions under 
which the choice of A will not increase in frequency, 

may be summarized as follows: The S samples from 
a set of hypotheses in trying to solve this problem. 
If he samples from an incorrect set (one which does 
not contain the solution) he will not solve the prob- 
lem. In such a circumstance, he will "learn," i.e., 
solve the problem, only by first exhausting the in- 
Correct set. When it becomes empty he samples a 
new set, one which (hopefully) contains the solu- 
ton. An S, then, who starts with an incorrect 
set of hypotheses can, nevertheless, solve the prob- 
lem, The eventual emptying of this set serves as 
a signal to try something new. 

Suppose, however, that this initial incorrect set 
of hypotheses is infinitely large—it never becomes 
empty, and 5 never receives a signal to try another 
set of hypotheses. For the specific task described 
above, suppose S’s initial hypothesis set is that the 
solution is a complex position sequence (eg., it 
double alternates [LLRR] Jor 4 trials, then single 
alternates [LRLR] for 4 trials, then repeais). The 
set of such position sequences is virtually infinite, 


‘This work was supported by Research Grant MH-11857 
from the National Institute of Mental Health to the second 
author. 

? Requests for reprints should be sent to Marvin Levine, 
Department of Psychology, State University of New York: 
Stony Brook, New York 11790. 


yet does not contain the hypothesis say A. Acc 
ing to the theory, S should learn nothing about | ¢ 
contingency between A-B and correct-wrong. 

One can plausibly cause S to sample this incorr 
but vast hypothesis set by presenting him fir 
with preliminary problems having just such posi 
tion-sequence solutions. Levine (1971) performed 
2 experiments in which 6 preliminary problems 
with various position-sequence solutions preceded 
the critical problem with the say-A solution. He 
found 80% of the Ss failed to solve the final problem 
in 100 trials, and were saying A about 50% of the 
time throughout all these trials, The prediction 
seemed confirmed, then, that S learns nothing about 
the simplest contingencies if the appropriate hy- 
pothesis is not in the set. 

The question arises, however, whether S really 
learns nothing about the relationship between 4 
and correct. The influence of this invariant repeti- 
tion might be detected by some more sensitive 
probe. One possibility is that S, though unlikely 
to formulate the correct hypothesis, might recognize 
it when stated. If, for example, the correct rule 
statement were included among a small set of alter- 
natives, S might be able to consult his memory of a 
few of the trials and recognize the existence of this 
invariant relationship. An S, therefore, responding 
A at chance level (i.e., failing by the normal criter- 
ion) might nevertheless select the correct alterna- 
tive from a simple multiple-choice test. The fol- 
lowing experiment was performed to investigate 
this possibility. 

Method. Forty-one Stony Brook undergraduates 
served as Ss. The data from 9 Ss were discarded for 
reasons described below. Of the remaining 32 Ss, 
half were fulfilling a requirement in the introductory 
psychology course, and half were paid volunteers. 

Two decks of 110 3 X 5 in. cards each were 
developed. Every card had the letters A and B 
printed on the center horizontal line. Within each 
deck, half of the cards had A on the left and B on 
the right; the other half had the reverse, The 2 
types of cards appeared randomly within each 
deck, and the 2 decks differed only in the sequence 
of these cards, 

Four additional cards were prepared, each of 
Which contained the following statement and 6 
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alternatives: 


I said “correct” on this last problem: 

(a) whenever you said "A". 

(b) whenever you paused for more than two 
seconds before responding. 

(c) whenever you responded according to the 
sequence LLRLRRLLRLLL. 

(d) whenever you responded according to 
some other sequence. 

(e) whenever you responded according to a 

sequence, The correct sequence, however, 

ept changing. 

(f) None of the above. 


The sequence of the alternatives was counter- 
balanced over the 4 cards, with the restriction 
that (using the above letter labels) Alternative d 
always followed c, and f was always last. Also, 
each of the 4 cards was replicated except that “B” 
appeared in Alternative a. 

The S, seated across a table from E, was pre- 
sented with a deck and instructed to go through 
the deck 1 card at a time, choosing on each card 1 
of the 2 letters. He was further told that E would 
say correct or wrong according to some rule, and 
that he could be correct all of the time if he dis- 
covered the rule, This routine was then followed 
as S went through the deck. Seven problems were 
presented in this way. The first 6 problems had 
position-sequence solutions beginning with double al- 
ternation (RRLL)and ending with LLRLRRRLRR. 
The S was run in each of these 6 problems until he 
either made 15 successive correct responses or 
completed 95 trials. The E announced the solution 
at the end of each of these problems and then said, 
“Here is another problem,” and presented the 
other deck. The final, critical problem also began 
in this style. For half of the Ss, the solution rule 
was say A; for the other half, it was say B. If S 
made an error after Trial 60, E said: “You seem to 
be having some trouble with this problem. A lot 
of subjects seem to have trouble with it, so I’ve 
prepared a multiple-choice question I would like 
you to answer to help me to figure out why." He 
then presented the appropriate multiple-choice 
card and said, "What do you think is the best 
answer?" After S's response, E said, "Now I'd 
like you to rank the answers from best to worst." 

On the critical problem, 2 different decks and 2 
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solutions (say A or say B) defined 4 conditions. 
These, combined with the 2 conditions of payment 
and with the 4 multiple-choice sequences, produced 
32 unique conditions, The Ss were run until there 
was 1 S in each condition who (a) solved at least 
1 of the first 6 problems, and (b) failed to solve 
the critical problem. 

Results. Since the theory is best tested if S is 
motivated to be correct and if he is indeed sampling 
among sequence hypotheses, the protocols of any 
S who solved none of the 6 preliminary problems 
were eliminated from consideration. Five of the 
41 Ss were dropped for this reason. None of these 
5 Ss, incidentally, solved the critical problem, 

There were, then, 36 Ss who solved at least 1 
sequence problem. Only 4 of these solved the 
final problem. This repeats the previous finding 
that the substantial majority of college students 
fail to learn this simple problem. For the 32 failing 
Ss, the percentages of correct responses on each of 
the 6 successive blocks of 10 trials were 48%, 52%, 
50%, 53%, 51%, and 47%. 

The multiple-choice test was presented to these 
32 Ss in the search for the merest modicum of 
learning, The results were decisive. None of the 
Ss selected the correct alternative as the solution. 
Twenty-seven Ss picked 1 of the 3 sequence- 
solution alternatives. The remainder picked "none 
of the above.” The Ss had also been asked to 
rank order the alternatives. The mean rank of the 
correct alternative was 4.5. It tied for last place 
with the alternative concerning waiting for 2 sec. 
before responding, the alternative most irrelevant 
to the experiment. Several Ss remarked on the 
unlikelihood of either of those 2 alternatives and 
said that they could not decide which to put last. 

The conclusion seems clear, Although Ss were 
forced, by stating the letter, to attend to the critical 
stimuli, and although the feedback was desirable, 
perceivable, and unambiguous, Ss failed to register 
the simple invariant relationship between the two. 
We can assert with greater confidence that Ss 
learn nothing about hypotheses not in their hy- 
pothesis set. 
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To determine whether both attended and unattended items are held in a com- 


mon short-term memory (STM) store, a Sternberg 
s employed, Four w , 
nultaneous dichotic presentation of a second 4-word list 
Pift 


(RT) paradigm 
aurally or with 
which Ss were instructed to ignore, F 
sisted of nonlist items, while the 


à no response, was 20%, indicating that 
unattended items. The RT for the posi 


varied set reaction time 
ord lists were presented either mon- 


y per cent of the negative probes con- 


remaining 50% were selected from the un- 
attended channel, The error rate for the latter probe words, 


which required 
there was some STM storage for the 
itive probes in the dichotic condition 


increased by 40 msec, These results were interpreted as suggesting that at- 


tended and unattended items are differentially tagged or in some 
kept separate in memory, but that random sampling of the 


takes place. 


While early investigators found no evidence of 
long-term memory (LTM) for verbal items which 
were not attended (Cherry, 1953; Moray, 1959), 
several recent studies have indicated that there is 
some short-term memory (STM) for such un- 
attended inputs (Glucksberg & Cowen, 1970; 
Norman, 1969; Peterson & Kroener, 1964). As yet 
little is known about the storage properties of un- 
attended materials. One question of interest is 
whether attended and unattended items are kept in 
a common memory store. 

To study this problem, Davis and Smith (1972) 
examined the serial position curves for lists of at- 
tended words as a function of whether words that 
Ss were instructed to ignore were presented in an 
unattended channel. It was argued that if all items 
are held in a common store, the presence of the 

unattended materials should result in an attenuation 
of the recency effect observed for attended materials, 
making it extend over fewer serial positions. No 
such change in serial position was found, suggesting 
that information from the 2 channels was not keptin 
common storage, 

There were, however, 2 
that despite the absence of 
there was some memor 
First, there was an overall 
the unattended material 
than nonsense syllables. 
that, to be analyzed for me 
contact with LTM store. 
was employed some intrusi 
the unattended channel. 
were plotted as a functio 
position, 
intrusions 


findings which suggested 
à change in serial position 
y for unattended input. 
decrease in accuracy when 
consisted of words rather 
Norman (1969) argued 
aning, items must make 
Second, when free recall 
on errors were found from 
When these intrusions 
n of their input serial 
a roughly inverse exponential decay of 
over serial position was found, suggesting 
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other way 
unattended channel 


an STM decay function for the unattended items. 
Further, in plotting these intrusions as a function 
of output serial position, it was found that they 
were offered only in the later output serial positions, 
i.e., after the recall of several other items, This 
latter finding suggested that attended and unat- 
tended inputs were differentially “tagged” (Yntema 
& Trask, 1963), and that unattended items were 
offered only after their “ear tag" had decayed. 
These results suggest that while there is some 
memory for unattended items, they appear either to 
be differentially tagged or in some other way stored 
separately from the attended materials, 

A different paradigm for examining the storage of 
unattended verbal materials is presented here. 
Two lists of words, each 4 items in length, were 
presented dichotically. The Ss were instructed to 
attend to one ear and to ignore the materials pre- 
sented to the other. Following presentation of the 
lists, a probe word was presented. A yes response 
was required if the probe was one of the words in the 
Previously presented attended list, and a no response 
if it was not, Sternberg (1966), using single list 
Presentation, suggested that when such a procedure 
is employed, Ss do a high-speed memory scan to 
determine if the probe was a member of the pre- 
viously presented list. 

In this study, to learn something about the storage 
of unattended materials, Ss were occasionally pre- 
sented with probe words that had in fact been 
presented to the unattended ear. A negative re- 
sponse to such probe words was required. If these 
words had no representation in STM, Ss should 
treat them just as they would words which had not 
been presented at all—i.e., error rate and reaction 
time (RT) should be no greater than for other 
negative probes. If, on the other hand, all items 
from both attended and unattended lists are held in 
common store in an undifferentiated form, Ss should 
always respond yes—i.e., there should be an error 
rate of 100%. Following Sternberg's (1966) as- 
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sumption that a serial scan of the memory set is 
made in order to reach a decision, twice as many 
items would have to be scanned in this condition. 
Consequently, RTs for both positive and negative 
probes should be considerably greater than RT in a 
control condition where only a single attended list 
is presented. A third possibility, suggested by the 
Davis and Smith (1972) study, is that the 2 lists 
may be tagged according to ear of arrival, or in some 
other way kept in a separate store in memory. If 
the tagging were perfect, and if Ss were able to limit 
their scan only to the attended list, then even 
though the items were held in STM they should be 
handled in the same manner as regular negative 
probes. To the extent that the tagging was not 
perfect, however, or that there was a fading of tag 
information with time, some false positives would be 
expected, giving a greater error rate for negative 
probes from the unattended list. To test this 
possibility, word lists were presented at both fast and 
slow rates, with the expectation that there would 
be less fading of tag information, and hence fewer 
errors, under a fast presentation rate. 

Method. The Ss were 8 male secondary school 
students, all right-handed according to self-report. 
They were paid $2.00 per session for serving in the 
experiment. 

The words used for each trial were sampled from 
a pool of 720 one-syllable words that had a rating 
of 45 or higher on the Thorndike-Lorge (1944) G 
count, and both began and ended with a spoken 
consonant. In cases where 2 or more words were 
homophones or syntactic variants of each other 
(e.g., give-gave), only one form of the word was 
retained in the pool. Only nouns, verbs, and ad- 
jectives were used. 

"The lists used in the experiment were generated by 
computer program and recorded by a male voice 
using a Sony TC-355 2-channel tape recorder. The 
Ss heard the lists through Sharpe HA-10 headphones 
and responded by pressing one of 2 push buttons. 
The S held one button in each hand. The RT for 
each response was measured to the nearest milli- 
second by a Hunter Klockounter. Each S was run 
individually in a quiet room. A plywood partition 
separated S and E. 

On each trial, S heard a warning tap, a list of 4 
memory words, another warning tap, and a probe 
word. The S was required to decide whether or not 
the probe word was present in the list and to make a 
response as quickly as possible. In the monaural 
condition, S heard all stimuli through one ear only. 
For any given S, this was the same ear throughout 
the experiment, and was designated as the “releva nt" 
ear for that S. In the dichotic condition, words 
were also presented to the irrelevant ear. Each 
irrelevant word was presented at approximately the 
same time as a memory word. Nothing was pre- 
sented to the irrelevant ear during presentation of 
warning signals or probe word. 

In the dichotic condition there were 3 types of 
probe words: (a) Positive probes were words that had 
been presented to the relevant ear as part of the 
memory set. The Ss were instructed to press the 
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yes button in response to positive probes. These 
probes were taken from each of the 4 serial positions 
equally often. Fifty per cent of all probes were 
positive. (b) Negative intralist probes were words 
that had been presented to the irrelevant ear while 
the memory set was being presented to the relevant 
ear. The Ss were instructed to press the no button 
in response to these words. Twenty-five per cent of 
the probes were of this type. They were drawn 
from each serial position of the irrelevant word set 
equally often. (c) Negative extralist probes were 
words that had never been presented to either ear. 
They were always treated as no words. The Ss 
were made fully aware of the nature of the no words 
and were especially encouraged to block out the 
irrelevant ear in the dichotic condition. 

In the fast condition, the memory set, warning 
nd probe words were presented at the rate 
In the slow condition, the 
rate was 1 item per second. Each S served for 1 
practice day and 4 test days. Each test day con- 
sisted of a monaural tape and a dichotic tape at one 
of the 2 rates. Each tape consisted of 4 practice 
trials followed by 80 test trials (40 yes, 40 no) pre- 
sented in a random order. There were a total of 
8 tapes, all dichotic. The monaural conditions were 
created by turning off the switch for the irrelevant 
channel. Half of the Ss heard each tape as a 
monaural tape, and half heard each as a dichotic 
tape. Assignment of response type to button and 
order of serving in each Mode X Rate combination 
was balanced across Ss. 

Results. The error rates are shown in Figure 1. 
In the monaural condition, error rates for both yes 
and no responses are a relatively low 4% (Figure 1, 
Section a). In the dichotic condition, the error rate 
for the yes and external no responses does not differ 
from that found in the monaural condition. How- 
ever, when the probe was a word that had been 
presented in the unattended ear (internal probe), the 
error rate was very high, jumping to 20%. Most of 
these errors were due to S's failure to make a 
correct rejection—i.e., a false positive response was 
made (Figure 1, Section b). In Figure 1 Section c, 
these internal errors are plotted as a function of their 
serial position in the unattended ear. Most of the 
errors occurred at the middle 2 positions. 

All RT analyses were based on correct responses 
only. Figure 2 shows mean RT for dichotic and 
monaural conditions as a function of serial position. 
Dichotic trials were on the average 42 msec. slower 
than monaural trials, F (1, 7) = 5.79, p < .05. 
There was also a significant effect of serial position, 
F (3, 21) = 4.48, p < .05. For both monaural and 
dichotic conditions, the serial position curve is 
bowed. The shortest RTs were for probes of the last 
position, followed by probes of the second-to-last 
position. 

A repeated measures analysis for the RT data on 
negative trials revealed a significant effect of mon- 
aural vs. dichotic conditions, F (1, 7) = 5.76, 
p < .05, and no other significant effects. 

Discussion. This study was conducted to deter- 
mine whether unattended words are placed in the 


signals, 
of 2 items per second. 


STM between attended and unattended inputs. If 
both materials were simply placed in a common 
store, with S being required to scan all the items in 
order to reach a decision about the probe, all probes 
from the unattended channel would have been 
treated as positive items. This would have yielded 
an error rate of 100% for intralist negative probes. 
However, the error rate found was 20%. Second, 
given Sternberg's (1966) estimate of 38 msec. to scan 
an item in memory, adding 4 items in the dichotic 
condition would have increased RT by about 150 
msec. Instead, an increase of approximately 40 
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Why, then, is there such a high error rate for 
intralist negative probes? One possibility is tha 
although the items in the 2 channels are differential! ù 
tagged according to ear of input, there is a fadi: 
o° the tag information with time. Consequently, 
when a probe is presented which was an item in the 
unattended channel, Ss will respond yes if the item 
is still available in memory but the tag information 
has faded away. However, this hypothesis would 
have predicted fewer errors with a fast presentation 
rate, since there is less time for fading. Yet no effect 
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of rate was found, Second, we would have expected 
the greatest number of errors when the probe was 
an item from the beginning of the list, with the 
number declining as the end of the list is ap- 
proached. Instead, a bow-shaped serial position 
curve was found for errors, with the maximum 
number of errors occurring in the middle 2 positions. 

A second possibility is that all presented items are 
placed in a common store, but that due to attenua- 
tion of the unattended input (Treisman, 1964), the 
trace strength of their memorial representations is 
very weak. We must, therefore, assume that the 
decision as to whether the probe was presented in 
the memory set is not based upon a serial scan but 
instead depends on trace strength. If the trace 
strength is not fixed, but varies about some mean, 
then on some trials it will be sufficiently high for Ss 
to make a positive response to the probe. To ac- 
count for the data, we must assume that trace 
strength for unattended materials does not vary 
greatly over serial position and is not sensitive to 
the different rate presentations employed here. 
This would account for the higher error rate for 
intralist than extralist negative probes, To explain 
the overall latency increase in the dichotic condition, 
we must postulate that the addition of these low- 
strength. traces in memory adds noise, thereby 
causing a decrease in efficiency. 

Finally, a different mechanism which could handle 
these data is one in which an attention switch to the 
unattended channel occurs during the list presenta- 
tions, thereby allowing for one of the unattended 
items to be stored with the attended items. Time of 
occurrence for such a switch would be random, but 
would have increased likelihood of occurring in the 
middle of the list, If such a switch occurs once, 
permitting one additional item to enter the attended 
store, Ss in the dichotic condition would have one 
more item to scan. The value of the increase in RT 
of 42 msec. found in the dichotic condition is just 
what would be expected for the scanning of an 
additional item. Further, if one item from the list 
of 4 does get tagged as an attended item, an error 
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rate of 25% would be expected—a value which is 
very close to the 20% which was in fact found. 

It isimportant to note that if this attention switch- 
ing did take place, it must have occurred involun- 
tarily, for Ss were instructed to ignore the material in 
the irrelevant ear, and there was no reason for S to 
sample the unattended input in order to perform 
the task. On the contrary, his performance would 
be better if this switching did not occur at all. 

Such an interpretation has several difficulties. 
It makes predictions opposite to the findings of 
Davis and Smith (1972), for such a mechanism must 
postulate a change in serial position data for the 
attended input. Second, it would bring into ques- 
tion the hypothesis of STM for unattended verbal 
items: it would suggest that items in the unattended 
channel reach STM only when an involuntary at- 
tention switch occurs, allowing them to pass into 
STM. This would seem unlikely, in view of the 
many recent studies in which evidence for STM 
for unattended materials has been found. 
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A set of target words from the same category, embedded in lists also contain- 
ing other items, was repeated across all trials for free recall of all items pre- 
sented on each trial. Retrieval of the repeated items from long-term storage 


was less when other items, from a different categc 
rather than also repeated across trials; and retrie 
other new items were from the same category as the repeated items. 
term storage of the items in the repeated set w: 


were new on each trial 
was still less when the 
Long- 
equivalent in all 3 conditions, 


confirming that retrieval of items within a list from long-term storage is 
affected by the context provided by the rest of the list, 


It has been pointed out (e.g, Bower, 1972; 
Kintsch, 1970) that the properties of a list of items 
as a whole affect the learning of individual items in 
the list. This view can be examined by presenting 
a set of words as part of larger lists containing other 
words, so that the target words would be learned 
in different contexts, i.e., as parts of different lists. 
Accordingly, half of the items in a multitrial free- 
recall experiment were repeated from trial to trial, 
while the rest of the items were new on each trial. 
Since Ss were required to recall all of the items 
presented on any given trial, their task was to learn 
the repeated items across trials, while also recalling 
the new items for only 1 trial. 

The set of repeated items were all from the same 
category (BIRDS), the new items being drawn from 
the same category (BIRDS) for one group of Ss and 
from a different category (TREES) for another group 
of Ss. In addition, a third group of Ss was given 
only repeated items, so that all items could be 

learned over trials; half of the items were the re- 
peated BIRDS given to the other 2 groups and the 
rest were a set of TREES also repeated on all trials. 
The context in which the repeated subset of items 
(BIRDS) was presented to all 3 groups, therefore, 
was varied in 2 ways: (a) the balance of the items 
were from either the same or different categories, 
and (b) with same-category items, the context was 
nonrepeated, whereas with different-category items, 
the context was either repeated or nonrepeated, 
Finally, all 3 experimental groups were given an 
additional trial in which the repeated BIRDS were 
presented by themselves for free recall, and a con- 
trol group was also given the repeated BIRDS alone, 
but without any previous trials. 
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Method. Eight successive lists of 16 items cach 
were read aloud at the rate of 1 item/sec for ‘ree 
recall of each list immediately after presentation. 
There were 3 conditions, and the same 8 BIRDS 
were presented on all trials in each. On each trial, 
the repeated BIRDS were alternated with the other 
8 items to distribute the 2 classes of items evenly | 
over the list. From trial to trial, the order of the 
repeated BIRDS was randomized over their designated 
serial positions. In Condition B-b, the 8 repeated 
BIRDS alternated with the names of 8 BIRDS which 
were new on each trial, In Condition B-t, the 
repeated BIRDS alternated with a new set of TREES 
on each trial. In Condition B-T, the repeated 
BIRDS alternated with 8 TREES, which were repeated 
across trials. These repeated TREES were ran- 
domized within their designated serial positions 
across trials, 

On each free-recall trial, Ss wrote the items that 
were recalled on a separate sheet of paper. The Ss 
were told about the structure and content of their 
lists, and were given 90 sec. for recall after each list 
was read. The 8 repeated BIRDS were presented 
alone on Trial 9 to all experimental Ss and for only 
1 trial to 20 control Ss, who recalled them for the 
first time. 

The Ss were undergraduate students between the 
ages of 19 and 25 enrolled in psychology courses at 
Lehman and Brooklyn Colleges. A total of 80 Ss 
participated in the experiment, 20 Ss being randomly: 
assigned to each of the 3 experimental conditions 
and to the control group. 

The relative contributions of short-term storage 
(STS) and long-term storage (LTS) to total recall 
were estimated by the method of Tulving and 
Colotla (1970). Recall Írom short-term storage 
includes those items with 7 or less other items 
intervening between presentation and recall, count- 
ing both presented and recalled items, Retrieval 
from LTS includes those items with more than 7 
intervening items between presentation and recall. 
The number of items retained in LTS is the cumula- 
tive number of items retrieved from LTS at least 
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Ficure |. The left panel presents the number of repeated BIRDS recalled when 
presented for recall together with new BIRDS, new TREES, and repeated TREES, with 


repeated TRE 


once, on the assumption that once an item has been 
retrieved from LTS, it remains in LTS, In support 
of this assumption, recent work has shown that 
when items on a list are no longer presented after 
they have been recalled just once (i.e., only items 
on the list never recalled are presented on subse- 
quent trials), 85% will be spontaneously retrieved 
subsequently, even after several intervening trials 
where they have not been recalled (Buschke, in 


press). 
Results, The left panel of Figure 1 shows the 
recall across trials of the repeated BIRDS embedded 


in the lists of the 3 conditions, and the right panel 
shows the recall of the rest of the items presented 
for recall on each trial in each of the 3 conditions. 
Recall of the repeated BIRDS increased most over 
trials (sum of Trials 1-4 sum of Trials 5-8) in 
Condition B-T, ¢ (19) = 4.98, p « .001; increased 
moderately in Condition B-t, ¢ (19) 52, 
p <.01; but did not increase at all in Condition 
B-b. There was a significant effect of conditions 
on the recall of the repeated BIRDS on Trial 8, 
F (2, 57) = 3.40, p < .05. The Ss in Condition 
B-t had a greater overall recall of the repeated than 
of the new items, ¢ (19) = 4.58, p < .001, whereas 
Ss in Condition B-b did not differ in their overall 
recall of the repeated vs. the new items. 

Evidence that the failure of the B-b group to 
increase in recall of the repeated BIRDS across trials 
was due to difficulty in retrieval from, rather than 
storage in, LTS was obtained on Trial 9, where the 
repeated BIRDS were presented alone to all groups. 
While there was no significant difference in recall 
among the 3 experimental conditions, recall by the 
control group, who were recalling the repeated 
BIRDS for the first time, was significantly lower, 
F (2, 76) = 8.54, p < .01. If the B-b group had 
not benefited from exposure to the repeated items 
during Trials 1-8, then they should not have done 
better on Trial 9 than the control group did on their 
single trial. Any benefit from previous trials pre- 
sumably must be mediated by LTS. 


ts for Trial 9 representing recall of repeated BIRDS when presented by them- 
; the right panel represents concurrent recall of new BIRDS, new TREES, and 


To further substantiate the contention that the 
lower recall scores of the B-b group were due to less 
effective retrieval from LTS rather than less effec- 
tive storage in LTS, estimates of retention in LTS, 
retrieval from LTS and recall from STS across 
trials are presented separately in Figure 2 for the 
repeated BIRDS for the 3 conditions. Recall from . 
STS remained constant across trials in all.3 con- 
ditions, and analy of variance indicated no 
nificant differences among the conditions on 


Trial 8. Long-term storage clearly increased across 
trials for all 3 conditions, with no significant dif- 
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storage STR) for the repeated preps in Conditions B-b, B-t, 
and B-T. 
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ferences among the conditions on Trial 8. Retrieval 
from LTS, however, was significantly different 
among the conditions on Trial 8, F (2, 57) = 3,82, 
p <.05. This indicates that retrieval from LTS 
accounted for the differences in recall among the 
conditions shown in the right panel of Figure 1. 


DISCUSSION 


The results indicate that the context in which the 
repeated BIRDS were embedded affected their recall. 
When the rest of the items were from a different 
category (TREES), changing the TREES to be recalled 
on each trial resulted in somewhat poorer recall of 
the repeated sirs than when the TREES also were 
repeated across trials (Condition B-t vs. Condition 
B-T). Recall of the repeated BrRDS was markedly 
reduced when the balance of the items, which 
changed from trial to trial, were from the same 
category (Condition B-b). In contrast to the other 
2 conditions, Condition B-b showed no increase at 
all across trials in the recall of the repeated BIRDS, 
even though the repeated BIRDS were presented an 
equal number of times to each group and in identical 
serial positions (Figure 1). 

The analysis of LTS, retrieval from LTS, and re- 
call from STS (Figure 2) yielded the surprising result 
that the differences in recall of the repeated BIRDS 

» among the experimental conditions were not due to 
differences in recall from STS of the repeated BIRDS. 
We had expected that STS would have been needed 
for the new items in Conditions B-b and B-t 
compared to Condition B-T, where Ss were not 
confronted with new items on each trial. Even 
though the new items could have been discriminated 
for preferential STS in Condition B-t on the basis 
of category membership, recall from STS of the 
repeated items was the same as in Condition B-b, 
where new items could not be distinguished on the 
basis of category membership, Long-term storage 
of the repeated BIRDS also was the same in all 
conditions. Since LTS and recall from STS of 
the repeated items were identical in all conditions, 
the differences in recall of these items must be 
accounted for solely on the basis of differences in 
retrieval from LTS. 

The differences in retrieval from LTS of the 
repeated BIRDS among conditions reflect the dif- 
ferences in the total composition of the lists pre- 
sented in the 3 conditions. In Condition B-T, 
where all of the items were the same from trial to 
trial, Ss could learn the items as one coherent list. 
In Condition B-t, the items could not be treated as 
part of one coherent list, because half of the list 
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changed on cach trial. Vincent cur dicated 
that as the trials progressed, Ss in ( on Bet 
reported an increasingly “x prop of the 
new items in the first half of recall and peated 
items in the second half of recall. It s that: 
Ss could treat the entire list (B-t) as 2 sts of 
repeated (B) and new (t) items. The vewhat 
poorer retrieval of the repeated BIRDS fr: TS in 
Condition B-t compared to Conditio: may, 


have been related to the extra proces 
in constructing 2 lists, 

In Condition B-b, where the new ite: 
trial were from the same category as tht 
items, the possibility of constructing 2 
impeded by the difficulty in discrimina 
repeated from the new items, The Vincen 
also indicated that Ss in Condition B-b h 
culty in discriminating repeated and new ite: 
the same category beca while they reported 
more new items in the first half of recall and nore 
of the repeated items in the second half of call, 
this was less successful and did occur not until later 
trials than for Ss in Condition B-t. The items in 
Condition B-b could neither be treated as a single 
coherent list nor as 2 distinct sublists. The S» in 
Condition B-b in effect faced an entirely new /ist 
on each trial (even though half of the items were the 
same on each trial). This is reflected by the failure 
of the recall of the repeated items to increase across 


Taken together, the results of this experiment 
confirm that the context provided by a list as a 
whole has a significant effect on the recall of indi- 
vidual items in that list, Recall of the repeated 
items did not increase over trials when other items 
from the same category changed from trial to trial. 
These results also indicate that the differences in 
recall among the 3 conditions were due to differences 
in retrieval of the repeated items from LTS, rather 
than to differences in storage and retention of the 
repeated items in LTS. Retrieval from LTS appears 
to depend on the properties of the total list within 

which the items are presented. ' 


urves 
dif- 
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THE EFFECT OF NUMBER OF STIMULI ON THE STEADY-STATE GENERALIZATION 
GRADIENT: 


A TEST OF THE SUM 


MATION HYPOTHESIS ! 


JOHN C. MALONE, JR? 


University of Tennessee, Knoxville 


The interacting-gradient model for d. 


iscrimination learning predicts changes 


in the slope of the generalization gradient as a function of the number of 
stimuli comprising it. Attempts to verify this prediction, using the method 


of extinction testing after reinforced 


In this experiment, three pigeons prod 
dients over a prolonged training period. 


pretraining, have proven inconclusive. 
ced steady-state generalization gra- 
The number of stimuli forming the 


gradient was varied, while the temporal appearance of stimuli and the pattern 


of reinforcement availability remaine 


d constant. Variations in the number 


of stimuli present did not affect the slope of the gradient. 


An intuitively reasonable way to view discrimina- 
tion learning is in terms of excitatory and inhibitory 
response tendencies; such a position was taken by 
Pavlov (1927) and was adopted in various forms by 
others. Probably the clearest formulation of this 
view was that proposed by Spence (e.g., 1937). 
According to this interpretation, discrimination 
learning is a secondary process, dependent upon the 
formation and interaction of generalization gradients 
of excitation and inhibition. During the formation 
of a discrimination between two stimuli on the same 
dimension (e.g., size, hue), a gradient of excitatory 
response tendency is presumed generated around 
S+, as the result of reinforcement (e.g., food) re- 
ceived for responding in its presence, while lack of 
reinforcement for responding in S— results in an 
inhibitory gradient around it. The height of these 
gradients is presumed to decrease with distance from 
S+ and S—, A simple algebraic summation of such 
gradients determines the tendency to respond to 
S+, S—, and to other stimuli which lie within the 
range of generalization of either. 

Despite a good deal of criticism, this model has 
remained the most popular general account for dis- 
crimination learning. In theory, it can account for 
the learning of a discrimination, for transposition 
and transposition reversal, and for most of the 
features of the shifted peak of the postdiscrimination 
generalization gradient ,(Hearst, 1968; Hilgard & 
Bower, 1966). The model has been limited in useful- 
ness, however, due to difficulty in empirically verify- 
ing its predictions. 

These difficulties must be due in large part to the 
extreme variability of empirical generalization 
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gradients obtained with present methods, i.e., in the 
manner popularized by Guttman and Kalish (1956). 
Animals are usually trained in the presence of one or 
more stimuli, under conditions of intermittent re- 
inforcement, so that resistance to extinction is de- 
veloped. Transfer tests are then carried out in 
extinction; reinforcement is discontinued and the 
animal is presented a number of stimuli varying 
along some dimension of the training stimulus. 

Two characteristics of empirical gradients which 
seem essential if they are to be used to explain the 
phenomena of discrimination learning are stability 
and recoverability. Gradients obtained in extinction 
are variable, both among and within subjects (cf. 
Blough, 1965). Such gradients are not recoverable 
if testing is repeated after additional single-stimulus 
training (Zeiler, 1969). The form of these gradients 
is influenced by a multitude of factors, such as the 
schedule of reinforcement used during training 
(Hearst, Koresko, & Poppen, 1964) and the length 
of the transfer test in extinction (e.g., Guttman & 
Kalish, 1956). 

Several investigators (e.g., Blough, 1965; Zeiler, 
1969) have suggested that gradients be obtained in 
the steady state, i.e., with reinforcement main- 
tained in S+ and with presentation of S+ and 
“test” stimuli over many sessions. There are several 
advantages in following this procedure, Steady- 
state gradients reach quite stable asymptotes (e. 
Malone & Staddon, 1973); they are relatively inde- 
pendent (at asymptote) of the reinforcement 
schedule in force in S+ (Zeiler, 1969) and, perhaps 
most important, they are recoverable after inter- 
polated training conditions (Zeiler, 1969). 

The interacting-gradient model predicts summa- 
tion of excitatory and inhibitory response tendencies 
which should affect the slope of the generalization 
gradient obtained during extinction testing, as well 
as during steady-state training. Excitatory gradients 
summating around reinforced stimuli should pro- 
duce increased overall responding and a shallower 
gradient. The addition of nonreinforced stimuli 
should act to lower overall responding and to in- 
crease the slope of the gradient due to the addition 
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of inhibitory gradients around these stimuli. Evi- 
dence for summation of this sort is equivocal; Kalish 
and Guttman (e.g., 1957), using the method of 
extinction testing after reinforced pretraining, did 
not find convincing evidence for summation of 
excitatory gradients. ^ Others, using the same 
method, have found evidence for excitatory-in- 
hibitory summation following pretraining with re- 
inforced and nonreinforced stimuli; the gradient 
obtained in extinction was shifted toward S+, as 
would be expected if an inhibitory gradient around 
S- is subtracted from an excitatory gradient cen- 
tered around S— (e.g., Hearst, 1968). 

Changes in the number of test stimuli presented 
during extinction should similarly affect gradient 
form, due to the summation of inhibitory gradients 
around these stimuli; such effects have been re- 
ported (e.g, Marsh, 1967), but in other cases no 
effect appears (e.g, Guttman & Kalish, 1956). 
Others, using steady-state procedures, have varied 
the number of stimuli comprising the gradient (e.g., 
Pierrel & Sherman, 1962). However, these studies 
were not attempts to definitively test the interacting- 
gradient model; in addition, their procedures did not 
ensure that factors other than the number of stimuli 
comprising the gradient remained constant. 

The interacting-gradient model is influential 
enough to warrant testing using the best means 
available; this experiment varies the number of 
stimuli comprising a steady-state gradient, while 
holding other factors constant. 

Method. Three white Carneaux pigeons were 
maintained at 80% of their free-feeding weights 
throughout the experiment. One bird (119) was ex- 
perimentally naive; the other two birds (48 and 72) 
were experienced with multiple fixed-interval 
schedules (in which key colors served as discrimi- 
native stimuli). 

The experimental chamber was a 33 X 33 X 33 
cm. Plexiglas box with a 1.9-cm. translucent key 
mounted on one wall, A line orientation produced 

by a Tektronix (Model 503) oscilloscope was 
focused onto the key by a small lens (focal length 
1 in.) mounted between the oscilloscope face and 
the key. The image on the key was a line orienta- 
tion approximately 1.3 X .5 cm., always presented 
on a blank background. A 24-v. battery connected 
to the horizontal input of the oscilloscope served to 
eradicate the trace left on the oscilloscope face be- 
tween stimulus changes and. during reinforcement. 
A 25-w. red lamp was mounted approximately 8 cm. 
behind the response key; when lit, the lamp trans- 
illuminated the key, providing a red stimulus which 
could be presented instead of the line-orientation 
stimuli provided by the oscilloscope. The animal 
chamber and # of the oscilloscope were enclosed by 
a larger box (51 X 51 X 122 cm.) comprised of 
hollow 5-cm. thick walls filled with soundproofing 
materials. Reinforcement (3-sec. access to mixed 
grain) was provided by a food magazine; the 5-em.2 
aperture was located 5 cm. below the key. The key 
display was extinguished during reinforcement but 
the 15-w, fluorescent houselight remained lit. White 
noise and a ventilation fan masked extraneous 
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noises; programming and recording eq nt were = 
located in an adjacent room. 

A daily session comprised 90 1-m 
presentations; the sequence of presen 
sisted of 10 blocks of nine present 
Each block included three presentati 
(90°; vertical line) and one presentation 
six other orientations (66°, 54°, 42°, 30 
6° from the horizontal), randomized w 
During S+ (90°) a variable-interv: 
reinforcement schedule was in effect; VI 
in effect at all other times. The selectio: 
reinforcement intervals for both schedules 
on the progression suggested by Fleshler : 
man (1962); this was done to ensure steady 
rates, by making reinforcement availability 
predictable as possible. 

The sequence of stimulus presentations was fixed 
throughout the experiment; although seven orienta- 
tion stimuli were included in the sequence, a i 
mum of four appeared during any one conc 
Orientations which were not scheduled to appear 
during a given condition were replaced by presenta- 
tions of a red key light; VI 3-min. remained in 
effect during this stimulus. In this way, the tem- 
poral appearance of stimuli and the pattern of re- 
inforcement availability during the session remained 
constant over conditions; only the presence or ab- 
sence of orientation stimuli was varied. 

To facilitate control by stimuli on the key, "no- 
stimulus” periods were inserted at the end of odd- 
numbered blocks of stimuli, During these periods 
no stimulus was present on the key and reinforce- 
ment was never obtained. The length of the period 
was determined by a timer which reset with each 
key peck, prolonging the period. The timer was, 
initially set at 5 sec.; with improved performance 
(absence of key pecking), the setting was increased 
to 20 sec. 

Each bird received preliminary magazine and key 
training in the presence of S-- (909). 'This was 
followed by a period of initial training consisting of 
11 sessions of 30 1-min. presentations of S-+, rein- 
forced according to VI 30 sec., alternated with 30. 
"no-stimulus" periods. 

The experiment comprised the following three 
conditions: (a) S + (90°) vs. 6°; (b) S + (90°) vs. 
30°, 6°; and (c) S + (90°) vs. 54°, 30°, 6°. Each 
bird was trained on each condition and returned to 
the original condition, as a test for recoverability. 
The order of conditions and number of sessions for 
each bird appear in Table 1. 

: Results. Withina week after the onset of training, 
virtually no responding occurred during "no- 
stimulus” periods. Extended training was carried 
out in order to assess any changes in gradient form 
due to length of training; little change occurred after 
the first few sessions, 

, There was no change in gradient form as a func- 
tion of the number of orientation stimuli present. 
Table 2 shows average response rates per minute 
and standard deviations during the last four days 
of each condition for individual birds. Gradients 
Were extremely stable over these periods, as they 
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TABLE 1 100 
SEQUENCE OF CONDITIONS AND NUMBER OF 
SESSIONS OF TRAINING FOR Each BIRD BIRD 72. BIRD 48 
Bird 72 Bird 48 Bird 119 po 
z o. 
CO | Sessions | COMI | Sessions | Conde | See — —— 60 .. 
——— W 4 4 
a 3t c 36 a 38 g 1 
b 22 a 37 b 27 $ " e 
c 16 b 5 c 11 & 40 2 o4 - 
a 15 c 5 a 5 a dr 
ETD TSR Ca 3 
o1 
: " ^ ad H x ag 90*,6* 
were during the entire period of training. Bird 72's 9053076*. | 
gradients were virtually indistinguishable over 69 4—a 90*,54*,50*,6 
sessions (Conditions a, b, and c), in spite of changes (lee el IU 
in the number of orientation stimuli present. Eo) PEERS) CONTE) Eek Ace SERER, 
Similarly, gradients for Bird 48 varied little across ORIENTATION (deg) 
the first three conditions. However, attempts to Ficure 1. Least squares approximations of data averaged 


recover the performance shown in the original con- 
dition led to an increase in absolute response rates 
in all stimuli for Bird 72, and an increase in 54° for 
Bird 48. 

Bird 119 responded at extremely high rates 
throughout the experiment. During the first two 
conditions (65 sessions), 119's gradients remained 
approximately flat (Table 2), reflecting a lack of 
control by orientation stimuli. During Condition 
c, the gradient steepened, an effect which increased 
with the shift back to the initial Condition a. Dur- 
ing a subsequent experiment, this animal showed a 
similar insensitivity to line-orientation stimuli; thus, 
the possibility that the lack of previous discrimina- 
tion experience was responsible for his performance is 
unlikely. His data will not be discussed further. 

The effect of alterations in the number of orienta- 
tion stimuli on the slope of the gradients appears in 
Figure 1. The curves in this figure represent linear 
least squares approximations of the data in Table 1. 
Gradient slope remained virtually unchanged over 
conditions. 

Discussion. The interacting-gradient model 
predicts that the slope of the steady-state generali- 
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over the last four days of each condition for Birds 72 and 48 
(see Table 1). (Gradients are labeled according to the order in 
which conditions appeared in training for each bird, rather than 
according to the number of stimuli appearing in that condition.) 


zation gradient will change with variation in the 
number of stimuli comprising it. However, the 
model may be interpreted to predict two outcomes 
for the present experiment. 

Since responding was reinforced in all stimuli, 
though stimuli other than S+ were reinforced at a 
lower rate, the model would predict that excitatory 
gradients would be generated around all stimuli. 
Response rate to a particular stimulus depends upon 
the excitatory gradient associated with it, as well as 
upon summation of this gradient with gradients 
centered around other stimuli on the same dimension 
(cf. Spence, 1937). Thus, when the number of 
orientation stimuli forming the gradient is increased, 
the gradient should flatten and overall responding 
in all stimuli should increase, due to the summation 
of additional excitatory gradients. 

An alternative prediction comes from Guttman's 
(1959) finding that a stimulus associated with 
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MEAN RESPONSE RATES/MINUTE DURING THE Last Four Days or EACH CONDITION 


90° 54° 30° e Red 
Condition || ——— — -|- — a bei m 
M SD M SD M SD M SD M SD 
Bird 72 See Deme a n 
a 62.0 4.5 41.1 3 23 24 
b 65.8 4,7 41.1 4.8 42.9 1.3 33,4 2.0 
€ 64.4 4.3 52.3 4.9 40.9 2.5 40.5 3.1 32.4 2.6 
a 70.1 6.9 52.0 48 37.8 3.5 
Bird 48 

[4 69.8 5.0 64.6 79 518 5.4 51.8 30.9 

a 70.6 4.2 48.5 41.4 

b 58.3 3.4 50.6 5.1 37.7 39.8 

€ 62.0 3.6 70.5 49 57.3 44 46,4 36.4 

Bird 119 

a 160.7 4.8 141.4 96.7 74 
153.6 17.1 146.4 10.2 139.6 71.5 47 
€ 152.6 6.4 151.4 14.6 127.3 15.7 112.5 82.4 10.6 
a 161,1 6.5 78.1 85.4 7.5 
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relatively less frequent reinforcement can become 
“functionally” inhibitory. Thus, the gradients 
centered around orientation stimuli other than S+ 
might be considered inhibitory, rather than excita- 
tory. The summation of such gradients with an 
excitatory gradient centered around S+ predicts 
that an increase in the number of orientation stimuli 
should cause the gradient to steepen and response 
rates in all stimuli to decrease. 

Neither of these effects occurred; an increase or 
decrease in the number of orientation stimuli pro- 
duced no discernible change in gradient slope or in 
absolute response rates. The only major change in 
gradient slope during the experiment appeared in the 
data of Bird 119; this change cannot be attributed to 
changes in the number of stimuli forming the 
gradient. 

‘The procedure used here seems optimal for testing 
the interacting-gradient model ; predictions made by 
either interpretation of the model regarding changes 
in gradient slope as a function of variations in the 
number of stimuli present were not confirmed. 
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The time course of recovery from 
2 X 4 completely randomized factor 
of habituation were 2 or 5 successive 
intervals were 1, 2, 4, or 6 min. 
increa 


criteria of habituation did not produce significantl 
covery. When both criterion groups were combined 
the median percentages of recovery for the 1- 
17%, 55%, 81%, and 84%, respectively. 


The term spontaneous recovery refers to the re- 
sponse increment that occurs after a “rest” period 


1 The results of this research were presented at the meeti 
of the Rocky Mountain Psychological Association, Albus 
querque, May 1972. 
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2 levels of habituation was examined. In a 
ial design, with 24 Ss per cell, the criteria 
trials without a response, and the recovery 


. Recovery of the orienting response was an 
ing negatively accelerated function of the ri 
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ecovery interval. The 2 
y different curves of re- 
at each recovery interval, 


4-, and 6-min, intervals were 


which follows either extinction of a learned response 
Or habituation of an orienting response (OR), © 
Despite the fact that spontaneous recovery of a 
habituated OR is considered a major parametric. 
feature of habituation (Thompson & Spencer, 1966), — 


little systematic research with human Ss has been 4 
reported. F 
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‘The purpose of the present study was to investi- 
gate the spontaneous recovery of the galvanic skin 
response (GSR) component of the OR. Specifically, 
an attempt was made to examine the time course of 
recovery from 2 levels of habituation. 

Method. The Ss were 192 male and female 
students enrolled in a general psychology course. 
Participation in the experiment partially fulfilled 
a course requirement. 

The apparatus consisted of a Hunter GSR ampli- 
fier, modified to detect resistances from 0 to 150,000 
ohms. Fels 20-mm.-diam. zinc electrode cups, filled 
with Fels electrode jelly, were used. Tones from a 
Grason-Stadler 950-D generator were presented 
through Roberts headphones. Stimulus durations 
were timed by a Huhter interval timer, The inter- 
trial interval (ITI) was timed by E with a Standard 
timer, and the recovery interval was timed with a 
stopwatch. 

The electrodes were attached to the palm and 
back of S's right hand. The S was told that he/she 
would hear several short tones through the head- 
phone and that he/she was to relax and remain as 
still as possible. The Æ next assigned each S ran- 
domly to 1 of 8 groups based on a completely 
randomized factorial arrangement of 2 habituation 
levels and 4 recovery intervals, The 2 habituation 
levels were 2 and 5 successive trials without a 
response, as defined later. The 4 recovery intervals 
were 1, 2, 4, and 6 min. following the final trial of 
habituation. These intervals were defined as the 
time between the onsets of the final habituation 
stimulus and the test stimulus, 

After the baseline resistance became relatively 
stable, 1,000-Hz. tones of 2-sec. duration were pre- 
sented at random intervals of 30, 40, or 50 sec. 
(M = 40 sec.). The tonal intensity was 62 d 
measured at the headphone. Stimulus presenta- 
tions continued until the appropriate criterion of 
habituation was met. On the test trial, the measure 
of recovery was the GSR magnitude elicited by the 
same standard stimulus. 

A response was defined as a decrease in skin 
resistance of at least 200 ohms occurring 1-5 sec. 
after stimulus onset. These defined changes in 
skin resistance were transformed to log conductance 
change scores. A base-level resistance score was 


the skin resistance in ohms recorded at stimulus 
onset, 
Resulls. On the first trial of habituation, the 8 


groups did not differ in either mean GSR magnitude 
(F <1) or base-level resistance (F <1). On the 
final trial of habituation, the groups did not differ 
in base-level resistance, F (7, 184) = 1.16, p > .20. 

Inter-S variability in rate of habituation was 
high. For the 2- and 5-trial groups, the ranges of 
trials required to meet the criterion of habituation 
were 3-30 and 6-27, respectively. The 2-trial 
groups averaged 7.29 trials to meet criterion; the 
5-trial groups required a mean of 10.66 trials. Trials 
to criterion of habituation did not differ significantly 
within the 2-trial (F < 1) or the 5-trial, F (3, 92) 
= 1.58, p > .10, groups. 
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Figure 1. Median galvanic skin response magnitude across the 
recovery interval as a function of 2 criteria of habituation. 


In Figure 1, median GSR magnitudes for the 2 
criteria of habituation are plotted across the re- 
covery intervals. Each data point represents a 
single group’s median GSR magnitude, Because of 
several zero scores, with the resulting deviance from 
normality, nonparametric stati: s were used for 
data analysis. By Kruskal-Wallis tests (corrected 
for ties), both 2-trial, /7 (3) = 9.88, p < .( and 
5-trial, H (3) = 29.31, p < .001, groups exhibited 
significant increases in response magnitude across 
the recovery interval. Although differential rates 
of recovery are suggested in Figure 1 by the ap- 
parent difference between the groups at the 1-min. 
interval, a Mann-Whit U test did not yield an 
acceptable level of significance (Z = 1.28, p = .10, 
1-tailed test). 

The percentage recovery was calculated for each S 
by dividing the GSR magnitude of the recovery test 
trial by the equivalent score for the first habituation 
trial. When the 2 criterion groups were then com- 
bined at each recovery interval, the median percent- 
ages of recovery for the 1-, 2-, 4-, and 6-min. intervals 
were 17%, 55%, 81%, and 84%, respectively. 

Discussion. The results suggest that the GSR 
component of the OR recovers a considerable por- 
tion of its prehabituation response magnitude, Fa 
recovery to the 2-min. posthabituation point 
followed by slower recovery to 6 min. This in- 
creasing negatively accelerated curve also describes 
recovery from habituation of the withdrawal and 
hooking response in earthworms (Gardner, 1968), 
the head-shake response in rats (Leibrecht & Askew, 
1969), and a variety of responses in several other 
species of vertebrates and invertebrates (cf, Hinde, 
1966). 

The 2 criteria of habituation used here did not 
produce significantly different curves of OR re- 
covery, a finding consistent with those of Gardner 
(1968). However, it is clear that the present study 
and that of Gardner can be compared only in 
general terms, because of the differences in response 
systems and species investigated. 
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The present results raise the question of how steeper recovery curves as a function of « V :bitua- 
many habituation trials beyond a certain point, a — tion trials, but may be limited to th: where 3 
zero response being an arbitrary point, are neces- relatively few habituation trials are ad tered, 7 
sary to produce differential recovery of the OR, If A steeper recovery curve for the groups th eived 
the ITI is thought of as one of the stimulus pa- the greater number of habituation trials i sested 
rameters, any lengthening of the ITI, such as occurs by the present data. 
during the recovery interval, would represent a bc j 
change in stimulus that may serve to evoke a REFERENCES 
generalized OR. Sokolov's (1960) neuronal model — G eps, p, E. Retention and overhabituati 

5 ituati edic a a point, component response in Lumbricus terrestris 
theory of habituation predicts that up to a point CR Physiological Easton | 
the greater the number of habituation trials, the HINDE, R. A. Animal behaviour: 4 syni 
e stablis cha comparative psychology, New Vork: M 

more completely established the model. A ch: ange ime Sua NEN Xok: Meor i 
in any stimulus parameter would then result in a shake response in the rat: Recovery, transier, » anges 

ii sti sa odel. in topography. Journal of Comparative and P. »gical 
greater mismatch between stimulus and a del HORT MA TR AR 
Since overhabituation necessarily involves addi- SokoLov, E. N. Neuronal models and the Srienti des 

i abi i la increased stimulu: In M, A. B. Brazier (Ed.), The centra nervous sy an ; 
tional habituation trials, the incre. ised stim: Us Behavior. New York: Jonah Meet TEL rows s M. 
novelty produced by the recovery interval may Tuowrsow, R, P. & Spence, W. A. Habituation: / model 
evoke a response increment that would tend to phenomenon for the study of neuronal substrates of b«!\.vior, 

A hol l Review, 1966, 73, 16-43. 
counteract the decremental effects of these addi- Se at rate he | 
tional habituation trials. This speculation predicts (Received April 21, 1973) | 
Journal oj Experimental Psychology 
1974, Vol. 102, No. 4, 734-736 
ASSOCIATIVE CONFUSIONS IN MENTAL ARITHMETIC ! 
JOHN H. WINKELMAN? AND JANET SCHMIDT 
University of Oregon 
The effects of associative interference between single-digit addition and multi- 
plication problems were investigated. Associative confusions (e.g, 4+5 
— 20) were about 50 msec. slower than comparable nonassociative confusions 
(e.g, 4 +5 = 15), and produced many more errors. The result indicates the 
presence of associative processes in simple mental arithmetic and casts doubt 


on the counting theory of addition. 


Previous work on single-digit addition problems 
has focused on reaction times for verifying sums as a 
function of the size of the addends (Parkman, 1971; 
Parkman & Groen, 1971). Parkman and Groen 
found a strong linear relationship between the size 
of the minimum of the two addends and the reaction 
time required for verifying the sum, They ac- 
counted for this finding by positing an unconscious 
counting process: first, the maximum of the addends 
is determined, and then it is incremented by ones 
for a number of times equal to the minimum addend. 
Since each increment presumably uses up the same 
amount of time, the decision time should be a linear 
function of the minimum of the two addends. 

The counting model is not the only model con- 
sistent with the linear increase of reaction time with 
minimum addend. It is possible that addition is 


1 This research was supported by the following grants: 
National Institute of Education rant 3-0097; National 
Institute of Mental Health Grant MH-17958-04; and Advanced 
Research Projects Agency Grant 50-262-5304. 

? Requests for reprints should be sent to John Winkelman, 
Department of Psychology, University of Oregon, Eugene, 
Oregon 97403, 


accomplished by an association between a digit 
pair (the addends) and the sum. As the size of the 
minimum digit increases, the strength of association 
between the digit pair and the sum decreases (be- 
cause larger sums are practiced less often), and hence 
the verification reaction time increases, The associa- 
tive model would predict a monotone increase of 
reaction time with minimum digit, rather than a 
linear increase; but most monotone functions can 
be fit fairly well by linear functions, so the Parkman 
and Groen (1971) result would be accounted for. 
The present study will not argue that associative 
processes are the only ones involved in addition and 
multiplication, but it will attempt to establish that 
associative processes do occur in addition and multi- 
plication. It is proposed that there exist associa- 
tions between pairs of digits and both their sums 
and products; for example, the digit pair (3, 3) 
would have associations with both 6 and 9. The 
obvious prediction is that stimuli of the type 3 +3 
z9and 3 X35 will produce a tendency to re- 
Spond yes because of associative interference, and 
hence that stimuli of this type will produce longer 
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reaction times and more errors than comparable 
stimuli not exhibiting interfering associations. 
thod, Six University of Oregon students served 
Ss, each for 1 hr. on three separate days. 
»e stimuli were constructed from the pairs (3, 3), 
(4, 3), (3, 5), (4, 5), and (5, 5). Each pair could 
serve as part of an addition or a multiplication 
stimulus, The stimulus answer (the right-hand side 
of the equation) was one of the following four types: 
(a) associative confusion items (e.g, 3 +5 = 15; 
3 8); (b) nonassociative confusion items, 
; from the set of possible answers to the other 
operation (eg. 3 +5 = 25, because 25 is in the 
€ answers to multiplication problems); (c) 
confusion items, chosen from the set of pos- 
answers for the actual operation (e.g., 3 +5 
10, because 10 is the answer to one of the addi- 
jon problems); or (d) the correct answer (e.g., 
3 +3 — 6). The complete set of stimuli is given 
in Table 1, Each number which occurs on the right 
hand side of an equation is the correct answer for 
some problem in the set. 

Subjects were run under three modes of presenta- 
iion. In the blocked mode, Ss received 30 addition 
problems, then a “ready” signal, and then 30 multi- 
ion problems; the 60 problems together con- 
stituted one block in the design. In the triad mode, 

Ss received three addition problems, then three 

ultplication problems, then three addition prob- 

‘lems, ete., for 60 trials; the 60 problems constituted 
a block. 


In the random mode, Ss received 30 addi- 

iow problems and 30 multiplication problems in 
random order, so that Ss did not know whether to 
expect an addition or a multiplication problem. In 
all three modes, the stimuli were selected by a 
random process, without replacement. 

Within each block, half of the stimuli were correct 
and half were incorrect. Each stimulus of the three 
confusion types occurred exactly once per block, 
and each correct stimulus occurred exactly three 
times per block, Subjects were given a practice 
block, which was not recorded, and then eight 
blocks of trials under a single mode within a f 
Mode of presentation was balanced against days in 
a Latin square design, so that each mode occurred 
an equal number of times on the first, second, and 
third days, and so that each S received each mode. 

A visual warning signal preceded each test stimu- 
lus by 500 msec. Subjects were required to push 
the left of two keys if the stimulus was correct, and 
the right key if it was not correct; the test stimulus 
remained on until the S made a response. Feedback 
was given after each response. 

The stimuli were exposed on a cathode ray tube, 
under control of the PDP-15 computer of the 
University of Oregon cognitive laboratory. 

Results. Only data from the three confusion types 
were analyzed; the stimuli for which the correct 
answer was yes did not bear on the hypothesis. 

The experimental design is given by SAMCB, 
where S = Subjects, A = Operation (addition or 
multiplication), M = Modeof presentation (blocked, 
triad, or random), C = Confusion type (associative, 
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TABLE 1 
STIMULI USED IN THE EXPERIMENT 


Associ- Nonas- 


s Nonas | Target | c. 
: ative sociative Correct. 
Operation confu- | confu- | Oniu- | stimuli 
sions sions mone 
343 9 n 7 6 
443 12 9 6 7 
345 15 25 10 8 
445 20 15 8 9 
545 25 20 9 10 
3x 6 7 12 9 
4X: 7 6 9 12 
3X5 8 10 25 15 
1X5 9 8 15 20 
5X5 10 9 20 25 


nonassociative, or target), and B = Block (eight 
levels). 

For the reaction time data, medians were taken 
over the five stimuli occurring within a block, con- 
fusion type, and level of A. Thus, each block 
yielded six medians. An analysis of variance, was 
performed on these medians, 

There was a significant effect of Confusion type, 
F (2, 10) = 12.96, p < .005, and of Block, F (7, 35) 
= 2.37, p < .05. Two interactions were signifi- 
cant: the Operation X Confusion type interaction, 
F (2, 10) = 18.27, p «.001, and the Mode of 
Presentation X Confusion type interaction, F (4, 20) 
= 420, p < .05. 

The Scheffé procedure (Meyers, 1966, p. 333) 
was used to test whether the differences between 
particular confusion types were significant. The 
comparison of chief interest was between associative 
and nonassociative confusions. An experiment-wide 
error rate of .10 was used, giving an F critical of 
5.48, The associative-nonassociative comparison 
gave an F of 25.90; the associative-target com- 
parison gave an F of 6.90; and the nonassociative- 
target comparison gave an F of 6.90. 

Associative confusions are consistently about 50 
msec, slower than nonassociative confusions, across 
both addition and multiplication, and in all three 
modes of presentation. Every S showed the effect. 
The error data show 221 errors for associative con- 
fusions, as opposed to 46 errors for the nonassocia- 
tive confusions. 

The main effect of Blocks was due to a fairly 
small but consistent decrease in reaction times over 
a practice effect. The difference between 
associative and nonassociative confusions did not 
show any consistent decrease over blocks. 

"The absence of a main effect for Mode of Presenta- 
tion is surprising: one would have expected the 
Blocked mode to be faster than the Random mode. 
The cell means show a tendency in this direction, 
but the differences are not significant (due to a 
large Subject X Mode of Presentation error term, 
which includes variation arising from the differing 
orders in which Ss received the Blocked, Triad, and 
Random conditions). The error data show the 
Blocked and Random modes to be approximately 
equal (mean errors = 3.06 and 3.17, respectively), 
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TABLE 2 
TABLE OF MEAN REACTION Times (1 MSEC.) 
Confusion type 
Item -— 
Nonassociative Target \ 

Operation H 1 
Addition 749.5(5.2) | 691.5(5.2) 113 
Multiplication | 742.1(5.2) | 739.9( | 722.4 
M | 745.5(7.7) | 692407.7) | 7122( | 

Mode of presentation | 
Blocked 709.1(4.3) 645.8(4.3) | | 687 
Triad ) | 700,8(4.3) | 736 
Random ) | 7 | 729.8 
A | | | 

Note. Estimates of standard error, derived from appropriate analysis of variance error term, ate included in parent) 
with the Triad mode producing somewhat more and more errors than nonassociative coni ionse 
errors than either (4.03). This result cannot be attributed solely to m: "ead 


The interaction of Operation with Confusion type 
results solely from the different behavior of Target 
confusions under addition and multiplication. The 
weak interaction between Mode of Presentation and 
Confusion type is also due to the behavior of Target 
confusions under the three modes of presentation, 

Discussion. The main result of interest is that 
associative confusions produce slower reaction times 


TABLE 3 
MEAN Errors PER SUR, 


Statistic 
ltem Eo uaa me 

M S En 

Mode of presentation QU Pi 
locke | Ses 
Triad | 30 
Random i0 

Confusion type | 

Associative 6.03 ET 
Nonassociative 128 24 
Target 2:94 38 


ings by because the effect occurs under all -hree 
modes of presentation: in both the blocke: anm 
triad modes the Ss knew which operation to epee 


in the next problem. 

The result forces us to reject an explanation. ol 
addition or multiplication based solely on intern 
computations since such a model would predict nog 
difference between associative and nonassociative” 
confusions. The result strongly suggests that the 
processes of addition and multiplication contain as- 
sociative components, and is consistent with the 
theory that only associative processes are 
in addition and multiplication. 


involved 
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CUE EFFECTIVENESS IN CUED RECALL ! 


MARION QUINN LEWIS? 


State University of New York College at Platisburgh 


Two experiments investigated effect 


blocked and unblocked category presentation. 


s of cues for categorized lists with both 


Category labels and extralist 


category items were effective recall cues for both blocked and unblocked 


category lists, 
recalled per category. 


With categorized lists, providing the category 
labels or items from the categories as cues has been 
found to facilitate recall if category members were 
presented contiguously (blocked), but cuing has not 
been found to increase recall if category items were 
mixed (unblocked) in the lists (Dong, 1972; Lewis, 
1971; Slamecka, 1968, 1972; Tulving & Pearlstone, 
1966). 

Facilitation of recall as a function of cuing has 
been interpreted as evidence that more information 
is available in memory than is immediately accessible 
for recall. Tulving and Pearlstone (1966) inter- 
preted recall of categorized lists as a product of two 
independent processes, recall of “higher order mem- 
ory units" (the categories) and the recall of items 
within the higher order units (category items). 
Category recall is seen as depending on the number 
of categories and retrieval cues, but recall of items 
within a category is apparently related only to 
whether the category itself is recalled and not to the 
experimental cues. Cuing increased the number of 
categories recalled, but did not affect the number of 
items recalled per category. 

The failure of cuing to facilitate recall with u 
blocked category presentation is puzzling becaus 
is not apparent that the two-level process described 
by Tulving and Pearlstone (1966) should be specific 
to blocked lists. 

It should be noted that most investigators report- 
ing no cued facilitation with unblocked lists used 
items from the list as cues (Lewis, 1971; Slamecka, 
1968). List-item cues do not appear to be without 
effect, as Lewis (1971) reported such cues increased 
However, when list items 
are used as cues, scoring of both cued and noncued 
conditions has been on the basis of "critical" words 
That is, cue words are chosen before the list is pre- 
sented, and the other list items are regarded as 
critical or target items. Thus, to the extent that Ss 
recall noncritical items which are not scored, total 
recall may be underestimated. It may be possible 
to devise scoring procedures or experimental manip- 
ulations which are unbiased in this respect. For 
example, this bias would not occur if cues were 


1The author thanks C. J. Wallace, Loyola University, and 
Earl E. Larre, St. Mary's Dominican College, for their coopera- 
tion in a preliminary study. An earlier version of this paper 
was presented at the meeting of the Psychonomic Society, St. 
Louis, November 1972. , f 

2 Requests for reprints should be addressed to Marion Quinn 
Lewis, State University of New York College, Plattsburgh, 
New York 12901. 


Cuing increased recall of categories but did not affect words 


category labels (e.g., Tulving & Pearlstone, 1966) 
or other exi t items. 

In the present investigation of cuing effects with 
blocked and unblocked category presentation, two 
types of cues were used: category labels and category 
members which were not presented as list items. 


EXPERIMENT I: CATEGORY LA Curs 


Method. Fight categorized lists were prepared. 
Each 42-word list consisted of six Battig and 
Montague (1969) categories, with seven words per 
category. Category items were the words ranked 
2-8 in the association norms, excluding obvious 
multiple-category items (for example, orange 
color and fruit). Categories were assigned to lis 
at random. Both a blocked and an unblocked ver- 
sion of each list were prepared. For the blocked 
version, all category members were presented con- 
tiguously ; category orders and within-category item 
orders w andomized, For the unblocked version, 
category items were mixed throughout the list with 
the restriction that no more than two items from the 
same category were presented contiguously; 

"There were thus four treatments representing lists 
and cuing conditions; blocked and unblocked list 
versions, and cued and noncued recall. Each treat- 
ment combination was presented twice; four list— 
treatment combinations and orders were prepared. 
For Cued conditions, the cue for each category was 
the category label. 

Lists were presented on audio tape at a 2-sec. rate 
for written free recall. After the last word on each 
list, the Ss heard a three-digit number. They wrote 
that number and subtracted from it for 6 sec., until 
signaled to begin recall. This procedure was used 
to equate cued and noncued recall in terms of 
possible forgetting from short-term memory while 
examining cues under cued recall (Lewis, 1971; 
Slamecka, 1968). 

When the recall signal was given, the Æ turned 
over a card. For cued conditions, cues were printed 
on the card; for noncued recall, the card simply 
said to begin recall. For any given list, the Ss did 
not know whether recall would be cued or noncued 
until they saw the card. The recall interval was 2 
min. The Ss were tested in small groups of two or 
three. 

The Ss were 40 students enrolled in psychology 
classes at the State University of New York College 
at Plattsburgh. They received extra course credit 
for participation. 
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TABLE 1 


Percent TOTAL RECALL, CATEGORY RECALL, AND WORDS 
PER CATEGORY: EXPERIMENT I (CATEGORY 
LABEL Cues) 


Presentation 
Blocked | Unblocked 
Recall con- | ? 
dition mr r 
| Words | Words 
Total | Cate- | per | Total | Cate- | per 
recall | gories | cate- | recall | gories | cate- 
gory gory 
Cued | 534 | 958 | 55.7 | aaa | 71 | 49.3 
Noncued | 40.6 | 70.2 | 57.5 | 42.7 | 827 | 513 


eo) cg MB 


"Total recall, number of categories recalled (de- 
fined as recall of at least one item from the category) 
and mean words per recalled category were examined 
by separate 2 X 2 X 40 (Presentation condition x 
Cuing condition X Ss) analyses of variance. New- 
man-Keuls comparisons among individual treat- 
ment combinations were made. 

Results. "Total recall, category recall and mean 
words per recalled category (WPC) are shown by 
percentages in Table 1. 

Total recall was higher for cued recall (CR) than 
for Noncued recall (NCR), F (1, 39) = 50.48, 
p < .005. For both Blocked and Unblocked presen- 
tations, CR was higher than NCR, p «.01. The 
difference between Cuing conditions was larger for 
Blocked than for Unblocked presentation, F (1, 39) 
—10.46, p < .005. 

More categories were recalled with CR than NCR, 
F (1, 39) = 12045, p < :005; and category recall 
was higher for Unblocked than for Blocked presenta- 
tion, F (1, 39) = 22.22, b «.005. The effect was 
clear in the interaction, F (1, 39) = 10.49, p < .005, 
With CR there was no difference between Blocked 
and Unblocked presentation, both being nearly 
perfect; but with NCR, Unblocked category recall 
was higher than Blocked, b «.0t. 

More words per category (WPC) were recalled 
with Blocked than with Unblocked presentation, 
F (1, 39) = 30.50, b < .005. No other effects or 
interactions were significant. 

The results are clear. Category label cues facili- 
tate recall for both Blocked and Unblocked lists. 
The facilitation derives from higher category recall 
with cues. There is no difference between Cuing 
conditions in mean words per category, 

It is perhaps possible that there is a qualitative 
difference between category items and category 
labels as cues, Intuitively, the label seems a more 
potent cue. In Experiment II, the cue for each 
category was an item which had not been presented 
on the list. That is, the cues were extralist members 
of list categories. 


EXPERIMENT II: EXTRALIST CATEGORY Trem CUES 


Method. The lists prepared for Experiment I were 
used. For Cued conditions, cues were the highest 
ranking word in each category in the word associa- 
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tion norms. 
Experiment I. 

The Ss were 26 students enrolled in I 
classes at the State University of New Y< 
at Plattsburgh. They received extra coi 
for participation. 


The procedure was the s as for 
hology $ 
ollege 
credit 


Total recall, number of categories re i, and 
mean words per recalled category were an ed by 
separate 2 X 2 X 26 (Presentation conditi < Cu- 
ing condition X Ss) analyses of variance. man- 
Keuls comparisons among individual :- ment 


combinations were made. 

Results. "Total recall, category recall a: 
words per recalled category (WPC) are shown by. 
percentages in Table 2. Total Cued reca! (CR) 
was higher than Noncued recall (NCR), F (i. 25) = 
14.06, p < .005; and Blocked presentatio: was 
higher than Unblocked, F (1:05): 8.77, p < Ol 
The interaction was not significant, 

Category recall was higher for CR than for 
F (1, 25) = 44.30, p < .005 Unblocked ca: 
recall was higher than Block d, F (1, 25) - 
b « .025, largely because of higher Unblocked 
category recall; however, the interaction w 
significant. 

Mean words per recalled category (WPC) was 
higher for Blocked than for Unblocked presentation, 
F (1, 25) = 22.09, p < .005. There was no interac- 
tion or effect due to Cuing. 

Overall, the results replicate those of Experiment 
I. Cuing facilitates recall by increasing category 
recall. Mean WPC is not affected. 

Discussion. The cued facilitation with bo h 
Blocked and Unblocked category presentation using 
category labels and extralist category items as cues 
is consistent with Tulving and Pearlstone's (1956) 
formulation of recall as a process of (a) recalling 
higher order memory units, and (b) recalling con- 
tents of the higher order units, Cuing increased 
category recall (higher order units), but did not 
affect recall of items within a category. 

Previous results (Lewis, 1971; Slamecka, 1968) 
showed no increase in recall for unblocked lists with 
list-item cues. Present findings indicate consistent 
facilitation with extralist cues, regardless of presen- 
tation condition. This Suggests that critical-item 
scoring procedures used with list-item cues may 


mean 


ICR; 
gory 
7.08, 
NCR 

not 


TABLE 2 


Percent TorAL RECAI CATEGORY RECALL, AND WORDS 
PER CATEGORY : EXPERIMENT IT (ExrRALIST 
CATEGORY ITEM CUES) 


Presentation 
Blocked Unblocked 
Recall con- È iw 
dition 

Words Words 
Total | Cate- per Total | Cate- | per 
recall | gories | cate- | recall Bories | cate- 
gory gory 
Cued 309 | 949 | 53.5 | 45.0 | 962 | 468 
Noncued | 434 | 76.6 | 571 | 40$ | 856 | 481 


h 
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introduce sizable artifacts, which would account for 
the apparent discrepancy between previous and 
present findings. 

Slamecka (1972) pointed out that cuing does not 
increase recall if unaided category recall is nearly 
perfect. In this context, the smaller differences be- 
tween cuing conditions observed with Unblocked 
presentation is understandable. Noncued category 
recall is substantially higher for Unblocked than for 
Blocked presentation. There is less opportunity for 
cuing to add categories. 

The higher Unblocked category recall might re- 
flect differential Blocked-Unblocked memory organi- 
zation. With Blocked presentation, it is most 
probable that Ss will use E-defined categories to 
Organize information for storage and retrieval. 
With Unblocked presentation, 5s may develop some- 
what different categories (for example, instead of a 
single “bird” category, 5 may use two, as “song- 
birds" and “birds of prey"). Moreover, serial 
position effects favor higher category recall with 
Unblocked presentation. Although the distractor 
task between list presentation and recall tends to 
reduce the characteristic free-recall recency peak, 
such a task has little effect on primacy (Glanzer & 
Cunitz, 1966). The first few words on an Unblocked 
list are from several categories, but the first few 
words on a Blocked list are all from the same 
category. 

Taken overall, the results of the present experi- 
ments demonstrate that cuing can facilitate recall 
with both Blocked and Unblocked lists. As such, 
these findings appear inconsistent with those of 


Dong (1972), who reported no cued facilitation with 
category label cues on an unblocked list. Dong 
presented a given S with only one list; in the 
present experiments Ss received several lists, both 
cued and noncued. Lewis (1971) noted that on 
every measure (total recall, category recall and 
WPC) differences between cuing conditions were 
greater for the last list of a series than for the first 
list. Although Ss in the present experiments could 
not anticipate cues for any particular list, it may be 
speculated that they learned to use cues more 
effectively after having received cues several times. 
This speculation is consistent with both Lewis' 
(1971) findings and Dong’s suggestion that cuing 
effects may reflect strategies and memory control 
processes. 
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RECALL OF ANTONYMS FROM SHORT-TERM MEMORY + 


LAIRD S. CERMAK? 


Psychology Service, Boston Veterans Administration Hospital 


An experiment was designed to investigate the effect of introducing antonyms 
hort-term memory distractor task. 
ised when the trial on which they occurred 
s preceded by several trials each containing words that were antonyms of 
the final trial. However, recall did not increase when only the immediately 
ed the antonyms. 


on adjacent trials of a 
that recall of the antonyms incr 
Wi 


preceding trial contain: 
priming hypothesis is given. 


Proactive facilitation (PF) has been used to de- 
scribe instances in which retrieval of a particular 
word is facilitated by the fact that the word shares 
a feature in common with one or more previously 
presented words. Proactive facilitation has been 
demonstrated by Cermak (1970), who found, during 
the course of a distractor experiment, that when the 
to-be-remembered words in a particular triad are 
acoustically identical to words presented on the 
preceding trial (i.e, they are homonyms) their re- 
trieval is facilitated. However, PF does not occur 
if the relationship between two adjacent trials is 
based upon the semantic features of the words, 
For example, synonyms, antonyms, and highly 
associated words all failed to produce facilitated 
retrieval of the second member of the relationship. 

The failure of synonyms and associates to produce 

an increase in recall can be explained by the fact 
that these words also all belong to the same cla: of 
material and probably interfere with, rather than 
facilitate, one another's retrieval. However, inter- 
ference between antonyms was unexpected since 
these words come from “opposite” categories and 
should be differentially encoded. One possible ex- 
planation for the antonym result might be that 
the procedure that was used, one in which only two 
words shared the critical relationship, may not have 
been sufficient to produce differential encoding. The 
usual procedure used to study differential encoding 
ability presents several instances of a particular re- 
lationship before testing for the ability to detect 
and utilize a differential relationship (Wickens, 
1970). Perhaps a subject cannot encode antonyms 
as belonging to two separate categories unless a 
differential "relationship" has already been estab- 
lished. The present report investigates this possi- 
bility by preceding the introduction of an antonym 
relationship with either several unrelated word triads 
(Experiment I) or with several synonymous word 
triads (Experiment II), 


1 This research was supported in part by National Institute of 
Mental Health Grant MH-21234-01 to the author. Apprecia- 
tion is extended to Alan Sholk and John Harrison for their 
help in the collection of data, 

? Also of the Aphasia Research Unit, Neurology Department, 
Boston University School of Medicine, Requests for reprints 
should be sent to Laird S. Cermak, Veterans Administration 
Hospital, 150 South Huntington Avenue, Boston, Massa- 
chusetts 01230. 
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It was discovered 


An interpretation based on the 


E 


PERIMENT | 


Method. The Ss were 48 introductory psy logy 
students at Tufts University who chose to ti 
pate in the experiment as partial fulfillmer of a 
course requirement, The Ss were ra ndomly as ned 
in order of their appearance to one of th. ‘wo 
groups, with 24 in each group. 

The short-term memory (STM) distractor -ech- 
nique was used. In this procedure a wor: ad 
was projected on a screen and the S repeated the 
words once; the S was then distracted from re- 
hearsal until a signal for recall was presented. T'he 
distractor task consisted of counting backward by 
threes from a number flashed on a een in time 
with a metronome beating at one beet per t 
Four additional trials followed consisting of dii: 
word triads but using the same procedure 

The words used in this experiment were che in 
such a way that an antonym existed for each ord. 
Five such word triads were used so that each triad 
could appear in each of the five positions in the 
series. In other words, five basic slides were counter- 
balanced with the fifth trial bearing an antonym 
relationship to the fourth trial for the subject in 
the Experimental (Antonym) Group, but no rela- 
tionship at all for the subject in the Control Gre oup. 

The experiment consisted of five trials in which the 
material presented on Trial 5 bore one of the follow- 
ing relationships to the material presented on Tr 
4: Antonym Group—antonymity ; Control Group— 
no relationship. Trials 1-4 bore no relationship to 
one another for either group. In order to compare 
the Antonym Group's performance on the fifth trial 
with the Control Groups', both groups received the 
same stimulus material on that trial. The antonym 
or neutral relationship was established by present- 
ing different material on Trial 4. In other words, 
the Antonym Group received antonyms of Trial 
5 on Trial 4, whereas the Control Group received 
words unrelated to those of Trial 5 on Trial 4. 
Any differences in recall on Trial 5 could thus be 
attributed to the effects of the relationship that 
had been produced previously and not the difficulty 
level of Trial 5 material. 

The S was seated facing a screen upon which the 
material was projected by means of a Kodak 
Carousel 800 projector. The projector was timed by 
à Gerbrands tape timer. The only other apparatus 
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in the room was a metronome-used to pace dis- 
tractor task. After the instructions had been given, 
Ns practiced on the counting task until approxima- 
tion to the metronome was achieved. The timing 
and sequence of events were then initiated by the 
tape timer and the time intervals for one trial were 
as follows: asterisk (ready signal), 2 sec.; trial 
presentation, 2 sec.; retention interval, 15 sec.; 
recall interval, 10 sec.; rest interval, 6 sec. Recall 
was scored as the number of correct words given 
out loud during the 10-sec. recall interval. Follow- 
ing the end of the fifth trial, Ss were asked whether 
they had noticed any similarities or differences 
among any of the word triads, and their responses 
were noted at that time. 

Results. On Trial 5, the critical trial, the Antonym 
Group recalled 60% of the words correctly, while 
the Control Group recalled 57% correctly (Figure 1). 
There was no significant difference in recall between 
the group receiving an antonym triad on the pre- 
vious trial and a group that did not. Furthermore, 
only two Ss reported detecting the antonymity re- 
lationship. The buildup of proactive inhibition 
(PI) from Trial 1 to Trial 4 was significant (p < .01) 
for both groups and neither group showed any 
significant increase from trial 4 recall to trial 5 
recall. 


EXPERIMENT IT 


Method, The Ss were 40 introductory psychology 
students at Tufts University who chose to partici- 
pate in the experiment as partial fulfillment ofa 
course requirement. The Ss were randomly assigned 
in order of their appearance to one of the two groups 
with 20 in each group. 

The distractor technique was used but, unlike 
Experiment I in which a relationship existed only 
between Trials 4 and 5, the present experiment pre- 
sented word triads on Trials 1 through 4 that were 
synonymous with each other. In other words, the 
first word in each triad meant the same as the first 
word in each of the other triads and the second 
word meant the same as each other triad's second 
word,and soon. Trial 5 (the critical trial) contained 
words that were either antonyms of the preceding 
four triads, or were synonyms. Thus, the groups 
were differentiated on the basis of the type of rela- 
tionship existing between the words presented on 
Trial 5 with those on Trials 1-4. It must be pointed 
out that the words appearing on Trial 5 were the 
same for both groups, and, in fact, were the same 
as those used in Experiment I. Trials 1—4 were all 
antonyms of Trial 5 for the Antonym Group, but 
they were synonymous with one another. For the 
Synonym Group the same word triad appeared on 
Trial 5, but all the preceding triads were synonyms 
of this triad and were synonymous with one another. 
"Two sets of critical Trial 5 word triads were used in 
the experiment, both appearing under synonym. and 
antonym conditions. 

Results, On Trial 5, the Antonym Group re- 
called significantly more words than the Synonym 
Group, ¢ (38) = 2.20 (p < .05). The Antonym 
Group recalled. 82% of the words correctly while 
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Ficure 1. The percentage of words correctly recalled for the 


Antonym and Control Groups of Experiment I and the 
Antonym and Synonym Groups of Experiment II, 


the Synonym Group recalled only 4296 correctly 
(Figure 1). Both groups showed equivalent and 
significant (p < .01) reductions from Trial 1 recall 
to Trial 4 recall. 

A comparison between the Antonym Group of this 
Experiment and the Antonym Group in Experiment 
I can be made since the same word triads were used 
on Trial 5 of both experiments. The only difference 
between the two groups was in the number of pre- 
ceding trials that bore the antonym relationship 
with the word triad on Trial 5. In Experiment If 
four synonomous triads preceded the switch to 
antonyms and 82% of that critical trial was recalled 
correctly. In Experiment I only one such triad pre- 
ceded and only 60% was recalled correctly on the 
critical trial. This difference was significant at the 
p <.01 level, £ (98) = 2.95. Trial 4 recall per- 
formance in Experiment I was compared with the 
same performance in Experiment II in order to 
determine the difference in the extent of PI buildup 
in the two experiments. This difference was signifi- 
cant, £ (98) = 2.53, p < .05, since Ss in Experiment 
I averaged 63% (both groups combined) and Ss in 
Experiment I] averaged 46.5%. There were 
no differences between Experiments on ‘Trial 1 
performance. 


Discussion 


The results of these two experiments, taken to- 
gether, suggest that the presentation of more than 
one exemplar from a category may be necessary 
before the introduction of information from the 
opposite category will result in an increase in re- 
call probability. This was shown by the fact that 
no increase in performance occurred when only Trials 
4 and 5 contained antonyms of one another (Experi- 
ment I) but did occur when Trial 5 contained words 
that bore an antonym relationship with each of the 
preceding four trials (Experiment II). It could be 
that more than one instance of a relationship is 
needed before a subject can differentiate words that 


m 
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do not participate in this particular relationship. 
This relationship could be established in one trial 
if all the words in that triad shared in the relation- 
Ship, but more than one trial is needed in the present 
paradigm because no within triad relationships exist. 

This finding poses a problem for theories that 
assume a word can be automatically encoded along 
several different dimensions immediately upon its 
presentation. If this were the case then the buildup 
of a relationship among the words should not be a 
necessary condition for release because differential 
encoding would occur as a result of automatically 
encoding several dimensions of each word. How- 
ever, the present experiment suggests that encoding 
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SEMANTIC AND ASSOCIATIVE INTERACTIONS IN CHILDREN’S 1 
FALSE RECOGNITION : 


PHEBE CRAMER? AND ANDREW P. SCHUYLER? 
Williams College 


Previous studies of memory encoding and false recognition in children have 
tended to focus either on associative relationships or on semantic attributes, The 
present study, based on the responses of. third- and six 
that associative and semantic factors interact to produce the false recogni- 
For Synonyms, associative strength determined the magnitude 
of false recognition, while for Antonyms 


tion effect. 


Both associative relationships and semantic fea- 
tures have been shown to be successful predictors of 
errors in children’s recognition memory (e.g., Bach 
& Underwood, 1970; Cramer, 1972a). In general, 
however, those studies that have investigated the 
effect of associative relationships have not been 
concerned with the role of semantic features, while 
those that have investigated semantic features have 
not considered the possible role of associative 
relationships, 

The influence of these two variables was investi- 
gated by Felzen and Anisfeld (1970), who compared 
the false-recognition errors made by third- and 
sixth-grade children to strongly and weakly associ- 
ated synonyms and antonyms, On the basis of the 
finding that strong associates produced no more 
errors than did weak associates, they concluded that 
associative relationships among words played no 
significant role in determining such errors, 


1 The research reported here was part of the junior author's 
Psychology Honors thesis, The authors gratefully acknowl 
edge the cooperation of the administration and the teachers 
at the Williamstown Central Schools in arranging for this 
research to be carried out. 

4 Requests for reprints should be sent to Phebe Cramer, 
Department of Psychology, Williams College, Williamstown’, 
Massachusetts 01267, 

! Now at Cornell Medical School, New York, New York. 
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of specific features of a word may depend up 

"priming" the subject for these features (Underwoo d, 
1972). If this is true then an alternative to the 
automatic encoding hypothesis must be sought to 


explain the PI release effect. 
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h-graders, indicated 


, it was not important. . 


This conclusion, of course, was at variance with 
the results obtained in the studies cited above, 
‘Two possible reasons for the nonconfirmatory results 
are suggested. First, previous work with adults 
has indicated that there is an interaction between 
semantic category and associative relationship in 
determining recognition errors (Cramer, 1972b). 
If such an interaction were also a determinant of 
children's performance on a recognition memory 
task, it might mask the main effect of either varia- — 
ble. Second, the nonconfirmatory findings may be 
a function of the fact that words used to represent 
the various semantic and control categories were not 
mutually exclusive. On both lists, some of the ` 
words listed as Rhymes and nearly half of the words 
listed as Controls were in fact ass ciates of the - 
itical stimuli (based on Palermo & Jenkins, 1964, 
data for Grades 4-6), Considering this observed 
overlap, it seems likely that the other 28 filler words, 
interspersed throughout the continuous list presen- - 
tation, might also be associatively related to the 
critical stimuli. Thus it is difficult to interpret the — 
basis on which false recognition errors were being 
made, 

The present study was designed to determine the | 
extent to which these two factors—semantic X | 
associative interaction and stimulus category over- 


; 
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lap—were responsible for the failure to find strength 
of associative relationship as a significant factor in 
determining children's recognition memory. The 
general continuous presentation method of Felzen 
and Anisfeld (1970) was followed. However, great 
care was taken to eliminate associative overlap 
among the experimental, control, and filler word 
categories. 

Method. The Ss were 40 third-graders and 40 
sixth-graders. An equal number of boys and girls 
was tested individually in each condition at each 
grade level. Following a practice list, consisting of 
numbers, each child was given the experimental 
test list. He was asked to indicate whether each 
item on the list was “old” or "new." The Ss within 
each grade were assigned randomly within blocks of 
four (Weak/Strong associates X 2 test orders). 
Male and female Ss were tested on alternating 
blocks. 

In order to construct the test list, a group of 20 
critical stimulus (CrS) words was selected such that 
half of the words elicited synonym word-association 
responses and half elicited antonym responses. An 
important criterion for the selection of these stimuli 
was that the associative responses should be of ap- 
proximately equal strength for both third-graders 
(Palermo & Jenkins, 1966) and sixth-graders 
(Palermo & Jenkins, 1964). Of the 10 CrS-Syn- 
onym pairs, half were strongly associated (Third 
grade: M = 40.0%; Sixth grade: M = 47.6%) and 
half were weakly associated (Third grade: M = 
1.4%; Sixth grade: M = 2.1%). Similarly, of the 
10 CrS-Antonym pairs, half were strongly associated 
th grade: M = 
41.2%) and half were weakly associated (Third 
grade: M = 1.2%; Sixth grade: M = 1.2%). The 
Synonyms and Antonyms were equated approxi- 
mately for Thorndike-Lorge (1944) familiarity. 

From these materials, two lists were prepared. 
One list consisted of the 10 strongly associated CrS- 
Synonym and CrS-Antonym pairs; the other list 
consisted of the 10 weakly associated pairs. In 
addition, each list included 10 control words (words 
that had no associative relationship to any word 
previously presented) and 12 filler words. The 
words of each list were presented in two different 
orders, to control for possible sequence effects, 
making four test lists in all, Each of the four lists 
consisted of 59 items, and was constructed such 
that a word of the same class (CrS, Synonym or 
Antonym, Control, or Filler) was assigned the same 
item position on each list. Each CrS word appeared 
twice on the list, with 7 items intervening between 
the first and second presentation. Each Synonym 
or Antonym appeared 9 items following the second 
presentation of its respective CrS. A control word 
appeared either immediately prior to or directly 
after each Synonym or Antonym. The list positions 
not occupied by CrS, Synonyms or Antonyms, or 
Control words were assigned to filler words, Each 
list was recorded on magnetic tape at a 5-sec. rate. 

Inasmuch as the study was concerned with the 
specific associative relationship between each CrS- 
Synonym or CrS-Antonym pair, it was important 


TABLE 1 


MEAN NUMBER OF "OLD" RESPONSES TO SECOND PRESENTA- 
TION OF CRITICAL STIMULUS Worps (CRS/), ASSOCIATE, 
AND CONTROL WORDS 


rss | Associ | Controla | Total Rs 
Third grade 
Weak list 9.0 L6 8 114 
Strong list 9.1 15 2 10.9 
Sixth grade 
Weak list 9.1 12 4 10.9 
Strong list 9.3 16 2 10.7 


a Maximum value = 10. 
b Includes Synonyms and Antonyms, 


to make sure that no Synonym, Antonym, or 
Control word was associatively related to any other 
preceding word in the list. Thus care was taken 
that there was no associative relationship (associa- 
tive strength = 0%) between any CrS, Synonym, 
Antonym, Control, or Filler word, and any subse- 
quent experimental (Synonym, Antonym, or Con- 
trol) word, other than the specific CrS-Synonym 
or CrS-Antonym associative relationship being 
investigated. 

Results and discussion. The Ss' responses were 
analyzed first in terms of the number of “old” 
responses made to CrS, Associate, and Control 
words. These results are presented in Table 1. 

A three-way analysis of variance of these data 
was performed. A highly significant main effect for 
Word type was obtained, F (2, 152) = 1,898.45, p 
< .001, Application of the Neuman-Keuls test in- 
dicated significantly more "old" responses were 
given to CrS words than to Associates, and more to 
Associates than to Controls, all ps < .01. There 
were no other significant main effects, 

A second analysis of variance, based on difference 
scores, considered responses to Synonym and An- 
tonym associates only. While there were no sig- 
nificant main effects, there was a significant As- 
sociate Strength X Semantic Category interaction, 
F (1, 76) = 1840, p < .001. A Neuman-Keuls 
test indicated that significantly more responses were 
made to Strong Synonyms than to Weak Synonyms, 
to Strong Synonyms than to Strong Antonyms, and 
more responses were made to Strong Antonyms 
than to Weak Synonyms (all ps < .01) (see Table 2). 

The results of the present experiment confirm the 
findings of earlier studies (e.g., Bach & Underwood, 
1970; Cramer, 1972a), by demonstrating that as- 
sociated words. produced more false-recognition 
errors in children than did nonassociated words, 
and serve to extend those earlier findings. Contrary 
to the conclusions of Felzen and Anisfeld (1970), 
when stimulus materials were selected with adequate 
concern for the possibility of confounding relation- 
ships, synonyms that were strongly associated pro- 


*Since there were twice as many opportunities to respond 
to Control words as there were to Synonyms or Antonyms, the 
difference scores for each S consisted of Synonym minus j 
Control and Antonym minus 3 Control. 
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TABLE 2 


MraN NUMBER OF “OLD” RESPONSES TO SYNONYMS AND 
ANTONYMS, CORRECTED FOR CONTROL 
Worp ERROR 


Third grade* 
Sixth grades 
Total 


^ Maximum value = 5, 


duced more errors than did synonyms that were 
weakly associated. 

This statement must be qualified by two further 
considerations. First, some other linguistic varia- 
bles, not investigated in this study, may be con- 
tributing to the results. More important, from the 
authors’ point of view, is the finding of a consistent 
pattern of interaction between semantic and as- 
sociative factors in determining false-recognition 
errors, For both grades, significantly more re- 
sponses were given to strong than to weak Syn- 
onyms; however, association strength was not im- 
portant in determining responses to Antonyms, 

Thus, the importance of association streagth 
varies with the nature of the semantic relationship. 
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More specifically, it may be that the semantic fel 
tionship of antonymity is particularly insens 

sociative strength. In a previous paper (Crai 
1972b), it was pointed out that several re 
studies have found ciative variables to ha’ 
different, and counterpredicted effect on antonyj 
as compared to other word types Inspection 6 
the published results of Felzen and Anisfeld (1970) 
reveals that the same Associative Strength X 
Semantic Category interaction appeared in t 
data. 
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Previous studies on self-embedding have not isolated the contribution of a 
limited short-term memory span to the difficulty of processing self-embedded 


sentences. 


With the procedure used, auditory presentation of semantically 


neutral sentences and the use of a memory probe technique to measure process- 

Q ing at five levels of sell-embedding, it was found that comprehension breaks 
down at three degrees of embedding. A control set of sentences, semantically 
equivalent, but with right-branching structure, were significantly better under- 
stood at three to five degrees of complexity, but not at one and two degrees. 
The results are predictable, given a seven-item short-term memory span, from 
some plausible assumptions about the manner of processing. 


An automaton with limited short-term memory 
cannot process sentences with unlimited self- 
embedding (Chomsky, 1963). Although this result 
must apply to humans, laboratory confirmation has 
proved surprisingly difficult. Miller and Isard 
(1964) showed that sentences of equal length and 
equivalent word content are memorized with increas- 
ing difficulty from zero degrees through four degrees 
of f-embedding. Howeyer, Blumenthal (1966) 
criticized this study on the grounds that naive Ss 
do not process multiply-self-embedded sentences 
correctly, but perceive them as ungrammatical 
sentences with a single embedding of conjoined 
relative clauses modifying the initial noun. Blu- 
menthal's 5s were unable to correctly rewrite three- 
degree self-embedded sentences in a more under- 
standable form. Since Ss do not perceive the 
structure, their poor performance cannot be due to 
difficulty in processing the structure. 

Other experimenters have found that humans can 
decode self-embedded sentences under suitable con- 
ditions. Schlesinger (1968) and Stolz (1967) found 
that they can be decoded if each predicate is prag- 
matically appropriate to its subject, and inappro- 
priate to the subjects of the other clauses. Self- 
embedded sentences with this property were called 
“semantically supported." To explain his results, 
Schlesinger proposed a "semantic-syntactic decod- 
ing process" in which the temporary memory store 
merely retains the nested parts (i.e., subjects and 
predicates), but no order information, and then the 
decoder matches up the parts with the help of the 
semantic cues. This decoding strategy cannot be 
used to decode semantically neutral sentences (i.e., 
sentences in which any of the predicates can prag- 
matically be predicated of any of the subjects) 
because such sentences cannot be interpreted unless 
the order of the subjects and predicates is registered. 
Stolz found that semantically neutral sentences can 


i This paper is based on part of the first author's MA thesis 
at the University of California, Santa Barbara, 1970, under the 
sponsorship of the second author. D 

2 Requests for reprints should be sent to Maija S. Blaubergs, 
Educational Psychology Department, University of Georgia, 
Athens, Georgia 30602. 


be decoded, but only after explicit training on the 
self-embedded structure. However, in Stolz's study 
the written form of the sentence was always avail- 
able as an external memory aid for the Ss to refer to, 
and thus the contribution of short-term memory 
to the difficulty of processing self-embedded sen- 
te was again not examined. 

nce none of the studies in the literature have 
shown that human short-term memory limitations 
actually interfere with the processing of self-em- 
bedded sentences, one purpose of the experiment 
reported here was to demonstrate and measure the 
extent of the interference. While the existence of a 
performance deficit is certain, the size of the deficit 
at various levels of self-embedding depends on how 
humans use their short-term memory in processing 
sentences. Measurement of the deficit therefore 
provides a basis for inferences about the manner of 
processing. 

The literature reviewed makes clear that an ade- 
quate study must present material auditorily, must 
use semantically neutral content (to preclude Ss 
from using Schlesinger's, 1968, semantically based 
decoding strategy described above), and must begin 
by training Ss on the nature of self-embedded sen- 
tences (otherwise errors will reflect lack of com- 
petence, not difficulty of processing). Since increas- 
ing self-embedding inevitably increases sentence 
length, matching right-branching sentences were 
used to estimate errors due to sentence length alone, 

Method. Forty introductory psychology students 
were assigned to one of two conditions with 8 males 
and 12 females in each condition. In both condi- 
tions, self-embedding (SE) and right-branching 
(RB) training in the understanding of the sentence 
structure used was given, followed by tests of com- 
prehension. The sentences varied from one to five 
degrees of complexity (i.e., from one to five em- 
bedded or appended relative clauses). The word 
content of equivalent sentences in the two conditions 
was the same. All sentences used in the practice 
and test sessions were semantically neutral (ie., 
any of the subjects can form a semantically sensible 
sentence with any of the predicates). 
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Since the procedure was identical for the two 

conditions except for the use of SE sentences in one 
condition and RB sentences in the other, it will be 
described here only for the SE condition. The in- 
structions read to the S by the E included an ex- 
planation of the grammatical structure of an $ 
sentence and an example illustrating how an $ 
sentence is composed of simple sentences. The ex- 
planation was accompanied by a visual display on 
cardboard strips of the examples used. The S was 
then given a series of practice SE sentences to read 
to himself. The series consisted of at least two 
sentences from each degree of embeddedness pro- 
ceeding from one degree to five degrees. Compre- 
hension of a sentence was tested by requiring the S 
to answer questions about the content, e.g., “Who 
was bickering with the Chinaman?” and “Who 
adored food?" were the questions for the SE sen- 
tence, “The Chinaman whom the Italian was bicker- 
ing with adored food." The number of questions per 
sentence depended on the number of simple sen- 
tences it was composed of, i.e., there was one ques- 
tion per simple sentence. An exhaustive list of 
questions of the kind described involving all the 
constituent simple sentences was asked in a random 
order for each practice sentence. The criterion for 
proceeding to the next degree of complexity was that 
there be no more than one error in two consecutive 
practice sentences at each level. (This criterion 
was met at each level by all Ss; SE Ss never re- 
quired more than four practice sentences to meet 
the criterion at any level, and RB Ss never more 
than three.) The S was given an opportunity to 
correct his errors at the end of the questions for 
each practice sentence. 

After the S had reached criterion at all levels of 
complexity, six test sentences for each level were 
presented by tape recorder, The order of presenta- 
tion of the sentences of the differing degrees of com- 
plexity was randomized within the set of 30 sen- 
tences that each S heard. The Ss were given as 
much time as they needed after each sentence to 
fill in a blank in a single simple sentence taken from 
the sentence they had just heard. Answers were 
written in an answer booklet that had separate 
pages for each blank. Examples of the blanks for 
the SE sentence given previously might be *— 
adored food" and “The Italian was bickering with 

-” (Only one of these sentences actually 
appeared in the answer booklet for this sentence.) 
The word to be filled in was always a noun but 
might be either the subject or object noun. Equal 
numbers of subject and object nouns (three each) 
were used at each level of SE, but they appeared 
randomly in the sequence of test sentences, The S 
was instructed not to turn the page for the next 
blank until after he had listened to the sentence for 
it, so that he did not know what information to 
listen for, but instead would try to comprehend the 
whole sentence. The S was allowed to hear each 
sentence only once and was not allowed to make 
any written notes about it. A practice sentence, 
which was reread if the S had any difficulty with it, 
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was provided before the test session beg The $i 


was allowed to guess when filling in thi 


the test sentences but not told if his s were 
correct or incorrect. 

Results. The total number of correct r ises at 
each level of complexity for each .S was orded, 
A correct response consisted of filling ii blank 
with the appropriate noun. ‘The mean ber of 
correct responses is plotted against t! els of 
complexity for both conditions (SE an B) in 
Figure 1. (A chance level calculated on t ump- 
tion that the S would choose random m all 
nouns present in the sentence is included 

A two-way, repeated measures, analys vari- 
ance with Ss nested within the two lev of the 
sentence-type factor (SE and RB) wa ed to 
analyze the data. The main effect of sentence type 
was significant, F (1, 38) — 37.4, p «.01, as was 


the main effect of sentence complexity, F (4, 152) = 
134.1. The interaction between the two was also 
i cant at the .01 level, F (4, 152) = 10.7. The 3 
ant sentence-type effect indicates that sen- 
tences with RB structure received significantly 
more correct responses than sentences with SE 
structure, The significant complexity factor in- 
dicates that level of complexity had a differential 
effect on the number of correct responses. 

Since the interaction between the two factors was 
significant, the simple main effects of each factor 


were examined for significance at I 1 of the 
other factor, Significant differences were fou or 
sentence type at the third, fourth, and {fifth levels 
of complexity (p < .01 in all cases), Furthermore, 


significant differences were found for the complexity 
factor in each type of sentence (p < .01). 

Scheffé tests were computed on all possible differ- 
ences between pairs for the level-of-complexity: 
factor for each sentence type. For the SE sentences, 
it was found that there were significant differences 4 
the .01 level between each of the first and sec 
levels of complexity and each of the third, fourth, 
and fifth levels of complexity. For the RB struc- 
ture, significant differences at b < .01 were found 
between one degree and three, four, and five degrees; 
between two degrees and four and five degrees; and 
between three and five degrees, 

Since the position of the test simple sentence 
within the longer sentence might have an effect on 
the number of errors, percentages of correct re- 
Sponses of the possible correct responses at each level 
of complexity for each position of a test simple sen- 
tence were calculated for the SE sentences. ‘These 
percentages are as follows: 100% of responses to the 
first position, i.e., outermost simple sentence, and 
100% of the responses to the last or innermost posi- 
tion, at one degree of complexity; 77.5% from the 
first position, 67.5% from the middle, and 95% from 
the innermost position at two degrees; 45% from the 
first, 30% from the middle, and 30% from the inner- 
most position at three degrees; 50% from the first, 
27.5% from the middle, and 40% from the innermost 
Position at four degrees; 75% from the first, 21.3% 
from the middle, and 20% from the innermost posi- 
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tion at five degrees. The high percentages of correct 
responses to the outermost sentence presumably 
reflect combined primacy and recency effects, 
a tendency to remember the first subject and the 
last predicate. Thus, even when the entire sentence 
is not fully processed, the S may be able to correctly 
respond to probe items from the outermost simple 
sentence in the complex SE sentence. If these in- 
creased percentages of correct responses to the outer- 
most test simple sentence at the higher levels of 
complexity are disregarded, it is clear that the scores 
at levels three, four, and five are even closer to 
chance level than the points in Figure 1 suggest. 
(A tendency for a larger percentage of correct re- 
sponses to test sentences from the last or most re- 
cently presented clause in the RB sentences was 
also observed. "Thus, the correct responses for RB 
sentences may also have included some responses 
that would not necessarily reflect processing of the 
complete complex RB sentence.) 

The significant differences found for the com- 
plexity factor over both grammatical types support 
the hypothesis that comprehension decreases with 
degree of complexity. Furthermore, the finding of 
significant differences between SE and RB at the 
third, fourth, and fifth levels of complexity sub- 
stantiates the hypothesis that the decrease in com- 
prehension is greater in the SE condition. 

Discussion. Results of the comparison between 
SE and RB conditions support the contention that 
it is complexity peculiar to SE and not merely sen- 
tence length that is causing the increased difficulty 
at the higher levels since there are no significant 
differences between SE and RB at levels one and 
two, but there are significant differences between SE 
and RB at levels three, four, and five. Length of 
sentence does contribute to increased errors since 
errors tended to increase with degree of complexity 
for both RB and SE sentences as shown by the down- 
ward curves in Figure 1, However, the sharp drop 
to near chance responses at level three for SE sen- 
tences, combined with the fact that the first signi- 
ficant difference between SE and RB appears at 
three degrees suggest that SE sentences become 
peculiarly difficult to comprehend at three degrees 
of complexity and remain so at higher degrees. 
Not understanding the structure has been ruled out 
by training the Ss. Length of sentence has been 
controlled by comparison with equivalently long 
RB sentences. 

Thus, since the memory factor associated with 
length of sentence alone was constant for SE and 
RB sentences, the difference in errors between SE 
and RB sentences at each level is an unconfounded 
measure of increased difficulty in understanding 
sentences with SE structure. The reason for this 
difficulty is presumably the increasing demands on 
the short-term memory which, it is suggested here, 
exceed the available capacity. Thus, at three de- 
grees of SE, in order to complete the simple sen- 
tences (i.e, recombine subjects with appropriate 
predicates), the decoder, hearing the predicates in 
reverse order to the subjects, must retain eight 
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different items in order if recombination is done after 
hearing the complete sentence, or somewhere be- 
tween four and eight items if recombination occurs 
starting with the first predicate heard. Only if the 
list of subjects were erased one by one as recombina- 
tions were completed before the next predicate was 
heard would there be only four items at a time 
present in the short-term memory. There is no 
reason to postulate such erasure. Thus if subjects 
and predicates are considered chunks or items of in- 
formation, which seems reasonable since in the SE 
sentence they are presented discretely to the de- 
coder, eight such items would have to be stored at 
the three-degree level. Accordingly, it is suggested 
that the breakdown of comprehension at the three- 
degree level can be accounted for by the assumption 
that the limitation of short-term memory to ap- 
proximately seven items at a time (Miller, 1956) is 
first exceeded at this level of complexity. For RB 
sentences, the simple sentences conjoined by rela- 
tive pronouns may reasonably be considered as the 
chunks of information to be retained. Thus, even 
at five degrees of complexity, a maximum of only 
six chunks need be retained, The results therefore 
support Miller's claim, 

In conclusion, decoding beyond two degrees of 
SE for semantically neutral sentences is virtually 
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impossible, presumably due to structurally intrinsic 
short-term memory limitations. 
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FREE RECALL OF SENTENCES AS A FUNCTION OF IMAGERY 
AND PREDICTABILITY! 


P. J. HOLMES axo D. J. MURRAY? 


Queen's University, Kingston, Ontario, Canada 


Four sentences were presented for immediate free recall: 
varied in imagery and in predictability. 


the sentences were 
It was found that these two variables 


predominantly affected recall from the secondary memory component of the 


seria 


position curve, but that they exercised enough of an effect in what was 


ostensibly the primary memory component to suggest that even when primary 


memory could be used for retention .Ss 


will try to process sentences in depth. 


In general high imagery and high predictability improved recall, with the 


exception that predictable 


abstract sentences were better recalled than pre- 


dictable concrete sentences for the last serial position only. 


In studies of free recall of isolated words, it has 
been suggested that the last three or four words 
presented are recalled from a. short-lasting type of 
storage known as primary memory, whereas the 
early and middle items are more likely to be recalled 
from secondary or long-term memory (see, e.g., 
Glanzer & Cunitz, 1966). As yet there seems to be 
no comparable evidence in respect of sentence re- 
tention. It is known that two variables likely to 
aid sentence retention are imagery (Paivio, 1971) 
and predictability (Rosenberg, 1968). These varia- 
bles are, however, more likely to affect secondary 
memory: for example, Madigan, McCabe, and 
Itatani (1972) showed that the imagery value of 
isolated words presented for free recall mainly 
affected the secondary memory component of free 
recall. It would therefore be predicted that in the 
free recall of sentences only the recall of the early 
sentences would vary with imagery and predicta- 
bility. Recall of the last sentence presented, which, 
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according to current views, would probably be re- 
called from primary memory, should not be affected 
by these two variables. The present experiment 
was designed to discover whether this was the case. 

Method. Four types of sentence were constructed : 
Predictable-Concrete (PC), Predictable-Abstract 
(PA), Unpredictable-Concrete (UC), and Unpre- 
dictable-Abstract (UA). "Twelve examples were 
found of each, making a total of 48 sentences. All 
sentences were of the form, “The (adjective) 
(noun) (auxiliary. plus verb) (preposition) (adjec- 
tive) (noun)," with the variable of imagery being 
manipulated via the nouns. Using the Paivio, 
Yuille, and Madigan (1968) ratings of noun imagery, 
the mean imagery values for the nouns used in the 
PC and UC sentences were 6.5 (range, 5.73 to 6.86) 
and 6.45 (range, 5.87 to 6.86), respectively. For 
the PA and UA sentences the mean ratings were 
2.77 (range, 1.93 to 3.37) and 2.85 (range, 1.63 to 
3.37), respectively. All sentences were controlled 
for individual word frequency using the Thorndike- 
Lorge (1944) ratings and all sentences had approxi- 
mately the same number of syllables (mean, 15.31 
syllables). Sentence predictability was defined in 
terms of the real world likelihood of occurrence of 
the situation described by the sentence. Typical 


example: the four types of sentence are: 


PC; The enraged alligator was chasing the 
small boy. 


UC: The angry alligator was smoking his 
favorite claw. 

PA: The serious casualty was caused by poor 
TORS 

UA: ;ew casualty was lacking the re- 


vd hypothesis. 


oted that the same individual word can 
oth in P conditions and in U conditions: 
was deliberately done in order to keep the P 

U conditions as similar as possible in respect of 
sue individual words used. 

lhe Ss were divided into four groups: each group 
received just one of the four types of sentence. 
Within each group, the 12 sentences were divided 
into three sets of 4 each, with one set of 4 sentences 
constituting the material for a single trial. Each S 
thus received three trials, The sentences were 
presented by tape recorder at a rate such that each 
sentence took 4 sec. to present, followed by a 4-sec. 
interval before the next sentence. After each trial 
of 4 sentences, S had 3 min. in which to write down 
the 4 sentences in any order he preferred. After he 
hod tried to recall the sentences, he was asked to 

r words he might still recall, 

Lhe sentences were scored both in terms of the 
number of complete correct sentences, and in terms 
of the number of critical words retained. There 
were five critical words per sentence (two adjec- 
, two souns, one verb). Since there were few 
cos between the two methods of scoring, 
only t | word analysis will be presented here. 

The ` 40 students from Introductory Psy- 
chology classes at Queen's University, with 10 Ss in 
cach group. 

Results. Figure 1 shows the percentage of correct 
critical words recalled for each of the sentence types 
as a function of the serial position of presentation. 
‘The last sentence was nearly always recalled first, 
and, following the convention used typically in 
studies of free recall of words, the retrieval of the 
elements from the last sentence was considered to 
be from primary memory. The recall of the first 
three sentences was considered to be from secondary 
memory, It is plain from Figure 1 that the differ- 
ences between the four sentence types recalled from 
primary memory were less than those same dilfer- 
ences as recalled from secondary memory. An 
analysis of variance on the data shown in Figure 1 
reveal that concrete sentences were better recalled 
than abstract, F (1, 36) = 66.26, predictable sen- 
tences were better recalled than unpredictable, 7 
(1, 36) — 18.79, and that there was a strong serial 
position effect, F (9, 324) = 17.66. All the above 
values are significant at p « .001. The only sig- 
nificant interaction (at the .01 level) was Concrete- 
ness X Serial Position, F (9, 324) — 5.19: Figure 1 
suggests that this interaction was due to the steeper 
rise of the Abstract sentences on Serial Position 4 
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FiGURE |. Recall asa function of imagery and predictability. 
for the four sentences. (Abbreviations: PA = Predictable- 
Abstract, PC = Predictable-Conerete, UC = Unpredictable- 
Concrete, and UA = Unpredictable-Abstract.) 


in comparison with that of the concrete sentences, 
and is consistent with the original hypothesis that. 
the effects of imagery should be more marked in 
secondary memory than in primary memory. 
Nevertheless it should be noted that in primary 
memory the four data points shown in Figure 1 did 
differ significantly from each other at the .05 level 
as determined by Newman-Keuls tests. 

A closer analysis showed that the reason the PA 
words were so well recalled from primary memory 
was that the second adjective and noun were par- 
ticularly well recalled in comparison with the case 
for the other three groups. Since an analysis of 
recall by grammatical class showed that at nearly 
all serial positions the sentences were recalled in a. 
“forward” manner, that is, with the first adjective 
and noun being better recalled than the second 
adjective and noun, it would seem that PA subjects 
paid more attention to the end of the sentences in 
primary memory than was normally the case, Ir 


dentally, the fact of “forward” recall suggests that : 


Ss did indeed try to retain the sentences as units, 
rather than as collections of individual word: 
haps the first adjective-noun pair formed a 
ceptual peg" on which to hook the second adjective- 
noun pair (Paivio, 1971). The more predictable 
and concrete the total sentence, the easier this would 
be to do. 

Discussion. Imagery and predictability clearly 
affected recall from secondary memory, a finding 
consistent with the results of previous work. But 
in what was ostensibly retrieval from primary mem- 
ory, there were two surprises: retrieval did vary 
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with imagery and predictability, and PA recall was 
best of all. Two choices are open to us: either we 
reject the primary-secondary memory dichotomy, 
at least insofar as sentence retention is concerned, 
or we assume that Ss' strategies for dealing with 
sentences were such as to make it unlikely that the 
use of primary memory in sentence retention is the 
same as the use of primary memory for isolated 
words. Since Figure 1 shows that there was a clear 
serial position effect, we prefer to adopt the latter 
view. The fact that sentences were recalled in a 
forward manner at all serial positions suggests that, 
even though it was theoretically possible to store 
the last sentence as a collection of isolated words 
“echoing” for a few seconds, in fact Ss preferred to 
process the last sentence in depth just as they had 
been doing for the earlier sentences. The advantage 
for the PA sentences may have resulted from an 
excessive concentration on the storage of the last 
sentence: the difficulty of the preceding three sen- 
tences may have led Ss to compensate by storing 
the last PA sentence both in terms of meaning and 
at the same time relying on echoic memory to a 
greater extent than was necessary for the easier 
PC sentences. If this speculation is correct, it may 
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be inferred that there is a flexibility in t 

primary memory which has perhaps been i 
overlooked in the existing literature, In turn, this 
would suggest that some of the available 2 
for primary versus secondary memory actually 
flects differences in processing strategies rather t] 


two types of storage (cf. Craik & Lockhart, 1972) 
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ATTENUATION OF BLOCKING WITH SHIFTS IN REWARD: 


OF SCHEDULE-GENERATED 
EXTUAL CUES! 


` THE INVOLVEME? 
CONT 


JAMES H. NEELY anp ALLAN R. WAGNER °? 


Yale University 


Responding»sto an element, X, of a stimulus compound, AX, which signaled 
the availability of reward, was investigated in four discrete-trial bar-press 
experiments. “Blocking” of the acquisition of responding to X was produced 
in each case for some groups by separately training A alone as a s nal for 
reward, The principle issue was whether or not blocking would be attenuated, 
ie., whether or not there would be more responding to X if the reward condi- 
tions associated with AX were less favorable than those associated with A 
alone, as was reported by J. M. Feldman in apparent conflict with the R. A. 
Rescorla and A. R. Wagner model. Blocking was attenuated in Experiments 
I and II when the reward manipulation involved the percentage of rewarded 


trials and the magnitude of reward, respectively, and the A-alone tra 
preceded AX training. Blocking was not 
IV, which involved the same reward manipulations a 
but the A-alone training was intermingled with AX training. 
interpreted as suggesting that contextual stimuli generated 


respectively, 
The results were 


by the prevailing reward schedule may 


ing 
attenuated in Experiments III and 
Experiments | and II, 


need to be taken into account in 


extension of the Rescorla- Wagner model of Pavlovian conditioning to instru- 


mental learning. 


The so-called “blocking effect" occurs in 
Pavlovian conditioning (e.g., Kamin, 1969) 
when reinforced training to an elementary 
conditioned stimulus (CS) (e.g., A) is ad- 
ministered prior to reinforced training to a 
compound (e.g. AX) formed from that 
element and an added cue. The essential 
observation is that S will then be less 


1 This research was supported in part by National 
Science Foundation Grant GB-30299X to the second 
author and was conducted while the first author was 
a National Science Foundation predoctoral fellow. 

? Requests for reprints should be addressed to 
Allan R. Wagner, Department of Psychology, Yale 
University, 333 Cedar Street, New Haven, Con- 
necticut 06510. 
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likely to evidence conditioned responding 
to the added cue, X, than if S had received 
only the compound. training: Pretraining 
to A somehow “blocks” the conditioning 
that would otherwise have occurred to X 
as a result of the compound trials. The 
same effect can be demonstrated when re- 
inforced A trials are intermingled with, 
rather than entirely preceding, the rein- 
forced AX trials. That is, S will again 
respond less to X alone than if training 
involves only the AX compound (Wagner, 
1969). 

To account for this effect, as well as a 
number of related observations, Wagner 
and Rescorla (e.g., 1972) have proposed 
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that the change in signal value which 
accrues to any cue as a result of a Pavlovian 
training trial depends upon the aggregate 
signal value (V) of all the CSs present on 
that trial, in relationship to the asymptotic 
signal value (Y) that can be maintained by 
the consequent reinforcement or nonrein- 
forcement which occurs. Specifically, it is 
assumed that the change in signal value to 
the ith cue in a compound of n cues may 
be represented as 


AV; = abi; — V). 


where V = EV. 

In this formulation, the values of the 
two rate parameters a and 8 are restricted 
to the unit interval, the former being deter- 
mined by the characteristics of the specific 
component cue involved, the latter, like 
the value of \, being determined by the 
characteristics of the reinforcing event. 
Important for present consideration is the 
implication for the blocking paradigm that 
when cue A is reinforced alone it will serve 
to increase the value of V4 and hence y 
appropriate to subsequent AX trials, and 
thus decrease the discrepancy, ^ — V, de- 
termining the further increment in signal 
value that may be enjoyed by either A or 
X as a result of a compound trial. 

In fact, the model leads to a richer set 
of deductions within the general blocking 
paradigm if one relaxes the condition that 
the unconditioned stimulus (UCS) magni- 
tude or intensity be the same on A and AX 
trials. For example, if the UCS (and hence 
X) is less on A than AX trials, V4 cannot as 
nearly approach the ^ appropriate to the AX 
trials, so that greater increments will be al- 
lowed in Vx on such trials. That is, blocking 
should not be as complete. Alternatively, if 
the UCS is greater on A than AX trials, 
blocking should either be more complete 
than would be the case if the UCS were the 
same, or Vx should become negative so 
that X will act as an inhibitory cue. The 
latter case should result when the A-alone 
training is sufficient to make V4 greater 
than the ^ appropriate to the AX trials. 
Then \ — V will be negative on such com- 
pound trials, resulting in a decrement in 
both V4 and Vx, necessarily driving Vx 


a 


negative. (See Rescorla and Wagner, 1972, 
and Wagner and Rescorla, 1972, for more 
detailed derivations.) 

There are now several studies relevant 
to these predictions. The majority have 
employed conditioned emotional response 
(CER) conditioning in rats, and these are 
generally supportive of the model. Kamin 
(1969), for example, first paired Cue A w ith 
a 1-ma. shock and then the compound, AX, 
with a 4-ma. shock. This treatment pro- 
duced more conditioned suppression to X 
alone (less blocking) than comparison treat- 
ments in which A and AX training involved 
equivalent shock levels, either high or low. 
Kamin also reported an attenuation of 
blocking when A was trained with a single 
UCS pulse, whereas subsequent AX trials 
involved a double pulse. 

Using the same general procedures, 
Kamin (personal communication, June 
1973) has also investigated the blocking 
paradigm under conditions involving a 
shift in UCS intensity between the compo- 
nent and compound occasions, but when 
the shift involved a reduction in intensity, 
i.e., when A was paired with a more intense 
shock than was AX. Since the obtained 
blocking was nearly complete in the com- 
parison condition without a shift in UCS 
intensity, the study was insensitive to the 
prediction of the model that greater block- 
ing (or inhibitory learning) should occur 
under the shift conditions. Important, 
however, was the fact that no attenuation 
of blocking was detected as was seen in the 
previously cited study (Kamin, 1969) when 
A was paired with a less intense shock than 
was the AX compound. 

Rescorla, in a CER experiment reported 
by Wagner and Rescorla (1972), followed 
each AX compound by a .5-sec. shock. 
Intermingled A presentations were either 
paired with the same .5-sec. shock, in a 
standard blocking procedure, or were paired 
with a longer and presumably more aver- 
sive 5.0-sec. UCS. Subsequent testing of 
X, which involved transfer toa conditioned- 
inhibition training schedule, revealed that 
X had acquired less of an excitatory ten- 
dency (or more of an inhibitory tendency) 
in the treatment in which A alone was 


x 
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ed with the stronger UCS. Wagner 
Rescorla interpreted this finding as 
ing provisional support for their model 
proceeded to point out that the con- 
‘ional technique (e.g, Pavlov, 1927) 
raking a cue, X, a conditioned inhibi 
viz., reinforcing A and nonreinforcing 
is but the limiting case of the generally 
juate condition specified by their model, 
; is, the greater reinforcement of A than 


In contrast to the relatively consistent 
utcomes of the available CER studies is 
in investigation by Feldman (1971) involv- 
ing a discrete-trial food-rewarded bar-press 
situation, In brief, it appears that in such 
situations a change in reinforcement be- 
tween A and AX occasions may uniformly 
itenuate blocking, whether A or AX is 
ihe more favorably reinforced. Although 
the Rescorla and Wagner (1972) model is 

cifically addressed to Pavlovian rather 

a instrumental learning situations, the 
cneral pattern of findings concerning over- 
hadowing, blocking, inhibitory learning, 
ete, which the model was calculated to 

duce, have otherwise been indiscrimina- 

in the two situations (e.g., Wagner, 
1269; Wagner, Logan, Haberlandt, & Price, 
1968), Thus, it is important to attempt to 
reconcile the apparent conflicts that the 
Feldman data occasion both with respect 
to the Rescorla and Wagner model and the 
available CER studies. 

The four experiments reported here were 
designed to comment upon the generality 
of Feldman's (1971) problematic finding 
that blocking is attenuated when A is more 
favorably reinforced than is AX. Experi- 
ment I was an attempt to replicate the 
inding using the same reinforcement pa- 
rameter as did Feldian, viz., different 
percentages of reinforced trials. Experi- 
ment II was similarly designed but em- 
sloyed variations in reward amount rather 
than percentage. Experiments III and IV 
used the same reward parameters as Ex- 
periments I and II, respectively, but in- 
volved an intermingled presentation of A 
and AX trials as opposed to successive 
phases of training. As will be pointed out, 
the pattern of findings suggests a modifica- 


753 


tion in application of the Rescorla and 
Wagner (1972) model to account for varia- 
tion in the effects of shifting reward 
parameters in different situations. 


EXPERIMENT | 


Feldman (1971) ran six groups of rats in 
a bar-pressing situation in which reinforce- 
ment availability was announced by either 
an isolated cue, A, or a compound, AX. 
For three of the groups the AX compound 
was associated with 100% reinforcement 
(AX-100) while for the remaining three 
groups it was associated with 25% rein- 
forcement (AX-25). In either condition 
one of the three groups was pretrained on 
A with 100% reinforcement (A-100), an- 


other with 25% reinforcement (A-25), 
while the third, control, group was not 
pretrained. 


In test, the responding to X alone showed 
the usual blocking effect when the rein- 
forcement percentage was the same on A 
and AX occasions, ie., in spite of the 
equivalent reinforcement of X, the A-25/ 
AX-25 group responded less to X than the 
group receiving only AX-25, while the 
‘A-100/AX-100 group responded less to X 
than the group receiving only AX-100. 
'The novel observation, however, was that 
the two pretrained groups that experienced 
a shift in reinforcement percentage between 
A and AX occasions (A-25/AX-100 and 
A-100/AX-25) showed more responding to 
X (less blocking) than did the aforemen- 
tioned blocking groups that received. the 
equivalent reinforcement percentages on 
AX trials. 

The Rescorla and Wagner (1972) model 
can account for the fact that more respond- 
ing to X was observed in the A-25/AX-100 
group than in the A-100/AX-100 group. 
]t does not anticipate, however, that more 
responding to X would be observed in the 
A-100/AX-25 group than in the A-25/ 
AX-25 group. As long as the À associated 
with nonreinforcement is less than the À 
associated with reinforcement, it should 
always be the case that after an equal 
number of trials the signal value of a con- 
tinuously reinforced cue will be greater 
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than the signal value of a partially rein- 
forced cue. Thus, on reinforced AX trials, 
Vx should have been incremented less in 
the group for which A was pretrained with 
100% reinforcement than in the group for 
which A was pretrained with 25% reinforce- 
ment. Likewise, on nonreinforced AX 
trials, Vx should have been decremented 
more in the group for which A was pre- 
trained with 100% reinforcement than in 
the group for which A was pretrained with 
2595 reinforcement. The overall effect 
predicted by the Rescorla and Wagner 
model is that there should have been /ess, 
rather than more, responding to X alone 
in the A-100/AX-25 group than in the 
A-25/AX-25 group. 

Experiment | was designed to evaluate 
the reproducibility of those aspects of the 
Feldman (1971) data embarrassing to the 
Rescorla and Wagner model. 


Method 


Subjects, The Ss were 18 experimentally naive 
male albino rats weighing 275-375 gm. at the start 
of the experiment, Each S was individually housed 
with free access to water and was maintained at 
75%-80% of its original weight via limited feedings 
given immediately after each experimental session, 
Two Ss died, and 1 S was discarded for failure to 
respond during shaping, leaving 15 Ss from which 
data will be reported, 

Apparatus. Experimental sessions were con- 
ducted in one of six identical 24.2 X 32.4 X 18.7 
cm. operant chambers housed in sound-attenuated 
enclosures, A ventilation fan produced a continu- 
ous masking noise of 75 db. (General Radio sound- 
pressure meter, 20-kHz. scale), while a .143-w. 
jeweled house light located directly above the food 
well, in the center of an end wall, provided a low 
level of illumination, The manipulandum was a 
lever located 8.3 cm. above the floor and 2.0 cm. to 
the left of the food well, 

A 1,000-Hz. 80-db. tonal cue was provided by a 
Foringer stimulus generator (Model 1166-4-MI) and 
was delivered through a 7.5 x 14.0 cm. speaker 
located under the level of the grid floor directly 
below the food well. A diffuse visual cue was pro- 
vided by a shielded 3-w, lamp located at the ceiling. 
W hen appropriate to the reinforcement contingen- 
cies, a .045-gm. Noyes food pellet was delivered to 
the food well from a Davis Model PD-109A feeder, 
All experimental events were controlled by electro- 
mechanical equipment in an adjacent room. 

Experimental design. Three conditions of training 
equivalent to those used by Feldman (1971) were 
employed to evaluate the effects of reducing reward 
percentage between the A and AX occasions, Group 


AX-25 was trained only with the AX compound, 
signaling reward on 25% of the trials. Groups 
A-25/AX-25 and A-100/AX-25 received an equiva- 


lent phase of AX training but were first pretrained 
with Cue A alone signaling reward on either 25% 
or 100% of the trials as indicated. Following 5 
loss, five Ss remained in each group. For al 
the auditory stimulus was designated as A and the 
visual stimulus as X. 

Shaping. Over the first five days of the experi- 
ment, Ss were trained to bar press for food reward 
and gradually adjusted to a trial regime. O ch 
of Days 1-3, 5 was exposed to four cycle a 
continuous reinforcement and extinction sc! le 
consisting of 13 min. in which either A or A^ 
present (depending upon which cue was first « 
uled in training) and each bar press was rewa: 
followed by 2 min. in which no cue was present an 
bar pressing was not rewarded. On Days 4-5, a 
trial procedure was initiated wherein the appropriate 
cue was briefly presented at regular intervals duriny 
I-hr. ons, signaling the only occasions in which 
reward was available. If a response occurri 
stimulus was terminated with reinforcemen 
the absence of a response, the stimulus wa 
matically turned off, initially after 15 sec., bi 
a gradual shortening to 6 sec. The intertrial iuie 
val, as measured from the onset of adjacent pro- 
grammed trials, was originally 15 sec. and was 
gradually increased to 30 sec. by the end of Day 5 
Finally, a pretrial period was prescribed, just ; 
to each scheduled cue presentation, such that a 
response in this interval would cancel the trial 
occurrence, The pretrial duration was initially .5 
sec. and was gradually increased to 5.0 sec. Thus, 
by the end of shaping, the conditions were in effect 
which prevailed throughout training—a 5-sec pre 
trial period, a 6-sec. (maximum) A or AX presenta 
tion, and a 30-sec. intertrial interval. 

Element training. Groups A-25/AX-25 and A 
100/AX-25 received six days of traini g with the 
A cue alone prior to compound training. Each 
daily session was scheduled to include 96 possibl 
trial occasions. In Group A-100/AX-25, e: A 
presentation announced the availability of reinforce 
ment, whereas in Group A-25/AX-25, reinforcc ment 
was available on only 25% of the trials in 
random sequence, 

Group AX-25 received no element training but 
had the shaping sessions delayed so that all three 
groups entered compound training at the same time 
in relationship to the initiation of deprivation. 

Compound training. Reward training terminated 
for all groups with eight daily sessions with the AX 
compound scheduled to announce reinforcement on 
25% of the trials. As in the element training, the 
schedule called for 96 trial occasions within each 
session. 

Testing. Responding to X alone as well as to A 
and the AX compound was evaluated in a final 
experimental session. Following 20 AX trials with 
the reward contingencies still in effect, all Ss were 
administered a series of 165 test trials in which a 
bar press terminated the stimulus but was 
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There were 55 test trials to each of A, 
ing the 


rewarded. 
X, and A) 
restriction that successive blocks of 15 trials in- 
clude 5 to each cue. In order to insure that all 
' test. trials were received, the pretrial, time-out 
contingency was omitted during testing. 


in a pseudorandom sequence he 


Results and Discussion 


All Ss rapidly adopted discriminated bar 
pressing during the training sequence such 
that on the first postshaping day the per- 
centages of trials presented (i.e., not pre- 
vented by a pretrial response) were 88.0, 
85.0, and 89.0, while the percentages of 
those trials presented which were responded 
to were 90.0, 86.0, and 92.0 for each of 
Groups A-25/AX-25, A-100/AX-25, and 
AX-25, respectively. The only remarkable 
deviation from such performance was on 
the first day in which the X cue was intro- 
duced in the pretrained groups. On this 
day, percentage responding to the trial cue 
dropped for each of the 10 shifted Ss (p < 
.01, sign test) in comparison with the pre- 
ceding day of element training, so that the 
mean percentage responding was 69.0 and 
63.0 for Groups A-25/AX-25 and A-100 
AX-25, respectively. With the exception 
of this transient evidence of "external in- 
hibition," however, response levels were so 
uniformly high as to preclude the detection 
of any potential treatment differences 
during training. 

Table 1 presents the percentage responses 
to each of the test stimuli for each of the 
three groups. Inspection of the data from 
Group AX-25 suggests that the visual X 
cue was more salient than the auditory A 
cue, the former being responded to on 
50.2% of its test trials, the latter on only 
19.6%, after equivalent compound training. 
In comparison, Group A-25/AX-25 re- 
sponded considerably more frequently to 
the pretrained A cue and less to the X cue, 
so as to reverse the ordering of responding 
to the two cues. Group A-100/AX-25, in 
a similar comparison to Group AX-25, 
evidenced less of an advantage in respond- 
ing to the pretrained A cue and no apparent 
decrease in responding to X. 

Analyses of variance indicated that the 
differences among the three groups in re- 
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Cue. 
Group [= = eee 
| AX 
AX-25 1.6 | 73.1 
A-25/AX-25 62.5 71.6 
A-100/AX-28 | 64.0 


Note, See text for explanation of group names, 


sponding to X were statistically reliable, 
F (2,12) = 5.54, p < .025. Subsequent / 
tests, employing the overall within-groups 
variance as the error estimate, revealed that 
Group A-25/AX-25 responded significantly 
less to X than did either Group AX-25, ! 
(12) = 2.91, p <.05; or Group A-100/ 
t (12) = 2.75, p < .05, while the 
latter twogroups were not reliably different, 
L « 1.0. 

Similar analyses of responding to the A 
cue indicated a statistically reliable differ- 
ence among the three groups, F (2, 12) 
= 681, p < .025, with the responding 
of Group A-25/AX-25 being significantly 
greater than that of Group AX-25, t (12) 
= 3.69, p < .01. Although the responding 
of Group A-100/AX-25 was greater than 
that of Group AX-25 and less than that 
of Group A-25/AX- these differences 
did not reach conventional levels of sta- 
tistical reliability in either case, / (12) = 
1.88, p « .10; and t (12) = 1.81, p < .10, 
respectively. There were no reliable dif- 
ferences among the groups in responding to 
the AX compound (F = 1.09). 

The major result of Experiment 1 was to 
replicate the Feldman (1971) finding: 
There was an attenuation of the blocking 
of responding to X when A was more 
favorably reinforced during pretraining 
(Group A-100/AX-25) as compared to a 
procedure involving the equivalent rein- 
forcement of component and compound 
(Group A-25/AX-25). 

While this finding is apparently inconsis- 
tent with the Rescorla and Wagner (1972) 
model, it should also be noted that a pre- 
sumptive condition was not obviously met. 
That is, there was no evidence that pre- 
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training A with the higher percentage re- 
inforcement resulted in greater signal value 
to A. Indeed, the test data showed some- 
what less responding to A following 100% 
as compared to 25% reinforcement in 
pretraining. As testing involved an extinc- 
tion series, the relative A responding of 
Groups A-100/AX-25 and A-25/AX-25 
may of course be taken to reflect the 
ubiquitous "partial-reinforcement effect." 
But as far as the Rescorla and Wagner 
model is concerned, this only raises the 
question of how such an effect is to be 
explained and leaves unchanged the fact 
that there was no evidence that the cue 
with which X was trained in compound 
had been pretrained to a higher signal value 
in Group A-100/AX-25 than in Group 
A-25/AX-25. 


EXPERIMENT II 


Reduction in reward percentage on the 
occasion of shift between component and 
compound training appears to lead to a 
reproducible attenuation of blocking, at 
least in an instrumental bar-pressing situa- 
tion such as that employed by Feldman 
(1971) and in Experiment I. The im- 
mediate question which this raises is how 
such an observation may be related to the 
contradictory evidence of Kamin (personal 
communication, June 1973) and Rescorla 
(cited in Rescorla & Wagner, 1972), ob- 
tained in a CER situation. 

Other than the major paradigmatic dif- 
ferences between the two training situa- 
tions, the one instrumental and appetitive, 
the other Pavlovian and aversive, there is 
the obvious difference in the reinforcement 
parameter manipulated, percentage of re- 
inforced trials in the present situation as 
contrasted with the magnitude of a con- 
sistently occurring reinforcement in the 
Kamin (personal communication, June 
1973) and Rescorla (cited in Rescorla & 
Wagner, 1972) studies. To evaluate this 
difference, Experiment I] was designed 
with conditions analogous to those of 
Experiment I but with variation in the 
magnitude of food reward rather than per- 
centage reinforcement, 


Method 


The apparatus, S population, and procedure 
except as noted, were identical to those of Experi 
ment I. Twelve Ss were randomly assigned to each 
of three experimental groups designated according 
to the number of pellets of reward consistently 
received for each discriminated bar press in the 
separate phases of element and compound training, 
ie, Group AX-1, Group A-1/AX-1, and Group 
X-1. 

Shaping. Preliminary shaping was carried out in 
eight daily 30-min. sessions. On each of Days 1—4, 
S was exposed to two cycles of a continuous rein- 
forcement and extinction schedule as in Experiment 
I. Over Days 5-8, Ss were gradually adjusted to 
the trial regime as in Experiment I, ept that the 
intertrial interval was brought to 60 sec., and Group 
A-5/AX-1 received five pellets reward for each re- 
sponse to the trial cue. 

Element training. Groups A-1/AN-1 and A-5/ 
AX-1 received 20 days of training with the A cue 
alone prior to compound training. zach daily 
session was scheduled to include 16 possible trial 
occasions, and on Days 6, 8, 10, 12, 14, 16, and 18 
the receipt of 16 trials was insured by terminating 
the session for each S only when this number had 
been completed, For Ss in Group A-1/AX-1, each 
trial response terminating the cue was reinforced 
with one .045-gm. Noyes pellet. For Ss in Group 
-l, each similar response was reinforced 
with five pellets delivered at 150-ms tervals. 

Compound training. For the final phase of train- 
ing, all three groups received 20 daily sessions of 16 
scheduled trials with the AX compound and one- 
pellet rewards. 

Testing. Responding to the compound and com- 
ponent cues was evaluated over 4 test sessions. On 
Day 1, 4 rewarded AX trials were administered, 
then 18 nonreinforced test trials, in such an order 
that 2 A, 2 X, and 2 AX trials occurred in each 
On each of Days 2-4, 
the 18-trial test sequence was repeated to produce 
a total of 24 tests to each cue. 


Results and Discussion 


More pretrial responding occurred than 
in Experiment I, but again the groups did 
not differ reliably in this respect nor in 
percentage of trial responses within any 
phase of training. During the common 
compound phase, Groups A-1/AX-1, AX-1, 
and A-5/AX-1, respectively, received 91%, 
88%, and 92% of the scheduled trials, 
(F < 1), and responded on 88%, 93%, 
and 92% of the occasions (F = 1.26). 

Table 2 presents the percentage of re- 
sponses to the two component cues and the 
compound oyer the test series for each of 
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the three groups. In general, the pattern 
of data repeats that of Experiment I. The X 
cue was more frequently responded to than 
A following equivalent joint training in 
Group AX-1. In comparison, Group A-1 

AX-1 responded considerably more often 
to the pretrained A and less to X. Group 
A-5/AX-1 evidenced no greater, indeed 
slightly less, advantage over Group AX-1 
in responding to the pretrained A cue than 
did Group A-1/AX-1, with no apparent 
comparative decrease in responding to X. 

Analysis of variance indicated that the 
overall difference among the three groups 
in responding to X was statistically reliable, 
F (2, 33) = 3.64, p < .05, while subsequent 
contrasts revealed that Group A-1/AX-1 
responded significantly less to X than did 
either Group AX-1, £ (33) = 2.54, p < .05, 
or Group A-5/AX-1, ¢ (33) = 2.04, p = 
.05, with the latter two groups not reliably 
different (t < 1). Similar analysis of re- 
sponding to A indicated a statistically 
reliable difference among the three groups, 
F (2, 33) = 12.88, p < .001, with the re- 
sponding of Group AX-1 being significantly 
less than that of either Group A-1/AX-1, 
t (33) = 4.65, p < .01, or Group A-5 AX-1, 
L (33) = 4.06, p < .01, which themselves 
did not differ (t <1). There were no 
reliable differences among the groups in 
responding to AX (F < 1). 

It appears that the attenuation of block- 
ing reported by Feldman (1971) and in 
Experiment I, when the pretrained A cue 
was more favorably rewarded than the sub- 
sequent AX compound, is not peculiar to 
variation in reward percentage but extends 
as well to variation in reward magnitude. 


EXPERIMENT III 


The results of Experiments I and II are 
apparently inconsistent with the results 
obtained by Kamin (personal communica- 
tion, June 1973) and Rescorla (cited in 
Wagner & Rescorla, 1972) in CER condi- 
tioning and correspondingly at variance 
with the Wagner and Rescorla model which 
anticipates the CER data. In attempting 
to reconcile these differences, it may be 
helpful to recognize that there are other 
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TABLE 2 
Responses TO EACH 


PERCENTA! 


Group | reme xn 
x A AX 
| 45.8 28.5 75.0 
|. 26.8 66.7 61.1 
| 42 61.8 TAT 


Note, See text for explanation of group names, 


major effects of manipulating reinforcement 
percentage and magnitude that are more 
typical of instrumental than Pavlovian 
conditioning and also not obviously ac- 
counted for by the Wagner and Rescorla 
model. It is possible that a manner of 
application of the model that would em- 
brace these major reinforcement effects 
would also accommodate the blocking data 
of present concern. 

One of the most reproducible and robust 
phenomena in instrumental learning situa- 
tions is the partial-reinforcement effect, 
i.e., the fact that reinforcing only a portion 
of the trial responses produces more per- 
sistent responding during an extinction 
series than. does consistent reinforcement 
(e.g., Capaldi & Hart, 1962 ; Wagner, 1961). 
In comparison, in studies involving Pavlov- 
ian conditioning of infrahuman Ss, the 
partial reinforcement effect has frequently 
not been obtained (e.g., Thomas & Wagner, 
1964; Vardaris & Fitzgerald, 1969 ; Wagner, 
Siegel, & Fein, 1967) or has been relatively 
evanescent (e.g., Wagner, Siegel, Thomas, 
& Ellison, 1964). Likewise, whereas it is 
a common observation in instrumental- 
learning situations that small reward mag- 
nitudes during acquisition produce more 
persistent extinction responding than do 
large reward magnitudes (e.g., Hulse, 1958; 
Wagner, 1961), such an effect has not been 
observed in comparable Pavlovian investi- 
gations (e.g., Wagner et al., 1964). 

Without added assumptions, the Rescorla 
and Wagner (1972) model would predict 
extinction performance more in line with 
that typically seen in Pavlovian condition- 
ing as opposed to instrumental learning. 
Assuming that A has a low (e.g., zero) value 
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on nonreinforced trials and increases with 
increasing magnitudes of reward (Wagner 
& Rescorla, 1972), the greater the percen- 
tage or magnitude of reward, the greater 
should be the effective limit toward which 
conditioning can proceed and hence the 
greater the signal value from which ex- 
tinction is begun. 

A common theoretical device for ac- 
counting for the partial-reinforcement effect 
and the slower extinction following smaller 
rewards is to assume that the reinforcement 
schedule, be it in acquisition or extinction, 
generates stimuli that function in com- 
pound with the nominal cues. Thus, for 
example, Capaldi (e.g., 1966) emphasizes 
the stimulus “aftereffects” of reward and 
nonreward, and Amsel (e.g., 1962) empha- 
sizes the cue properties of “anticipatory 
reward" and “anticipatory frustration." 
Whatever the specific identification of the 
schedule-generated stimuli, the argument 
is commonly made that shifting from an 
aquisition schedule to an extinction sched- 
ule produces less of a change in stimulation 
for partially reinforced Ss than continu- 
ously reinforced Ss and less of a change 
for small-reward Ss than for large-reward 
Ss. Thus, for example, partially reinforced 
Ss are assumed to be more resistant to 
extinction than continuously reinforced Ss, 
not because of a "stronger" response ten- 
dency, but because they have presumably 
been trained to respond in the presence of 
cues that more closely approximate those 
that occur during extinction. 

Such reasoning, of course, is perfectly 
compatible with the Wagner and Rescorla 
(1972) model. It simply leads to an en- 
tiched conceptualization of the stimuli that 
must potentially be taken into account in 
application of the model in any situation. 
Thus, for example, it might be assumed 
that when a nominal cue, A, is rewarded 
on a 25% schedule during acquisition, the 
functional stimulus should better be repre- 
sented as AS, with the latter term standing 
for those stimuli generated by the particular 
partial reinforcement schedule. Rescorla 


è This notation is deliberately noncommital con- 
cerning the specific source and character of the 
schedule-generated stimuli since such is not im- 


and Wagner (1972) in several applications 
of their model have made important appeal 
to “situational” cues acting in compound 
with the nominal CS. It apparently should 
further be acknowledged that certain of 
these situational cues are a consequence of 
the prevailing reward schedule. To handle 
the fact that such extinction phenomena 
as the partial-reinforcement effect, which 
are presumed to depend upon the influence 
of schedule-generated stimuli, are more 
prevalent in instrumental as compared to 
Pavlovian conditioning, it need only be 
assumed that the relative saliences (a 
values) of the nominal and schedule-gener- 
ated cues are likely to differ in the two 
situations under conventional experimental 
arrangements such that the schedule-gen- 
erated cues typically come to carry a 
greater relative signal value in instrumental 
as compared to Pavlovian conditioning. 
This formulation has important implica- 
tions for cases in which reward parameters 
are shifted in a blocking paradigm, as in 
Experiments I and II, if it is assumed that 
there are, in fact, salient schedule-produced 
cues that must be taken into account. 
That is, it must then be assumed that the 
reward shift produces not only a shift in 
X but a change in the stimuli present during 
the pretraining and compound phases. 
Such stimulus change should consistently 
work to attenuate blocking of the added 
cue regardless of whether the reward shift 
is toward a higher or lower value. It may 
be seen that if S is pretrained with AS’ 
rewarded and then with ASX rewarded, 
the degree of blocking to X will be deter- 
mined by the signal value which is held by 
AS as a result of AS’ training, and this will 
be limited by differences in S and S’ which 
make Vg less than Vs’. When AS’ is 
associated with a less favorable reward 
schedule than ASX, both the lower \ during 
pretraining and the change in cues between 


portant here. It would be anticipated, however 
that in more precise treatment it would be ne 
sary to acknowledge that the schedule-generated 
stimuli are not constant over a series of trials, but 
change systematically with each reinforcement and 
nonreinforcement occasion, with time since such 
occasions, and likely with other variables as well 
(see, e.g., Capaldi, 1966). 
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the former and latter occasions should work 
to minimize the signal value of AS and 
allow acquisition to X, i.e, to attenuate 
blocking. When AS' is associated with a 
more favorable reward schedule than ASX, 
the greater À during pretraining and the 
shift in stimuli should have opposing in- 
fluences upon the subsequent acquisition 
to X: The greater À should work to produce 
a greater signal value of AS so as to mini- 
mize acquisition to X (or produce inhibi- 
tory learning), while the shift in stimuli 
should serve to minimize the signal value 
of AS and allow acquisition to X. Accord- 
ing to this treatment, it would be expected 
that increasing reinforcement between pre- 
training and compound training would con- 
sistently attenuate blocking, whether it be 
in CER conditioning (e.g, Kamin, 1969) 
or discriminated bar-press learning (Feld- 
man, 1971), whereas the results of decreas- 
ing reinforcement would be expected to be 
more variable, depending upon the relative 
magnitude of the opposing ^ and stimulus- 
change effects in different experimental 
situations. In this light, it is significant 
that when decreasing reinforcement has 
been found to attenuate blocking it has 
been in instrumental reward learning (Feld- 
man, 1971; Experiments I and II above) 
where there is ample other reason for as- 
suming that schedule-produced stimuli are 
important controllers of behavior, rather 
than in CER conditioning where such an 
assumption is less demanded, as has been 
pointed out above. 

This set of arguments makes analysis of 
the blocking paradigm relatively compli- 
cated. One must be concerned, for ex- 
ample, with the schedule-produced stimuli 
that are present during testing as well as 
during training. That is, it must be as- 
sumed that responding during an exti nction 
test session is not assessed to X alone but 
to XS", where S” represents those stimuli 
produced by the prevailing nonreinforce- 
ment condition, Then the relevant theo- 
retical question is not simply the degree 
of acquisition to be expected to X under 
various blocking treatments, but the signal 
value to be expected to XS", where S" 
can be assumed to be more similar to the 
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stimulation produced by partial reinforce- 
ment than by continuous reinforcement, 
and more similar to the stimulation pro- 
duced by small rewards than by large 
rewards (e.g., Capaldi, 1966). 

As unwieldy and indeterminate as this 
interpretation may be, it allows at least 
one clear prediction. It suggests that the 
only factor acting to attenuate blocking 
when A is more favorably reinforced than 
AX is the change in schedule-generated 
stimuli between the occasions of compo- 
nent and compound training. Thus, if A 
were more favorably reinforced than AX, 
but in the context of the same schedule- 
generated stimuli, there should be no atten- 
uation of blocking. Indeed, the remaining 
factors, such as the greater \ on A than 
AX occasions, should then be allowed to 
produce enhanced blocking or the acquisi- 
tion of inhibitory tendencies to X, as 
otherwise predicted by the Rescorla and 
Wagner (1972) model. 

Experiments III and IV were designed 
to evaluate this possibility. It is known 
that blocking will occur when A and AX 
trials are intermingled, as well as in se- 
quential phases of training (e.g., Wagner, 
1969), In such an instance, the prevailing 
schedule-generated stimuli which may be 
in compound with the nominal cues can 
be assumed to be equivalent on the two 
kinds of trials. Thus, if A were to be 
preferentially rewarded in comparison with 
AX, both kinds of trials would still occur 
in the context of the same schedule-gener- 
ated stimuli, resulting from the inter- 
mingling of the separate rewards. In this 
case, there should be no attenuation of 
blocking as was obtained in Experiments | 
and II, which employed separate phases 
of A and AX training. Experiment IH 
repeated the blocking comparison of Ex- 
periment I, involving different percentages 
of reinforcement, but with an intermingling 
of A and AX trials, Experiment IV simi- 
larly repeated the blocking comparison of 
Experiment II, involving different magni- 
tudes of reward. 


Method 


The apparatus, 5 population and procedures, 
except as noted, were identical to those of Experi- 
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ment I, Six Ss were randomly assigned to each of 
three experimental groups designated according to 
the percentage of reinforcement scheduled in the 
presence of the A element and AX compound. 
Group AX-25 was trained only with the AX com- 
pound signaling reward on 25% of the trials. Group 
A-25;AX-25 and Group 100;AX-25 received 
equivalent Ha with AX but had intermingled 
trials with A alone signaling reward on either 259 
or 100% of the trials, r 

Shaping. Over the fi five days of the experi- 
ment, 5s were gradually adjusted to a trial regime 
in a manner identical to Experiment I except that 
the cue presentations in Group A X-25 and 
Group A-100;AX-25 were equally distributed be- 
tween A and AX, while Group AX-25 received only 
AX: 

Training. Element and/or compound training 
was carried out over 14 sessions, equivalent to the 
total number in Experiment I, but in this case, all 
Ss received 48 scheduled AX trials daily. Group 
ived no further tri hile Group 
l; and Group A-100;A) received 48 
additional daily trials with the A cue a 
spersed among the AX trials according to an ir- 
regular sequence. 

Testing. Following 20 trials with the reward 
contingenci still in effect, responding to the 
several cues was evaluated in a final nonreinforced 
test series identical to that in Experiment f. 


one inter- 


Results 


As in the previous experiments, all Ss 
rapidly adopted discriminated bar pressing 
during the training sequence, revealing no 
notable differences among the three groups 
or between the A and AX trials within 
the groups receiving both cues. 

'The final test data of major interest 
are presented in Table 3. Comparing the 
percentage responding to X of Group 
A-25;AX-25 with that of Group AX-25 re- 
veals a blocking effect similar in magnitude 
to that observed in the comparable percent- 
age groups of Experiment I. The major 
finding of Experiment III is to be seen in 


TABLE 3 


MEAN PERCENTAGE OF BAR-PRESS RESPONSES TO 


EACH STIMUL 
Cue 
Group — = 
x A AX 
57.8 26.1 89.8 
37.4 76.7 79.2 
30.9 62.1 61.0 


Note, See text for explanation of group names, 


the relative responding to X in Group 
A-100;AX- Rather than responding 
similarly to Group AX-25, as did the com- 
parable Group in Experiment I, Group A- 
100;AX-25 evidenced no attentuation of 
blocking, indeed responded somewhat less 
to X than did Group A-25;AX-25. Anal- 
ysis of variance indicated that the difference 
among the three groups in responding to X 
was statistically reliable, F (2, 15) = 5.80, 
p < .05, with Group A-100;A X-25 as well 
as Group A-25;AX-25 responding signifi- 
cantly less than Group AX-25, ¢ (15) = 
3.25, p < .01; and ¢ (15) = 2.46, p < .05, 
respectively. The difference between the 
two groups trained with A alone was not 
reliable (t < 1). 

The pattern of responding in test to A 
and AX was less radically different from 
that of Experiment I. Again, training A 
alone increased the test responding to this 
cue in Group A-25;AX-25 and Group 
A-100;AX-25 ascompared to Group AX-25, 
and somewhat less in Group A-100;A2 


in which A alone was consistently rein- 
forced, than in Group A-25;AX- in 
which the cue was partially reinforced. 


Notable also was a tendency similar to 
that observed in Experiment I for the 
group in which A alone was consistently 
reinforced to respond less frequently to the 
AX compound than did the group trained 
only with the compound. Separate analy- 
ses of variance performed on the respond- 
ing to A and to AX each revealed signifi- 
cant differences among the several groups, 
F (2,15) = 10.37, p < .01; and F (2, 15) 
= 5.12, p < .05, respectively. However, 
in neither case was the contrast between 
the twogroups trained with A alone reliable. 
Group A-25;AX-25 and Group A-100;AX- 
25 both responded significantly more to 
A than did Group AX-25, ¢ (15) = 4.43, 
pb < .01; and ¢ (15) = 3.15, p < .01, re- 
spectively, while Group A-100;AX-25 also 
responded significantly less to the AX 
compound than did Group AX-25, ¢ (15) 
= 3.17, p < .01. 


EXPERIMENT IV 


The results of Experiment III, in which 
A and AX training trials were intermingled, 
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stand in contrast to those of Experiment I, 
in which A and AX training trials were 
presented in sequential phases: In Experi- 
ment I, blocking was abolished when A was 
more consistently reinforced than was AX ; 
in Experiment III, blocking was not at- 
tenuated under such circumstances. Ex- 
periment IV was consequently conducted 
as a modification of Experiment II, involv- 
ing intermingled as opposed to phased 
training. P 

a 


Meihod 


The apparatus, S population and procedures, 
except as noted, were identical to those of Experi- 
ment II, and when different, were patterned after 
those of Experiment IHI. Twelve Ss were randomly 
assigned to each of three experimental groups desig- 
nated according to the number of pellets of reward 
scheduled for a discriminated bar press in the 
presence of the A element and the AX compound 
during training, ie., Group AX-1, Group A-1;AX-1 
and Group A-5;AX-1. 

Shaping. Over the first eight days of the experi- 
Ss were gradually adjusted to a trial regime 
ing of a 5-sec. pretrial period, a 6-sec. 
mum) CS, and a 60-sec. intertrial interval, 
ical to Experiment II. The only change in 
shaping from that in Experiment II was that A and 
AX were equally presented from the outset in Group 
A-1;AX-1 and Group A-5;AX-1, whereas only the 
AX compound was presented in Group AX 
xperiment II, the five-pellet rewards to A were 
in Group A-5;AX-1. 
Training. Acquisition was distributed over 40 
sessions equivalent to the total length of element and 
compound training in Experiment II. All Ss had 
eight AX trials scheduled daily. Group A-1;AX-1 
and Group A-5;AX-1 had an additional 
trials scheduled daily, irregularly interspersed among 
the AX trials. To minimize the number of trials 
that could be missed a result of any pretrial 
responding early in acquisition, occasional sessions 
within the first half of training were terminated for 
each S only after the scheduled number of A or AX 
trials was received, in a manner similar to Experi- 
ment II. 

Testing. Responding to the compound and com- 
ponent: cues was evaluated over four nonreinforced 
test sessions identical to those of Experiment II. 
Each session included six A, six X, and six AX test 
trials, 


Results 


Acquisition performance, as in the previ- 
ous experiments, was generally so high as 
to obscure the detection of group differ- 
ences, There was, however, evidence of 
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TABLE 4 
MEAN PERCENT. 
Eacu Ti 
| Cue 
Group EE ce N 
eres 
| 590 | | 77.4 
122271 74.7 
35.4 69.5 
Note. See text for explanation of group names. 
the higher reward magnitude in Group 


A-5;AX-1 as compared to Group A-1;AX-1 
in a reliably more frequent responding 
(90.5% vs. 73.9%) to A alone over the first 
eight sessions, ¢ (22) = 3.27, p < .05, be- 
yond which the groups did not differ. 

The percentages of bar-press responding 
to the several cues during testing are pre- 
sented in Table 4. As expected, blocking 
was observed in the X responding of Group 
A-1;AX-1 as compared to Group AX-1, 
The important observation is that an equiv- 
alent blocking was seen in the X responding 
of Group A-5;AX-1. This finding con- 
trasts sharply with that from the compara- 
ble reward group of Experiment II involving 
sequential phases of A and AX training, 
but is consistent with the results of Ex- 
periment III when A was also more 
favorably reinforced than AX in an inter- 
mingled schedule. Analysis of variance 
indicated a significant difference among the 
three treatment groups in responding to X, 
F (2, 33) = 6.58, p < .005, attributable to 
the lower frequency of response in Group 
A-1;AX-1 and Group A-5;AX-1 in com- 
parison to Group AX-1, £ (33) = 3.16, p < 
.01;and£ (33) = 3.11, < .01, respectively. 

The lower level of responding. to X in 
Group A-1;AX-1 and Group A-5;AX-1 in 
comparison to Group AX-1 was accom- 
panied by a higher level of responding to 
the separately trained A cue. Analysis of 
variance performed on the frequencies of 
responding to A indicated a reliable differ- 
ence among the three treatment groups, F 
(2, 33) = 30.08, p < .005, with Group 
AX-1 responding less than either Group 
A-1;AX-1, f (33) = 6.72, p < .01, or Group 
A-5;AX-1, ¢ (33) = 6.92, p < .01, and no 
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reliable difference between the latter two 
groups. None of the differences in respond- 
ing to AX approached statistical reliability. 


GENERAL DISCUSSION 


In an instrumental-learning situation in 
which responding to one cue, A, is pretrained 
prior to training on a compound, AX, blocking 
of the acquisition of responding to X is at- 
tenuated by reducing the percentage or mag- 
nitude of reward with the introduction of the 
compound (Feldman, 1971; Experiments I and 
II above). If, however, A and AX are con- 
temporaneously trained in the same sessions, 
no similar attenuation of blocking results from 
less favorably rewarding responding to AX vs. 
A (Experiments III and IV above). 

Any theoretical interpretation of this pattern 
must somehow discount the importance of the 
differential rewards per se as producing the 
attenuation of blocking seen in Experiments I 
and Il. The view that has been presented 
here is that the reward schedule, at least in 
some situations, contributes important con- 
textual stimuli which work in compound with 
the nominal cues to control S's behavior and 
influence the learning which occurs. Changing 
contextual stimuli at the time of introduction 
of a new cue may then be assumed to decrease 
the likelihood that the new cue will be treated 
as redundant and increase the likelihood that 
its signaling relationship with reward will be 
learned about. Such reasoning is compatible 
with the Rescorla and Wagner (e.g., 1972) ac- 
count of the blocking phenomenon as well as 
alternative attentional treatments (e.g., Suth- 
erland & Mackintosh, 1971). 

The implication of the Rescorla and Wagner 
(1972) model remains, however, that if the 
contribution of the schedule-generated stimuli 
could be ignored, preferentially rewarding A 
as compared to AX should either enhance 
blocking or make X a conditioned inhibitor, 
depending upon the degree of training in- 
volved. It thus remains problematic for the 
theory that Experiments III and IV, which 
sought to create equivalent schedule-generated 
contextual cues during the differential reward- 
ing of A and AX, did not provide evidence of 
enhanced blocking in comparison with the 
eases in which A and AX were equivalently 
rewarded. 

It is not clear how seriously this apparent 
failure of the model should be viewed. First 
of all, of course, it may simply represent a 
detection failure. With the relatively low 


levels of responding to X in Groups À-25;^ 

and A-1;A X-1it may be difficult to demon 

yet less responding in Group A-100;AX-25 : 
Group A-5;AX-1, respectively. Indeed 
comparison conditions were provided to co 
ment upon the degree to which the respondi: 
to X observed in these blocking groups wi 
greater than that which would have been see: 
to an untrained cue. More importantly, one 
should recognize that the designs of Experi 
ments III and IV were such as to insure that 
A and AX were trained in the presence of the 
same schedule-generated contextual stimuli 
within any blocking group. The designs did 
not insure that A or AX were trained in the 
presence of the same schedule-generated stimuli 
across the different blocking groups in either 
experiment, or therefore that the change in 
schedule-generated stimuli would be equiva- 
lent in the different blocking groups when the 
nonreinforced test sessions were introduced. 
Thus, one could argue that X was indeed more 
effectively blocked in Group A-5;AX-1 than 
in Group A-1;AX-1 of Experiment IV for 
example, but this tendency was obscured by 
the differential change in schedule-generated 
stimuli between training and testing in the 
two cases. Just as clearly, however, any such 
arguments would be a posteriori, with the 
issue better decided by further investigation 
in which potential differences in schedule- 
generated stimuli might be better specified and 
controlled. 

Additional complexities are introduced into 
any theoretical analysis when one assumes that 
schedule-generated stimuli must be taken into 
account along with the nominal cues. Such 
an assumption has been extremely useful, how- 
ever, in understanding the effects of reward ma- 
nipulations in instrumental-learning situations 
(e.g., Amsel, 1962; Capaldi, 1966; Wagner, 
1961) and is likely to be as fruitful in applica- 
tion of the Rescorla and Wagner (1972) model 
to such situations, whatever the ultimate reso- 
lution of the manner in which reward shifts 
influence blocking. 
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ANAGRAM SOLVING AS A FUNCTION OF WORD IMAGERY 
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Two experiments are reported which examine the role of imagery in anagram 
solving. In Experiment I Ss attempted 12 anagrams constructed from nouns 
of high or low imagery value, with either no information, structural informa- 
tion, or semantic information as aids to solution. All words had a Thorndike- 
Lorge frequency value of 20 or more words per million, and other variables 
known to influence anagram solution time were controlled (word length, 
number of possible solutions, frequency of successive letter pairs, and letter 
order) High imagery words were found significantly faster than low imagery 
words. Both structural and semantic clues facilitated performance, but 
structural information had a greater effect on the solution of low imagery words 
and semantic information had more effect on high imagery words. lt was sug- 
gested that this might reflect the use of either letter manipulation or word 
production strategies under the different clue conditions, In Experiment lI 
the main elfect of imagery was replicated with different word lists and addi- 


tional controls. The imagery effect was 


again highly significant and clearly 


independent of word frequency and associative meaningfulness. 


The image-evoking (J) value of stimuli 
is now well established as an important 
variable in associative learning and mem- 
ory (Paivio, 1969). Imagery value is co- 
ordinated to a concrete-abstract stimulus 
dimension, extending from objects to pic- 
tures to concrete nouns to abstract nouns, 
in decreasing order of concreteness. ltem 
concreteness or J value has been shown to 
exert an effect on paired-associate learning, 
recognition memory, free recall, and, under 
some circumstances, serial learning. 

The aim of the present study was to 
extend the findings relating to the role 
of imagery in verbal learning and retrieval 
to the solution of anagrams. There has 
been some recent research interest in this 
area (Jablonski & Mueller, 1972), but 
findings have been equivocal. The effect 
of word imagery, if present, must occur 
before the word is known, as in the case 
of the well-established effect of word fre- 
quency on anagram solving (Dominowski, 
1966; Mayzner & Tresselt, 1966). 

The frequency effect has been explained 
in terms of the “spew hypothesis” of 
Underwood and Schulz (1960), which says 
that the order of emission of verbal units 


1 Kathleen Dewing passed away since this article 
was written. Requests for reprints should be sent 
to Paul Hetherington, University of Western Aus- 
tralia, Nedlands, Western Australia 6009, Australia. 


is directly related to their frequency of 
occurrence in the language. When the S 
looks at the letters which when rearranged 
will make up the solution word, it is 
claimed that he makes implicit word re- 
sponses similar to free recall (Safren, 1962) 
or to free association (Mayzner, Tresselt, 
& Helbock, 1964) and that if the word is 
of high frequency, it will be more likely 
to be among the responses the S makes. 
If noun imagery facilitates the availability 
of the words as units, it would follow that 
anagrams of high-7 words would be easier 
to solve than those of low-/ words. 

In the first experiment an attempt was 
made to explore the anagram solving pro- 
cess in more detail by providing Ss with 
partial information about the solution 
words. The interaction between J value 
and structural and semantic clues to the 
solution words was examined. The struc- 
tural information provided was the initial 
and final letter of the solution word. The 
semantic information was the title of the 
category of which the word was a member, 
ie. the word's superordinate. The aim 
here was to determine whether the types 
of clues would show any differential effect 
on anagram solution times depending on 
whether the solution words were of high 
or low I value. It may be argued that 
if a word is of low J value, the likelihood 
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of its appearing among words called to 
mind as suggested by the letters of the 
anagram is lessened. The Ss then may 
tend to adopt the strategy of sorting out 
the letters more or less mechanically until 
they recognize the solution word. In such 
cases Ss would be expected to find a clue 
giving the position of 2 of the letters of 
the solution word of more use than se- 
mantic information. 

In the second experiment an attempt 
was made to replicate the main effect of 
solution-word imagery found in Experi- 
ment I using different word lists. By as- 
signing an estimate of A — 50 and AA 
= 100 to the frequency value of items, 
it was possible to control frequency more 
precisely. In this experiment the effect of 


verbal associative meaningfulness, m (No- 


ble, 1952), was also controlled. 


EXPERIMENT I 


Method. A 2 X3 repeated-measures factorial 
design was employed. Every S was given 2 ana- 
grams to solve under each of the experimental 
conditions making a total of 12 problems per S. 
The independent variables were (a) solution-word 
imagery, and (b) type of clue given, viz., no clue, 
first and last letter of solution word (structural 
clue), or the title of the category of which it is 
a member (semantic clue). For example, the 
semantic clues for high-Z words CABIN and STORM 
were DWELLING and WEATHER CONDITION, and for 
low-I words SHAME and MERCY, clues were LING 
and MORAL SENTIMENT. 

The degree of association between each solution 
word and its semantic clue was rated on a 5-point 
scale by 10 independent raters without any knowl- 
edge of the purpose of the study. There was no 
significant difference between ratings for low-I 
words (M = 3.8, SD =.72) and high-/ words 
(M = 4.0, SD = .82 

Solution words were selected from Paivio, Yuille, 
and Madigan's (1968) word list. Low-I words 
had an J value of less than 4.20 and high-Z words 
a value above 6.40. All were 5-letter common 
nouns, and no other 5-letter common nouns could 
be made by rearranging the letters. All words 
occurred with a frequency of 20 and above per 
million in the language by Thorndike and Lorge's 
(1944) tables. Since bigram frequency (BF)—total 
frequency of occurrence of successive letter pairs 
in solution word—may be a factor in anagram 
solving (Dominowski & Duncan, 1964), only solu- 
tion words of similar BF were included. According 
to tables by Underwood and Schulz (1960), fairly 
extreme BF values are represented by ZEBRA 
(1,780) and EATER (11,812). Words used in the 


765 


study were all in the BF range, 3,500-8,500. The 
words selected were EVENT, FOLLY, MERCY, TRUTH, 
ARRAY, and SHAME (low-I words), and ANKLE, 
PIANO, STORM, CABIN, CHAIR, and sucar (high-I 
words). 

Three anagrams for each solution word were 
constructed by 3 times randomly ordering the 
letters of each of the 12 words. This procedure 
was adopted to minimize any interfering effects 
arising from anagram letter order or anagram BF. 
The 3 sets of anagrams resulting from 3 times 
randomly rearranging the letters of each of the 
12 words were combined factorially with the 3 clue 
conditions to give 9 resulting lists of anagrams, 
each of the treatment conditions being assigned 
to each anagram. 

Seventy-two male and female students enrolled 
in first-year psychology at the University of Western 
Australia served as Ss. The Ss were randomly 
assigned to each of the 9 anagram lists. The 
nature of the task was explained, and examples 
given of both clue types. 

The Ss were given 3 practice trials, one with 
each type of clue and one with no clue. The 
practice trials were followed by the 12 experi- 
mental cards presented one at a time in a random 
order, The time taken by S to solve each anagram 
problem was recorded by To control for Æ 
bias, cards were placed fi downward and pre- 
sented to Ss without Æ seeing the anagram until 
after solution times had been recorded. No pro- 
vision was made to control for trial effects since 
it has generally been found (Mayzner & Tresselt, 
1962) that Ss do not show any great deal of im- 
provement on such tasks until after at least several 
weeks' practice. 


Results. Subjects were required to solve 
2 anagrams under each of the 6 experi- 
mental conditions, and the Ss’ scores for 
each set of 2 problems were averaged. 
Since the distribution of response times 
was highly skewed, a logarithmie trans- 
formation of the form .X — log (X; + 1) 
was applied to the resulting data prior to 
within-S analysis of variance. The log 
mean solution times for word I values and 
clue conditions are shown in Table 1. 

Anagrams of high-/ words were solved 
faster than anagrams of low-Z words to a 
highly significant degree, RO E 27) 
64.864, p < .001. The variance under the 
3 clue conditions was also highly signifi- 
cant, F (2, 142) = 94.154, p < .001. In- 
teraction between I value and clue type 
was significant, F (2, 142) — 5.306, p 
< 01. This interaction includes the no- 
clue condition. The Semantic vs. Struc- 
tural clue X I Value interaction was also 
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"TABLE 1 


LoG Mean SOLUTION Times FOR HIGH- AND 
Low-IMAGERY WORDS UNDER STRUCTURAL, 
Semantic, AND No CLUE CONDITIONS 


High | Low 
Clue condition. imag- | imag- M 
ery | ery 
No clue 1.358 | 1.566 1.462 
Structural clue | .902 | 1.078 .990 
Semantic clue | .704 | 1.182 -943 
M .988 


1.275 | Grand Mean —1.132 


significant, F (1, 71) = 8.212, p < .01, and 
the No Clue vs. Average Clue X 7 Value 
interaction was not significant, F (1, 71) 
= 1.897. Structural clues had a greater 
effect on the solution of low-7 words and 
semantic information on high-/ words. 


EXPERIMENT II 


Method. The second experiment replicated the 
main effect found for 7 in Experiment | using 
different words and additional controls for solution 
word attributes, As in Experiment I the number 
of letters, bigram frequency of the solution word, 
and number of possible solutions for each anagram 
were controlled. In addition, the number of 
redundant letters in each anagram and frequency 
and meaningfulness of the solution words were 
equated for high- and low-Z lis The words 
selected were POWER, CHASM, GRIEF, VENOM, CHAOS, 
and DRAMA (low-/ words) and TRUCK, SPRAY, 
HOTEL, PLANK, FLASK, and DRESS (high-/ words); 
m values used were calculated according to Paivio 
et al. (1968), following Noble's (1952) production 
method for obtaining m data. The procedure for 
constructing and presenting the anagrams was the 
same as for Experiment I, except that no clues 
were given. Twenty-four male and female second- 
year psychology students served as 


Results. Mean response time for high-7 
words was 32.0 sec. (SD = 46.6) and for 
low-/ words 50.6 sec. (SD — 42.4). An- 
grams constructed from high-7 nouns were 
solved significantly faster than anagrams 
constructed from low-/ nouns (f= 3.5, 
p < .001). 


DISCUSSION 


The results of this study clearly demon- 
strate that solution-word imagery has a strong 
effect on anagram solving, and that this effect 
is independent of word frequency or associa- 
tive verbal meaningfulness. Imagery must 
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exert its influence before the o of 
the word and in this sense displ: 1 the 
same effect as frequency. The ition 


that suggests itself is in similar t 

When Ss are presented with a set « 
bled letters they at least in part recal 
to match these letters (Safren 
frequency words are more likely 
by S since they are towards : top of 
hierarchy of available words. 
studies have found such results. The present 
findings suggest that the image-arousing po- 
tential of nouns also affects the availability 
of nouns as uni This view would predict 
that high-/ words, because of their more 
ready availability, have a greater probability 
of being among those words suggested by the 
letters of the anagram. Word 


» be recalled 


intification 


is expedited and the anagram solved more 
easily. 

Reviews of anagram solving earch in- 
dicate that both parts and wholes contribute 
to the solution of the problem (Jo!:nson, 1966). 


Studies carried out within an S mediational 
model (e.g., Mayzner & Tresselt, 1966) have 
tended to restrict themselves to only one 
aspect of the processes involved, i.e., letter 
arrangement. Anagrams are treated as stim- 
uli which evoke a variety of implicit responses 
in S based on successive rearrangements of 
the letters into new combinations, and little 
attention is paid to the characteristics of the 
word as a whole. 

However, some evidence suggests that 
gram difficulty in general is mor 
on the characteristics of the rd 


than of the anagram (Domi; 19057). 
The S's solution attempt is word 
which has more, less, or oth rs than 


does the anagram. The S jun 
(word), then returns to match it against the 
anagram; the progression is from the word 
to the anagram rather than from the anagram 
to the word (Johnson, 1966). The availability 
of the solution word appears to be very im- 
portant in the anagram task, and the present 
study provides further support for this view. 
The interaction between solution-word im- 
agery and clue type may throw some light 
on the strategies employed by Ss in the solu- 
tion process. Rearranging letters and pro- 
ducing words are different types of responses, 
but they are likely to occur simultaneously 
in anagram solving when no clues are pro- 
vided, and need not be independent of each 
other (Dominowski, 1966). The present 
study showed that both semantic and struc- 


to his goal 
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tural information improved performance, but 
semantic clues facilitated the solution of high-7 
words more than low-J words and vice versa 
for structural clues. This may indicate that 
word production and letter arrangement are 
used differentially as strategies under those 
clue conditions. However, it is possible that 
the concrete and abstract category titles may 
not have been equally informative as solution 
clues, either because of differences in category 
set size or in the strength of the associative 
link between the word and its superordinate. 
An attempt was made to measure the degree 
of this association; the associative bond was 
judged by independent raters to be slightly 
stronger for high-J words, but the difference 
was not statistically significant. Further work 
is needed to clarify the issue, and the present 
findings should be interpreted with caution. 

The results of both experiments show that 
imagery exerts an influence on availability of 
items per se and does not act purely as à 
mediator between stimulus and response. 
Experimental investigations of concreteness 
and the image-arousing value of stimuli have 
so far tended to concentrate on paired-asso- 
ciate learning, and have led to a rather 
narrow view that the effect of imagery is 
chiefly to cement associative relationships 
(e.g, Bower, 1972). The findings of the 
study reported here, taken in conjunction 
with investigations of the effects of imagery 
on free recall and recognition memory, suggest 
that imagery is much more than a mediator 
and may prove to be an effective variable in 
a wide range of cognitive tasks. 
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TRANSFER OF INFORMATION FROM SHOR'T 
TO LONG-TERM MEMORY! 
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Wesleyan 


The present study examined current hy 
fer from short-term memory (STM) 

Peterson STM task with word triplets 
3, 6, 9, and 18 sec. One-third of the Ss 


University 


ypotheses concerning information t: 
to long-term memory (LTM) usi 
presented over retention intervals o 
counted backward during the inter 


one-third employed a relational imagery code, and the last third were und: 


tracted and uninstructed in coding. 
L'TM recall test of all stimuli. 
of the subsequent testing. 
of information in STM had an effect c 
nificant effect. These data were discus: 
sufficient for storage of information in 


The present study examines the condi- 
tions of information transfer from short- 
term memory (STM) to long-term memory 
(LTM) in light of current theories of 
memory (Atkinson & Shiffrin, 1968; Waugh 
& Norman, 1965). Such theories assume 
that information put into STM is quickly 
lost unless maintained by a process of 
rehearsal. Rehearsal may have other func- 
tions aside from maintaining information in 
STM. Atkinson and Shiffrin hypothesize 
that rehearsal "serves the purpose of in- 
creasing the strength built up in a long- 
term store, both by increasing the length 
of stay in STS (during which time a trace 
is built up in LTS) and by giving coding 
and other storage processes time to operate 
[p. 111].” Hence, the longer the duration 
of information in STM, the higher the 
probability of its transfer to LTM. Re- 
hearsal thus enables information to be 
automatically transferred into LTM and 
allows time for control processes, such as 
coding, to be applied. 

The present research is an attempt to 
examine the above issues within the context 
of the Peterson and Peterson (1959) para- 


1 This research was supported by Wesleyan Uni- 
versity Research grants in support of scholarship 
awarded to both authors. Appreciation is expressed 
to James P. Rizza, who collected the data for this 
experiment. 

2 Requests for reprints should be sent to Vito 
Modigliani, Department of Psychology, Wesleyan 
University, Middletown, Connecticut 06457. 
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forgetting in STM when a rehca 
venting distractor task was emi yed for 
0-18 sec. Meunier et al. (1972) extended 
this by comparing STM performance be- 
tween groups which differed by the presence 
or absence of a distractor task. Forgetting 
in STM was observed only in the group 
engaging in the distractor task. However, 
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The present study 
Meunier et al. (1972) b in 
and LTM performance « i different 
instructional sets in í Peterson and 
Peterson (1959) paradigm. The Ss were 
instructed to employ a relational imagery 
coding strategy or a rehearsal-preventing 
distractor task, or were not instructed to 
engage in any particular activity during 
the retention interval of 0-18 sec. More- 
over, half of the Ss in each condition were 
warned of a retention test on all items at 
the end of the experiment. 

It was hypothesized that only Ss in the 
distractor group would show forgetting in 
the STM trials. Short-term memory per- 
formance in the coding and free groups 
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would be essentially perfect over all STM 
retention intervals. With regard to the 
more important LTM performance, it was 
expected that Ss engaged in imagery coding 
during the STM task would show signifi- 
cantly better LTM performance than all 
other Ss, and that Ss warned of subsequent 
testing should perform somewhat better 
than Ss not so warned, because of the 
possibility of spontaneous coding. 


METHOD 


Design. The experiment consisted of 2 phases, 
The first was an STM task, in which Ss were pre- 
sented with 3 word triplets to be recalled after a 
time interval of 0, 3, 6, 9, or 18 sec. This was 
followed by a recall phase, in which each S was asked 
to write down all the words he could remember. 

The STM retention interval was a within-Ss vari- 
able, while warning information and STM instruc- 
tional set were both between Ss. The experiment, 
therefore, can be considered a 5 X 2 X 3 factorial. 

Subjects. There were 10 Ss in each of the 6 
groups. They were Wesleyan University students 
ned up voluntarily for the experiment and 
were paid $2 for their services. No attempt was 
made to control for sex, and Ss w randomly 
assigned to experimental conditions as they showed 
up for the experiment. 

Materials. The words used in this experiment 
consisted of concrete nouns with a frequency count 
of A or AA in the Thorndike-Lorge (1944) G count. 
Only nouns of 3-5 letters in length were selected to 
constitute the initial pool from which all successive 
selections were made, From this pool, 150 nouns 
were chosen to constitute the experimental words. 
They were then randomly grouped into 50 triplets 
with the restriction that no words in a triplet begin 
with the same letter and that they not be obvious 
associates of one another. 

Each triplet was typed in capital letters on the 
same row, and approximately in the middle of a 
12.7 X 20.4 cm. white index card. At the bottom 
of each card, a 3-digit number was also typed. The 
distance between the words and the number was 
approximately 6 cm. The 50 cards so obtained were 
thoroughly shuffed to yield one random order of 
presentation. A second order of stimuli was ob- 
tained in the same fashion. In addition to the 50 
experimental cards, there were 5 warm-up cards, 
constructed in the same manner. The numbers on 
the bottom of all cards were taken from a table of 
random numbers. 

Other materials consisted of an electric auditory 
clicker set at 3 clicks/sec and a screen which was 
put on top of a desk shielding S from E. There 
was an opening in the screen, slightly smaller than 
the cards being used, through which Æ presented 
all stimuli. 

Procedure. Four different sequences of the STM 
retention intervals were used, each of which con- 
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sisted of 10 blocks of 5 trials each, Each block 
contained a random permutation of the 5 retention 
intervals being used. Thus, over the 50 trials, each 
retention interval appeared 10 times. 

Each of the 2 stimulus sequences was combined 
with each of the 4 STM retention interval sequences 
to yield 8 different sequences. Each of these was 
used with the first 8 Ss in each group, with the 
remaining 2 Ss given 2 of the 8 sequences selected 
at random. 

During each trial, S was to read orally each of the 
3 words followed by each of the digits, one at a time 
and in time with the auditory clicks. Reading time, 
determined by click frequency, took 2 sec.—1 sec. 
for the words and 1 sec. for the number. The .S was 
explicitly asked not to pause between the words and 
the number, so as to minimize the opportunity for 
rehearsal at that point, As soon as 5 read the last 
digit of the number, Æ started a timer set at 0, 3, 
6, 9, or 18sec. At the end of the retention interval, 
E said “recall.” The S was given approximately 
10 sec. to respond. As soon as he responded, Æ re- 
corded the response and then, after saying “ready,” 
proceeded to the next presentation. The minimun 
time from the cue for recall to the next presentation 
was about 5 sec. 

In the distractor condition, S was instructed to 
count backward by 3's beginning with the number 
he had just read. He was to count by pronouncing 
each digit of each number in time with the clicks 
until Æ gave a signal to recall the words. He was 
also told to avoid rehearsing the triplet and to 
concentrate on the counting task. 

In the free condition, 5 was told that after he had 
read the last digit of the number there would be a 
period of time, varying in length from trial to trial, 
at the end of which Æ would give a signal for him 
to recall the words. 

In the code condition, 5 was instructed to form 
an imaginary scene of the referents of the words he 
had just seen during the time between the presenta- 
tion of the triplet and the signal for recall. The 
relational imagery instructions were the same as 
those used in a previous study (see Seamon, 1972). 
It was stressed that the visual scene should be one 
in which the single images touched each other and 
interacted to form meaningful, if not always realistic, 
units. An example of an appropriate construction, 
such as a car sitting in a tree with a belt around the 
tree, was provided to insure that S understood the 
task. No S reported any difficulty. 

Within each of the above conditions, half of the 
Ss were told that after the completion of the present 
task there would follow another session, in which 
they would be asked to recall all the words they 
had previously studied. For the other half of the 
Ss, no mention was made of the subsequent task. 
Irrespective of initial instructions, all Ss, after com- 
pleting the STM phase, were given a sheet of paper 
and asked to write down as many of the words they 
had studied in the preceding trials as they could 
remember, They were allowed approximately 7 min. 
for recall, and guessing was discouraged. 
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RESULTS 


Responses in the STM and recall phases 
were scored in terms of correct triplets and 
correct words recalled. The latter measure 
was thought to be more sensitive to possible 
differences than the former. Triplets were 
scored correct regardless of word order. 

In all analyses, none of the main or 
interaction effects associated with fore- 
knowledge of the recall test approached 
significance (all ps > .10). The warning 
variable, therefore, was not considered 
further, and groups which differed only in 
this factor were combined. 

Short-term memory retention data, Mean 
percentage of correct triplets and words re- 
called in this phase of the experiment are 
shown in Table 1. There was little vari- 
ability associated with the code or free 
conditions, recall being uniformly high at 
all STM intervals. Recall in the distractor 
condition, however, whether scored for 
words or triplets, showed the decay function 
typical of this paradigm. As expected, the 
retention function for triplets showed a 
greater decrement over time than that for 
words, since the scoring criterion was more 
stringent for triplets. 

Long-term memory recall data. Mean 
percentages of correct words and triplets 
recalled after the completion of the STM 
trials are shown in the top and bottom 
sections of Figure 1, respectively. Since a 

floor effect for the distractor and free con- 
ditions was evident when recall was scored 
for triplets, only recall for words was 


TABLE 1 


CENTAGE OF SHORT-TERM MEMORY RE- 
CALL FOR TRIPLETS AND WORDS 


MEAN PE 


Experimental 
condition 


0 3 6 9 | s 
Code | 
Triplets | 98.5 .0 | 98, í 
Words 99.5 99,0 | 99. 99.7 
Free 
Triplets | 99.5 98.5 T. 96.0 


Distractor 
Triplets | 95.5 
Words 98.8 
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FIGURE. 1. Mean percentage of correct recall of 
individual words and triplets for the Code (C), 
Free (F), and Distractor (D) conditions as a function 
of the short-term memory retention interval. 


analyzed statistically. An overall analysis 
of variance showed that STM retention 
interval had a significant effect on the 
subsequent LTM recall of words, F (4, 216) 
= 14.58, p < .01, with Neuman-Keuls 
comparisons showing recall to be better at 
3 sec. than at 0 sec. (p < .01) and at 18 
sec. than at 3 sec. (p < .05). A significant 
STM Interval X Group interaction was 
also found, F (8, 216) = 7.27, p < .01, 
indicating that relative recall performance 
between groups was not constant over all 
STM retention intervals. The code con- 
dition was poorest at the shorter intervals, 
but best at the longer intervals. The main 
effect of group was not significant? 
Analyses of simple main effects followed 
by Neuman-Keuls comparisons for each 
experimental group found no significant 
comparisons between intervals for the free 
condition, significantly better recall at 3 
sec. than at 0 sec. for the distractor condi- 
tion (p «.05), and significantly better 


* Essentially the same findings were obtained in a 
recognition test given to all Ss after the recall test. 
Recognition performance was similar and did not 
increase with STM interval length for the free and 
distractor conditions, while the performance for the 
code condition was initially poorer than that of the 
other conditions, but improved through the 9-sec. 
interval to surpass them. These data do not, how- 
ever, provide independent information. 
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recall at 3 sec. than at 0 sec. (p < .05) and 
at 9 sec. than at 6 sec. (p < .01) for the 
code condition. Comparisons of groups as 
a function of STM interval found signifi- 
cant differences at 3, 9, and 18 sec., with 
the distractor condition better than the 
code condition at 3 sec. (p < .05), and the 
reverse true at 9 and 18 sec. (both ps < .01). 
Other differences were not significant, al- 
though the distractor-code comparison ap- 
proached significance at 0 sec. (p < .10). 

The mean number of false recalls, i.e., 
words not from either the warm-up or 
experimental trials, was 2.0 across all 
groups, as compared to an overall mean 
number of experimental words recalled of 
23.4. 

Serial position dala. The STM phase 
of the experiment lasted approximately 15 
min. In the second phase of the experi- 
ment, therefore, some of the to-be-recalled 
words had been studied as long ago as 15 
min, and some as recently as 2-4 min. 
before recall. It was desirable to examine, 
albeit informally, whether serial position in 
the STM list had an effect on recall. For 
this purpose, the STM list, consisting of 
10 blocks of 5 triplets each, was divided 
into 3 groups, viz., Blocks 1 3, 4-7, and 
8-10, covering the initial, middle, and end 
portions of the list, respectively. Two 
overall trends were noted. First, regardless 
of list position, words studied at 0 and 3 
sec. were always recalled more poorly in 
the code condition than in the distractor 
condition, with the free condition inter- 
mediate. The reverse was true for words 
studied at 9 and 18 sec. This indicates 
that the recall pattern shown in Figure 1 
was fairly constant over the range of 
possible list positions. Second, a strong 
recency effect was present, with much 
better recall from Blocks 8-10 than from 
Blocks 1-3 or 4-7. 


DISCUSSION 


The major focus of this study was on the 
mechanisms of information transfer from STM 
to LTM. Such mechanisms will be di: ussed 
with reference to the following 4 propositions, 
which summarize the major tenets of current 
theories of memory on this subject. (a) Amount 


of information transfer is an increasing function 
of the time an item remains in STM (Atkinson 
& Shiffrin, 1968; Waugh & Norman, 1965). (b) 
Rehearsal strengthens LTM traces (Bjork, 
1970). To the extent that rehearsal keeps an 
item in STM, this proposition is a corollary of 
the first. However, since rehearsal per se is 
currently receiving a great deal of attention, 
we prefer to keep these 2 propositions separate. 
(c) Rehearsal provides more time for coding 
processes to operate, and (d) memory traces in 
LTM can be strengthened by such coding 
processes 

Consistent with earlier data (Jacoby & 
Bartz, 1972; Meunier et al., 1972), the present 
findings offer no support for the proposition 
that prolonged stay (up to 18 sec.) in STM 
increases an item's probability of transfer to 
LTM. This is shown both by an analysis of 
the free condition itself, and by a comparison 
with the distractor condition. First, the find- 
ing of essentially perfect retention in the free 
condition over all intervals in the STM trials 
sates that triplets were held in STM for 
ng intervals up to 18 sec. However, no 
1 retention was observed as a 
function of STM interval length, a result 
arly at odds with the proposition being 
ussed. Second, while the distractor and 
conditions were qualitatively and quanti- 
tatively different in STM recall, they were 
alike in LTM retention. The STM decay 
function found in the distractor condition 
means that items were lost from STM very 
rapidly. In spite of this, they were remem- 
bered as well as items which, in the free condi- 
tion, had been kept in STM for longer periods 
of time, again at odds with the proposition in 


question. 

With respect to the proposition that re- 
hearsal, directl -orindirectly, increases an item's 
strength in LTM, the situation is more am- 
biguous. The present data rule out the possi- 
bility that rehearsal, by virtue of simply keep- 
ing items in STM, automatically transfers 
them to LTM. Further, if it is assumed that 
Ss engaged in rehearsal to maintain the items 
in STM over the retention intervals, the LTM 
performance of the free and distractor con- 
ditions would suggest that either rehearsal 
does not strengthen LTM traces, or the amount 
of rehearsal was the same for both conditions 
and over all intervals. This follows from the 
fact that there was more opportunity for 
rehearsal in the longer STM intervals than the 
shorter intervals and in the free condition than 
in the distractor condition. 
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Does rehearsal provide time for coding pro- 
cesses to operate? A comparison of the free 
and code conditions would suggest that spon- 
taneous coding strategies, if employed, were 
not very effective in engendering L'TM reten- 
tion, or that Ss simply did not engage in 
coding processes when given the opportunity 
in this task. It is still possible, of course, that 
under different conditions, effects of spon- 
taneous coding would be evidenced. 

We shall now consider the fourth of the 
propositions under scrutiny, the one concerning 
coding processes. As Figure 1 shows, LTM 
performance of the code group was poorest for 
words shown at 0-sec. STM intervals and 
increased up to asymptotic level as the STM 
interval increased. At 9- and 18-sec. STM 
intervals, this group showed better LTM 
performance than either the free or distractor 
groups. The monotonic increase in LTM 
retention as a function of STM interval in the 
code group suggests that it takes time to apply 
the imagery code to words, a time that can be 
estimated to be within 3-9 sec., judging from 
the recall function. This parameter may 
actually be expected to vary with the number 
and concreteness of the words employed. 
"Moreover, it can also be expected to vary with 
practice. This can be inferred from the fact 
that the highest relative increase, as a function 
of trial blocks in the STM list, occurred for 
items shown at ()-sec. STM intervals, while the 
smallest relative increase occurred for items 
shown at 9 and 18 sec. The indication is, 
therefore, that Ss became better at imagery 
coding with practice, and this increased effi- 
ciency was most beneficial at very short STM 
intervals. 

The finding that the code condition, given 
sufficient time, produced superior LTM reten- 
tion as compared to the free and distractor 
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conditions is consistent with previous frdins 
(Bower, 1972) indicating the superiority 
mnemonic device over other methods and wit! 
the view that organization enhances 
retention. Thus, imagery coding is cl 
sufficient condition for storage of infor: 

in LTM and, as judged by the lack of a w 
effect, intent to remember may be superi! 
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GOOD CONTINUATION REVISITED ! 


LUBOMIR S. PRYTULAK ° 


University of Western Ontario, London, Ontario, Canada 


"Twenty female Ss inspected patterns of three lines of dots radiating from a 
common dot, in which one line (the shaft) was longer than the other two 
lines (the arms). Which arm the shaft appeared to group with, and which of 
two arms in different patterns the shaft grouped with more strongly, were 
predicted from three predictors: (a) rectilinearity—grouping is stronger with 
the arm closer to forming a 180° angle with the shaft, the function between 
strength of grouping and angle being nonlinear; (b) orientation—grouping is 
stronger with an arm on the vertical-horizontal rather than an oblique arm; 
and (c) enclosure—grouping is stronger when the arm closer to rectilinearity 
is ipsilateral with the other arm than when it is contralateral. Computer- 
generated predictions were correct 16/16 times when predicting grouping within 
individual patterns and were correct 103/109 times when predicting inequali- 
ties of grouping between patterns. 


Within each line, dots were placed at 5-mm. inter- 


The principle of good continuation was 
introduced into the psychological literature 
in 1923 by Max Wertheimer (1958). In 
several demonstration displays (Figures 
1-5, pp. 126-127), he showed that when 
three straight lines of dots radiate from a 
common point, the two lines forming a 180° 
angle appear to group more with each other 
than either appears to group with the third 
line. This is the fundamental good con- 
tinuation effect which the present paper 
attempts to elaborate. More particularly, 
using three-line patterns similar to Wert- 
heimer's, three questions are explored. 
First, what are the grouping effects at vari- 
ous angle juxtapositions, including ones 
that do not contain 180°? Second, does the 
orientation of the lines with respect to the 
vertical-horizontalaffect grouping? Third, 
does ipsilateral vs. contralateral juxtaposi- 
tion of two lines at constant angles to a 
third line affect grouping? 


METHOD 


Stimuli, Each stimulus was made up of 2-mm.- 
diam. circular black dots on a 21.6 X 27.9 cm. 
white card. The dots were arranged in three lines 
radiating from a single shared dot, as in Figure 1. 


1'This research was supported by Grant APA-390 
from the National Research Council of Canada. 

? Requests for reprints should be sent to Lubomir 
S. Prytulak, Department of Psychology, University 
of Western Ontario, London, Ontario N6A 3K7, 
Canada, 


vals, A line could appear at any of eight orienta- 
tions—four on the vertical and horizontal and four 
45° off the vertical or horizontal, One of the three 
lines (the shaft) was longer than the others—it con- 
tained four dots, counting the shared dot. The re- 
maining two lines (the arms) contained three dots, 
again counting the shared dot. As the shaft ap- 
peared at eight orientations, and for each of these 


the two arms assumed = 21 pairs of orienta- 


tions, the number of stimuli was 168. When one of 
the arms came closer to forming a 180° angle with 
the shaft, it was referred to as Arm a and the other 
arm as Arm b; when the arms were symmetrical 
about the shaft, however, this labeling could not be 
used. Figure 1 shows a stimulus in which Arm a 
is at 90? to the shaft and Arm b is at 45? to the 
shaft—the “angle of an arm" always refers to the 
smaller angle between that arm and the shaft. 

Subjects. Twenty female students at the Uni- 
versity of Western Ontario were paid to participate 
in a series of experiments on grouping and visual 
illusions, of which the present experiment was one, 
At no time, however, did E discuss with S the nature 
of the experiments, or apprise her of data from 
earlier experiments or of hypotheses relevant to the 
current experiment. 

Procedure. The S placed a stimulus card on a 
stand tilted back 15° from the vertical (where its 
illuminance was .088 cd/cm?), rested her chin in a 
chin rest (which gave a viewing distance of 45 cm.), 
pushed a button which turned a quietly-sounding 
buzzer off for 5 sec., and fixated the shared dot 


Figure 1, Example of a stimulus display. 
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during the 5 sec; of silence. During fixation, she 
decided which of the two arms appeared to group 
with, or belong to, the shaft. She indicated her 
experience of grouping by writing an integer from 
—3 to +3 excluding 0. Positive values indicated 
that the shaft appeared to group with Arm a and 
negative values, with Arm b (a small replica of the 
stimulus drawn in the upper-left corner of the 
stimulus card labeled Arms a and b). The integers 
1, 2, and 3 indicated that grouping was weak, 
moderate, or strong. After recording her rating, S 
found the stimulus number on the back of the card 
and wrote it beside her rating. Before running each 
55, the deck of cards was rearranged by shuffling as 
well as by dealing the cards into separate piles in 
haphazard order, then picking the piles up in hap- 
hazard order. 

Data simplification. In order to reduce the task 
of data presentation and data interpretation to 
manageable proportions, 144 of the 168 stimuli were 
sorted into 18 patterns of 8 stimuli each. (The re- 
maining 24 stimuli were ones in which the arms were 
symmetrical about the shaft and will be discussed 
below.) The 8 stimuli that were considered to be 
instances of a single pattern were stimuli in which 
the two arms bore the same relation to the shaft 
and in which the shaft was either always on the 
vertical-horizontal or always oblique. Below the 
abscissa in Figure 2, for example, Pattern 1 stands 
for 8 stimuli, 4 of which can be generated by rotating 
Pattern 1 in 90? steps, and the other 4, by rotating 
à mirror image of Pattern 1 in 90? steps. Si milarly, 

Pattern 10 stands for 8 stimuli whose arms bear the 
same relation to the shaft as in Pattern 1, but now 
with the shaft oblique. The data in Figure 2 were 
produced by averaging together each S's ratings to 
the 8 stimuli within 1 pattern, then calculating the 
means and standard errors of the means across Ss. 

What is predicted. Two sorts of predictions are 
made. The first is which arm within a pattern will 
group more strongly with the predic- 
tions are called swilhin-pattern predictions and con- 
sist merely of specifying the sign of the mean 
ratings, 

The second sort of prediction specifies inequalities 
between means. These predictions are called 
between-pattern predictions. Between-pattern pre- 
dictions, however, were attempted only for pairs 
of means originating from comparable patterns. 
Comparable patterns are pairs of patterns in which 
one of the arms in the first pattern 

angle to the shaft as one of the 
pattern, These two arms are called common arms, 
and the remaining two arms are called target arms, 
Thus, Ib-2b are common arms and 1a-2a are target 
arms; la-I5b are common arms and 1b-15a are 
target arms; and so on, In between-pattern com- 
parisons in which the angles of the arms in one 
pattern are the same as the angles of the arms in 
another pattern (1-5, 2-4, and others), the a arms 
will be considered to be the target arms. 
The between-pattern predictions generated below 
will be for the target arms of comparable displays. 
"Thus, no prediction will be ventured for Patterns 


at the same 
arms in the second 
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1-9 or 1-18 because they contain no common arms 
and, therefore, no target arms for which predictions 


can be made. Of the (5) = 153 combinations of 


patterns, 129 proved to be comparable. 

Three predictors. "Three predicto 
rectilinearity, orientation, and enclosure. 
described below. 

The first predictor, rectilinearity, states that à 
shaft groups more strongly with the arm with which 
it comes closer to forming a 180° angle. The in- 
crease in the grouping of an arm to the shaft, how- 
ever, is considered to be a nonlinear function of 
angle. If we start off by imagining a linear func- 
tion, we can say that at every 45? increase in angle, 
the increase in strength of grouping is “moderate,” 
in comparison with which any larger increase would 
be "large." If we now imagine that the point in 
the function above 135? is pulled down so as to be 
on a level with the point above 90°, the 902-1359 
increment will be zero, and the 1352-180? increment 
will be large. By inference, the 459-135? increment 
will be moderate, and the 459-180? and 90°-180° 
increments will each be large. In short, all cor 
parisons in which one arm is at 180? imply a strong 
rectilinearity effect, all comparisons in which the 
two arms are at 90? and 135? imply no rectilinearity 
effect, and all other comparisons imply a moderate 
rectilinearity effect. 

For an example of a within-pa 
we see that in Pattern 1, Arm la is at 90° and Arm 
Ib is at 45?—grouping, therefore, is predicted to be 
with la so that the sign of the mean rating should 
be positive. In between-pattern prediction, we 
might compare the two target arms la and 3a—3a 
is at 180° and is, therefore, predicted to receive a 
higher rating than 1a which is at 90°, (Wheneve: 
a target arm is a b arm, the sign of the mean ratin 
for that pattern must be reversed. In predicti» 

1b < 15a, for example, the data are considered 
show =1,27 < — .14.) Rectilinearity, however, 
unable to generate a prediction in every instance 
Thus, it generates no within-pattern prediction for 
Patterns 6, 8, 15, and 17 (because within cac! 
pattern the two arms are at 90° and 135°) and 
generates no between-pattern prediction for Pat 
terns 1-5 or 1-10, for example (because the target 
arms are at equal angles to the shaft), or in the 
case of 1-2 or 9-16, for example (because the target 
arms are at 90° and 135°). 

The second predictor, orientation, states that a 
shaft groups more strongly with an arm on the ver- 
tical or horizontal than with an obliquearm. Within 
Pattern 1, for example, 1a is horizontal and Ib is 
oblique—thus, orientation joins rectilinearity in 
predicting la » 1b. In the case of Pattern 10, 
however, orientation works against rectilinearity— 
whereas rectilinearity predicts 10a > 10b, orienta- 
tion predicts that 10a < 10b. (The resolution of 

contradictory predictions is discussed in the next 
section.) In cases where both arms are oblique 
(like Pattern 2) or both vertical-horizontal (like 
Pattern 11), orientation makes no prediction. 

In between-pattern comparisons, the orientation 
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prediction is more complex to derive. First, each 
target arm is assigned a value of +1, 0, or —1 de- 
pending on whether the within-pattern prediction 
is that the target arm is helped, unaffected, or hurt 
by orientation, Second, the values assigned the 
target arms within each pattern are compared—a 
shaft is predicted to group more strongly with the 
target arm which has the higher value, otherwise no 
prediction is made. For example, la gets +1 and 
10a gets —1—therefore, orientation predicts 1a 
> 10a; la(4-1) — 2a(0)—therefore, orientation pre- 
dicts 1a > 2a; and 2a(0) — 11a(0)—therefore, 
orientation makes no prediction, 

The third predictor, enclosure, is used only in 
between-pattern comparisons and only when the 
target arms in comparable displays are at the same 
angle to the shaft, Enclosure specifies that the shaft 
groups more strongly with the target arm when the 
target arm is ipsilateral with, and thus enclo 
the common arm, Illustrative predictions are 
la > Sa and 2a > 4a. 

Contradictory predictions. When only one predic- 
tor was applicable, or when two predictors were 
applicable and both predicted the same thing, the 
prediction was said to be unequivocal. (In no 
comparison were all three predictors applicable.) 
In the of equivocal predictions, however (when 
two predictors were applicable and made opposite 
predictions), a simple tie-breaking rule was invoked 
—when rectilinearity contradicted orientation, recti- 
linearity was expected to dominate when the recti- 
linearity effect was large (one target arm was at 
180°) and orientation was expected to dominate 
when the rectilinearity effect was zero (the target 
arms were at 90° and 135°)—although the latter 
case could have been classed as unequivocal, it was 
classed as equivocal so as to distinguish it from cases 
in which rectilinearity differences did not exist and 
rectilinearity was totally inapplicable. In all other 
cases of contradiction, no prediction was made— 
the antagonistic predictors were considered to be 
equivalent in strength so that no overall prediction 
could be ventured with safety, 

Stimuli with symmetrical arms. Data from 24 
stimuli will not be discussed—these are stimuli in 
which the arms were symmetrical about the shaft 
so as to forma Y, a T, or an arrowhead. They will 
not be discussed for four reasons: (a) no predictor 
was able to generate a pre ion on the sign of the 
mean rating within a given stimulus; (b) in between- 
stimulus comparisons, there was no basis for deciding 
which arms to call common arms and which to call 
target arms, and thus there was no way of knowing 
which arm a prediction should be made for; (c) 
four, rather than eight, stimuli existed per pattern; 
and (d) no systematic tendency in the ratings given 
these stimuli was apparent. 


RESULTS 


Figure 2 is organized as follows. The 
two functions on the left present data from 
patterns in which Arm b is at 45°, with 
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FiGURE2, Means with 1 standard error of each 
mean for 18 stimulus patterns, (An instance of one 
of the 8 stimuli in each pattern is shown below the 
abscissa in schematized form, with Arm a dashed, 
Positive ratings indicate that the shaft grouped 
with Arm a; negative ratings, with Arm b.) 


Arm a rotating from 90° ipsilateral with 
Arm b, to 90° contralateral to Arm b. The 
two middle functions present data from 
patterns in which Arm b is at 90°, with 
Arm a rotating from 135° ipsilateral to 
135° contralateral. The two right-hand 
data points are from patterns in which 
Arm b is at 135? and Arm a at 180°, 
Finally, with the exceptions noted below, 
the solid functions present data from pat- 
terns in which orientation promotes group- 
ing toward Arm a, and the dashed func- 
tions present data from patterns in which 
orientation promotes grouping toward Arm 
b. 

Within-pattern predictions. Rectilinearity 
and orientation both predicted grouping 
toward Arm a in Patterns 1, 3, 5, and 9. 
Rectilinearity alone predicted grouping 
toward Arm a in Patterns 2, 4, 7, 11, 13, 
and 16. Orientation alone predicted group- 
ing toward Arm a in Patterns 6 and 8 and 
toward Arm b in Patterns 15 and 17 (in 
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these four patterns, the rectilinearity dif- 
ference was the impotent 90?—135?). Orien- 
tation favoring Arm b opposed rectilinearity 
favoring Arm a in Patterns 12 and 18, but 
rectilinearity was predicted to dominate 
because of the presence of a 180° arm. 
Finally, no prediction was made for Pat- 
terns 10 and 14 where orientation favoring 
Arm b was opposed by a moderate recti- 
linearity effect 7(arms at 45° and 90°) 
favoring Arm a. 

Overall, then, the algorithm made pre- 
dictions on the sign of the mean rating 
given 16 of the 18 patterns, and was correct 
16/16 times, p < .001 in the binomial dis- 
tribution 7 = 16, p = .5. (The binomial 
distribution was used to calculate all prob- 
abilities reported below.) 

Between-pattern predictions. As noted 
above, the algorithm for generating be- 
tween-pattern predictions showed that 129 
of the 153 pattern combinations were com- 
parable. In 20 of the 129, however, the 
algorithm generated no prediction for the 
following reasons. In 3 comparisons (2-11, 
4-13, and 7-16), no predictor applied. As 
the orientation predictor was among those 
that failed, it was unable to determine 
which member in each of the three pairs 
should go in the solid “Orientation Helps 
Arm a” functions and which in the dashed 
“Orientation Helps Arm b" functions. Ac- 
cordingly, Patterns 2, 4, and 7 were placed 
in the solid functions in order to keep the 
solid and dashed functions from crossing. 

In 17 more comparisons, two predictors 
made contradictory predictions which the 
algorithm was not programmed to resolve — 
in 2 comparisons (8-15 and 5-10), orienta- 
tion opposed enclosure and in 15 com- 
parisons, rectilinearity opposed orientation 
with target arms at 45° and 90° (2-6, 8; 
4-6, 8; 6-11, 13; 7-12; 8-11, 13; and 12-16) 
or at 45° and 135° (9-12; 10-15, 17; and 
14-15, 17). 

Of the 109 comparisons that remain, 
predictions were generated and were correct 
103/109 times (p < .001). These may be 
broken down into 75/79 correct (p « .001) 
when predictions were unequivocal and 
28/30 correct (p < .001) when predictions 

were equiyocal, 


LUBOMIR 5, 


PRYTULAK 


A further breakdown of the 79 unequivo- 
cal predictions shows: (a) 41/45 correct 
(p < .001) when rectilincarity and orienta- 
tion made compatible predictions (predic- 
tion confirmed : 1—6, 7, 8, 16; 2-3, 9, 14, 15, 


17; 3-4, 7, 10, 11, 13, 14, 16, 18; 4—9, 14, 
15, 17; 5-6, 7, 8, 16; 6-9; 7-15, 17, 18; 
8-9; 9-11, 13; 11—14, 15, 17; 13-14, 15, 17; 


15-16; and 16-17, 18; prediction discon- 
firmed: 2-10; 4-10; 10-11, 13)—the 4 in- 
correct predictions, it will be noted, all 
involve Pattern 10, whose mean was ex- 
pected to be lower, even lower than the 
mean of Pattern 13; (b) 18/18 correct 
(p < .001) in comparisons in which recti- 
linearity was the only applicable predictor 
(1-3, 15, 17; 3-5, 9; 5-15, 17; 6-10, 14, 18; 
8-10, 14, 18; 9-15, 17; 10-12; and 12-14, 
18); (c) 8/8 correct (p < .01) in compari- 
sons in which enclosure was the only 
plicable predictor (1-5; 2-4, 13; 4-11; 6-8; 
10-14; 11-13; and 15-17); (d) 6/6 correct 
(p < .05) in comparisons in which orienta- 
tion was the only applicable predictor 
(1-10; 3-12; 5-14; 6-15; 8-17; and 9-18); 
and (e) 2/2 correct (p > .05) in compari- 
sons in which orientation and enclosure 
made compatible predictions (1-14 anc 
6-17). 

A further breakdown of the 30 equivoca 
predictions shows (a) 19/20 correct (^ 
-001) when rectilinearity contradicted « 
tation but when rectilinearity domi: 
because one of the target arms was at 1 40? 
(prediction confirmed: 2-12, 18; 4-12, 18; 
3-12; 6-7, 16; 7-8, 10, 14; 8-16; 10-16: 
11-12, 18; 12-13; 13-18; 14-16; 15-18; and 
17-18; prediction disconfirmed : 1-12) ; (b) 
9/10 correct (p < .01) when orientation 
dominated the impotent 909-135? rectilin- 
earity difference (prediction confirmed : 1-2, 
4, 11, 13; 4-5; 5-11, 13; 7-9; and 9-16; 
prediction disconfirmed : 2-5). 

Although the above listing of predictions 
gives a precise and complete account of 
what the algorithm does and does not ac- 
complish, it may be helpful to note a few 
of the key effects as they appear in Figure 
2. First, the difference between solid and 
dashed functions reflects the effect of orien- 
tation helping Arm a in the first case and 
helping Arm b in the second (with the 
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exception of Patterns 2-11, 4-13, and 7-16, 
as discussed above). Second, enclosure 
accounts for the means to the left of the 
two 180° positions on the abscissa being 
higher than the corresponding means to 
the right. Thus, 1 > 5, 2 > 4, 10 > 14, 
11 > 13, 6>8, and 15 » 17. Third, if 
Arm a’s approach to rectilinearity facili- 
tates grouping, why is there a drop in the 
function from Pattern 1 to 2 or from 5 to 
4? The answer is that the 90°-135° step 
toward rectilinearity is the one that fails 
to enhance grouping, and that 2a and 4a 
lose the help of orientation that boosted 1a 
and 5a. Fourth, the rise from 14 to 13 
and the incorrectly predicted rise from 10 
to 11 were based on Arm a's gaining strength 
from orientation, the rectilinearity differ- 
ence again being impotent. Fifth, the 
ordering 7b > 9b >3b (recall that when 
a target arm is a b arm, the sign of the mean 
rating must be reversed) can be explained 
as follows: 7b > 9b (orientation, rectilin- 
earity impotent); 9b > 3b (rectilinearity) ; 
and 7b > 3b (rectilinearity and orienta- 
tion). Sixth, the ordering 18b > 16b > 
12b can be partially explained as follows: 
18b > 16b (orientation, rectilinearity im- 
potent); 18b > 12b (rectilinearity); 16b- 
12b, however, generates no prediction (a 
moderate rectilinearity effect favors 16b 
but orientation favors 12b). Seventh, 15b 
> 11b and 17b > 13b (rectilinearity and 
orientation). These, then, are a few of the 
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bigger effects that catch the eye in scanning 
Figure 2. 


Discussion 


Overall, the simple algorithm employed 
above made 16/16 correct within-pattern pre- 
dictions and 103/109 correct between-pattern 
predictions. It does not seem too much to 
say, then, that the algorithm generates an un- 
precedented bounty of predictions and that 
these turn out to be mostly correct. 

The success of the algorithm suggests that 
Max Wertheimer's (1958) original observation 
that juxtaposition at 180? produces stronger 
grouping than at other angles needs to be 
elaborated in two ways. First, the function 
relating strength of grouping to angle of juxta- 
position appears to be nonlinear—compared 
to the slope between 45° and 90°, the slope is 
relatively shallow between 90° and 135° and 
is relatively steep between 135° and 180°. 
Second, good continuation is not a function 
merely of the degree of rectilinearity; it is a 
compound of three phenomena—rectilinearity, 
orientation, and enclosure. As one of these can 
enhance or cancel another, any attempt to 
examine one must be guided by an awareness 
of all three. 
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The hypothesi is tested that the phenomenon of recognition failure of re- 
callable words is attributable to the discrepancy between semantic proper 
of encoded target words and the semantic interpretation of corresponding recog- 
nition test words. Target words were presented for study and encoding in the 
presence of specific list cues, and subsequently tested with strong associates 
of target words serving as extralist cues. In one critical encoding condition, 
the list cues were semantically congruous with target words; in another, they 
were identical with target words. Results showed that congruous encoding 
rendered extralist cues only slightly less ineffective than incongruous encoding, 
and that under the conditions of “identical” encoding extralist cues were least 
effective. The results thus provided little support for the hypothesis, sug- 
gesting instead that specificity of encoding of word events in episodic memory 


transcends the semantic meaning of words. 


This article describes two experiments 
designed to explore the phenomenon of 
recognition failure of recallable words re- 
cently reported by Tulving and Thomson 
(1973). In the Tulving and Thomson ex- 
periments, target words T (eg., CHAIR) 
were presented at input in the company 
of cue words C (eg. glue) Subjects 
expected to be tested for recall of T with 
C as a cue. When, instead of presenting C 
as a cue, the experimenter provided another 
word X (e.g., fable)—which was a close 
semantic associate of the target word T 
but which had not appeared anywhere in 
the list—as an extralist retrieval cue, sub- 
jects could not readily use it as an aid in 
recall of T. Retrieval of the target word 
T was rather poor even in a subject-gener- 
ated recognition test. In this test, subjects 
were instructed to write down free-associa- 
tion responses to the cue X, and then were 
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asked to identify the generated responses 
as copies of target words from the studied 
list. Subjects had no difficulty generating 
copies of target words, but they did have 
difficulty recognizing them as "old" words 
from the studied list. In one experiment, 
for instance, only 24% of generated target 
words were correctly recognized in such a 
subject-generated recognition test, although 
63% of all target words were subsequently 
recalled when the word C that had been 
paired with T at input was presented as 
cue. 

The effectiveness of extralist retrieval 
cues consisting of strong associates of target 
words is a well-established phenomenon, 
under a variety of experimental conditions 
(e.g., Bahrick, 1969; Light, 1972; McLeod, 
Williams, & Broadbent, 1971; Thomson & 
Tulving, 1970), and hence the lack of their 
potency under the conditions of the Tulving 
and Thomson (1973) experiments is of some 
interest. What renders extralist cues in- 
effective? Why cannot certain recallable 
words be recognized ? 

A general interpretation of all acts of 
successful and unsuccessful retrieval of in- 
formation from episodic memory is given 
by the encoding specificity principle (Tulv- 
ing & Thomson, 1973). It states that the 
properties of the memory trace of a word 
event are determined by specific encoding 
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operations performed on the input stimuli, 
and that it is these properties, rather than 
the properties of the word in semantic 
memory, that determine the effectiveness 
of any given stimulus as a retrieval cue for 
the event. The principle suggests that if 
a stimulus in the retrieval environment 
renders possible or facilitates recall of the 
target word T, the retrieval information 
was appropriate to or compatible with the 
information contained in the episodic trace 
of T. Conversely, if a particular stimulus 
is ineffective in retrieving a particular trace, 
the conclusion follows that the appropriate 
relation was lacking. It is not yet known 
exactly what constitutes the "appropriate" 
relation between information contained in 
the trace and information present in the 
general retrieval environment or in a spe- 
cific cue. Nor is the nature of the retrieval 
process as such clear. In absence of such 
knowledge, we assume that the unknown 
relation is one of "similarity" or "informa- 
tional overlap," and that the retrieval. 
process consists of some sort of combining 
of information from the two sources. 

The failure of recognition of recallable 
target words in the Tulving and Thomson 
(1973) experiments thus can be interpreted 
as a special case of encoding specificity. It 
can be assumed that the trace of the target 
word T that had been encoded in relation 
to a particular list cue C did not possess 
sufficient overlap with the copy of the 
target word that was generated as a seman- 
tic associate of the extralist cue X. This is 
why X failed to bring about recovery of in- 
formation in the trace of the target T. The 
encoding specificity principle, however, says 
nothing further about conditions or mecha- 
nisms responsible for inadequate overlap or 
complementarity between the trace of the 
C-T complex and the copy of T. But, if 
one accepts the reasoning so far, it is pos- 
sible to ask the next question: Why do the 
encoding operations performed by the sys- 
tem on the target word T sometimes create 
a trace whose informational content can- 
not be matched by that of cue X, a strong 
semantic associate of T? Thus, for in- 
stance, why does the trace of the target 
word CHAIR, when presented as a part of 
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the glue-CHAIR compound, differ from the 
trace of the target word CHAIR presented 
by itself, so that in the former case cues 
table and chair are relatively ineffective, 
although they both are quite effective under 
different experimental conditions (Bahrick, 
1969; Light, 1972; McLeod et al., 1971; 
Thomson & Tulving, 1970; Tulving & 
Thomson, 1973) ? 

One hypothetical answer to this question 
was studied and evaluated in two experi- 
ments reported in this article. The answer 
consists of two parts: (a) what is stored, or 
at least what is retained after an interval, 
about a target word in a typical list-item 
memory experiment is the "meaning" of 
the word, and (b) pairing of the target 
word T with a certain list cue C at input 
produces a specific meaning of T that differs 
from the meaning of the same lexical unit 
perceived or produced in a different con- 
text, such as in a free-association test in 
which some other word X elicits T. Thus, 
CHAIR presented in a to-be-remembered list 
of words as a part of the glwe-CHAIR com- 
pound is assumed to have a different se- 
mantic representation in memory than 
CHAIR generated as a semantic associate to 
the word fable. The failure of recognition 
of recallable target words, the argument 
goes, then comes about for the same reason 
as the decrement in recognition of a homo- 
graphic target word when the semantic in- 
terpretation of its copy at the time of the 
test is changed through changes in its 
verbal context (e.g., Light & Carter-Sobell, 
1970; Winograd & Conn, 1971). 

This semantic explanation of encoding 
specificity manifested in the phenomenon 
of recognition failure of recallable words 
was tested in the experiments described 
here. The experimental paradigm used 
was one in which the lexical identity of the 
target words was held constant and in 
which the effectiveness of various retrieval 
cues was examined as a function of seman- 
tic overlap between encoding context of 
the target word and the retrieval cue. 
Thus, retrieval of target word T was tested 
with extralist cue X following the presenta- 
tion of the target word in one of three 
input contexts: (a) in the company of cue 
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words whose dominant semantic meaning 
was relatively incongruous with that of the 
target word T and the extralist cue X; 
(b) with cue words whose meaning was 
congruous with that of the semantic target 
word and the extralist cue; and (c) with 
cue words that were identical with target 
words. Thus, for instance, with the word 
CHAIR serving as target, the Incongruous 
encoding context or condition was repre- 
sented by the pair glwe-cuair, the Congru- 
ous condition by the pair furnish-CHAIR, 
and the Identical condition by cAuzr-CHAIR, 
The extralist retrieval cue was /able. It 
was assumed that the semantic information 
contained in the retrieval cue table would 
be most compatible with the stored infor- 
mation about the target word CHAIR pre- 
sented in the chazr-CHAIR compound (Iden- 
tical encoding), somewhat less compatible 
with CHAIR presented in the furnish-CHAIR 
compound (Congruous encoding), and least 
compatible with information about CHAIR 
that had appeared as a part of the glue- 
CHAIR compound (Incongruous encoding). 
In keeping with the semantic encoding 
hypothesis it was expected that the effec- 
tiveness of the extralist cue would be di- 
rectly related to the semantic similarity 
or compatibility between the cue and the 
target word. 


EXPERIMENT | 
Method 


Materials and design, Target words presented in 
three different encoding contexts were recalled in 
three successive The three encoding condi- 
tions were all represented within a single list that 
contained 24 target words. Each target word was 
accompanied at presentation by another word, a 
list, or input cue. Three types of list cues defined 
the three encoding conditions: (a) incongruous, (b) 
congruous, and (¢) identical. The designations 
refer to the semantic relations between target words 
or subsequently presented extralist cues on the one 
hand and the list cues on the other. Each type of 
list cue appeared with 8 target words in the 

Three succ: 


sive retention tests were those used 
in previous experiments by Tulving and Thomson 
(1973): (a) a test in which strong extralist associates 
of target words served as retrieval cues, (b) a recog- 
nition test in which copies of target words were 
generated by Ss as free associations to strong 
extralist associates of target words, and (c) a test in 
which list cues served as retrieval cues, 


ENDEL TULVING 


The basic set of materials used in the const: ction 
of experimental lists is presented in Table L “ach 
of the 24 target words is shown together ~ its 
extralist associate which was used in the fir aed 
recall test, and with both the Congruous a he 
Incongruous input cue. The 24 target word re 
specifically selected from the materials us in 
earlier experiments described by Tulving and 7 l= 
son (1973) to yield low recognition scores. Exi t 


cues and the incongruous input cues of thes 4 
target words were the same as those used in « r 
experiments. The identical list cues are not sh« n 
in Table 1, for obvious reasons. Extralist and. .- 
congruous cues had been selected in the carior 
experiments (Thomson & Tulving, 1970) from t 
sets of free-association norms, In these norm 
target words were primary associates of extrali: 
cues; they were given as responses to incongruou: 
cues 17 of the time, and extralist and incongruous 
cues were not related. The congruous input cues 
were generated by Æ, for the purpose of the present 
experiment, to represent words closely related 
semantically to both the target words and their 
extralist associates. By perusing the listing of the 
experimental materials displayed in Table 1, the 
reader can convince himself that the congruous lis! 
cues are more closely related to the target word 
and the corresponding extralist cue words than ar« 
the incongruous cues. 

For the purpose of balancing of specific words and 
their encoding conditions, three different lists we: 
constructed of the materials that appear in Table i 
The 24 target words were divided into three subset 
of 8, each subset being used with a different set of 
input cues (incongruous, congruous, and identical) 
in one of the three lists. The lists were used with 
equal frequency in the experiment (specifically, 
since there were 50 Ss, with 17, 17, and 16 Ss 
respectively, in each group). Half of the target 
words in each subset of 8 were tested in the first 
retention test (with extralist associates of targets 
às cues) while the other half was tested in the 
second test (subject-generated recognition test), 
with the two halves counterbalanced across the two 
tests. Thus, each of the 24 target words was repre- 
se nted with equal frequency in all conditions of the 
experiment, 

Prior to the presentation of the critical experi- 
mental list, Ss were given a single practice and set- 
establishing list. It consisted of 24 target words, 
half of which were accompanied by input cues 
identical with target words and half accompanied 
by incongruous cue words comparable to those used 
in the critical experimental lists, 

Subjects and procedure. Fifty Ss, undergraduate 
students at Yale University and the Southern 
Connecticut State College, served in Experiment I, 
either in fulfillment of course requirements or for 
pay. They were tested in small groups of from 2 to 
4 persons, in a session lasting approximately 40 min. 

The initial instructions, given prior to the presen- 
tation of the set-establishing list, informed Ss of the 
general nature of their task and the type of ma- 
terials to be presented. They were told that they 
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would see pairs of words presented on the screen in 
front of the room, that their task was to remember 
the capitalized target words, but that they should 
also pay attention to the cue word accompanying 
each target word and notice the relation between 
them, since these cue words would help them to 
remember the target words. No special mention 
was made of the fact that some of the pairs would 
consist of identical words, nor were Ss told anything 
particular about the retention tests. 

The cue-target pairs from the set-establishing list 
were presented by means of a slide projector, at 
the rate of 3 sec/pair. At the end of the presenta- 
tion Ss were asked to open their recall booklets that 
had n handed out to them at the beginning of 
the session, turn to page 3 in the booklet, read the 
instructions, and proceed with the retention test. 
On this recall sheet the 24 input cues from the list 
were presented in an order unrelated to the input 
order. The instructions at the top of the recall 
sheet told Ss to write down the capitalized words 
that had accompanied the given word cues in the 
list. Subjects were given 3 min, for the completion 
of this test. 

Next, Ss were told that they would be shown a 
new list of pairs of words, and that their task was 
the same as before—to remember each capitalized 
target word, noting the relation between it and i 
cue word. The nature of the relation between 
target and cue words was not specified or further 
commented upon. 

The 24 pairs of the experimental list were then 
presented, again at the rate of 3 sec/pair. At the 
end of the presentation Ss were asked to turn to 
page 6 of the recall booklet, read the instructions on 
the top of the page, and then proceed, (The pages 
for recall tests in the recall booklet were always 
separated by two numbered blank pages, hence this 
numbering system.) The recall instructions in- 
formed Ss that their task was to try to recall the 
capitalized words they had seen in the list, but that 
this time the cues presented on the page were 
different from those they had seen in the input list. 
The Ss were told to look at each cue word given 
on the recall sheet, see whether it reminded them of 
any target words they had just seen, and, if so, 
write it down beside the cue word. Twelve extralist 
cues, corresponding to four target words from each 
of the three encoding conditions, appeared on the 
recall sheet. Subjects were given 3 min. for this 
task. 

Next, Ss were asked to turn to page 9 in the recall 
booklet, which contained 12 extralist cues corre- 
sponding to the target words that had not yet been 
tested, Extralist cues, each followed by four spaces 
for free-association responses, were listed in a 
column on the left-hand side of the page. The in- 
structions at the top of the page told Ss that their 
task now was to produce free associations to the 
presented cue words. They were to write down four 
words in response to each of the cue words they 
saw on the page, words that the cue word “made 
them think of." This was an unpaced task, and Ss 
were given as much time as they needed to complete 
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TABLE 1 
"TARGET WORDS AND CUES 


T, : Congruous Incongn 
Target words Pete pedis 
BALL. tennis player whistle 
BLUE sky heaven pretty 
CHAIR table furnish glue 
COAT |- lining cloth covering 
COLD | hot fire ground 
DAY | night evening sun 
DIRTY | clean wash barn 
FLOWER bloom leaf fruit 
| eat meal moth 
grass lawn cheese 
finger arm tool 
soft smooth glass 
low down hope 
small size stomach 
LIGHT dark black head 
MAN | woman child command 
want desire bath 
OPE closed shut country 
ROUND | square circle cabbage 
SHORT. long thin stem 
SLOW | fast speed memory 
WATER lake fish whisky 
WET | dry moisture | cave 
V | blow move | noise 


it. When all Ss in the group had finished writing, 
they were told to look at each word they had 
written down, decide whether or not it had occurred 
one of the target. (capitalized) words on the pre- 
ceding study list, and, if they thought it had, draw 
a circle around the word. Four minutes were given 
for the completion of this task. 

Finally, 5s were asked to turn to page 12 of the 
recall booklet, read the instructions on the top of 
it, and proceed as instructed. This page contained 
all 24 input cues, in a scrambled order with respect 
to the order of their original appearance in the 
study list. The instructions informed Ss that these 
were the input cues from the list, and that their 
task was to write down as many target (capitalized) 
words from the study list as they could, each one 
beside its corresponding cue word. Four minutes 
were allowed for this task, 

The experimental session concluded with Æ ex- 
plaining the purpose of the experiment to Ss and 
answering any questions they had. 


EXPERIME 


The Ss in Experiment ] seemed to do 
considerably better in the first retention 
test (cuing with extralist associates) than 
in the second (subject-generated recogni- 
tion test). Since these data suggested that 
the order of the two tests may have been 
an important source of variance in the 
retention scores, Experiment II was con- 
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ducted to replicate Experiment I in every 
respect, with the sole exception of the order 
of the two tests. In Experiment II, the 
first of three tests was the subject-generated 
recognition test, while the second one was 
the cued recall test with extralist associates 
as cues. Otherwise the procedure was 
identical with that used in Experiment I 
in all details. The same materials were 
used both in the set-establishing list and 
the experimental list, the same counter- 
balancing procedure was followed, and the 
same final test with input cues was given. 
Thirty-six new Ss from the same sources as 
in Experiment I served in this experiment. 


REsuLTS 


Set-establishing lists. The mean number 
of words recalled from the first set-estab- 
lishing list was 16.7 in Experiment I, and 
17.8 in Experiment II. In both experi- 
ments slightly more target words were re- 
called to identical than to incongruous cues 
(M = 8.8 and 9.5 in the two experiments, 
respectively), The mean number of in- 
trusion errors was 1.34 and 1.83 per S in 
Experiments I and Il, respectively. Of 
these intrusion errors in the two experi- 
ments, 44% and 58%, respectively, were 
"identical" intrusions: S gave a response 
identical with the cue although in the input 
list the cue had been paired with another 
word. It is possible that the higher recall 
of target words to identical cues was at 
least partly a consequence of inflation of 
recall scores by unidentifiable intrusions, 
but since it is not entirely clear what would 
constitute an appropriate correction, no 
attempt was made to assess the possible 
guessing bias. 

Scoring and analysis. Analysis of the 
cued recall data, in all tests in which extra- 
list or input cues were used, was straight- 
forward. ich S's recall score was the 
number of target words recalled, from a 
maximum of four or a maximum of eight per 
condition in the two tests, respectively. A 
stringent scoring criterion was used: S re- 
ceived credit for every target word only if 
it was correctly paired with its cue. (The 
inclusion in the data of target words 
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recalled to incorrect cues, or to n jar- 
ticular cues, would not have chang the 
conclusions.) 

Intrusion errors were ignored in all avoiy- 
ses. It is not immediately obviou: w 
their inclusion in the results would ch: e 
the overall conclusions drawn from e 
experiments. 


In scoring the protocols from the su! 
generated recognition tests, Ss were givon 
credit only for those copies of target words 
generated and recognized that belonged | 
the half of the input list being tested i: 
the recognition test. In Experiment | 
there was a total of 33 cases in the protocols 
of the 50 Ss where a copy of the target 
word had been generated as a part of the 
Íree-association procedure and correctly 
recognized even though these words nomi- 
nally belonged to the other half of the list 
that had been tested in the immediately 
preceding cued recall test using extralist 
retrieval cues. Since it was impossible to 
rule out the possibility that the words from 
the other half of the list constituted impor- 
tations from the immediately precedin 
test rather than representing Ss’ memory 
for the material seen in the input list 
(particularly since most of these generated 
and recognized words had been in fact 
recalled by Ss in the immediately preceding 
test), they were excluded from the analysis. 

The data from the subject-generated 
recognition tests required a special treat- 
ment inasmuch as the maximum possible 
score of correct identification. of target 
items depended upon the number of copies 
of target items any particular S had gener- 
ated in the free-association test. There 
were three essential steps in the analysis of 
these data. 

First, the number of copies of target 
items correctly generated in the free-associ- 
ation test were tabulated for each S, sepa- 
rately for target words from each of the 
three encoding contexts. The means of 
these scores were 2.84, 2.76, and 2.91 for 
the incongruous, congruous, and identical 
encoding conditions, respectively, in Ex- 
periment I, and 2.98, 3.11, and 2.44, for 
the same three encoding conditions in Ex- 
periment II. There is no obvious explana- 
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tion for the lower value of this statistic in 
the identical encoding condition in Experi- 
ment II, and since the same decrement was 
not observed in Experiment I, whatever 
implications this finding might have will 
be ignored in what follows. 

The second step consisted of the calcula- 
tion of proportions of copies of target words 
that were recognized by each .S in each 
encoding context. Thus, for instance, if a 
given S generated three copies of the target 
item in a particular encoding condition, and 
recognized one of these, his recognition 
score (hit rate) was $ or .33. The means 
of these proportions are provided as sum- 
mary statistics for the subject-generated 
recognition test in Table 2. In a small 
number of cases, when S did not generate 
any copies of target words for a particular 
encoding condition in the free-association 
test, and hence could not possibly have 
recognized any, his score was considered 
indeterminate and was not entered into any 
analyses. 

The third step in the analysis of the 
recognition scores was necessary for the 
purposes of the statistical evaluation of the 
data. Since the distributions of the recog- 
nition hit rates from individual Ss in dif- 
ferent encoding conditions in most cases 
were not distributed normally, statistic: 
tests involving these measures were deemed 
inappropriate, Another measure of recog- 
nition performance was used instead. For 
each S in each encoding condition a right 
minus wrong (R — W) score was calcu- 
lated.’ Each target word whose copy had 
been generated and correctly recognized 
was considered R while each generated 
copy of a target word that was not recog- 
nized was considered W. For instance, an 
S who generated three copies of targets in 
a particular encoding condition, and recog- 
nized one of these, received an R — W score 
of 1 — 2 = — 1. The differences of these 
R — W scores for all three comparisons 
(incongruous ys. congruous, incongruous 
vs. identical, and congruous vs. identical) 
were distributed normally, thus permitting 


5] am grateful to Perry Gluckman for suggesting 
this analysis. 
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the use of / tests for related measures. 
Thus, while the data in Table 2 are sum- 
marized as mean hit rates, the correspond- 
ing analyses were conducted on the R — W 
scores as explained. "There was, of course, 
a high although not a perfect correlation, 
within each of the encoding conditions, 
between individual Ss’ hit rates and R — W 
scores, the coefficient exceeding .90 in all 
cases. 

Retrieval data. Table 2 presents a sum- 
mary of the data on retrieval of target 
words from three different encoding con- 
ditions in three different tests. 

Consider first the mean proportions of 
target words retrieved in various tests, 
with data pooled over the three encoding 
conditions in each test. ‘Two observations 
are of interest. First, the order of the first 
two tests, both involving extralist cues, 
appears to have exerted considerable effect 
on the level of retrieval: Ss’ performance 
was higher in the first test (cued recall 
with extralist cues in Experiment I, and 
subject-generated recognition test in Ex- 
periment II) than in the second test. To 
evaluate test-order effects, a ¢ test for un- 
related measures was done comparing the 
differences between Ss’ scores on the cued 
recall and recognition tests in Experiment 
I with the same difference scores in Experi- 
ment Il. The obtained value for ¢ (84) 
was 4.57, p < .01. It appears, therefore, 
that some general test interference was 
produced in the course of the first test that 
suppressed the level of performance in the 
second, The proportions of copies of target 
items generated in the recognition test were 
identical at 71% in both experiments, sug- 
gesting that it is only the episodic and not 
the semantic (Tulving, 1972) component 
of the task that reflects interference, but 
given the design of the experiments, it is 
difficult to say anything more about the 
nature and sources of the test interference. 

The second point with respect to re- 
trieval scores pooled over different encoding 
conditions concerns the superiority of the 
performance in the final cued recall test 
over that in the other two tests, in both 
experiments. Statistical evaluation of the 
data was accomplished by means of ¢ tests 
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TABLE 2 


MEAN PERCENTAGES AND STANDARD DEV 


RECOGNIZED IN THREE 


TIONS OF TARGET Wonps Re: 
TS AS A FUNCTION OF ENCODING Con 


ALLED AND 
T 


Test 
Banennet and Cued recall, extralist cues | Subject-generated recognition Cued recall, list eue 
encoding contex A EL 
M SD M SD M SD 
Experiment I (N = 50) 5 2 
cone 51.5 34.0 26,3" 31.5 | 60.5 
Congruous 52.0 27.4 36.5" 32.3 49.8 
Identical 39.0 30.0 24.6" 31.1 46.5 
M 47.5 23.4 28.6" 22.9 52.2 
Experiment II (N — 36) » 
dene imdis : 28.5 | 29.4 42.4» 35.7 51.0 23.2 
Congruous 36.1 | 29.6 46.1^ 30.4 51.4 22.9 
Identical 22.9 25.6 26.8» 27.2 36.8 30.3 
M 29.2 20.1 39,9^ 22.5 46.4 19.3 
AN =45. 
bN = 32. 


after the data in various tests were normal- 
ized in order to equate the three tests for 
maximum scores possible. The resultant 
t test was highly significant when the per- 
formance in the third test was compared 
with the performance in the second test in 
each of the two experiments, / (49) = 6.75 
in Experiment I, and / (35) = 5.28 in Ex- 
periment II, both ps < .01. The / values 
were not significant when recall with input 
cues was compared with probability of 
retrieval in the first test in each of two 
experiments, / (49) = 1.74 in Experiment 
I, and ¢ (35) = 1.88 in Experiment II, in 
both cases .05 < p < .10. 

The data of primary interest concern 
differences in retrieval of target words that 
had been presented in different input con- 
texts in the study list. The differences 
between encoding conditions were evalu- 
ated by means of ¢ tests for related mea- 
sures, separately for each of the three tests 
in both experiments. The results of these 
tests are tabulated in Table 3. 

Although there was a slight numerical 
superiority in extralist cued recall and 
subject-generated recognition scores in the 
congruous input cue conditions oyer those 
in the incongruous input cue conditions, 
statistical analyses of the data, summarized 


in Table 3, showed that these differences 


did not reach acceptable degrees of relia- 
bility. The largest difference between the 
incongruous and congruous encoding con- 
ditions was observed in the subject-gener- 
ated recognition test in Experiment I 
(26.3% vs. 36.5%), yielding a £ (44) = 
1.47, .05 < p < .10. In Experiment I, 
recall in response to list cues of target 
words that had appeared in the context of 
incongruous input cues was higher (60.5%) 
than recall of target words that had ap- 
peared in the context of congruous encod- 
ing cues (49.8%), t (49) = 2.80, p < .01, 
and this difference might be regarded as 
tempering the conclusion about lack of 
evidence for the effect of encoding condi- 
tions on retrievability of target words by 
extralist cues or their copies, but since 
Experiment II did not replicate this find- 
ing, its implications cannot be taken too 
seriously. 

Since each to-be-remembered list word 
served as a target word in two successive 
tests, it was possible to examine recall or 
nonrecall of individual words in one test in 
relation to their recall or nonrecall in the 
other test. Indeed, the phenomenon of 
recognition failure of recallable words that 
provided the starting point for the present 
experiments can be most directly defined 
by demonstrating that subjects cannot 
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TABLE 3 


SUMMARY OF / STATISTICS FROM TESTS COMPARING RECALL AND RECOGNITION 
SCORES BETWEEN PAIRS OF ENCODING CONDITIONS 


Cued recall, 
extralist cues 


Cued recall, 


Subject-generated 
input cues 


recognition 


Comparison 


ARCA UU Ee Experiment | Experiment | Experiment 
Hi I Hu 


Congruous vs. Incongruous Al 1.12 1.47 78 —2,80** 08 
Incongruous vs. Identical 2.31* 1.10 A8 2.31* 3.10** 2.82** 
Congruous vs. Identical 2.99** 2.45** 2.19* 2.68** .88 2.60** 


dentify copies of target words as “old” 
but can produce these same words in pres- 
ence of list cues. In the analysis of the 
"fate" of individual words that follows, re- 
trieval of target words in extralist cue 
tests is compared with their retrieval in the 
final list-cue test. 

Frequencies of words that are recalled in 
both the extralist cue test and the final 
list-cue test, as well as words that are re- 
called in neither of these two tests, are of 
little immediate interest. "These data may 
reflect general "ease" or "difficulty" of 
words, breadth of encoding or failure of 
storage, effects of recoding or strengthening 
of traces in the first test, and other similar 
factors. Words that were retrieved only 
in the extralist cue test or only in the list- 
cue test provide more interesting data from 
the point of view of encoding specificity. 
These data are tabulated in Table 4. 

Table 4 lists total frequencies of target 
words that were recalled or recognized in 
the extralist cue test but not recalled in the 
final list-cue test, and frequencies of target 


words not recalled or recognized in the 
extralist cue test but recalled in the final 
test. These two categories of words will 
be referred to as RN words and NR words, 
respectively, indicating the words’ recall 
(R) or nonrecall (N) in the first (extralist 
cue) and the second (list cue) test. The 
left-hand panel of Table 4 presents data 
for target words initially tested in the 
extralist cued recall test, and the right-hand 
panel contains data for target words tested 
in the subject-generated recognition test. 
Each entry in Table 4 consists of two 
figures. The first gives the total frequency 
(with the data pooled over all 50 Ss in 
Experiment I and 36 Ss in Experiment 11) 
of RN words, while the second represents 
the total frequency of NR words. Each 
entry thus can be thought of as a ratio of 
two quantities, the number of RN words to 
the number of NR words, and the nu- 
merical value of this ratio, or some trans- 
formation of it, could be used as a rough 
index of the magnitude of the recognition 
failure of recallable words or the magnitude 


TABLE 4 


TOTAL FREQUENCIES OF TARGET WORDS RECALLED ONLY IN TES 
FREQUENCIES OF WORDS RECALLED ONLY IN 


WITH EXTRALIST CUES, IN RELATION TO 


T WITH List C 


Recall with extralist cues Subject-generated recognition test 


Encoding condition 


Experiment I | Experiment II Total Experiment I | Experiment IT Total 
Incongruous 17/45 15/54 32/99 10/47 17/26 27/13 
Congruous 27/33 9/32 36/65 15/29 16/28 31/57 
Identical 15/40 8/33 23/73 8/30 11/20 19/50. 
Total 59/118 91/237 33/106 44/74 77/180 


32/119 


186 ENDEL 
of the failure of extralist cues in retrieval 
of words known to be represented in the 
episodic memory store. 

Three features of the data in Table 4 
should be mentioned. First, the data, 
pooled over both experiments, are rather 
similar in the two panels of Table 4, sug- 
gesting that underlying retrieval processes 
are probably not entirely dissimilar in 
extralist cued recall test and the subject- 
generated recognition test. Second, in all 
three encoding conditions, in both experi- 
ments and in both tests, the number of 
target words accessible only through list 
cues (NR words) was higher than that of 
words accessible only through extralist cues 
(RN words), Third, extralist cues were 
somewhat more effective for target words 
encoded in the congruous input condition 
than for those encoded under the other 
two conditions, as judged by the ratio of 
RN words to NR words, although the rela- 
tively small number of observations on 
Which these data are based renders the dif- 
ferences statistically unreliable. 


Discussion 


The primary purpose of the experiments 
was to evaluate the hypothesis that specif- 
icity of encoding manifested in the phenomenon 
of recognition failure of recallable words 
(Tulving & Thomson, 1973) is à matter of 
Specificity of semantic meaning of the to-be- 
remembered words and retrieval cu Since 
in earlier experiments the target words were 
encoded in relation to list cues possessing 
little semantic similarity to target words and 
extralist cues, it seemed reasonable to 
that considerable semantic discrepancy existed 
between the target word as stored in that 
particular context and the literal copy of the 
target word presented as retrieval cue in the 
subject-generated recognition test, The gra- 
phemic units might have been the same in 
both cases, but they represented different 
bundles of semantic information. 

The results of the present experiments do 
not provide much support for this semantic 
interpretation. of encoding specificity. Some 
facilitation of retrieval of target words by 
extralist cues was observed under conditions 
Where target words had been encoded in rela- 
tion to cue words semantically much more 


es. 
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compatible with both target words and extra- 
list cues, in comparison with retrieval of target 
words encoded in relation to less congruous 
cues, but this facilitation was rather «mall and 
statistically not reliable. Furthermore, a con- 
siderable number of target words encoded in 
presence of congruous input cues were still not 
recognized in the subject-generated recognition 
t and could not be recalled to strongly as- 
sociated extralist cues, even though they were 
recalled subsequently in presence of list cues, 
indicating that relevant information was avail- 
able in the store. 


tes 


Finally, extralist cues were 
least effective, both in the recall a the 
subject-generated recognition tests, when tar- 
get words were presented under the identical 
encoding condition where the input was 
nominally most compatible with the arget 


word. 

It is entirely possible that the hypotimsis of 
semantic determination of encoding specificity: 
fared so poorly in these experiments ly 
because the congruous encoding conditions 
were not congruous enough and the identical 
encoding conditions did not produce semantic 
encoding. If greater care were taken to en 
sure a high degree of semantic overlap between 
list cues and. extralist cues the hypo! hesis 
might give a better account of itself; 
theless, the fact that it is not easy to eliminate 
the encoding specificity effect by manipulating 
the relation between input cues and extralist 
cues suggests that encoding of a familiar word 
in certain episodic memory tasks (Tulving, 
1972) may entail degrees of specificity that 
cannot be achieved by changing semantic 
context of words. It may be more appropriate, 
therefore, to think about properties of memory 
traces of particular events and e ! 
going beyond the properties of wi 
municative units of language, 
clear what these nonsem 

The low 


Never- 


pisodes as 
ords as com 
It is : 

antic properties are, 
level of recall and recognition of 
target words that had been presented in the 
input list under identical encoding conditions 
was quite unexpected. Several previous ex- 
periments, cited in the introduction, have 
shown that associated extralist retrieval cues 
are quite effective in facilitating recall of tar- 
get words when these words occur in the study 
list without any manipulated context, and 
when the subjects are left free to encode them 
any way they want. There was no particular 
reason to expect, before doing the experiments 
described here, that the presentation of two 
identical copies of a word, one as “cue” and 
the other one as "target," would make any 
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substantial difference in the encoding of these 
list words. Yet the data were quite clear in 
showing that the Identical encoding condition 
was associated with lowest levels of recall and 
recognition, both in extralist cue tests and the 
final input cue test. It is this finding that 
appears to be most difficult to reconcile with 
the “semantic” interpretation of encoding 
specificity, suggesting that other or at least 
additional factors are involved. 

The data from the practice list, in which 
immediate recall of target words was just 
about the same under identical encoding and 
incongruous encoding conditions, suggest that 
it was not the initial level of storage of “identi- 
cally" encoded words that was impaired but 
rather that the low level of subsequent recall 
was some consequence of events that took 
place after the study of the list. It is difficult 
to say anything definite about this impairment 
on the basis of existing data. It may be that 
subjects encoded target words under identical 
conditions primarily in terms of their phonetic 
properties, in terms of a clang association be- 
tween the cue and the target word, and that 
this information was lost more rapidly than 
the specific "semantic" information, or that 
the subsequently presented retrieval cues were 
somewhat less appropriate for providing access 
to this kind of information. It is also con- 
ceivable that the low level of retrieval of 
target words under identical conditions was à 
consequence of a high degree of similarity of 
encoding operations performed on these target 
words during the presentation of the list, If 
part of the information stored in the memory 
trace of an event represents the encoding 
operation performed on the input (Tulving & 
Thomson, 1973), then a high degree of simi- 
larity of encoding operations performed on a 
number of list items could render the resulting 
traces less unique and hence somehow less 
readily retrievable (Lesgold & Goldman, 1973 
von Restorff, 1933). Be it as it may, the 
present. data rather clearly suggest that the 
identity relation between two items, one desig- 
nated as cue and the other as target, does not 
represent the extreme position on the dimen- 
sion of associative or semantic relatedness. 

Finally, in relation to earlier experiments 
(Tulving & Thomson, 1973) the present data 
showed rather a high degree of effectiveness of 
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extralist retrieval cues, despite the fact that 
the materials for the present experiment were 
specially selected from among those used in 
earlier studies to yield a minimal level of 
retrieval in presence of extralist cues. It was 
hoped that such minimal levels of retrieval 
would have made it easier to demonstrate the 
effect of a congruous semantic input con- 
text. It may be that the use of a study 
list containing different sorts of cue-target 
relations was responsible for the discrepancy 
between the present and earlier experiments 
with respect to effectiveness of extralist cues, 
although for the time being this must remain 
pure speculation. 
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BODY TEMPERATURE AND TEMPORAL ACUITY : 


JAMES F. O'HANLON} JAMES J, McGRATH,* ann MICHAEL E. McCAULEY 


Human Factors Research, Inc., Goleta, California 


‘Two experiments were conducted to determine the relationship between tem- 
poral acuity and body core temperature; a third experiment served asa control. 
In Experiment I, thresholds for the perception of the succession of dichoptic 
light flashes were tested when body core temperature was normal and after it 
was increased to 38.3°C. by diathermy. In Experiment II, critical flicker fre- 
quency (CFF) thresholds and an electroencephalogram (EEG) were measured 
during euthermic, hyperthermic (39.0°C.), and recovery conditions. Results 
lof. both experiments indicated that temporal acuity increased with hyper- 
thermia. Perception of succession and CFF were both facilitated by increasing 
body core temperature. The EEG alpha frequency also varied directly with 
body temperature, but was not significantly correlated with CFF. In Experi- 
ment III, no change in pupil diameter was found during hyperthermia, elimi- 
nating an interpretation of the results of Experiments I and II based on the 
Ferry-Porter law. These results were interpreted as su pporting “psychological 
moment” theory, 


A common premise shared by many 
theorists (Craik, 1948; Kristofferson, 1967 ; 
McReynolds, 1953; Shallice, 1964; Stroud, 
1955; Walter, 1950; White, 1963; Wiener, 
1948) is that the nervous system does not 
process all sensory inputs continuously, but 
briefly samples each input and routes 
selected inputs serially into the perceptual 
process, The time required to complete 
one sampling cycle has become known as 
the “psychological moment.” According 
to Harter (1967), an assumption frequently 
made by the "moment" theorists is that 
the nervous system codes all sensory in- 
formation. in terms of sequential order of 
occurrence, All information arriving in 
the nervous system within one moment is 
encoded similarly and perceived as instan- 
taneous and simultaneous; information ar- 
riving over the course of more than one 
moment is perceived as having duration. 
Discrete stimuli which are perceived in 
different moments are perceived as differ- 
ing in time of occurrence, 

According to Stroud (1955, p. 181), iden- 
tical visual events that differ in time of 
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occurrence are psychologically differen- 
tiated at a maximum rate of one per mo 
ment. By his reasoning, temporal acuit; 
(the ability to discriminate between th: 
occurrence of events in time) would b 
proportional to the moment frequency. 

Still other theorists have used the mo 
ment concept to explain the perception of 
time passing (Treisman, 1963; White, 
1963). Their premise has been that somc- 
how the nervous system registers the ac- 
cumulation of moments between events in 
time in a manner permitting the perception 
of the relative durations between those 
events. To those authors, the mechanism 
responsible for the moment also functions 
as a neural pacemaker which governs the 
perceived rate of time passing, That as- 
sumption implies that a factor which in- 
fluences the neural pacemaker would affect 
both temporal acuity and the perception 
of time passing. Whether hyperthermia is 
such a factor is the question addressed in 
the present research. 

Induced hyperthermia has been found to 
increase the perceived rate of time passing ; 
i.e., subjective estimates of a unit of time 
are systematically shortened as body tem- 
perature increases (Hoagland, 1933 ; Kleber, 
Lhamon, & Goldstone, 1963; O'Hanlon, 
Danish, & McGrath, 1968). Hoagland at- 
tributed this result to the physiochemical 
effect of temperature on a hypothetical 
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pacemaker located in the central nervous 
system. However, some investigators (Bell, 
1965; Bell & Provins, 1963) have failed to 
find any significant relationship between 
hyperthermia and performance in time- 
keeping tasks. But those negative results 
have been attributed to possible confound- 
ing effects of interpolated tasks and the 
relatively small induced changes in body 
temperature (Fox, Bradbury, Hampton, 
& Legg, 1967). 

Various investigators have sought to de- 
termine whether induced hypothermia, as 
well as hyperthermia, affects time percep- 
tion in the manner predicted by Hoagland's 
pacemaker model. In two studies (Fox 
et al., 1967; Lockhart, 1971), an increase 
in the perceived rate of time passing was 
found in both hyperthermic and hypo- 
thermic conditions. However, Baddeley 
(1966) found a significant decrease in the 
perceived rate of time passing in scuba. 
divers exposed to cold water as predicted 
by the model. Further support was pro- 
vided by O'Hanlon et al., who found that 
the perceived rate of time passing de- 
creased with hypothermia and increased 
with hyperthermia in the same Ss. There- 
fore, while the evidence is by no means 
unanimous, there appears to be a substan- 
tial degree of support for the existence of a 
temperature-dependent neural pacemaker. 

If the neural pacemaker and the psycho- 
logical moment are related, as suggested 
by White (1963), then hyperthermia would 
be expected to increase temporal acuity. 
Thus, the major purpose of the experi- 
ments described here was to determine if 
temporal acuity, like the perception of 
time passing, varies with hyperthermia. A 
secondary purpose of this research was to 
ascertain whether a change in the brain’s 
dominant alpha rhythm parallels any hy- 
perthermia-induced change in temporal 
acuity. Several authors have suggested a 
relationship between alpha frequency and 
temporal acuity} (Harter, 1967 ; Kristoffer- 
son, 1966, 1967), but no conclusive evidence 
has been offered (Legg, 1968). 

Two separate experiments were per- 
formed to determine" the relationship be- 
tween temporal acuity and hyperthermia. 


In the first, temporal acuity was determined 
from a test of the perception of succession 
(Type 2 test, Fraisse, 1963) ; in the second, 
it was determined using a version of the 
test for critical flicker frequency (CFF), 
Electroencephalogram (EEG) recordings 
were taken during the second experiment. 
A third experiment was conducted to as- 
certain whether hyperthermia changes pu- 
pillary diameter in any way which might 
affect interpretation of the results from the 
former experiments. 


EXPERIMENT I 
Method 


Subjects. The Ss were 11 male 21-28-yr.-old 
college students, who were paid for their participa- 
tion. All had experienced induced hyperthermia in 
a previous experiment. 

Experimental. setting. 
of two adjacent rooms. 


The laboratory consisted 
The S's room contained a 
‘ital bed, a shortwave diathermy má- 
heim, No. SW-660), and a visual 
display. room contained apparatus for 
controlling the display and recording Ss’ tempera- 
ture and responses 

Apparatus, The v 


isual display was à standard 
Lafayette binocular viewing chamber (No. 1202C). 
The display contained two Sylvania glow-modulator 
lamps (No. R1166), each of which illuminated a 
circular diffusion screen with a diameter of 1.3 cm. 
The two difusion screens were 1.0 em. apart and 
were viewed from a distance of 38 cm. through an 
eyepi An opaque septum within the chamber 
permitted. dichoptic stimulus presentation. Each 
lamp provided a flash luminance of about 44.56 
cd/m? inst a background of less than .16 cd/m? 
at the eyepiece. The display control was a modified 
Lafayette flicker-fusion unit (N 1202A). The 
modification permitted the administration of two 
flashes, 10 msec. in duration, one to each eye, Two 
Hunter timers controlled the duration of the interval 
between flashes of the two lights. Tympanie tem- 
perature, which approximates brain temperature 
(Benzinger & ‘Taylor, 1963), was recorded by à 
Digitec digital thermometer (No. 500) connected by 
cable to a thermistor fixed in S's external auditory 
meatu ainst the tympanic membrane, 

Task. Two pairs of light flashes were presented 
in the visual dis one pair was simultaneous, 
the other was successive, The onsets of successive 
flashes were separated by 10-, 20-, 30-, or 40-msec. 
intervals, and all flashes were of 10-msec. duration, 
Upon observing both pairs of flashes, 5 was required 
to state which pair consisted of successive flashes. 
If he was unsure, both pairs were pr ented again. 
The task conformed to the psychophysical met hod 
of constant stimuli with a forced-choice response. 

Procedure. Each S was scheduled for a sequence 
of three sessions with 1 wk. between sessions. 


790 


3 


100 


> 


90r 


80} 


70} 


PER CENT CORRECT RESPONSES 
T 


60} 
so Le : : iue 
o 10 20 30 40 
INTERVAL BETWEEN SUCCESSIVE 
FLASHES (MILLISECONDS) 
Ficure 1. The percentage of correct responses 
by 11 S a function of the duration between 


successive flashes in the first control condition (C1), 
hyperthermia condition (H), and the second control 
condition (C3). 


In e; 
Sweat suit and reclined on the 
Upon request, he 

thermos, which wa: 
sary. 


In the training condition, § performed the task 
in 40 trials with knowledge of results after each 
trial, and then in four blocks of 40 trials without 
knowledge of results, In C, S performed the task 


TABLE 1 


OF A NEWMAN-KEULS TEST or 

RENCE IN TOTAL NUMBER OF 

Correcr RESPONSES ny 11 S, 
COMPARING ALL PossiBLE 

PAIRS or CONDITIONS 


SUMMARY 
Dr 


Comparison Total difference q df ? 
H-C; 63 6.80 | 3,20] <.01 
H-C, 55 5.94 | 2,20 | <01 

CC: 86 |2,20| ns 
Note. Abbreviations: H = hyperthermia, C, 


= first 
condition, and Cs = second control emai’ contral 
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in four blocks of 40 trials, Between 
rested for 5 min. At the end of Cy, S rest 
min., and then began H. In H, diathermy 
plied to the thorax, abdomen, and thivhs 
S's tympanic temperature was raised from | 
age euthermic level of 37.0°C. to 38.3% om 
60-90 min. of diathermy were required tc m- 
plish this change in body temperature he 
criterion temperature, diathermy was disco d, 
and S performed the task in four blocks of 4 ls 
each. Tympanic temperature was maintaii e- 
tween 38.25°C. and 38.35°C. by applying diat ly 
necessary during the 5-min. rest p ls 
1 blocks of trials. In Cs the procedure | is 
me as in Condition C,. 

In all conditions, each block of 40 trials 
following characteristics : (a) In 20t 
of flashes were simultaneous-succ ive, and in . ) 
trials they were succes: ‘e-simultaneous. (b) TI 
sequence of flashes in the successive pair was left 
right in 20 trials, and right-left in 20 trials. (c) 
Each interflash interval (10, 20, 30, or 40 msec. ) 
occurred 10 times in the 40 succ. ssive pairs, Each 
interflash interval was used an equal number oi 
times with each order of pairs and each sequenc« 
of flashes. The order of pair presentations and the 
order of interflash intervals were randomized ind 
pendently within each block of trials, 


s he 
r 15 
ap- 
hat 
cr- 


Results 


Training session data from each § werc 
inspected to ensure that he had arrived at 
a stable level of performance. These data 
were not analyzed further., 

The percentage of correct responses was 
tabulated for each interflash interval in 
Conditions Ci, H, and Cs. These percent- 
ages are shown in Figure 1 as functions of 
the duration of the interflash interval. The 
Percentage of correct Tesponses was greater 
in H than in either of the control condi- 
tions for all values of interflash intervals, 
The differences among performances in 
the C,, H, and C, conditions were tested 
for statistical significance using a two-way 
mixed-design analysis of variance of the 
total number of Correct responses, That 
analysis revealed that the differences among 
total correct responses in the three condi- 
tions were significant, F (2, 20) = 3.74, 
P<.05. A Newman-Keuls test was ap- 
plied to determine the significance of the 
differences between all Possible pairs of 


conditions. The results are summarized in 
Table 1, 


The difference 
Correct responses 


in the total number of 
between the H session 


ti 
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and each control condition was significant 
beyond p < .01. There was no significant 
difference between the control conditions. 


EXPERIMENT II 
Method. 


Subjects. The same 11 men who had served as 5s 
in the first experiment served in the second. 

Apparatus. The display control apparatus was 
nodified for estimating Ss’ CFF, according to the 
sion of the psychophysical method of limits 
leveloped by von Békésy (1947). Briefly, the 
imps were made to flash in unison with a 50/50 
ight to dark ratio, and the flash frequency was 
de to vary continuously at a rate of .5 Hz/sec. 
l'his was done with a reversible servomotor which 
drove the oscillator that controlled the flicker rate, 
as well as a potentiometer which yielded a voltage 
analog of flicker frequency to drive a pen in a strip- 
chart recorder. After initial adjustment, S deter- 
mined the duration of frequency change by depres- 
sing a handheld switch which caused the servomotor 
to reverse direction after an 8-10-sec. delay. These 
responses were recorded by a brief pen deflection 
on the chart recording. 

Two channels of a Gilson polygraph (M8PM) 
recorded EEG from Grass silver subdermal elec- 
trodes placed over S's right and left occipital 
cortices. The active recording electrodes were 
referenced to linked, clip-type, silver electrodes 
attached to both earlobes. 

Task. The task consisted of pressing the response 
switch whenever the lamp’s brightness appeared 
constant after it had been flickering, or whenever 
the brightness seemed to flicker after it had been 
constant. The former response marked S's fusion 
threshold, the latter response marked his flicker 
threshold. The midpoint of a line between fusion 
and flicker thresholds on the chart recording was 
the measurement of CFF. 

Procedure. Each S participated in three sessions 
scheduled 1 wk. apart. Conditions Ci, H, and C2 
were administered in successive sessions. 

In Cı and C; the task was admi stered in four 
10-min. tests, interspaced with 5-min. rests. Re- 
cordings of EEG were made during 5-min. periods 
before the first test and after the final test. During 
all EEG recordings, 5 reclined quietly with his eyes 
closed. 

In H, S rested as before for 5 min. while his EEG 
was recorded. Next, he undertook a 10-min. version 
of the CFF test. Diathermy was then administered 
to S for 30 min., and his CFF was tested as before. 
The sequence of heating and testing was repeated 
until S's tympanic temperature had risen by 2.0°C. 
This took 90-120 min. to complete. After reaching 
the criterion temperature, S was allowed to recover 
to his normal temperature. During recovery, S’s 
CFF and EEG were measured in alternate 10- and 
5-min. periods. Recovery lasted 55-75 min. 

Eight separate CFF determinations were made in 
each 10-min. test. These were averaged to provide 
a single CFF measurement for each S in each test. 
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FiGuRE 2. Mean critical flicker frequency as a 
function of the change in tympanic temperature 
during the hypert hermia condition. 


Results 


In both control conditions, C1 and C», 
Ss mean CFF varied in a nonsignificant 
manner between about 43.5 Hz. and 44.3 
Hz. In H, mean CFF varied as a function 
of tympanic temperature as shown in 
Figure 2. 

The data in Figure 2 were obtained by 
computing the mean CFF for each .5?-C. 
interval of changing tympanic tempera- 
ture. However, all Ss were not tested 
within the same .5?-C. intervals due to 
differential body heating rates, so the mean 
values shown in Figure 2 were necessarily 
based upon a different number of observa- 
tions. The N for each mean is included in 
Figure 2 below the respective data point. 
The product-moment correlation coefficient 
between mean CFF and mean tympanic 
temperature was 7 = .18, p < .05. 

The EEG recordings from one S were 
practically devoid of alpha activity. These 
data were not used. The mean (2: SE) 
alpha frequency for the remaining 10 Ss 
was 10.7 Hz. (-.25) in both Ci and Co. 


MEAN ALPHA FREQUENCY (Hz) 


BEFORE 2 


HEATING E IN 


CHANG 
TYMPANIC TEMPERATURE (°C) 
Figure 3. Mean alpha frequency (+SE) be- 
fore hyperthermia and as a function of the change 
in tympanic temperature during recovery from 
hyperthermia, 


During H, mean (2-SE) alpha frequency 
varied with tympanic temperature as shown 
in Figure 3. 

Even before heating, the mean alpha 
frequency was higher in H than the 10.7 
Hz. of €, and Cs. There was a further 
increase upon heating the Ss, up toa maxi- 
mum mean of 11.5 Hz., followed by a 
decline as their temperatures returned to 
normal. The difference between the mean 
alpha frequency in C and the first measure- 
ment (preheating) in H approached sig- 
nificance, £ (9) = 249, 5 « .10. The dif- 
ference between mean alpha frequency in 
C, and the second measurement in H (i.e., 
during maximum hyperthermia) was sig- 
nificant, 1 (7) = 4.05, p < .01. However, 
the difference between the first and second 
measurement in H was not significant, 
L(7) = 127. 

In spite of the increase in mean alpha 
frequency during heating, no significant 
correlation was found between alpha fre- 
quency and CFF in either C, or H, The 
correlation between the changes in CFF 
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occurred with 
ire was simis 


and in alpha frequency thai 
changing tympanic temperat 
larly nonsignificant. 


EXPERI^ 


from Experi- 
the failure 


Interpretation of th: 
ment II was made di 


to control or measure lary diam- 
eters while their core t changed. 
A systematic increase į Jiameter 
with temperature mig! sed the 
observed change in ( to the 
Ferry-Porter law (B: The 
relationship between cor and 
pupillary diameter was n led by an 
extensive review of th: Ihere- 
fore Experiment Ill o 
supply the data requi ig 
whether any such rel: 
Method 

Subjects. The Ss were three 27 thy 
males. 

Apparatus, The diather: 


used to induce hyperthermi: 
complished with a Nikon I’ 
Nikkor 55 mm. F3.5 len 


measured from photographic « dial 
calipers (Mitutoyo, No. 505 empera= 
ture was measured, using th thermometer, 

Procedure, The Ss were heated toa i erature 
of 39.0°C, Heating was accomplished in 34 min,, 
depending on the S, At 15-min, intervals during 


heating, a series of three photographs of S's left 
were taken within a 30- 
criterion temper: 
and a similar se 


re 
After rc aching 
llowed 


>. peric 
ure, Ss were 
of phot« 


to recover, 
phs was taken at 


approximately 15-min. inter itil a drop of 
1.0°C. during recovery. Th of recovery 
was obtained in 1640 min ding on the; 
During photography, .Ss loc n point 
on a blank field at a distance o uminance 


at S's eye was a constant 164 cd/m?, A circular 
paper spot of known diameter was attached to S's 
lower eyelid as a calibration for pupil measurement 
from photographic enlargements, 


Results and Discussion 


Pupil diameter during heating and re- 
covery is shown in Figure 4, where each 
data point represents the mean pupil diam- 
eter of the series of three photographs. 

According to Hakerem (1967), spontane- 
ous changes in pupillary diameter under 
dim illumination are of the order of 1-1.5 
mm., and typically greater fluctuation oc- 
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curs with normal light conditions. It is 
apparent from Figure 4 that there were 
only the normal spontaneous fluctuations, 
but no systematic change in pupil diameter 
with hyperthermia. 

According to the F erry-Porter law, with 
constant luminance, an increase in pupillary 
diameter of approximately 58%, more than 
doubling the retinal illuminance, would be 
required in order to increase CFF to the 
degree recorded in Experiment ll, i.e, 4 
Hz. Because no such change in pupillary 
diameter was observed as a consequence of 
hyperthermia in Experiment Ill, it may 
be concluded that the increase in CFF was 
attributable to factors other than a change 
in pupillary diameter. 


GENERAL DISCUSSION 


The results from both Experiments I and Il 
indicate that temporal acuity increased during 
hyperthermia. Perception of succe sion and 
CFF were both facilitated by incr 
temperature. 

In Experiment I, under control conditions, 
Ss' 75% correct response level corresponded 
to an interflash interval of about 17.3 msec. 
This value will therefore be considered the 
,average threshold of succession in the present 
study. 

The average threshold of succession may be 
used to estimate the duration of the psycho- 
logical moment. An interval separating events 
by half of the duration of the moment should 
lead to a rate of correct responding that is 
50% greater than the rate expected by chance 
alone. If the first and second events fell 
within different halves of the same moment, 
they would be perceived as succes sive. Since 
the first event has an equal likelihood of falling 
any time within the moment, that particular 
pair of events would be perceived as simul- 
taneous half of the time and as successive half 
of the time. 

If S responds at the 75% correct level on a 
forced-choice basis, his threshold of succession 
is actually producing correct responses at a 
rate of 50% greater than the rate (50%) ex- 
pected by chance alone, From this, it might 
be reasoned that the threshold of succession is 
half the duration of the moment. However, 
this reasoning ignores the fact that the flashes 
used to demark the interflash interval had 
themselves a finite duration—10 msec. in this 
experiment. Neurophysiological evidence in- 
dicates that elements in the retina and cortex 
respond vigorously to both the onset and offset 
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FiGvRE 4. Pupil diameter during hyperthermia 
and recovery for three Ss. 


of a light stimulus (see the review by Gross- 
man, 1967, pp. 213-260). Presumably, the 
first neural events would arise from the onset 
of the first flash; the final neural events follow 
the offset of the second flash. Thus, the dura- 
tion of the second flash, 10 msec., should be 
added to the threshold of succession, 17.3 
msec., as an estimate of half of the duration of 
the moment. This results in a value of 54.6 
msec, as our best estimate of the S's average 
normal duration of the psychological moment 
in this experiment. 

Our estimate of the psychological moment 
is in remarkable correspondence with Kristof- 
ferson's (1966, p. 7) average estimate of 54.3 
msec, The correspondence is even more sur- 
prising in that Kristofferson's test of the per- 
ception of succession employed long-lasting 
(2 sec.) and mixed sensory (auditory and 
visual) pairs of test stimuli. 

During hyperthermia, Ss! threshold of suc- 
cession dropped to an average of 15.7 msec., 
a reduction of 1.6 msec. from control condi- 
tions, It was estimated, in the manner de- 
scribed above, that their average moment was 
shortened to 51.4 msec., à reduction of about 
6% from control conditions. 
periment II, the CFF was shown to 
© as a function of increasing body tem- 
perature. Lockhart (1971) reported similar 
changes in Ss’ CFFs as they were tested in a 
hot (49.2? C.) environment, which raised their 
mean body temperature by .6°C. The failure 
to demonstrate any systematic effect of hy- 
perthermia on pupillary diameter in Experi- 
ment [II leads us to interpret these findings 
as further evidence that temporal acuity varies 
as a function of body temperature. 
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The EEG alpha frequency also increased 
with body temperature in Experiment II, 
which supports the findings of Hoagland 
(1936). The relative changes in alpha fre- 
quency and CFF may be similar enough (a 
maximum 7% increase in alpha; a correspond- 
ing 6% increase in CFF) to warrant further 
consideration of a relationship between these 
variables. However, neither we nor Lockhart 
(1971) found any significant relationship be- 
tween changes in alpha frequency and CFF, 

It is of interest to note that the mean alpha 
frequency recorded prior to heating in Condi- 
tion H was greater than the control values. 
Though this result was only marginally signifi- 
cant (p < .10), it may indicate that anticipa- 
tion of heating accelerated Ss’ alpha frequency. 

In the present experiments, temporal acuity, 
as judged from tests of the perception of suc- 
cession and CFF, increased with core tempera- 
ture. Previous studies, cited above, have 
shown that temperature exerts a similar 
influence upon the perception of time passing. 
Both phenomena may be explained as resulting 
from a shortening of the psychological moment; 
and both may reflect the direct action of tem- 
perature on a neural pacemaker, 
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JUDGMENT OF RECENCY FOR PICTURES AND WORDS' 


GARY L. 


, TERRY C. DANIEL, AND NEIL R. BARTLETT 
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An experiment was conducted comparing judgment of recency (JOR) for line 


drawings 2 
groups of 15 5s w 
presentation rates. Upor 


id printed words representing common objects. 
> tested using either pictures or words at either of two 
an item's second occurrence, which was marked to 


In a mixed design, 


be easily distinguished as a test, Ss were required to make a judgment of 


recency, their bes 
lag length increa 


stimate of the actual lag for that item. 
ed, accuracy of rece 


It was found that as 


ncy judgment declined. More import- 


antly, an interaction was found between stimulus type and rate of presenta- 


tion. 


Words were found to be superior to pictures at a rate of 2-sec, exposure 


per stimulus, while this pattern was reversed at the 4-sec, exposure rate, The 
results are discussed in reference to a memory-strength interpretation of JOR 


and the dual-coding theory of Paivio. 


How is the judged recency of a remem- 
bered event determined? Recent accounts 
have related judged recency of an event to 
the memory strength for that event. 
Memory strength is usually assumed to be 
maximal at the time of occurrence of the 
event and then to decline over time. The 
stronger the memory trace for an event, 
the more recently it will be judged to have 
occurred. The most explicit memory- 
strength interpretation was presented by 
Hinrichs (1970; Hinrichs & Buschke, 1968), 
who extended the basic strength and de- 
cision processes proposed in models of event 
memory to the judgment of recency (JOR). 
He proposed a two-process memory-strength 
theory of JOR that described the data for 
recency judgments of letters reasonably 
well. 

A characteristic finding in both recall 
and recognition memory paradigms is that 
information presented pictorially is less 
susceptible to forgetting than is informa- 
tion presented verbally (Paivio, 1971; 
Shepard, 1967). This has been interpreted 
as indicating slower decline (decay) of 
memory strength for pictorial information 
than verbal information. Pictorial and 
verbal materials are known to produce dif- 
ME ects ye a aN, 

1 This research is based in part on a thes sub- 
mitted in partial fulfillment of the requirements for 
the master's degree in the Department of Psychology 
at the University of Arizona. 

2 Requests for reprints should be addressed to 
Neil R. Bartlett, Department of Psychology, Uni- 
versity of Arizona, Tucson, Arizona 85721. 


ferent effects in a variety of other para- 
digms as well. 

The dual-coding theory of Paivio (1971) 
distinguishes between sequential and paral- 
lel accessing of information in memory ; the 
hypothetical verbal system being suited to 
sequential processing, while the image sys- 
tem is a parallel processing system that 
organizes input spatially. The theory gen- 
erally predicts superiority of words to 
pictures in serial tasks, but negligible dif- 
ferences between pictures and words are 
expected if the pictures could have been 
easily labeled. 

Pictures have been found to be superior 
to words in serial tasks using recency para- 
digms. Yntema and Trask (1963), for 
example, found discrimination of relative 
recency (DR) of two unrelated stimuli to 
be less accurate when the items were nouns 
than when they were pictures. Using the 
same paradigm, Fo: ard (1970) presented 
pictures and words in the same sequence 
at a rate of 1.5 sec. per item. He found the 
best recency discrimination when both 
items compared were pictures. Further, 
Fozard and Weinert (1972) found superior 
JOR performance for pictures when a self- 
paced procedure was used. Even though 
Ss may have had time to verbally code 
the picture stimuli in these experiments, 
the superiority of pictures to words in these 
tasks presents some difficulty for Paivio's 
(1971) conceptualizations. 
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PICTURES 


FIGURE 1. 
a function of Rate of Presentation X Lag Length 


Mean judgment of recency (JOR) as 


treatment combination 
separately, 


for pictures and words 


The present experiment investigated judg- 
ment of recency for pictures and words. 
Picture or word stimuli were presented 
serially at either a fast (2.0 sec. per stimu- 
lus exposure) or a slow rate (4.0 sec. per 
stimulus exposure). On signaled test trials 
Ss judged the recency of occurrence of the 
indicated item. The rate factor is relevant 
to memory-strength interpretations of JOR 
(e.g., Hinrichs, 1970). The slower rate of 
presentation results in longer intervals be- 
tween items and should result in generally 
longer judgments of recency than the faster 
rate of presentation, Further, if memory 
Presented items 
time than does 

the slower rate 

of presentation should have less effect on 
items than for words. 
of Presentation, picture 
stimuli may be judged to have occurred 
more recently than word stimuli at the 
same lag. With regard to Paivio's (1971) 
hypothesis, rate of Presentation would not 
be expected to produce marked effects as 
the fastest, 2.0-sec. exposure, rate is suffi- 
ciently slow that all Ss should be able to 
code verbally the picture stimuli, 


METHOD 


Design and subjects. Two between-Ss variables, 
type of stimulus (picture or word) and stimulus 
exposure time (2 sec. or 4 sec.) were combined with 
two within-.Ss variables, trials and lag length (num- 
ber of items plus one intervening between two occur- 
rences of the same stimulus) into a. 2X2x ($ x9) 
factorial design. The Sixty Ss (30 males and 30 


females) were volunteers from introduc psy- 5 
chology courses at the Unive sity of zona. 
Groups of Ss were randomly assigned t ler a 
"picture" or a "word" stimulus condit l to 
either a 2-sec. or 4-s timulus exposur ion, 
Stimuli, Line draw of familiar obj: sted 
for prompt common naming, and their « nd- 
ing word names v photographed and : lon 
35-mm. slides. Stimuli were projected o; ar- 
projection screen by a modified Kodal sel 
(Model 801) slide projector, Random lis ich 
class of stimuli were formed with the restrict at 
no item was reused until at least 10 differ: ms 


intervened (see Hinrichs, 1970). Five subli re 
formed according to the stimulus ordering tex 


of Nickerson and Brown (1963). This o: ng 
echnique insured exactly one test at each of - ags 
1 through 9 within each su blist. These sublist re 
presented to S one after another, without pau is 
one continuous list, 
Procedure. 


ch S was instructed to mak: n 
estimate of the lag between two Occurrences of <e 
same stimulus item at the time of its second o 

ren An item's second occurrence was cl 
designated asa test. The correct recency judgm« 
they were instructed, consisted of the number 
intervening items since the most recent nont 
presentation of that item including itself, or ti 
number of intervening items plus one. The S 
sponded verbally, giving “judgments of rec 
upon each occurrence of a test. A hort practic 
session using similar stimuli was always given t 
make sure S understood the requirements of tl; 
task, 


RESULTS 


Mean JORs at each lag 
Figure 1 for all four 
tions. In spite of the 
of presentation effect for picture stimuli, 
there was a general tendency for Ss to 
underestimate the true lag at the 2.0-sec. 
rate and to overestimate the lag at the 4,0- 
sec. rate. The main effect of rate was 
statistically reliable, F (1, 56) = 4.97, p < 
-05. Inspection of Figure 1 strongly sug- 
gests that rate of presentation had differ- 
ent effects for pictures and words, but the 
Rate X Stimulus Type interaction was only 
marginal, just failing to reach the .05 level 
of significance, Apparently the lack of rate 
differences for either pictures or words at 
the short lags (one to four) prevented the 
rather dramatic differences at the longer 
lags from emerging in the analysis of vari- 
ance. This interpretation gains support 
from the significant Rate X Lag Length 
Interaction, F (8, 448) 2.19, p < .05, 


are presented in 
experimental condi- 
apparent lack of rate 


ad 


JUDGMENT OF RECENCY OF PICTURES AND WORDS 


indicating that the principal locus of rate 
effects was at the longer lags- 

There was a small but consistent ten- 
dency for judgments to become larger as a 
function of trials (experience), F (4, 224) 
= 6.27, p < 01. Judgments became larger 
on later trials, but the differences involved 
weresmall and followed nosignificant trend. 
A significant Trials X Lag Length effect 
was found, F (32, 1972) = 3.12, $ < .01. 
Again, no consistent trend was apparent in 
the data. 

Accuracy of JORs was assessed in terms 
of absolute deviation of JOR from actual 
lag. These data can be seen in Figure 1 by 
comparing mean JOR with true lag indi- 
cated by the solid line diagonal. Mean 
absolute deviations for picture and word 
stimuli at each presentation rate are also 
presented in Table 1. 

Analysis of variance revealed a signifi- 
cant Stimulus Type x Rate of Presenta- 
tion effect for mean absolute deviations 
from actual lag, F (1,56 = 5.22, p < .05. 
'The most notable difference was between 
words at the slow and fast rates. Pictures 
differed considerably less as a function of 
rate. Subsequent post hoc testing (Tukey 
HSD—Kirk, 1968) disclosed that the fast- 
word condition was significantly more ac- 
curate than the fast-picture condition. 
The slow-picture condition was significantly 
more accurate than the slow-word condi- 
tion. The fast-word and the slow-picture 


conditions were not significantly different. 


DISCUSSION 


The observed superiority of words over pic- 
tures in accuracy of JOR at the faster rate of 
presentation appears to be consistent with 
Paivio's dual-coding theory. Words arouse 
the verbal code that is crucial to performance 
on sequential tasks more readily than do 
pictures. This difference is usually restricted 
to cases where the rate of presentation is quite 
fast, reducing the possibility that pictures 
would be verbally coded. Contrary to this 
analysis, pictures were found to be superior 
to words in JOR accuracy at the slow rate of 
presentation. This finding is difficult to ex- 


„plain if verbal coding is assumed to mediate 


performance on sequential memory tasks. 
While the slow rate of presentation surely 
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TABLE 1 


'ANDARD DEVIATIONS) OF ABSOLUTE. 
TIONS FROM ACTUAL LAG 


MEANS (AND S 
DE 


Rate of presentation 


Stimulus type 


2 sec. 4 sec. 
Pictures 1.812 1.479 
(1.349) (1.555) 
Words | 1.516 1.976 
(1.316) 


(1.748) 


provided ample time for Ss to verbally label 
the pictures, it seems unlikely that the verbal 
code for the pictures was more effective in the 
JOR task than the verbal code for the words 
themselves. The availability of a verbal code 
does not appear sufficient to explain all of the 
observed differences in JOR accuracy for pic- 
tures and words. Perhaps the verbal coding 
system is preferred when rate demands are 
high, but “jmaginal” coding is more effective 
at slower rates of presentation in spite of the 
large serial processing component of the JOR 
task. 

The picture-word differences also present. 
difficulty for Hinrichs’ (1970) memory-strength 
interpretation of JOR. The slower presenta- 
tion rate did not lead to longer JORs for 
pictures as was predicted by the memory- 
strength interpretation. The slower rate ac- 
tually led to more accurate JORs than the 
faster rate. It may be that contrary to 
Hinrichs’ memory-strength hypothesis, high 
memory strength for an event need not pro- 
duce a judgment that the event occurred 
recently. Instead, high memory strength may 
produce a more accurate judgment of when the 
item occurred. The Ss presented with pictures 
in the JOR task apparently made beneficial 
use of the extra 2-sec. stimulus exposure pro- 
vided by the slow rate of presentation to over- 
come the effects of longer delays. The Ss 
presented with words apparently did not bene- 
fit sufficiently from the extra processing time 
to overcome the detrimental effects of longer 
delays. 

An extension of Morton’s (1969) “Jogogen’ 
model of memory offered by Seymour (1973) 
may prove useful in explaining picture-word 
differences as à function of rate. The model 
proposes that visual inputs for words and 
pictures are distinguished early in processing 
and undergo different coding processes. If 
the ultimate response is vocal, the route for 


' 
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words is more direct than that for pictures 
Word input, after having been graphically 
analyzed, proceeds directly to an exit node 
where it is available for vocal output. To 
reach this same node, picture input must first 
be pictorially analyzed, proceed to a pictorial 
exit node, and then be interpreted in semantic 
memory before reaching the vocal exit node. 
Because there are additional stages in the 
encoding of pictures, words would have an 
advantage over pictures in vocalization tasks 
particularly when fast rates are demanded. 
These same additional stages may contribute 
to superior performance with pictures in other 
memory tasks, 
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IN SHORT-TERM MOTOR MEMORY ! 
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Motor proactive interfe 


nce (PI), produced by requiring Ss to recall five 


linear lever displacements in reverse order after a 90-sec. retention interval, 
was completely eliminated by an instruction to forget the first four positions 
given prior to the presentation of the criterion (fifth) response and significantly 
reduced by a similar instruction given immediately before recall. The ability 
of Ss to intentionally forget prior motor responses was assumed to explain the 
inconsistent findings of short-term motor memory research, although it ap- 


peared that to-be-forgotten prior items were not completely 
since they were recalled equally as well as to-be-remembered prior items. 


“discarded,” 
The 


discussion centered on the cognitive control of motor behavior and, in par- 


ticular, on the mechanisms of set differentiation, 


selective search. 


The importance of eliminating irrelevant 
information was known to the ancients 
(Cicero, 1959, p. 427). More recently 
Bjork (1972, p. 218) has suggested that 
intentional forgetting is fundamental to the 
efficient functioning of man, since without 
it he would "degenerate to a proactive- 
interference-induced state of total con- 
fusion." Intentional or directed forgetting 
has received considerable experimental 
attention recently in the area of verbal 
short-term memory (for reviews see Bjork, 
1972; Epstein, 1972) but it has not been 
considered within the realm of motor be- 
havior. This experimentation has led to 
an increased understanding of verbal mem- 
ory and one might expect that motor 
memory research would also benefit from a 
systematic study of the effect of instruc- 
tions to forget. The present study was the 
first such investigation and, since it was 
specifically designed to test the effect of 
instructions to forget prior motor learning, 


a 
1 This paper was submitted as a PhD dissertation 
to the Physical Education Department at the 
University of Illinois. The author wishes to 
acknowledge with thanks the insightful comments 
and suggestions of committee chairman Jac 
Adams and committee members Michael J. 
Rainer Martens, and William E. Montague. 

? Requests for reprints should be sent to Leslie 
Burwitz, Department of Physical. Education and 
Recreation, Acadia University, Wolfville, Nova 
Scotia, Canada. 


selective rehearsal, and 


the results were relevant to the understand- 
ing of short-term motor memory (STMM) 
proactive interference (PI). 

The initial research concerned with the 
effect of prior motor learning on the reten- 
tion of a motor response was conducted by 
Adams and Dijkstra (1966), A Peterson- 
Perterson (Peterson & Peterson, 1959) 
paradigm was employed whereby S was 
required to learn and recall seven different 
linear displacements in turn and, in con- 
trast to the results of verbal short-term 
memory (STM) research (Keppel & Under- 
wood, 1962) there was no evidence of PI 
over trials, The null PI finding stimulated 
further research. and, although Montague 
and Hillix (1968) and Schmidt and Ascoli 
(1970) reached essentially the same con- 
clusion, Ascoli and Schmidt (1969), Stel- 
mach (1969), and Williams (1971) reported 
conflicting evidence, in that criterion (final) 
response recall was indirectly related to the 
number of prior motor learning move- 
ments. The initial motivation for the 
present research was a desire to resolve 
this inconsistency, which appeared to be a 
function of procedural differences. That 
is, the studies that found no evidence of 
PI used a Peterson-Peterson learn-recall 
paradigm — (single-trial design), whereas 
those that were successful in their efforts 
to produce PI required S to learn and 
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recall in reverse order a number of different 
movements within one trial. 

The reverse order or multiple-trial design 
was introduced by Ascoli and Schmidt 
(1969) to permit the inclusion of a number 
of motor items into STM very rapidly. 
Ascoli and Schmidt reasoned, on the basis 
of a verbal STM study conducted by Bjork, 
LaBerge, and Legrand (1968), that, be- 
cause verbal PI is reduced by an instruc- 
tion to forget previous items, prior motor 
learning may not interfere with a criterion 
response if S is allowed to "discard" pre- 
vious traces from STM. They suggested 
that motor PI is not produced in a single- 
trial design because S is allowed to inten- 
tionally forget prior motor learning. To 
some, this argument may seem irrational. 
Alter all, there is ample evidence that 
directed forgetting affects verbal learning 
and yet PI is produced in single-trial design 
experiments in verbal learning (Keppel & 
Underwood, 1962). ‘The difference between 
the findings of motor and verbal single- 
trial design experiments indicates, however, 
that there is an inherent danger in consider- 
ing that one set of principles governs be- 
havior in both domains. The single-trial 
verbal STM studies generally test the re- 
tention of trigrams (three items) on a 
dichotomous (right or wrong) scale, while 
the single-trial motor STM experiments use 
only single items and measure retention on 
a continuous (graded in millimeters or 
iin. units) scale. Thus, the verbal and 
motor single-trial experiments are not 
identical and an explanation that does not 
account for the data in one domain may 
very possibly explain behavior in the other 
area, 

The ability of S to intentionally forget 
prior motor learning is unknown but, if it 
is possible, one would appear to be justified 
in concluding that the inconsistent STMM 
PI findings are a function of procedural 
differences which permit or restrict inten- 
tional forgetting of prior learning. That is, 
if S can make use of a forget instruction to 
eliminate motor PI, one would not expect 
to find PI ina single-trial design, where S 
is allowed to learn, recall, and completely 
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forget each response prior to the pres. ʻa- 
tion of the next distance, whereas one » ild 
expect to find PI in a multiple-trial dee in, 
because S is required to remember all is- 


placements in reverse order and is give: no 
opportunity to forget prior learning. 


The underlying assumption of the pre: nt 
research was that motor PI, which o s 
in a multiple-trial design, is a manife a- 


tion of cognitive processes, which o ur 
consequent with or subsequent to encod 

To test this hypothesis the mechanis s, 
which have been proposed to expl n 
directed forgetting in verbal STM, w 
considered to determine which, if any, w 
relevant to the study of motor men 
The six mechanisms, which were subjec 
to analysis, have been referred to 
erasure, set differentiation, selective 
hearsal (Bjork et al, 1968), recall lo. 
(Reed, 1970) or anticipatory recycli: n 
(Epstein, 1969), initial registration (Rec, 
1970), and selective search (Epstein, 1972 
Support for any of the hypotheses 
assumed to lend credence to the view th: 
motor behavior is controlled by cognitis 
processes, 


Mrtuop 
Experimental Design 


Thirty Ss were randomly assigned to each of tho 
10 independent treatments of a 2 X 5 factorial de 
sign. The two levels of the first factor involved 
retention intervals of 5 and 90 sec. The five levels 
of the second factor were referred to as PI, forget 
late (FL), forget early (FE; and FE;), and con- 
trol (CO). The PI treatment involved four prior 
learning (PL) responses, a criterion response, and an 
instruction to remember all five responses in reverse 
order at recall. The FL group received four PL, 
a criterion response, and an instruction to forget the 
st four respons nted immediately prior to 
recall. FE treatment groups were given 
four PL, a criterion respouse, and an instruction to 
forget the first four responses given prior to the 
presentation of the criterion. The FE, treatment 
group was reminded immediately prior to recall to 
reproduce only the criterion response, whereas in the 
FE, condition 5s were instructed to recall all five 
responses in reverse order at that time. I "arning 
and subsequent recall of only one response acted as 
CO. Absolute and algebraic error scores were com- 
puted for the criterion response and for the prior 
learning (first four) responses recalled in the P] and 
FE; conditions. 


PI AND DIRECTED FORGETTING IN SHORT-TERM MOTOR MEMORY 


Apparatus 


‘The task employed involved linear lever-position- 
ing. The Ss were able to hear movements of the 
almost frictionless lever but they were restricted 
from viewing the apparatus by means of opaque 
goggles, The apparatus consisted of two 72-cm. 
long Thomson case-hardened steel rods, which were 
mounted at their ends into aluminum blocks so 
that the rods were horizontal, parallel, and 5-cm. 
apart. The aluminum blocks were secured onto a 
78 X22 X1 cm. aluminum base plate, which was 
in turn screwed to à wooden table of normal height. 
A slide consisting of four Thomson ball-bushing 
sleeves (cach 3-cm. Jong), an aluminum plate, and 
a steel handle (8-cm. high) moved freely along the 
trackway with a range of 0-47 cm, Eight holes, 
located in the baseboard, accepted a steel pin, which 
effectively arrested carriage movements at 5, 10, 15, 
20, 25, 30, 35, and 40 em. The criterion distance 
was 20 em. and a scale marked in millimeters was 
used by E to measure error indicated by a fine 
pointer. 


Experimental Procedure 


On entering the experimental room, 5 was seated 
in front of the apparatus so that his right shoulder 
was opposite and a full arm's length from the right- 
hand edge of the carriage. A demonstration (47 
cm.) and a standard set of instructions were given. 
The instructions informed S that he would either 
have to make one or five movements down to a stop 
and then, after a rest, he would have to reproduce 
some or all of the movements without the assistance 
of the stop. The Ss were told that they would not 
be given jnstructions as to which movements they 
would have to remember until later in the experi- 
ment and they were informed that the best pro- 
cedure was to try to remember every position until 
they heard otherwise. 

Ón the command "grasp" S took hold of the 
handle and waited for the instruction “move.” 
On this signal S moved the handle at a moderate, 
deliberate pace from right to left until he hit the 
stop. He maintained his grasp on the handle for a 
period of 3 sec, at which time he was instructed te 
"relax," which meant he was to place his hand on a 
pad located on the table in front of him The s 
quence of events described above defined a learning 
trial and for Ss in the CO treatment the retention 
interval followed. After the retention interval S 
was given recall instructions. The commands 
“regrasp” and “estimate” acted as signals for S to 
move the lever to where he thought it had been 
during learning. 

If S received PL, he made four movements before 
the sequence described above. The prior positions 
were 5 or 10 cm. longer or shorter than the criterion 
distance of 20 cm. The number of possible permuta- 
tions of previous positions was 24, The order of 
previous positions was randomized but each of the 
24 permutations occurred approximately as often. 
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The intertrial interval was 10 sec. The stop, de- 
fining the distance of each movement, was posi- 
tioned by E, and he was also responsible for all 
return movements of the handle to the start position. 

The forget instruction, given in the FE;, FEs, 
and FL treatments, informed S that he was to 
completely erase the first four responses and con- 
centrate only on the last (criterion) response. The 
FL instruction was given in a S-see, period immedi- 
ately prior to recall, and FE instructions were pre- 
sented in the last 5 sec. of the 10-sec. intertrial 
interval between the fourth and the criterion re- 
sponse. The recall instructions were presented in à 
5-sec, period immediately prior to recall. The PI 
and FE; Ss were told to recall all five responses in 
reverse order (10-sec. intertrial interval), FL and 
FE; Ss were instructed to recall only the criterion 
response, and CO Ss, to reproduce their one position. 
Following the final recall trial, a postexperimental 
interview was conducted to determine what strategy 
S used, what he thought he did with recall and 
forget items, and what effect the forget instruction 
had had. 


Subjects 


The Ss were 300 (153 male, 147 female) right- 
handed volunteers (average age, 19 yr., 9 mo.) from 
the general student body at Acadia University, 
Nova Scotia. There were approximately an equal 
number of males and females in each treatment, the 
largest discrepancy being an overload of 16 to 14 in 
either direction. 


RESULTS 
Absolute Error 


Criterion response. The analysis of ab- 
solute error indicated that Ss were on aver- 
age 2.93 cm. off target. There was no 
difference between the performance of 
males (3.00) and females (2.88). The 
Retention Interval X Treatments (2X5) 
analysis revealed that the retention interval 
and treatment main effects were significant, 
‘F (1, 290) = 37.05, p < .001, and F (4, 290) 
=8.97, p < .001, respectively. The Ss re- 
called more accurately at 5 sec. (2.21) than 
after a 90-sec. retention interval (3.57), and 
a Newman-Keuls test indicated that both 
the FL (3.32) and PI (3.99) treatments 
recalled significantly (FL at .05 level; PI 
at .01 level) less than CO (2.23), FE: 
(2.37), and FEs (2.55) conditions. None 
of the other differences was significant. 

The Retention Interval X Treatment in- 
teraction was also significant, F (4, 290) = 
3.34, p = .01. Figure 1 graphically illus- 
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Fıcure 1. Criterion response: Retention Interval 
X Treatment interaction. (Abbreviations; PI = 
proactive interference, FL — forget late, FE, = 


lorget early, remember one, FE; — forget early, 
remember five, and CO — control.) 


trates the interaction and a post hoc 
Newman-Keuls test indicated that signifi- 
cant recall decrements occurred 3-90 sec. 
under PI (p < .01) and FL (b < .05) but 
not under FE; FE;, and CO. There were 
no differences between the five treatments 
at 5 sec., but at 90 sec. PI recall (5.44) 
was significantly (p < .01) worse than FL 
(4.00), FE; (3.12), FE, (2.69), and CO 
(2.60) recall. In addition, FL recall was 
significantly (p < .05) worse than that of 
the FE; and CO treatments. 

Prior learming responses. Four treat- 
ments (PI at 5 and 90 sec. and FE; at 5 
and 90 sec.) contributed data to the Treat- 
ment X Retention Interval X Trials (PI, 
FE; X 5, 90 sec. X 1-4) repeated measures 
analysis. The treatment and retention 
interval main effects were not significant, 
F (1, 116) = 2.79, p > .05, and F (1, 116) 
<1, respectively. There was no difference 
between PL recall of Ss w 
to remember the first four responses 
(PI = 5.10) and those who had been in- 


Structed to forget prior positions (FE; — 
6.28). 


ho were trying 
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The main effect for trials was sign’ ant, 
conservative F* (1, 116) = 4.76, ; .05. 
A post hoc Newman-Keuls test in ited 
that Trial 1 (4.34) was significantl fer- 
ent from Trial 2 (5.80, .05 level), ^. al 3 
(6.48, .01 level), and Trial 4 (6.1  .05 
level). 

The Treatment X Retention Inte: | X 
Trials analysis indicated that nonc the 
interactions was important, with t! ex- 
ception of Retention Interval X als, 
which was marginally significant, cons “va- 
tive F (1, 116) = 3.02, p < .10. A ost 


hoc Newman-Keuls test revealed that 
of the eight cells differed significantly 
each other (all ps > .05). 


ne 
nm 


Algebraic Error 


Criterion response. The analysis of 
algebraic error data indicated that Ss 
tended to undershoot the criterion « y 
slightly (—.11 cm.). There was no dil 
ence between the performance of mia! 
(.03) and females (—.25). The Retenti 


mn 
Interval X Treatment (2 X 5) analysis re- 
vealed no significant effects (all Fs < 1). 
Prior learning responses. The Treatment 
X Retention Interval X Trials (2X2 x^) 
repeated measures analysis revealed a slight 
tendency to overshoot (1.9 cm.) on Trials 


1-4. "The treatment, retention interval, 
and trials main effects were not significant, 
F (1, 116) < 1, F (1, 116) = 1.62, p> .10, 
and conservative F (1, 116) = 1.57, p> 
.10, respectively, and neither were any of 
the interactions. 


Discussion 


The directed forgetting paradigm employed 
in the present study contributed in a general 
and positive way to the understanding of 
motor memory. The results support the 
hypothesis that the inconsistency in STMM PI 
research is a function of procedural differences, 
which either permit or restrict intentional for- 
getting of prior learning. Proactive inter- 
ference was produced by using a multiple- 
trial design, where S was not allowed to forget 
the first four responses since he was required to 
SISTED EE 


* The conservative F test was used for within-Ss 


data to guard against probable positive bias of 
standard F ratios, 
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reproduce them after the criterion response at 
recall. ‘The effect of PL was completely elimi- 
nated by an early instruction to forget the 
first four positions (FE) as well as being sub- 
stantially reduced by a similar instruction not 
presented until immediately prior to recall 
(FL). Thus, Ascoli and Schmidt (1969), 
Stelmach (1969), and Williams (1971) were 
able to produce PI by adopting à multiple- 
trial design, which restricts intentional for- 
getting of prior learning, while Adams and 
Dijkstra (1966), Montague and Hillix (1968), 
and Schmidt and Ascoli (1970) found no evi- 
dence of PI because they used a single-trial 
design that contained an implicit cue to forget 
previous learning prior to the presentation of 
each criterion response. 

The PI reduction following forget instruc- 
tions indicates that motor PI is not a con- 
sequence of prior motor activity per se 
Motor PI was not affected by the mere act of 
responding to or encoding prior movements 
but by operations, which can be assumed to 
be cognitively based, since they occurred con- 
tiguous with and/or subsequent to encoding. 
Verbal researchers have referred to these 
operations as erasure, set differentiation, selec- 
tive rehearsal (Bjork et al., 1968), recall load 
(Reed, 1970) or anticipatory recycling (Ep- 
stein, 1969), initial registration (Reed, 1970), 
and selective search (Epstein, 1972). The 
present study was designed to determine which, 
if any, of the above mechanisms was relevant 
to the study of motor behavior and, although 
further research is necessary to substantiate 
the findings, the interaction data indicated 
that the mechanisms of selective rehearsal, set 
differentiation, and selective search were the 
most influential. 

Analysis of the interaction revealed that the 
FE; FEs and FL treatments. reduced PI, 
that there were no differences between any 
of the five treatments at 5 sec., and that there 
was a significant difference between FE, and 
FL conditions The largest reduction in PI 
occurred under FEr. Each of the various 
hypotheses is able to account for the reduction. 
The forget early cue may have allowed S to (a) 
completely “discard” prior learning (erasure), 
(b) tag or organize recall items in such a 
way as to separate them from forget items 
(set differentiation), (c) concentrate rehearsal 
time on the criterion response (selective re- 
hearsal), (d) recall the criterion without hav- 
ing to recycle prior learning for subsequent 
recall (anticipatory recycling), (e) properly 
encode the criterion at the time of its presenta- 
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tion (initial registration), and/or (f) reduce the 
size of the search set from which the criterion 
was to be recalled (selective search). 

The second largest reduction in PI occurred 
under FE; The finding discriminates be- 
tween the six hypotheses offered to explain 
directed forgetting. The erasure, set differ- 
entiation, selective rehearsal, and initial regis- 
tration notions are able to account for the 
difference but the result contradicts the pre- 
dictions of the anticipatory recycling and 
selective search hypotheses. The anticipatory 
recycling hypothesis postulates that having to 
recycle other material for subsequent recall is 
detrimental to criterion response recall. Thus, 
it predicts no difference between FE; and PI, 
because both conditions required S to recycle 
four prior learning responses during criterion 
response recall. One might argue that the 
first four responses were stronger under PI 
because FE; Ss were instructed to forget prior 
learning and, although this argument seems 
plausible, it is undermined by the fact that 
the first four positions were recalled with equal 
facility under the two conditions. Apparently, 
the to-be-forgotten items did not interfere at 
recall in the manner outlined by the antici- 
patory recycling hypothesis despite being 
strong enough to do so. Additional evidence 
against the recycling notion was provided by 
the nonsignificant difference between FE; and 
FE, Evidently the size of the recall load did 
not affect directed forgetting. 

The nonsignificant difference between the 
first four responses under FE; and PI provides 
strong evidence against the erasure hypothesis 
and support for the set differentiation and 
selective search notions. The to-be-forgotten 
positions were not completely "discarded" but 
rather they appeared to be stored separately 
(set differentiation) and recalled equally as 
well as the to-be-remembered items, when 5 
was aware within which set to search (selective 
search). 

The set differentiation and selective search 
mechanisms are considered by Epstein (1972) 
as necessary conditions for directed forgetting 
in verbal learning. The present data indicate 
that there is no reason to doubt that partition- 
ing the input is fundamental to directed for- 
getting in motor learning but the selective 
search mechanism does not appear to be as 
vital within the realm of motor behavior. 
In verbal learning, Epstein, Massaro, and 
Wilder (1972) and Shebiske and Epstein 
(1972) report that forget instructions are only 
effective when there is a difference in search-set 
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size. In the present study, search-set size 
was equated using a procedure much the same 
as that employed by Shebilske and Epstein, 
and the difference between the FE; and PI 
treatments indicates that forget instructions 
reduce motor PI in the absence of search-set 
differences. The finding contradicts the 
search-set hypothesis and suggests that the 
selective search mechanism may be a neces- 
sary condition for directed forgetting in verbal 
learning but not motor learning. 

Of the four hypotheses capable of explaining 
the reduction of PI under FE;, the erasure 
notion appears to be the weakest, since forget 
items were not completely "discarded." The 
initial registration, selective rehearsal, and set 
differentiation explanations are considered be- 
low. The initial registration hypothesis postu- 
lates that prior motor learning impairs criterion 
response acquisition and that this, rather than 
interference, causes the PI effect. The hypoth- 
esis is able to account for the reduction of PI 
under the two forget early conditions but it 
cannot explain the fact that none of the treat- 
ments differed significantly from each other at 
5-sec. recall. If one assumes that immediate 
recall (5-sec.) reflects original learning, the 
equality of the conditions at this point in time 
indicates that prior learning does not influence 
criterion response acquisition. In addition, 
the enhanced encoding position is unable to 
explain the reduction of PI following the FL 
cue. The difference between PI and FL treat- 
ments contradicts the idea that PI reduction 
is a consequence of an event that precedes the 
presentation of the criterion response. 

The difference between the PI and FL treat- 
ments also proves to be difficult for the selec- 
tive rehearsal notion to explain. The hypoth- 
esis postulates that PI is a consequence of 
divided rehearsal time. Having to share re- 
hearsal attention over the retention interval 
is assumed to reduce the probability of the 
correct response being accurately reproduced, 
The FL and PI treatments had essentially the 
same rehearsal opportunities and consequently 
one would expect there to be no difference be- 
tween them. A proponent of the rehearsa] 
position might argue that the difference be- 
tween PI and FL treatments could be attribu- 
ted to the extra few seconds of rehearsal time 
FL Ss had between the FL cue and recall. 
This possibility, although it cannot be dis- 
missed, seems unlikely as a few seconds would 
hardly appear to be long enough to produce 
such a difference. 
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The significant reduction of PI ut 
can be explained by the erasure, antic 
recycling, selective search, and set diff 
tion hypotheses. The first three expla 
have been discussed above and wea 


sses 
have been pointed out in each. Theset Ter- 
entiation hypothesis is also unable to lain 
allthedata. Thesignificant difference b cen 
FE; and FL conditions contradicts k's 


(1970) suggestion that the ability to or 


organize recall and forget items is una! ted 
by the placement of the forget cue. Tl: nd- 
ing does not necessarily contradict tho dea 
that partitioning of the input must be as- ned 
in any account of directed forgetting (Ep vin, 
1972), but it does indicate that FE; Ss © her 
originally learn the criterion response | ter 
than FL Ss (initial registration) or that ey 
are better able to retain the criterion res) ise 
because they are allowed to concentrate ir 
rehearsal time on that response rather in 


having to divide it between five different i- 
tions (selective rehearsal). The differen: in 
initial registration was not apparent at 5 : c. 
and, although any difference in original le. a- 
ing may interact with the length of the re n- 
tion interval, it would appear that the select: ve 
rehearsal hypothesis is better able to explain 
the difference between FE; and FL. Further 
research is obviously necessary to substantiate 
this point. 

It is clear from the above discussion that, 
when S is instructed to forget a number of 
prior motor responses, he responds by invoking 
more than one of the mechanisms described 
above. The set differentiation, selective search 
and selective rehearsal operations were con- 
sidered the most important, as, in combination, 
they were able to explain all of the data with- 
out the assistance of the erasure, anticipatory 
recycling, and initial registration hypotheses. 
In addition, the latter three hypotheses re- 
ceived no direct support as forget items were 
not completely forgotten, an increased recall 
load did not prove to be detrimental to recall, 
and there were no differences between the 
five treatments at 5 sec. 

In conclusion, the present study indicates 
that the inconsistent findings in STMM PI 
research are a function of procedural differ- 
ences which either permit or restrict inten- 
tional forgetting of prior learning. That is, 
motor PI is produced not merely by the occur- 
rence of prior learning but by requiring S to 
remember five movements and reproduce them 
all in reverse order after a 90-sec. retention 
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nterval. It appears that, when S is able to 
forget prior learning, he voluntarily sets aside 
prior, potentially interfering responses, selec- 
tively rehearses the criterion response, an 
finally, selectively. searches either the recall or 
forget set to find the appropriate response, 
which is subsequently accurately reproduced 
regardless of which store it is located in. Thus, 
the mechanisms that operate in motor memory 
are assumed to be cognitively oriented and, 
although this may seem dubious to those who 
view the mental and motor domains as com- 
pletely separate, there are others (Adams, 
1971) who appreciate that higher order 
processes are involved in even the simplest 
motor behaviors. 
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REGRESSION EFFECT AND INDIVIDUAL POWER FUNC 


IONS 


OVER SESSIONS! 
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University of Notre Dame 


Stevens attributed individual differences in power function exponents mainly 


to differences in “the regression effect 
gression may largely do away with individual differences, 


and suggested that balancing for r« 
Cross-modality 


matches of loudness to apparent time duration, and vice versa, were repeated 


over six sessions for each of 20 Ss with 24 hr, between sessions. 
of variance, Kendall's W, and significant 
positive correlations across sessions, occurred before 
balances. Group exponents for duration matches inc: 


differences, indicated by analysi 


Individua 


and after regressio: 
ed over sessions, but 


those for loudness matches and those obtained by regression balance did no: 


change significantly. 


The development of direct scaling tech- 
niques, e.g, magnitude estimation and 
cross-modality matching, has shown that 
sensory magnitude grows as a power func- 
tion of stimulus intensity when data are 
averaged for a group of Ss (Stevens, 1961, 
1971). In addition, there is evidence that 
the power law may provide an adequate 
description of the data of individual .Ss 
(Marks & Stevens, 1966; Stevens & 
Guirao, 1964), although some investigators 
(Luce & Mo, 1965; Pradhan & Hoffman, 
1963) find the description a poor one. 
Even when individual data yield power 
functions, exponents vary widely across Ss, 
These individual differences appear to be 
reliable effects, since when a group of Ss 
repeats a task like magnitude estimation, 
Ss’ later exponents are correlated with their 
earlier ones (Rule, 1966). 

Luce (1972), noting the wide variation 
in exponents, has doubted the possibility 
of finding invariant relations among mea- 
sures in psychophysics as well as the de- 
velopment of a “coherent system of units 
as has been done in physics [p. 100].” 
He concludes that individual differences do 
not allow the specification of a unit of sen- 
sory magnitude apart from the individual. 

1 Based on a master's thesis completed by the 
first author at the University of Notre Dame. Also, 
presented as a paper at the meeting of the Mid- 


porieen Psychological Association, Chicago, May 
1973. 


? Requests for reprints should be sent to William 
E. Dawson, Department of Psychology, University 
of Notre Dame, Notre Dame, Indiana 46556. 


Various explanations for yncratic 


exponents have been offered. (1966) 
and Rule and Markley (1971 rpreted 
them as reflecting consistent individual re- 
sponse strategies. "They feel, > others 
(Jones & Marcus, 1961), that tend to 


use a characteristic range of the variable 
under their control with some using a 3 
narrow range and others more extended 
ones. Ekman, Hosman, Lindman, Ljung- 
berg, and Akesson (1968) pointed to differ- 
ences in perceptual sensitivity as well as to 
response biases when not all of thcir vari- 
ance due to Ss was accounted for by the 
latter. 

Teghtsoonian and Teghtsoonian (1971) 
have suggested that session-to-session cor- 
relations are transitory and are mainly due 
“to chance factors when the constraints 
introduced by repeated judging are mini- 
mized [p. 149]." They find that such 
correlations are not significan: inter- 
session intervals greater than 24 hr. and 
argue that a sensory or judgmental trait 
of individual Ss would not disappear in this 
manner. 

Stevens (1971) suggested that individual 
differences are due largely to the regression 
effect (Stevens & Greenbaum, 1966), This 
effect is a tendency for the person to use a 
shortened range of the variable under his 
control, whether it be the number con- 
tinuum as in magnitude estimation, or a 
physical continuum as in cross-modality 
matching. The shortening takes place 
about the average value of the variable 
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inder control. Such a tendency to "re- 
gress" toward the mean value of a series 
of controlled stimuli was first noted by 
Hollingworth (1910). 

Because of this effect, when one con- 
tinuum is matched to another, two differ- 
ent power functions are obtained depending 
on which is the adjusted variable. Each of 
these functions is a straight line in a log-log 
plot but one has a steeper slope than the 
other. To present both functions on the 
same plot, the ordinate and the abscissa 
are interchanged for that curve obtained 
by adjusting the variable represented on 
the abscissa. This necessitates drawing a 
line with a slope equal to the reciprocal of 
the exponent of the regularly fitted power 
function. Indow and Stevens (1966) have 
argued that the most representative func- 
tion that describes the relation between the 
two continua is a power function with an 
exponent that is the geometric mean of 
the exponents for the two power functions 
in such a plot. Such an exponent is con- 
sidered balanced for the regression effect 
and is obtained directly as the geometric 
mean of the exponent for the ordinate 
continuum and the reciprocal of the ex- 
ponent for the abscissa continuum. 

No one to date has thoroughly investi- 
gated whether balancing for regression will 
do away with reliable differences in ex- 
ponents. The present research attempts to 
answer this question, and furthermore it 
examines whether or not these effects are 
stable with repetition of the scaling task 
over several sessions. Stevens (1971) re- 
ported that on some Occasions exponents 
may change with repetition. Teghtsoonian 
and Teghtsoonian (1971) found that ex- 
ponents for apparent length and apparent 
area were steady over five sessions and 
Mitchell and Gregson (1971) reported in- 
dividual differences in olfactory exponents 
that were stable over four sessions. Other- 
wise, not much has been reported about 
repetition effects for more than two 
sessions. 

‘Two questions were of primary focus: (a) 
Do consistent individual differences occur 
across sessions? If so, do these differences 
disappear when the exponents are balanced 
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for regression in judgments? (b) Are 
group exponents stable over repeated ses- 
sions or do they change with repetition ? 
If they change, is this change symmetrical 
for both tasks, i.e., does it disappear when 
the regression effect is balanced out using 
the geometric mean? 

Apparent time duration and loudness 
were chosen as modalities primarily for the 
reproducible regression effects that have 
been obtained with them previously. Cross- 
modality matches were undertaken in 
which loudness was adjusted to match 
given time intervals and, in turn, apparent 
time duration was adjusted to match given 
levels of loudness. 


METHOD 


Subjects. The Ss were 20 naive volunteers, 16 of 
whom were undergraduate or graduate students 
attending summer school at the University of Notre 
Dame. Three Ss were wives of graduate students 
and 1 was an undergraduate not attending summer 
school, Ten men and 10 women participated. 

Apparatus. For matches of loudness to given 
time intervals, an audio oscillator produced a 1,000- 
Hz. tone whose output level was controlled by a 
variable attenuator and a sone potent iometer 
(Poulton & Stevens, 1955) in series with a pair of 
earphones. Each S listened to the tone while ad- 
justing the potentiometer for the appropriate loud- 
ness, The phones were calibrated so that the voltage 
measured across them could be converted to sound 
pressure readings in dynes/cm*. Eleven intervals 
were used: .30, .42, .60, .84, 1,20, 1.70, 2.40, 3.40, 
4.80, 6.80, and 9.60 sec. Time durations were con- 
trolled by a Hunter timer, which lighted a small 
white lamp for the appropriate interval. 

When apparent duration was the adjusted vari- 
able, S turned on the lamp for the desired duration 
by pressing a. key, and the duration was recorded 
by a Standard timer. "The levels of loudness matched 
were produced by 11 discrete sound pressure levels 
(re .0002 dynes/cm?), which varied 40-90 db. in 
5-db. steps. 

Procedure. Each S participated in six sessions 
scheduled at the same time cach day over 6 con- 
secutive days. In a session, S matched both dura- 
tion to loudness and loudness to duration with half 
of the Ss first making all the time matches and then 
all of the loudness ones. The remaining half began 
by adjusting loudn and then continued with 
duration. The order in which each modality served 
as the matched variable was randomly assigned to 
both Ss and sessions under the constraints that, for 
any S, half of the sessions began with duration and, 
for any session, half of the Ss began with duration. 
Within any session, the stimuli were presented in 
random order with S making only one match to each 
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(SEC) 


DURATION 


TIME 


40 50 60 70 80 90 
SOUND PRESSURE LEVEL 
FIGURE 1, Cross-modality matches of apparent 


duration to loudness (circles), and vice versa 
(triangles), for the data pooled over sessions and Ss, 


stimulus level. For each part of each session an 
appropriate set of instructions was read by S before 
he began, 


RESULTS 


Geometric means of the matches made 
to each stimulus were computed for the data 
pooled over (a) Ss and sessions, (b) Ss for 
each session, and (c) sessions for each S 
Power functions were then fitted to each 
of these sets of geometric means for both 
duration and loudness adjustments. Time 
and sound pressure, respectively, were the 
independent variables in the equations 
fitted. In addition, power functions were 
fitted to each S's matches for each session. 

Group data, Figure 1 shows that the 
group data combined over sessions is de- 
scribed well by power functions for both 
cross-modality matches. The duration 
exponent (DE), for the matches where 
time was adjusted, is indicated by a slope 
of .45. The loudness exponent (LE) was 
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1.24, and hence the figure gives reci- 
procal of this value, .81, for the sl. vhere 
loudness is the adjusted variable. sence 
of a regression effect is indicate: the 
difference in slope for the two ches, 
The regression angle (RA) between — .e two 
fitted lines amounts to .259 radias Tak- 
ing the geometric mean of .45 and gives 
-60 as the balanced exponent (BE 
Previous studies place the exp t for 
apparent duration at 1.10 (Steven- 1961) 
or, more recently, at 1.03 (Stevens & reen- 
baum, 1966). For loudness, exp ients. 
from .60 (Stevens, 1961) to .68 (S ens, 
1969) have been reported. Thus, ti. pre- 
dicted slope of the equisensation fur tion 
lies in the interval from ,60 1.10, or .- 5, to 
-68/1.03, or .66. If values of .64 for oud- 
ness and 1.03 for duration are take: the 
predicted slope for the cross-m« lity 


match is .62, a value only slightly | ger 
than the .60 obtained as the BE. 

Analyses of the data for each session are 
summarized in Table 1. The DEs show an 
orderly increase over sessions, but the LE 
are more stable. Again, most of the Es 
lie in the interval .55—66, Figure 2 gives 
plots of the geometric means of the matches 
for each session, and these results are well 
described by power functions as indicated 
by the straight lines fitted to the datum 
points, 

Individual data. Random samples of the 
results of individual Ss are plotted in 
Figures 3 and 4. The results, combined 
over sessions, for six Ss chosen at random 
are given in Figure 3; those for six Ss in 
single sessions are given in Figure 4. The 
individuals vary considerably in their 
manner of responding in the cross-modality 
matches. Data combined over sessions 


TABLE 1 


SESSION MEANS FOR THE FOUR MEASURES 


Session 
Measure " -— 
1 2 3 4 EY rors M 
Duration exponents 38 44 43 45 49 45 
Loudness exponents 1.21 1.33 120 122 124 121 
Balanced exponents .59 -60 ‘61 ‘02 ‘66 62 
Regression angles (in radians) -36 27 32 28 26 29 
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rent duration to loudness 
(All pairs of curves, ex- 


cept the upper left, have been shifted to'the right or downward for presentation.) 


tend to approximate a power function 
better than data for one session. 

Balanced exponents for each S in each 
session were obtained by taking the geo- 
metric mean of the appropriate duration 
and inverse loudness exponents (ILEs). 
In addition, for the same 120 combinations 
of session and S, RAs were computed. 
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Changes over sessions. To assess the 
stability of the various measures, à Treat- 
ments X Ss analysis of variance was ap- 
plied to each matrix of 120 measures. Thus, 
four analyses in all were conducted, one 
each for DEs, LEs, BEs, and RAs. The 
sessions effect was significant when dura- 
tion was adjusted, F (5, 95) = 5.56, p < 


100 120 140 160 


PRESSURE LEVEL 


duration to loudness (circles), and vice 


versa (triangles), for each of six randomly selected Ss with their data pooled 


over sessions. 


(All pairs of curves, except the upper left, 


have been shifted to 


the right or downward for presentation.) 
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.001, but not when loudness was, 
—.862, p > .05. The BEs for the group 
data tended to increase over Sessions but 
not significantly, F (5, 95) = 2.14, .05 <p 
<.1. On the other hand, RAs changed 
reliably, F (5, 95) = 3.62, p < .0t. 
Additional comparisons were undertaken 
for those measures where à sessions effect 
occurred, Duncan's new multiple range 
test indicated that the DE for Session 1 
was significantly less than those for Sessions 
5 and 6, but other comparisons of session 
means yielded no difference. With RA 
means, Session 1 was greater than Session 
6, but other comparisons were not signifi- 
cantly different, 
Individual differences, 
nonadditivity (see My 
ried out for e 
Nonadditivity 


F (5, 95) 


Tukey's test for 
ers, 1972) was car- 
ach of the four measures, 
effects were not significant 
with F (1, 94) = 2,98, 1.54, 2.47, and «1 
for DEs, LEs, BEs, and RAs, respectively. 
Thus, it was permissible to test the Ss 
effect for each of the measures. [n all 
cases this effect was Significant at the .001 
level: DEs F (19, 95) = 9.73; LEs, 
F (19, 95) 213.18; BEs, F (19, 95) 212.87; 
and RAs, F (19, 95) = 10.29. 
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The fact that the Tukey tests found the 
additive model to be appropriate implies 
that Ss were not differentially affected by 
sessions. In order to examine the consis- 
tency of S differences over sessions, Ken- 
dall's coefficient of concordance was calcu- 
lated for each of the four measures, All 
were significant at the .001 level with 
W (19) = .64, .74, .72, and .67 for DEs, 
LEs, BEs, and RAs, respectively, Similar 
evidence for persisting individual differ- 
ences was given by intersession product- 
moment correlations (df = 18), all of which 
were positive. For DEs, all 15 possible rs 
were significant at the .05 level using a one- 
tailed test and all but 2 were so at the .025 
level. With LEs, all rs were significant 
at the .01 level, while for BEs, all were re- 
liable at the .05 level with all but 2 being 
at the .005 level. The rs for RA were all 
significant at the .05 level, and all but 2 
were at the .025 level. Table 2 presents 
the intersession correlations for successive 
pairs of sessions, The Session 1 vs. Session 
2 values tend to be lower than succeeding 
correlations. 

Correlational analysis was also used to 
examine further Stevens's contention that 


REGRESSION AND INDIVIDUAL DIFFERE 


individual exponents may be mainly due to 
regression. Under Stevens's hypothesis, a 
negative correlation is expected between 
individual duration exponents and their 
orresponding inverse loudness exponents. 
Phat is, if regression balances for all in- 
dividuals are to lead to the same exponent, 
then an S with a low DE must have a 
compensatingly high ILE and vice versa. 
Cross-modal Pearson rs were calculated 
between these individual exponents both 
for each session and using individual ex- 
ponents based on all sessions. The cor- 
relations between modalities at each session 
were all positive, with only Session 2 
nificant, r (18) = .51, p < .05. The 
orrelation based on combined sessions 
was not significant, ” (18) = .30, p > .1. 


DISCUSSION 


In general, results provide additional sup- 
port for the use of direct scaling methods and 
the assertion of the power law for both the 
data of groups and individual Ss. The pres- 
ence of regression in judgments was apparent, 
and balances for this effect yielded several 
interesting results. 

Stability of the measures. A change in ex- 
ponents with repetition oc urred for apparent 
duration but not for loudness The DE change 
was in the direction of decreased regression 
over sessions, and it occurred gradually. Since 
the balanced exponents are based on the DEs 
and LEs, it is not surprising that they tend to 
increase over sessions even if not significantly 
so. Although the BEs are, in effect, stable, 
the RAs indicate that regression decreases 
significantly over sessions. Stevens and Green- 
baum (1966) suggested that regression balances 
may not be perfectly stable when they stated 
that 


the distorting constraints that affect one variable, 
say, magnitude estimation, may be different from 
the constraints that affect the other v able, 
which may involve, for example, the turning of a 
knob that has peculiar characteristics of its own. 
It seems unlikely that the errors in two such 
different tasks as magnitude estimation and mag- 
nitude production would inevitably cancel exactly 
and completely [p. 446]. 


A possible reason for the difference between 
the sessions effects for DEs and LEs lies in 
the manner in which the two continua were 
adjusted. Loudness adjustment was con- 
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TABLE 2 


INTERSESSION CORRELATIONS FOR SUCCESSIVE PAIRS 
OF SESSIONS 


cent sessions 


Measure 


1-2 | 2-3 sa | es 5-6 


Duration exponents | .45 | .56 | .66 | .77 | .68 
Loudness exponents | | 72 | .78 | .86 
Balanced exponents | .71 | .78 14 | .67 | .85 
Regression angles 39 | .69 | .54 | .66 | .69 


variable in both directions of in- 
but duration adjustment was only 
possible in an increasing direction. Loudness 
matches did not require return to a "zero" 
intensity before increasing or decreasing the 
intensity, With duration, Ss had to start 
over in order to produce a longer or shorter 
interval. There are two ways in which such 
adjustment differences could be avoided. 
First, Ss could be required to make loudness 
matches in a manner analogous to duration 
ones, that is, by increasing monotonically 
from zero loudness for all settings. Another 
approach would be to use continua. which are 
usually adjusted in a similar fashion, e.g., time 
duration and force of handgrip or loudness 
and brightness. Whether some types of ad- 
justment lead to stable exponents, while 
others do not, deserves to be investigated. 

Individual differences.’ The results, in agr 
ment with earlier research, demonstrate that 
consistent individual differences persist over 
sessions. Since only one match per stimulus 
for each continuum was made per session and 
since there were approximately 24 hr. between 
sessions, it was expected that the intersession 
correlations, especially for Session 1 vs, Session 
2, would be low and possibly nonsignificant, 
Teghtsoonian and Teghtsoonian (1971) found 
nonsignificant correlations in such à situation 
for magnitude estimation data, The present 
results, however, were obtained with cross- 
modality matching which conceivably could 
make for the difference. For both continua, 
all entries in the correlation matrices and the 
Ss effects of their analyses of variance were 
significant, indicating that there is a consis- 
tency in responding even with a 24-hr. inter- 
session interval. The highly significant Ken- 
dall's Ws indicated reliability of individual 
differences over repetitions. 
The attempt to remove individual differ- 
ences by balancing for regression was not suc- 
cessful. They were still present with BEs as 
indicated by the analysis of variance, the W, 


tinuously 
tensity, 


and the intersession correlations. Other 
sources of idiosyncratic exponents must still 
exist—perhaps several causes will be found 
responsible. The fact that the average inter- 
session correlation for BEs was about the same 
as those for the DEs and LEs suggests that 
the proportion of variance attributable to in- 
dividuals is similar in all three cases. 

The positive cross-modal correlations be- 
tween DEs and ILEs argue against Stevens's 
expectation that individual differences will 
largely vanish after a regression balance. As 
mentioned earlier, in order for all individual 
BEs to converge on the same value, a DE that 
is relatively low must be accompanied by a 
relatively high ILE, and so forth. Thus, 
negative correlations would be expected. 
Incidentally, they would also be expected by a 
“general range hypothesis.” According to such 
a hypothesis, idiosyncratic exponents would be 
due to consistent use by individuals of a rela- 
tively constant proportion (large or small) of 
the range of any adjusted variable as they go 
from continuum to continuum. Some investi- 
gators (Rule, 1966; Rule & Markley, 1971) 
have suggested that the individual differences 
arise from the idiosyncratic use of the depen- 
dent variable—a sort of response style. Accord- 
ing to a general range hypothesis, such response 
styles would be expected to transfer across 
continua. 

What other possible sources of S differences 
might there be? Stevens (1961) earlier sug- 
gested three components of variability: S's 
modulus, S's conception of a subjective ratio, 
and the operating characteristic of S's sense 
organ. Stevens (1971) rules out the latter 
because normal sense organs do not vary as 
widely as individual exponents. Differences in 
the modulus can be ruled out since they pre- 
sumably affect the multiplicative constant but 
not the exponent of a power function. Differ- 
ences in subjective ratios clearly would affect 
the exponent and it is known that they occur. 
Svenson and Akesson (1966), for example, 
find such differences but they also show that 
Ss differ in other ways as well, Accounting for 
individual variability will probably entail dis- 
covering at least several factors an 
how they operate in combination 
(1971) has suggested, 


d assessing 
as Stevens 
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PRESENTATION RATE AND INTRALIST REPETITION 
EFFECTS IN IMMEDIATE PROBE RECALL' 


V. DAVID BURNS* 
University of Maryland 


Recall in an immediate memory study was tested by à missing-item probe for 
two levels of presentation rate (1 and 4 items per second) and four intralist 
repetition conditions. The presentation rate results contradicted the single- 
process forgetting hypotheses that were considered, including constant or 
variable time decay and interference-only. Repetition effects supported a 
multistore or multiprocess hypothesi The divergent performance patterns 
exhibited by groups of 5s who were exposed to different presentation rate orders 
emphasize the importance of separating strategy from memory effects. 


Two important issues which have been sensitivity estimates as prescribed by signal 
investigated in the study of short-term detection theory (see Banks, 1970), (5) 
memory are sources of forgetting (time that Ss adhere to nonrehearsal instructions 
decay versus interference) and intralist when given, and (c) that only a single re- 
repetition effects. These issues have ordi- sponse is required per trial, as in Waugh 
narily been examined in isolation from each and Norman's (1965) probe procedure. If 
other and hence it is reasonable to investi- it is hypothesized that the only cause of 
gate their interaction; however, both are forgetting is interference from subsequently 
related to the more general problem of presented items within a list (Norman, 
whether multiple memory systems (e.g. 1966), forgetting rates should not differ 
Atkinson & Shiffrin, 1968) or multiple en- across pr ‘sentation rates. Support for this 
coding or retrieval processes (e.g., Craik & hypothesis has been found by Norman 
Lockhart, 1972; Murdock, 1972) are neces- (1966) and has been replicated (Anderson 
sary to describe human performance in the & Burns, 1973) using a missing-item probe 
immediate memory paradigm. This latter with visual presentation (four digits per 
question was examined in the present st udy exposure), required voicing of individual 
by testing the predictions of a number of items, and nonrehearsal instructions. 
“single process" models either for presenta- Alternatively, rate of forgetting is pre- 
tion rate or intralist repetition effects. dicted to be greatest at the slowest pre- 

With respect to the pre sentation rate sentation rate according to any of the 
variable, model predictions are based on following hypotheses: pure time decay, à 
the following assumptions: (a) that forget- combination of interference and time decay 
ting rates as a function of intervening (Wickelgren, 1970), or differential degrees 
items are mos appropriately measured by of processing across presentation rates 
ae (Massaro, 1970). Wickelgren (1970) has 

1 This res | is based on a master's thesis sub- reported an experiment in which this pre- 
mitted to the Graduate School of the University  diction was supported for. a recognition 


of Maryland and was completed while the author 3 
held a Maryland Fellowship. Support was fur- task. One purpose of the present study 


nished by the Biomedical Sciences Support Center Was to replicate Anderson and Burns (1973) 
of the University of Maryland to the Center for with visual presentation of individual list 
Language and Cognition, and computer time was — jtems. Estimates of sensitivity and reac- 


provided in full by the Computer Science Center. ; E eras Vara BS n2 ; 
The author wishes to thank Nancy Anderson for uon time for individual Ss were obtained 
advice and assistance throughout all stages of the 11 order to allow a test of the forgetting 


project and John Holmgren and David Horton for rate hypotheses. 
critical readings of the manuscript. 'The second variable examined in the 


? Requests for reprints should be sent to V. David i Erde UST 3 
Burns, Department of Psychology, University of present study was the requency and spac- 


Maryland, College Park, Maryland 20742. ing of intralist repetitions. For all three 
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FiGURE1. Proportion correct as a function of serial 
position for each rate and group. 


previous studies in which no evidence indi- 
cating time decay was obtained (Anderson 
& Burns, 1973; Norman, 1966; Waugh & 
Norman, 1965), stimulus sequences were 
used in which the repetition of critical 
(tested) items was not investigated. It is 
possible that the failure to find evidence 
for time decay can be partially attributed 
to the absence of a control for repetition 
of critical items. However, repetition 
effects are important in themselves, since 
at least three different outcomes are pre- 
dicted depending on the theoretical model 
chosen. First, if recall in the immediate 
memory paradigm reflects the contribution 
of two distinct memory Systems (i.e., re- 
cently presented items tend to be recalled 
from a short-term store whereas earlier 
items are retrieved from a long-term store), 
one would expect repetitions to affect per- 
formance differentially across serial posi- 
tion. This hypothesis is suggested by the 
facilitative effect of repetitions found in 
"long-term" memory studies (e.g., Waugh, 
1970) but the absence of any facilitation 
when a single item is repeated in the "probe 
memory-span" task (Jahnke, 1969). Sec- 
ond, repetitions may facilitate performance 
across all serial positions (Bernbach, 1969). 
Third, repetitions may degrade perform- 
ance if item-to-item associations mediate 
recall (e.g., Wickelgren, 1966). According 
to the associative hypothesis the repetition 
of an item in a Sequence should decrease 
the probability of recall of the items im- 
mediately following the repeated item (if 
different) because of competing A-B, A-C 
associations. "These three hypotheses were 
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tested by varying the repetition 
of the stimulus lists. 


ructur 


METHOD 

Lists of fifteen digits were composed : ing all 10 
digits (0 to 9) with the restriction t 
occur more than twice per list. Each ! 
structed according to one of the fol 
straints, which define the four repetitio: 
(a) The critical item was repeated at 
intervening item (ie, .. . E 
critical item and j is any other ! 
repeated)—condition CR-1. (b) The cr 
was repeated at a lag of six intervenin 
condition CR-6, (c) The item in the se 
preceding the critical item was repeated : 
six intervening items, and the critical i 
repeated—condition PR. (d) Neither t! 
item nor the preceding item was repeat 
list (although other items did repeat)— 
NR. When an item was purposely repeat 
always repeated on only one side of thi 
position. Hence, if j > 8 was the critic: serial 
position, the repetition was always placed before the 
critical item, and conversely for j < 8. 

Presentation rates of one and four digits per 
second (with item “on-times’ of approximately 1 
and .27 sec., respectively) were chosen to replicate 
previous studies. On any single trial only one digit? 
was probed and the critical item occurred o«ly in 
serial positions 2, 4, 5, 6, 8, 9, 10, 11, 12,0r13. Each 
5 was exposed to all (80) levels of the three main 
variables of Presentation rate X Repetition X Serial 
position. The two levels of presentation rate were 
varied across sessions, but the 40 possible Repetition 
X Serial position conditions occurred equally often 
in a random order within each session, Six Uni- 
versity of Maryland undergraduates were recruited 
as Ss and paid $2 per hour plus a bonus based on 
Proportion correct. Each S participated in nine 
sessions (120 lists per 50-min, session) at the slow 
rate and three (320 lists in 75 min.) at the fast 
presentation rate. As a control for order effects one 
group of three Ss (Group 1) was exposed to the 
slow presentation rate initially, and another group 
of three Ss (Group 2) was exposed to the fast pre- 
sentation rate initially. Because of scheduling con- 
flicts, the only constraint imposed was that Group 1 
(start slow) Ss be exposed predominately to the 
slow presentation initially, and that Group 2 (start 
fast) Ss be exposed predominately to the fast pre- 
sentation initially. The consequence of this ar- 
rangement was that Ss were exposed to the following 
sequence of sessions: : 


ditions? 


Group 1: Group 2: 


SSSSSSSFFSSF (2.55) FSSFFSSSSSSS 
SSSSSSSFFSFS (1.5 —FSFSFSSSSSSS (1.5 each) 
FSFFSSSSSSSS. 


Stimuli were displayed and responses recorded by 
means of a CRT terminal connected to the time 
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haring computer system at the University of 
Maryland. After a ready signal on the CRT, 5 
initiated the stimulus presentation. Each digit was 
displayed individually to the right of the preceding 
digit such that no two items were ever visible 
simultaneously. Each digit was read aloud as it 
appeared on the screen, and Ss were instructed not 
to rehearse previous item: "he instructions were 
repeated on numerous occastons throughout the 
experiment. 

Following presentation of the 15 digits, the miss- 
ing-item probe, consisting of the stimulus list which 
just occurred except for a single missing digit, was 
displayed and two hyphens were printed at the 
position of the item which was to be recalled. One 
of the 10 digits and a confidence judgment were 
entered while the probe remained visible, then the 
n was cleared and another ready signal given. 
Two rest intervals lasting a minimum of 1 min. 
were provided at equal intervals during the session. 


RESULTS 


Performance às measured by proportion 
correct is summarized for each presenta- 
tion rate in Figure 1. An analysis of vari- 
ance of the aresin transform of proportion 
correct indicated that performance at the 
slow presentation rate was superior, F (1, 4) 
= 15.8, p < .05 (mean difference = .087). 
Presentation rate interacted with serial 
position only for Group 1 (start slow), F 
(9, 18) = 11.5, p< O01, which accounts 
for the significant Groups x Rate X Serial 
position interaction, F (9, 36) = 3.56, 
p < .005. 

Since the forgetting rate predictions for 
all models are specified in terms of sensi- 
tivity estimates, Memory Operating Char- 
acteristic (MOC) curves were generated 
and Ay, a nonparametric estimate of sensi- 
tivity representing the area under the 
MOC, was calculated for each S and con- 
dition. The sensitivity estimates are plot- 
ted in Figure 2 for each group and presen- 
tation rate. Performance as estimated by 
A, is superior at the slow presentation rate 
for both groups. (The sharply reduced Ag 
in serial positions 12 and 13 for Group 2 
reflects. the inadequacy of this measure 
when performance is almost perfect.) 

The slopes of the linear least-squares 
fits to log A, as à function of number of 
intervening items (the reverse of serial 
position) were employed as estimates of 
forgetting rates, as specified by the models 
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position for each rate and group. 


considered above. (If Ay for serial posi- 
tions 2 or 4 exceeded the minimum A, for a 
given S and rate by „1, or if the difference 
in A, for positions 12 or 13 and the maxi- 
mum A, exceeded .1, those positions were 
not included in the slope estimate.) The 
forgetting rates obtained by this procedure 
were greater at the fast presentation rate 
for all Ss, and the slope differences were 
significant for four of six Ss (two-tailed 
confidence interval, p < 05). The zero- 
intercepts estimated by least-squares were 
greater at the fast presentation rate for 
four of six S! 

Mean reaction times were significantly 
longer at the slow presentation rate, F 
(1, 4) = 85.33, p < .005, (mean difference 
= 5. sec). This re ilt has also been ob- 
tained by Norman (1966) and Murray 
(1968). 

The overall differences among repetition 
conditions were not significant but a signifi- 
cant interaction of repetitions with presen- 
tation rate and serial position was observed 
for Group 1, F (27, 54) = 2.83, p € O01, 
Separate analyses performed on the first 
and last five positions indicated that this 
interaction was significant only for the first 
five positions tested, F (9, 19) = 3.52, p < 
.025. Performance was generally superior 
for condition CR-6 but the differences 
among conditions varied across presenta- 
tion rate and position. 

For Group 2 (start fast) repetitions inter- 
acted with serial position, F (27, 54) = 
2.08, p < .001, and the differences among 
repetition conditions were significant only 
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lor the first five positions tested, F (3, 6) 
= 6.9, p «.025. For these positions (2, 
4, 5, 6, and 8) the critical item was recalled 
with highest probability when repeated, 
regardless of lag (conditions CR-1 and 
CR-6). Recall was poorest for condition 
NR in which neither the preceding nor the 
critical item was repeated, and performance 
was intermediate for lists in which the item 
preceding the critical item was repeated 
(condition PR). Vus 

Group 2 (start fast) Ss performed sig- 
nificantly better than Group 1 (start slow) 
Ss, F (1, 4) — 26.86, b « .01 (overall dif- 
ference in proportion correct — .20), and a 
Similar pattern of group differences was 
apparent for the A, measure. The per- 
formance superiority exhibited by Group 2 
Ss was restricted primarily to early posi- 
tions at the slow rate and later positions 
at the fast rate, Aaronsson (1967) has 
reported a similar result. 


Discussion 


The forgetting rate differences obtained in 
the present study contradict all “single pro- 
cess" models considered, namely, models in 
which it is assumed that encoding across serial 
position is constant (for a given presentation 
rate and nonrehearsal instructions), and that 
forgetting results either from interference, 
time decay, or some combination. The for- 
getting rate differences obtained are consistent 
with a number of multiprocess models (e.g., 
Atkinson & Shiffrin, 1968), since performance 
for recently presented items (later serial posi- 
tions) is predicted to be independent of pre- 
sentation rate but recall of items in early 
positions is assumed to be an increasing func- 
tion of study time. In addition, the multi- 
process model can more easily account for the 
pronounced primacy effect apparent in Figures 
l and 2 without resorting to post hoc 
explanations, 

Additional evidence that the memory process 
used by Ss in this task is not unitary was indi- 


Al- 
in a complex 
Manner with presentation rate and serial posi- 
tion for one group, recall was generally facili- 
early but not 
[ This Supports the hy- 
pothesis that the Processes of encoding and/or 


V. DAVID BURNS 


retrieval for early items are analogo: +o those 
involved in "long-term" recall studie w hereas 
retention of later items more cl. corre- 
sponds to the mnemonic proces which 
takes place in the memory span task. Neither 
of the other two hypotheses conside: regard- 
ing repetition effects were supported 

The pattern of group differences s ort the 
hypothesis suggested by Aaronsso 1967), 
that Ss trained at a slow presents on rate 
tend to adopt an "immediate" ; eptual 
strategy at both presentation rates hereas 
Ss trained at a fast rate tend to opt a 
“delayed” identification strategy. $. wever, 
in the present study it may be mo: ppro- 
priate to hypothesize that the grou; Jiffer- 
ences result from a differential delay in" ceper- 
level" or "long-term memory" pr sing. 
Group 1 Ss (exposed to the slow rate ally) 
may have attempted to maximize the pth” 
of processing of individual items at h pre- 
sentation rates, However, Group 2 Ss, . sined 
initially at the fast rate, may have c ‘ayed 
"deeper-level'" processing at both prese: tion 
rates, since any processing of individuai ‘ems 
was impossible at first, The result of uc ng 
à delayed Strategy would be to acquire g; ater 


information about the relationship among 
items, hence Group 2 Ss would be expected to 
Show superior performance particularly for 
early serial positions at the slow presentation 
rate. The "immediate" Strategy has the addi- 
tional disadvantage of requiring more process- 
ing time per item, resulting in a greater proba- 
bility that the encoding of later items at the 
fast presentation rate is disrupted. 

The results of varying both presentation rate 
and intralist repetition provide further evi- 
dence for the necessity of postulating more 
than a single memory process or dimension to 
describe retention in the immediate probe 
recall paradigm, The differences between 
groups can be accounted for in terms of 
strategy differences which are compatible with 
the multiprocess model. The two memory 
Processes hypothesized may be more clearly 
differentiated by varying the degree of re- 


dundancy over more intricately patterned 
stimulus sequences, 
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EFFECTS OF POSITIVE AND NE 


GATIVE FORCE-C( 


TINGENT 


REINFORCEMENT ON THE FRUSTRATION EFFECT 
IN HUMANS?! 


GAIL DITKOFF? ANp RONALD LEY 


State University of Ne 


Ninety college undergraduates learned 


sponse under conditions of positive or negative 
received for correct training-trial respons 
received five extinction trials. 


of the reinforcement < 
Following training, 

generalized-drive interpretation of the 
extinction mean response force on a nc 


Ss 


both the positive-reinforcement and the negative- 


unexpected finding was a decrease in me: 
gent reinforcement level (Li) from late t 
by the negative-reinforcement groups 
trend in which the magnitude of the 
forcement on training trials increased, 


Amsel's (1958) frustration-drive hypoth- 
esis states that nonreinforcement follow- 
ing the development of an anticipation of 
reinforcement results in the aversive moti- 
vational state of frustration. According to 
frustration-drive theorists, frustration acts 
as a source of drive (D), which results in 
an increase in the vigor of behavior that 
immediately follows frustration (the frus- 
tration effect—FE), Empirical support for 
the notion that nonreinforcement following 
a history of positive reinforcement is moti- 
vating has been obtained with infrahuman 
Ss, using double-runway, hurdle-jump, and 
operant lever-press apparatuses (Amsel & 
Rousel, 1952; Daly, 1971; Marzocco, 1951), 
and with humans, using force, movement 
speeds, and latency of lever-press responses 
às dependent variables (Blixt & Ley, 
1969; Ryan & Watson, 1968). With re- 
Spect to negative reinforcement, a com- 
paratively small amount of research has 
been reported. Although Lambert and 
Hammond (1970) were unable to obtain 
an FE in rats using escape from shock as a 
negative reinforcer in a double runway, 
Graham and Longstreth (1970) did obtain 


! This research was supported in part by a State 
University of New York Research Foundation Grant 
in Aid to the second author, 

* Requests for reprints should be sent to Gail 
Ditkoff, who is now at the University of Pittsburgh, 
Johnstown, Pennsylvania 15904, 
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à force-discrimination lever-press re- 
reinforcement. The magnitude 
s was varied, 
tent with a 
:), during early 
2) increased for 
reinforcement groups. An 
an response force on the force-contin- 
raining trials to early extinction trials 
The Ly and the Ls data suggested a 
= increased as the magnitude of rein- 


Cor 
frustration effect (F 
onreinforced lever (L 


FEs in rats in a double-runway study in 
which shock escape served as the negative 
reinforcer. The primary purpose of the 
present study was to extend the investi- 
gation of the effects of termination of 
negative reinforcement (avoidance of mone- 
tary loss) on the FE to the realm of human 
behavior. In the context of this study, the 
termination of negative reinforcement dur- 
ing extinction trials consisted of punish- 
ment, i.e., monetary loss following responses 
which had previously been negatively rein- 
forced. On the basis of current theories of 
the FE, it was hypothesized that the 
termination of reinforcement following both 
positively and negatively reinforced train- 
ing would result in the FE in humans. 
The secondary purpose of the present 
study was to investigate the effect of differ- 
ent magnitudes of both positive and nega- 
tive reinforcement on the FE. According 
to both Amsel (1958) and Spence (1960), 
the magnitude of the FE is related to the 
strength of S's anticipation of reinforce- 
ment (ra). As the strength of rq increases, 
nonreinforcement should produce greater 
amounts of frustration. Researchers have 
found that the magnitude of the FE in- 
creased (a) when the similarity between a 
runway and its goal box was increased 
(Amsel & Hancock, 1957), (b) when S's 
nearness to task completion when he was 
thwarted was increased (Hanner & Brown, 
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1935), and. (c) when the number of rein- 
forced acquisition trials given prior to 
nonreinforeement was increased (Penny, 
1055), According to Spence (1960), another 

in which the strength of re may be 
varied is by varying the magnitude of the 
reinforcement that Ss receive during train- 
ing; ien Ss which receive larger reinforce- 
ments during training should develop 
stronger ra, and hence show greater FEs, 
than Ss which receive smaller reinforce- 
ments during training. On the basis of 
this theoretical position, the second hypoth- 
esis of the present study was that the 
magnitude of the FE would be a function 
of the magnitude of reinforcement. (both 
positive and negative) received during 
training, i.e., as the amount of reinforce- 
ment received for each correct training- 
trial response increased, the magnitude of 
the FE would increase. 

In the double-runway studies (e.g., 
Amsel & Rousel, 1952) the use of the same 
reward (ie, food) in the goal boxes of 
both the first runway (Ra) and the second 
runway (Rs) has led to the question of 
whether an increase in R, response vigor 
reflects an increase in general drive level 
or an increase in food-related motivation. 
Levine and Loesch’s (1967) finding that 
the vigor of a continuously reinforced in- 
strumental response (food-reinforced lever 
presses) increased during periods when a 
dissimilar instrumental response (water- 
reinforced chain pulls) was being extin- 
guished was interpreted as evidence for the 
hypothesis that nonreinforcement, follow- 
ing continuous reinforcement, leads to in- 
creases in D. The tertiary purpose of the 
present study was to test further the 
hypothesis that the FE reflects an increase 
in D, which will be evidenced in an in- 
crease in the force of a second response (a 
response unrelated to the reinforcement 
contingency of the frustrated response). 

By means of a double-lever force-con- 
tingent reinforcement apparatus, Ss succes- 
sively pressed each of two levers (Lı and 
L;) on each experimental trial, Correct Li 
responses were either positively or nega- 
tively reinforced during training trials 
and were not reinforced during extinction 
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trials. Lever 2 responses were never re- 
inforced. Therefore, a significant Le frus- 
tration effect (i.e., an increase in the force 
of L, responses following Lı nonreinforce- 
ment) could not be interpreted as the result 
of the termination of La reinforcement and 
would support an explanation in terms of 
the FE reflecting increases in D. 


METHOD 


Subjects. The Ss were 90 undergraduate students 
enrolled at the State University of New York at 
Albany. 

Apparatus, A modification of the force-contingent 
reinforcement apparatus designed by Blixt and Ley 
(1969) was used to measure changes in the force of 
on a force-contingent reinforcement. lever 
ie 


responsi 
(Ly) and a nonreinforced lever (L4) when à s 
of continuously reinforced responses was followed 
by nonreinforcement. These two vertical 48,26-cm.- 
high steel levers were mounted side-by-side (30.5 
cm. apart). There was no discernable movement 
of the levers when force was applied. The left-hand 
lever was painted red and the right-hand lever green. 
Two signal lights (one red and one green) were 
directly behind and approximately 5 cm, above their 
correspondingly colored levers. Between the two 
signal lights, at approximately eye level, was à 
digit counter (Lafayette Model 57-T-B). At the 
middle of each lever (front and rear) were two 
bonded strain gauges (SR-4 Type FA-50-35, 350 ohm. 
resistance, manufactured by Baldwi -Lima-Hamil- 
ton) which measured the amount of force applied to 
the levers. Pressure on the levers resulted in the 
development of a small signal volta which was 
routed through a "arrier amplifier to à Brush eight- 
channel pen recorder. (Model RD-1682-30) and 
caused a pen deflection on à moving (1 mm/sec) 
graphical tape. The graphical tape was preexperi- 
mentally scaled, and since each lever was connected 
to a separate pen, à direct reading in grams of force 
on each lever was possible. 

A test control box was manually set at a constant 
5.sec, intertrial interval. Beginning with the red 
light, each of the two signal lights was succes ively 
activated for 5-sec. durations. The red-light circuit 
activated and reset one of two force-band discrimi- 
nators that monitored voltage output from the 
strain gauges. Located on Force Band Discrimi- 
nator 2 was a control switch that enabled E to 
program the L, response force-counter. activation 
contingency; i.e E controlled the amount of force 
that would activate the counter. 

Procedure. From the total sample of 90 S 
equal number of Ss (n = 15) was randomly à igned 
to each of six reinforcement-ty pe (positive reinforce- 
ment—PR—or negative reinforcement —NR) by 
magnitude-of-reinforcement (0€, 5¢, or 10¢) groups, 
ie, PR-0¢, PR-5é, PR-10¢, NR-OF, NR-5¢, and 
NR-10¢. i 


eS 


The Ss were instructed to press La after 
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TABLE 1 


MEAN LEVER 1 AND LE 
REINFORCE: 


3 kg. or greater than 5 kg.) 
on the counter. The PR 
that each counter point rey 
PR-5¢) or 10¢ (Group PR. 
earned, while NR Ss were 


told that the 


à point would accumulate 
Ss we 


re further instructed 
presented 5¢ (Group 
10¢) that they had 
y initially 


Lever 2 1 
ati iate post- Prefrustration Imr L 
om [^ Pett itio e "We | m oum 
7 m SD M SD M | SD M SD 
23 3.80 97. 
PR-0 5.08 2.18 5.33 2.36 3.72 23 0 
: PRSE 4.63 1.05 4.58 .99 3.92 Al 4.15 5 
PR-10¢ 4,35 1,66 4.53 1.92 3.71 32 4.00 ‘ot 
NR-0¢ 5.28 1.57 5.32 1.81 3.81 .39 3.18 "ox 
NR-5¢ 4.83 1.95 5.18 2.60 3.45 A2 3.28 a 
NR-10¢ 4.21 .80 4.38 1.29 3.61 29 3.63 2 
Note, —Abbreviations: PR = positive reinforcement, NR = negative reinforcement, 
the onset of the L; signal light and to press Ls after tion trials) with the mean Ly force on the t fog 
each Ly press, but not until the onset of the Le signal training trials plus the first extinction ul (i 
light. The Ss were further instructed that the func- prefrustration trials), 
tion of Ly was to reset the recording mechanism 
pre : RESULTS 
The PR Ss were instructed that following each 
d Li press Mn RS Point would Bend rots The performance data in terms o! mean 
On the counter, while NR Ss were instructe, hai 2 : PUR 1% Ww 
following each incorrect Li press (a press less than force of response for each of the six ex; 


mental groups ( 
forcement by 0¢, 
reinforcement) on 
trials for Levers 1 


positive or negative rein» 
5¢, or 10¢ magnitude of 
pre- and postfrustration 
and 2 are given in Table 


had $4.00 to work with and 
represented 5¢ (Group N 
10¢) which woul 
The PR control 

regardless of thei 


that each counter point These data we 
R-5¢) or 10¢ (Group NR- 
d be deducted from their $4.00, 
Ss (Group PR-0£) were told that 


r performance they would be paid 


re analyzed by means of 
à four-variable mixed factorial design im 
which the between-Ss variables were re- 
inforcement magnitude (0¢, 5é, or 106) and 


$1.50 for participating in the 


should try to accumulate poir 
5s (Group NR-0£) were told 
not to accumulate points. 

Following a practice pe 
to familiarize Ss with th 
and with the propriocepti 


riod 


erer 


study, but that they 
ts, while NR control 
that they should try 


n 


» Which was designed 
perimental apparatus 


a correct Li response, 
tered until a criterion 


ve feedback associated with 
training trials were adminis- 
of five consecutive correct Li 


responses following 
reached. Following training 
trials were given during 
correctness of Li responses, 
money) accumulated for the 
one point (a loss of money— 
lated for the NR groups, 

The magnitude of the FE 
paring the response force of 
that followed 
force on extinction 
with the mean I 
training trials (Ly prefrustr; 
comparing the response force 
Which followed planned L 
the Li force 


trial 1—L, 
-» response 


© 


the tenth training 


whi 


planned L, nonreinfc 


1 nonrei 
on Extinction Tri 


trial was 
trials five extinction 
ich, regardless of the 
no points (additional 
PR groups, whereas 
punishment) accumu- 


was 


assessed by com- 
the 


first Ly response 
orcement Se ps 
postfrustration trials) 


force on the last five 
ation. trials 


) and by 
Li response 
inforcement (i.e., 
ial 2—L, postfrustra- 


of the first I 


reinforcement 
reinforcement), 
ables were tri 


type (positive or negative 
and the within-Ss vari- 
als (pre- and postfrustration 


trials) and levers (Li—the force-contingent 
reinforcement lever—and L;—the reset 
lever). 


With respect to the primary hypothesis, 
the Reinforcement Type X Trials X Levers 
interaction. was significant, F (1, 84) = 
8.70, p < .005, thus indicating that the 
Reinforcement Type X Trials interaction 
was different from Pas stor T Separate 
analyses of this interaction revealed that 
on L: the Reinforcement Type X Trials 
interaction was not Significant, F (1, 81) 
= .14, 5 > .05, whereas on Lı the Rein- 
forcement Type X Trials interaction. was, 
Tod 84) = 8.28, p < .005. The means 
of the PR and NR groups on the pre- and 
Postfrustration trials on Ls; which are 
plotted in Figure 1, clearly indicate an 
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crease in force of response from pre- to 
postfrustration trials for both the PR and 
NR groups, F (1, 84) = 411, $ «CX. 
Phat is, with respect to Lə the FE was 

(ained under negative as well as positive 
einforcement, results consistent with the 
primary. hypothesis. The observed differ- 
ence between the PR and the NR groups 
was not significant, F (1, 84) = .10, 
p.05. 

"The means of the PR and NR groups on 
the pre- and postfrustration trials on Li, 
which are plotted in Figure 2, indicate an 
observed increase in force of response from 
pre- to postfrustration trials for the PR 
group, findings consistent with the data 
based on Ls and with the primary hypoth- 
esis, but a decrease in force for the NR 
group, F (1, 84) = 5.25, p < .05, findings 
opposite to those based on PR on L; and 
opposite to those based on both PR and 
NR on Le. 

Contrary to the second hypothesis, re- 
inforcement magnitude was not significant, 
F (2, 84) = 1.19, p > .05. Furthermore, 
neither the four-factor interaction nor any 
of the two-factor or three-factor inter- 
actions including the reinforcement mag- 
nitude variable were significant. 

Consistent with the third hypothesis, re- 
sponse force on L, (a response unrelated 
to the reinforcement contingency of the 
frustrated response) showed a significant 
increase from pre- to postfrustration trials. 
This effect is illustrated in Figure 1 and 
was presented in the context of the results 
with respect to the first hypothesis. It 
should be noted, however, that the mean 
force of response on Lz (M = 4.76) was 
significantly greater than the mean force 
of response on Li (M = 3.68), F (1, 84) — 
42.26, p < .001—results that might be ex- 
pected since reinforcement was contingent 
on Ss’ restricting their force of response on 
Lı within a relatively light force band, 
whereas reinforcement never followed Le re- 
sponses, regardless of the force Ss exerted. 

Consistent with expectations based upon 
previous research (Otis, 1973), the FE was 
found to dissipate rapidly over time. 
Analyses of changes in response force from 
prefrustration trials to delayed postfrus- 
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PRE-FRUSTRATION IMMEDIATE POST- 
TRIALS FRUSTRATION TRIALS 
Ficure 1. The mean Lever 2 response force as a 


function of trials and reinforcement type (PR = 
positive reinforcement, NR — negative reinforce- 
ment). 


tration trials (Extinction Trials 3-5 on Li 
and Extinction Trials 2-4 on Ly) showed 
the relevant main effects and interactions 
to be nonsignificant. 

Table 2 presents the mean number of 
incorrect Li responses as à function of 
trial blocks for each reinforcement-ty pe 
group. A Trial Blocks X Reinforcement 
Type analysis of variance was computed 
and indicated no difference between rein- 
forcement type groups, F (1, 88)= .12, 

> .05; a significant trial blocks factor, 
F (6, 528) = 22.13, p < .001; and a non- 
significant interaction, F (6, 528) = 1.54, 
p > 05. The significant trial blocks factor 
reflects the complete elimination of incor- 
rect Lı responses in Trial Block 7, con- 
sistent with the fact that training trials 
were terminated when Ss reached a criterion 
of five consecutive correct Lı responses. 


e- WR 


FORCE OF RESPONSE (Kgs) 


SRETRATRATION IMMEDIATE POST- 
TRIALS FRUSTRATION TRIALS 

FiGuRE 2. The mean Lever 1 response force as à 
function of trials and reinforcement type (PR — 
positive reinforcement, NR — negative reinforce- 
ment). 
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TABLE 2 


MEAN NUMBER or INCORRECT LEVER | RESPONSES 
DURING TRAINING AS A FUNCTION oF TRIAL BLOCKS 
(EacH TRIAL BLOCK REPRESENTS ONE-SEVENTH 
OF THE TOTAL NUMBER OF TRAINING 
TRIALS FOR Eacu S) ror EACH 
REINFORCEMENT-TYPE 


Trial blocks 


Group 
1 2 3 4 s 6 7 
PR 
M | 1.22 | 1,22 | 1.11 | 1.04 | .98 71 | 0.0 
SD | 1.49 | 2.20 | 2.05 | 1.94 | 94 94 | 0.0 
NR 
M | 1.13 | 1.00 | 1.02 | 1.27 | 1.42 87 | 0.0 
SD | 1.36 | 1.85 | 1.98 | 2.25 | 1.16 1.14 | 0.0 


Note. Abbreviations: PR = 1 
negative reinforcement. 


itive reinforcement, NR = 


Discussion 


With respect to the primary purpose of the 
present study, the data indicate that the termi- 
nation of negative reinforcement, as well as 
positive reinforcement, results in the FE in 
human The evidence that indicates an 
increase in force of response following the 
termination of positive reinforcement supports 
Amsel's (1958) frustration-drive hypothesis 
and is consistent with the findings of Blixt and 
Ley (1969). The demonstration of the FE in 
terms of an increase in force of response follow- 
ing the termination of negative reinforcement 
is unique with respect to both human and 
animal studies, With respect to other mea- 
sures of the FE (e.g., start speeds from the 
first goal box of the double runway), the re- 
sults are consistent with those of Graham and 
Longstreth (1970), who obtained the FE in 
rats using escape from shock às the negative 
reinforcer. It is important to note, however, 
that under conditions of negative reinforce- 
ment, the FE (an increase in force of response) 
was limited to Le, the lever which was not 
instrumental in obtaining reinforcement dur- 
ing any phase of the experiment. With re- 
Spect to Li, the results were directly opposite 
of those expected, le, under conditions of 
negative reinforcement, force of L, responses 
decreased from prefrustration to postfrustra- 
tion trials. This decrease may have been the 
result of greater ave 'eness of nonreinforce- 
ment following negative reinforcement than 
nonreinforcement following positive reinforce- 
ment. In the present study, nonreinforcement 
for the negative-reinforcement groups con- 
sisted of punishment (i.e., the presentation of 
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an aversive stimulus following the . ;/rectf 
sponse); whereas nonreinforcem: for 
positive-reinforcement groups did cons 
of punishment. If punishment lea to a d 
pression in performance, perhaps greater 
aversiveness of nonreinforcement ler the 
NR condition resulted in a depre e 
sponse force which overrode the rgizing 
effects of frustration. This expla: n does 
not contradict those data that d. istrate 
the FE for both positive and nege ve reins 
forcement on Ls, since reinforcemen id sub- 


limited 
‘r rein: 
under 


sequent extinction contingencies wer: 
to Lı responses (L» responses were n: 
forced). This explanation proposes t! 
NR conditions there may be two 
which influence changes in Li respo: 
following nonreinforcement: (a) the « 
frustration, which energize performan 
resulting in an increase in Li respon 
and (b) the effects of the aversiveness cí pun- 
ishment, which depress performance, thus re- 
sulting in a decrease in Li response force; 
Consistent with this explanation, the data of 
Table 1 show that the changes in L; response 
force from prefrustration trials to postfrustra- 
tion trials were (a) a sharp decrease in force | 
by Group NR-0é, the group for which the 
effects of frustration would be expected to be 
weakest, (b) a slight decrease in force by Group 
NR-5¢, and (c) a slight increase in force by 
Group NR-10¢, the group for which the ef- 
fects of frustration would be expected to be 
strongest, 

With respect to the second purpose of the 
present study, both the Lı data and the Lg 
data showed a trend in the direction of a posi- 
tive relationship between increases in force of 
response, following reinforcement. termination, 
and magnitude of reinforcement, A perusal of. 
the Ly data of Table 1 indicates that the 10€ 
groups showed an increase in mean response 
force from prefrustration trials to postfrustra- 
tion trials of .18 kg., while both the 5¢ and 
the 0¢ groups showed increases of .15 kg. 
With respect to the Li data, the 10€ groups 
Showed an increase in mean response force 
from prefrustration trials to postfrustration 
trials of .14 kg., the 5¢ groups showed an in- 
crease of .04 kg., and the 0¢ groups showed a 
decrease of .36 kg. This trend suggests that 
greater manipulations of reinforcement mag- 
nitude May produce results consistent. with 
those predicted from Spence's (1960) theory. 

With respect to the tertiary purpose, the 
results of the present study support a general- 
ized-drive explanation of the FE. The FE 


force, 


POSITIVE AND NEGATIVE FORCE-CONTINGENT REINFORCEMENT 


vas clearly evidenced on Lz (the nonrein- 
iorced lever) under conditions of both positive 
ind negative reinforcement. In addition, 
results are consistent with those of 
ne and Loesch (1967), who demonstrated 
© generalization of the FE following termi- 
nation of positive reinforcement. Further- 
nore, the results of the present study extend 
the findings of Levine and Loesch to human 
behavior and to situations involving negative 
reinforcement. 
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THE MODALITY EFFECT: 
IS PRECATEGORICAL ACOUSTIC STORAGE RESPONSIBLE? ! 


RANDALL W, 


ENGLE ? 


King College, Bristol, Tennessee 


Auditory presentation typically gives be 
tation in short-term memory experim 


to be due to a precategorical acoustic store 
conducted using the suffix procedure with free or seria 
findings did not support the PAS interpre 


stands, That is, 


The question is a very old one. Does 
auditory presentation of materials to be 
learned or remembered result in better 
learning or memory than those materials 
presented visually, or vice versa? A corre- 
sponding question that has attracted con- 
Siderable interest is whether some people 
have primarily auditory memories while 
others favor the visual modality. This 
paper is concerned primarily with the initial 
question. If uncertainty is aroused by the 
second question then the reader is referred 
to Jensen (1971) for a partial answer. 

A great deal of evidence has been com- 
piled recently that modality of presenta- 
tion is a potent variable in short-term 
memory experiments. The usual finding 
is that retention in immediate memory is 
much better for auditorily presented ma- 
terials than for those presented visually. 
There is also evidence that visual presen- 
tation with concomitant vocalization of the 
material leads to better performance than 
just visual presentation (Murray, 1966). 
The supremacy of auditory presentation, 
known as the modality effect, has been 
observed using paired-associates (Murdock, 
1966) and free recall (Craik, 1969) as well 
às paradigms involving order information 
(Murdock, 1967). The modality effect is 
typically obtained with both recall and 
recognition type retention tests. 

A SANS 

1 The experiments were conducted as part of a 

senior seminar by Glen Camenish, Luke Houchins, 


Kathy Lane, Terri Macray, Will Kniseley h 
Saavedra and Sandy Sharp. E 


? Requests for reprints should be sent to Randall 


Engle, Department of Psychology, Kin, Coll 
Bristol, Tennessee 37620, ty VOS 
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etter performance than visual pre 


the suffix did not elimin 
should have according to the PAS interpret 


cn- 
This modality effect is theorized 

(PAS). Two experiments were 
| recall of words. The 
tation of modality effects as it now 
ate the modality effect whi: 
ation, 


In single-trial free recall situations tht 
auditory superiority is most promine 
over the last four or five serial input p 
tions, in other words, the ; cency item 
The earlier list items seem (o be affect 
very little by modality manipulations ex 
cept that occasionally a slighi facilitatio 
is found for the visual group on the primac 
items (Craik, 1969; Murdock, 1966), 

Craik (1969) and Murdock and Walke 
(1969) have attempted to delineate th 
theoretical implications for mod lity effec 
in short-term memory. One explanation 
for the results is that the short-term store 
is primarily auditory or acoustic in nature 
and auditorily presented materials enter 
the STS directly but visually presented 
materials must be translated into the 
acoustic code. This notion would predict 
auditory supremacy for all serial input 
positions and not, as is typically found, 
just those at the terminal end of the input ^ 
list. Therefore, this idea seems to be some- 
what discounted. 

Another explanation makes use of pre- 
Perceptual or prelinguistic stores, This 
explanation takes two forms. Crowder 
and Morton (1969) and Crowder (1970) 
have proposed that there is a precategorical 
acoustic store (PAS) similar in nature to 
the visual icon (Neisser, 1967) except of a 
longer duration. They argue that PAS is 
capable of holding information sufficiently. 
long to affect the immediate memory task 
(on the order of 2 sec.) Information is 
supposedly lost from PAS due to displace- 
ment by subsequent acoustic events and/or 
decay with the passage of time. Crowder 
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josits that presence of this prelinguistic 

formation for auditory presentation is 

onsible for the modality effect. Crowder 

/lorton see no need for separate long- 

n and short-term memory stores. They 

cel that the concept of a PAS coupled with 

some form of articulatory coding makes 

the notion of a short-term store counter- 
productive. 

The other attempt to explain the mo- 
dality effect also relies on the notion of a 
PAS. Craik (1969) argues that the su- 
periority of auditory presentation over the 
recency portion of serial position retention 
curves reflects the output from a post- 
linguistic or postcategorical short-term store 
hich is itself indifferent with respect to 
input mode but which can, in some in- 
stances, be augmented by relatively un- 
processed information still present in the 
visual and acoustic stores (icon and PAS). 
The modality effect is viewed as resulting 
from output from the short-term store 
being supplemented by the information in 
PAS for auditory presentation. Since the 
icon decays too quickly to be of benefit for 
visually presented items, the recency effect 
in single-trial immediate free recall is con- 
sidered to be pure output from the short- 
term store. 

As a phenomenon, the PAS has been 
studied almost exclusively by something 
called the suffix technique. The usual 
procedure, perfected by Crowder and his 
colleagues, is to present a sequence of items 
(typically digits) auditorily with a redun- 
dant nonrecalled suffix following the last 
item. This suffix is usually something like 
the word “zero” spoken at the end of each 
list. The serial recall data typically take 
the form of an increase in errors over the 
terminal (two-three) items in the suffix 
group as compared to a nonsuffix control 
group. Theoretically, the acoustic repre- 
sentation of the last few items in the list 
is still in the PAS at the time the suffix is 
presented and when the suffix enters the 
store it overwrites or interferes with the 
acoustic representation already there. 

Since both major theoretic notions re- 
garding modality effects rely on the PAS 
concept it seemed that the suffix procedure 
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might be a useful converging operation to 
tease out those factors involved in the in- 
terpretation of the modality phenomenon. 
The first experiment was set up to be a 
simple demonstration that the suffix pro- 
cedure would eliminate the superiority of 
auditory presentation over the terminal 
items in single-trial immediate free recall. 
This did not prove to be the case and thus 
led to the second experiment. 


EXPERIMENT | 


In terms of the PAS explanation, a 
redundant nonrecalled word at the end of 
auditorily and visually presented lists 
should have differential effects. If the 
suffix eliminates the ancillary information 
usually available in PAS for auditory pre- 
sentation then the auditory suffix group 
should become exactly like the visual suffix 
and nonsuffix groups over the terminal or 
recency items. In other words, the pre- 
diction of Craik (1969) and Crowder (1970) 
should be that the suffix would affect the 
visual groups little, if at all. The auditory 
nonsuffix group should exhibit the typical 
supremacy over the visual groups for the 
terminal items, while the auditory suffix 
group, as a consequence of having the 
extra PAS information removed, should be 
no different than the visual groups. This 
should yield a Modality X Suffix and a 
Modality X Suffix X Serial Position inter- 
action effect. 


Method 


Materials. Twenty-four lists of 12 items each 
were composed from the Kucera and Francis (1967) 
norms. The lists were recorded for auditory pre- 
sentation at a l-sec. rate and typed on paper for 
visual presentation by memory drum (also at à 
1-sec. rate). Each list was used in both suffix and 
nonsuffix conditions but never with the same S. 

Design. Half of the Ss received the lists via 
auditory presentation and half by the visual mode: 
The visual and auditory groups were further sub- 
divided into within-subjects suffix and nonsuffix 
conditions. In the suffix condition the lists ended 
with the word "zero." The "zero" was not to be 
recalled and S was told to either use it as a cue to 
recall or ignore it altogether if he so desired. The 
nonsuffix lists, of course, consisted of just the 12 
words without the suffi The Ss served in the 
suffix and nonsuffix conditions in separate half-hour 
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FiGURE 1. Mean free recall as a function of serial 
position, modality (auditory—A, or visual—V) and 
presence (S) or absence (NS) of a suffix. 


sessions (12 lists per session) separated by 24 hr. 
The order of the suffix and nonsuffix sessions and 
set of lists in the two sessions were counterbalanced 
across subjects, 

Subjects. The design above resulted in eight dif- 
ferent conditions and since each of the four experi- 
menters ran 1 § in each condition, 32 Ss were 
needed. The Ss were college sophomores at King 
College and served as part of a course requirement, 

Procedure. The S was seated at a table facing 
either a Lafayette memory drum or Sony tape 
recorder depending on the appropriate condition. 
The S was instructed to recall the words in any 
order he deemed desirable with 60 sec. allowed for 
him to write his recall in a prepared booklet, 


Results 


The data from the first two lists in each 
session were discarded as practice lists, 
The recall data for each serial input posi- 
tion were combined across the remaining 
10 lists for each of the four major condi- 
tions and these are depicted in Figure 1. 
Analysis of variance on the two modalities, 
two suffix conditions, and 12 serial posi- 
tions showed that only the main effect of 
serial position and the Modality X Serial 
Position interaction were significant. The 
main effect of Serial Position resulted in 
F (11, 330) = 56.06, p < .001, and the 
Modality X Serial Position interaction re- 
sulted in F (11, 330) = 7.10, p < .001. 
Neither the Modality X Suffix nor Modal- 
ity X Position x Suffix interactions were 
significant, contrary to the predictions of 
the PAS interpretation of the modality 
effect. Both interactions resulted in F < 1. 
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The absence of a Modality main effe 
F (1, 30) = 1.36, p > .10, seemed t0 
due to the nearly perfect crossover of th 
auditory and visual groups, is cle 
shown in Figure 1. Therefore sepai 
analyses were carried out for Positions 14 
and 9-12. The analysis on the primacy 
items (1-4) resulted in significant Posi 
F (3, 90) = 32.38, p < .001, and Modali a 
F (1, 30) = 5.06, p < .01, main effect 
This indicates that, for the four primacy 
positions, the visual presentation was, 
fact, superior to the auditory presentation 

Of more importance to the present study 
was the analysis of the terminal positions 
(9-12). Auditory presentation was sū 
perior to visual presentation, / (1, 30) 
8.58, p < .001, showing the usual modality 
effect for the recency items. The Suffi 
and Serial Position main effects were also 
significant, with F (1, 30) = 5.05, p <.01, 
and F (3, 90) = 54.49, p < .001, resp 
tively, None of the interactions were sig- 
nificant including the predicted Modality 
X Suffix interaction which resulted, again, 
in F <1. This means that auditory pre- 
sentation was superior to visual presenta- 
tion for all four terminal positions and that 
the suffix did hinder recall for these termi- 
nal items. But jt also means that thei 
suffix was equally effective on these termi- 
nal items for both auditory and visual 
groups. 

A further post hoc analysis was con- 
ducted in an attempt to ascertain the locus 
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FIGURE 2, Mean output position as a function 
of serial position, modality (auditory—A, or visual 
—V) and presence (S) or absence (NS) of a suffix. 
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the rather surprising experimental re- 

Following the lead of Murdock and 

- (1969) the output order of the re- 

d items was analyzed. Figure 2 shows 

mean output position as à function of 

. recalled items input position. As can 

c seen from this Figure there are no major 
between-groups differences. 


Discussion 


Let us first dispense with the superiority of 
the visual groups to the auditory groups for 
the primacy items. As was mentioned in the 
introduction to this paper, this is an occasional 
result of modality experiments. Murdock 
(1964) used four or six paired-associates with 

of the pairs being tested for retention. 

r his six item lists he found superiority for 
the visual presentation on the primacy items. 
Craik (1969), among others, has obtained this 
result with free recall, i.e., better performance 
on positions 1-3 for visual presentation than 
auditory presentation. Jensen (1971) has also 
reported data in which visual presentation is 
superior to auditory presentation. He used 
the memory span procedure and presented 
digits either visually or auditorily. The Ss 
recalled either immediately after presentation 
or after a delay of 10 sec. filled with a re- 
hearsal preventative task. Jensen found that, 
with immediate recall, auditory presentation 
was slightly superior to visual presentation. 
However, the delayed recall results exhibited 
much better performance for the visual group 
than for the auditory group. 

A probable explanation is that visual su- 
periority in these situations is due to differ- 
ential cumulative rehearsal strategies. Waugh 
(1960) has suggested that digit series are re- 
hearsed cumulatively and Rundis and Atkin- 
son (1970) have demonstrated that Ss in a 
free recall task do rehearse in a cumulative 
manner. Corballis (1966) found that with 
digit strings presented visually Ss did cumula- 
tively rehearse but with auditorily presented 
strings there was very little cumulative re- 
hearsal. Many subjects report a kind of 
“echo box" phenomenon with auditory pre- 
sentation which would fit the PAS model. This 
quite likely leads the S in auditory groups to 
rehearse less or at least differently than the S 
in a visual group. The auditory S would be a 
great deal more passive and rely on the “echo 
box" while the visual S would need to rehearse 
This would explain the dif- 
This would 


in some manner. 
ferences for the primacy positions. 
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also explain Jensen's (1971) findings since the 
PAS ("echo box") would decay leaving the 
auditory S with very little remaining informa- 
tion. The visual 5, on the other hand, would 
be better off on delayed recall since he re- 
hearsed the items. Other possible support for 
the rehearsal differences notion come from 
the fact that in free recall situations, facilitated 
primacy for visual items is usually observed 
when the subject serves in only the visual 
group or only the auditory group, i.e., between- 
Ss designs, and is not observed in within-Ss 
designs. See Craik's (1969) two experiments 
as evidence for this informal observation. 

With respect to the analysis of the terminal 
positions, the prediction of the PAS explana- 
tion of modality effects seemed to be that the 
suffix should have eliminated any superiority 
for the auditory group over the visual group. 
The suffix did hinder recall but it did so for 
both the auditory and visual groups. And 
the suffix did not bring the auditory group 
down to the level of the visual nonsuffix group. 
This clearly does not support the PAS interpre- 
tation, One possibility for the contradiction 
between present findings and prior suffix pro- 
cedure results might be that prior experiments 
using the suffix have typically used serial recall 
whereas the present experiment made use of 
free recall. 


ZxPERIMENT II 

Since most previous experiments using 
the suffix procedure to study PAS have 
used serial recall, a second experiment was 
conducted to test the possibility that type 
of retention test contributed to the failure 
to support the PAS interpretation of mo- 
dality effects. 


Method 


The same materials and design were used as in 
Experiment I with the exception that 5s were in- 
structed to recall the item: in the order they were 
presented. The Ss were 32 introductory psychology 
students who served as part of a course requirement. 


Results 


Again, the first two lists in each session 
were discarded as practice lists. The serial 
position curves are shown in Figure 3. 
The overall analysis of variance resulted 
in significant effects of Serial Position, F 
(11, 330) = 18.28, p <.01, Modality X 
Suffix, F (1, 30) = 12.76, p < .01, and Mo- 
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Ficure 3. Mean serial recall as a function of 
serial position, modality (auditory—A, or visual 
—V) and presence (S) or absence (NS) of a suffix. 


dality X Serial Position, F (11, 330) = 
1.78, 5 «:05. Since the thrust of the 
study was aimed at the terminal positions, 
à further analysis was performed on Posi- 
tions 9-12, This analysis resulted in a 
significant effect of Modality, F (1, 30) = 
5.36, p < .05, showing that, for the termi- 
nal positions, auditory presentation was 
superior to visual presentation. The main 
effect of Suffix was not significant, F (1, 30) 
= 3.84, p > .05, but the Modality X Suffix 
interaction was significant, F (1, 30) = 
13.77, b <.01. Paired comparisons showed 
the auditory suffix group to be lower than 
the auditory nonsuffix group, / (30) = 2.40, 
b « .05, but still significantly higher than 
the visual nonsuffix group, ¢ (30) = 2.29, 
b < .05. So the suffix was effective, as the 
PAS interpretation predicts, in decreasing 
recall for auditorily presented terminal 
items. However, it still was not so effec- 
tive as to make the auditory group identi- 
cal to the visual group. 
Another point of intere 
tion of Figure 3 shows the 
for primacy items found in Experiment Í, 
to be absent in the present experiment, 
This supports the Observation that the 
visual supremacy over the Primacy posi- 
tions is usually obtained with free recall, 


St is that inspec- 
visual superiority 


Discussion 


The PAS interpretation of modality effects 
seemed to predict that the addition of a re- 
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dundant nonrecalled sufix to an auditoril 

presented list would result in recall curves 

much like those of a visually presented 

While the suffix did have derogatory effects i 

both Experiment I and I xperiment II itd 

not eliminate the superiority of auditory pre 
sentation over the recency positions, 

There are several probi. inherent in usi, 
the present data to arguc ut the contribu 
tion of PAS to the modalit jects. As previ- 
ously mentioned nearly a evious research 
on the PAS has made us serial recall of 
short lists of digits. The nt experiments 
used lists of words whic vere 
longer than the typical sı 
Even though the second ex; 
recall, there is a great diffe; 
recall of an 8 or 9-item list 
digits and that of a 12-iteu 
considerably less availabiii 
mantic content than the 10 «i; 

Nevertheless, the preca 
Store, as proposed by Crow; 
(1969), should not be affec 
bility of the items stored rein or their 
semantic content. Also, the modality effect 
is certainly found in nonordered recall tasks 
Therefore, these tasks should be amenable to: 
study, 

One alternative might be to say that the 
suffix is not completely effective in eliminating 
the trace of those items in PAS or that the 
PAS is of longer duration than is typically 
Suggested. Data pertinent to this point were 
reported by Murdock and Walker (1969). 
They randomly switched from auditory to 
visual within a single list and observ, sd audi- 
tory superiority over a full 10-item list, This 
represented five full seconds and if PAS 4s 
responsible for their findings it must be of, at 
least, 5-sec, duration, 

The Murdock and Walker (1969) study used 
words and free recall, as did the present re- - 
search. This raises the completely speculative. 
possibility that PAS is physical in nature but. 
its duration and capacity may depend on the 
type of material presented, i.e, phonemes, 
digits, words, etc., and the type of task being 
used. This, of course, suggests some strategic 
control over the PAS, 

One last word of caution should be men- 
tioned with regard to the decrease in perform- 
ance for the visual sufüx group on terminal 
items. There is no obvious reason why this 
could have happened and Crowder and Morton 
(1969) found no such effect of suffix on recall 


procedure 


st of words 
ind more se 


rical acoustic 
and Mortom 
by the availa 
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ı visually presented string of digits. It may, 


tcrefore, be a spurious result. 
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RECOGNITION TIME FOR WORDS IN SHORT-TERM, LONC 
OR BOTH MEMORY STORES ! 


RICHARD C. MOHS axp RICHARD C. ATKINSON ? 
Stanford. University 
The Ss memorized a long-term set (LT. 


recognition memory test. On each tri. 
or four words was given; on some trials either half or all of the S 


were also members of the LT-set. 


ST-set words had a smaller intercept. 
LT-set words or to distractors. 


A paradigm developed by Sternberg 
(1967, 1969) has been used extensively in 
recent years to investigate search processes 
in short-term memory (STM). In this 
paradigm Ss are given a series of trials; on 
each trial S's task is to decide as quickly 
as possible whether or not a test stimulus 
(probe) is a member of a previously memor- 
ized set of items (the positive set). In 
studies using small positive sets (six items 
or less) Sternberg typically finds that reac- 
tion time (RT) is a linear increasing func- 
tion of the size of the positive set and that 
the slopes for positive and negative re- 
sponse functions are nearly equal. To ac- 
count for these results Sternberg has pro- 
posed a theory of high speed scanning in 
memory. According to this theory the 
time between stimulus presentation and 
response execution is occupied by a series 
of stages including a stimulus encoding 
stage, an exhaustive serial comparison pro- 
cess, and a response selection stage. Be- 
cause the comparison process is assumed 
to be serial and exhaustive, this model 
predicts a linear increase in RT with the 
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-set) of 24 words before the start of a 


The Ss gave a positive response to test 
words that were: (a) only in the ST-set (new ST-set words), (b) only in the 
LT-set (LT-set words), or (c) in both sets 
responses were given to words not in either set (distractors), 
(RT) to both old and new ST-set words was a function of ST-set size; the RT 
functions for these two conditions had equal slopes but the function for old 
The ST-set size had no effect on RT to 
In contrast to the results usua 
using this paradigm, RT to distractors was less than RT to L1 
The results are interpreted in terms of a model which assumes that .Ss assess 
the “familiarity” of a test word before searching the memory sets. 


RM, 


a new short-term set (ST-set) of two 


-set words 


(old ST-set words). Negative 


Reaction time 


y obtained 
t words. 


size of the positive set and equal slopes for 
positive and negative response functions. 

Although the serial-exhaustive sca: ing 
model has been supported by a number of 
short-term. recognition experiments, data 
inconsistent with the predictions of the 
model have been reported by severa! in- 
vestigators (see Nickerson, 1972). In an 
attempt to account for some of thesc in- 
consistencies Atkinson and Juola (1973, 
1974) have proposed that Ss may, in some 
cases, be able to execute fast responses 
based on a test item’s “familiarity” without 
scanning the positive set. The familiarity 
model was originally formulated (Atkinson 
& Juola, 1973) to account for RT and error 
data from recognition memory studies using 
long, well-memorized lists of items stored 
in long-term memory (LTM); it has sub- 
sequently been applied both to short-term 
recognition studies and to studies involving 
searches of both STM and LTM (Atkinson, 
Herrmann, & Wescourt, 1974; Atkinson 
& Juola, 1974). The model assumes 
that test items (e.g., digits, words) are 
Tepresented in memory as nodes and that 
each node, when accessed, can yield an 
index of that item's familiarity. Familiar- 
ity values are assumed to range over a 
continuum of values with the familiarity. 
of a particular node being a function of the 
time since that node was last accessed 


830 


» 


RECOGNITION TIME FOR WORDS IN 


iasve to the number of times that node 

ias been accessed in the past. That is, a 
ingle presentation of an infrequent item 
ncreases that item's familiarity more than 
oes a single presentation of a frequently 
curring item. Subsets of the total set 
f test items are defined in terms of the 
frequency and recency of presentation of 
items in that subset; for each subset there 
is a density function which reflects the 
probability that a member of that subset 
will yield a particular familiarity value. 
It is usually assumed that all familiarity 
distributions are normal and have a com- 
mon variance but that the mean of a dis- 
tribution increases with frequency and 
recency of presentation. 

If the pool of test items used in a recog- 
nition memory test is large enough so that 
stimuli not in the positive set (distractors) 
are repeated infrequently, the mean famil- 
iarity value for items in the positive set 
(target items) may be considerably higher 
than the mean familiarity value for dis- 
tractor items. In such a situation Ss may 
adopt a strategy involving a check on each 
test item's familiarity value before the test 
item is scanned against the positive set. 
If the test item’s familiarity value falls 
above an upper criterion or below a lower 
criterion, S executes a positive or a nega- 
tive response, respectively, without a mem- 
ory scan, Only if the test item's famil- 
iarity value falls between these two criteria 
does S conduct a search of the positive set. 
The slope and intercept of the RT func- 
tion, then, are predicted to depend not only 
on the encoding, comparison, and response 
processes, but also on the proportion of 
trials on which Ss execute fast familiarity 
responses. This proportion, in turn, de- 
pends on the distributions of familiarity 
values for targe nd distractors and the 
position of the decision criteria relative to 
these familiarity distributions. 

The present experiment was designed to 
investigate recognition memory processes 
in a task in which the items to be recog- 
nized were stored in LTM, in STM, or in 
both memory stores. The Ss were given 
a long-term memory set (LT-set) of 24 
words to be memorized before the start of 
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the recognition test and were given an 
additional short-term memory set (ST-set) 
of 2 or 4 words at the start of each test 
trial. The Ss’ task was to give a positive 
response to test words from the LT-set or 
the current ST-set and to give a negative 
response to test words not in either set. 
Previous experiments (Mohs, Wescourt, & 
Atkinson, 1973; Wescourt & Atkinson, 
1973) using this paradigm have shown that 
RT to test items drawn from the ST-set 
is a function of ST-set size, but that RT 
to items drawn from the LT-set and to 
distractor test stimuli does not depend on 
ST-set size. The present experiment dif- 
fered from these previous experiments in 
that some of the words used in the ST-sets 
appeared on more than one trial and in 
more than one memory set; on some trials 
in the present experiment either all or half 
of the ST-set words were also members of 
the LT-set. Thus, there were three types 
of target words in this experiment: (a) 
words in the LT-set but not in the current 
ST-set, (b) words in the current. ST-set 
but not in the LT-set (new ST-set words), 
and (c) words in both memory sets (old 
ST-set words). 

Of primary concern in the present ex- 
periment was the relationship between the 
RT functions for positive responses to new 
and old ST-set words. For a given set of 
decision criteria the familiarity model pre- 
dicts that the proportion of correct fast 
familiarity responses for a subset of items 
from the positive set varies directly with 
the mean familiarity value of that subset. 
If words in both STM and LTM at the 
time of test have a higher mean familiarity 
than do words in STM alone, the model 
predicts both a smaller intercept and a 
smaller slope for old ST-set words than 
for new ST-set words; that is, the model 
predicts that Ss would conduct memory 
scans on fewer of the trials with old ST-set 
probes than on trials with new ST-set 
probes. 


METHOD 


Subjects. Eight female students at Stanford Uni- 
versity served as Ss. Each S was paid $5 for 
participating in two test sessions. 
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Stimuli. Words for the L'T-set, new ST-set words, 
and words used as distractor test stimuli were taken 
from the Thorndike-Lorge (1944) word list. All 
words were four to eight letters in length, had two 
syllables, and had frequency counts of more than 
20 per million, New ST-set words and distractor 
words were each used on only one trial in the 
entire experiment; that is, they were presented only 
once to each S. A single list of 24 words was used 
as the LT-set for all Ss, The first 2 and the last 2 
words on the list were the same for all 5s and were 
tested only on warm-up ‘trials. The middle 20 
words on the list were given in a different random 
order to each S. 

Ten blocks of 28 trials each were constructed. 

Two of the blocks served as practice blocks and 
were given at the beginning of the first test session; 
these two blocks were constructed in the same 
manner as were the eight experimental blocks. 
Within each block, ST-sets of two and four words 
appeared equally often, For each of the two ST-set 
sizes each block contained 4 trials on which the 
test word was a member of the ST-set: 1 trial on 
which the probe was a new ST-set word drawn from 
a pure ST-set (i.e., an ST-set containing only new 
words), 1 trial on which the probe was an old ST-set 
word drawn from a pure ST-set (i.e., an ST-set 
containing only old words), 1 trial on which the 
probe was a new ST-set word drawn from a mixed 
ST-set (ie, an ST-set containing half new 
half old words), and 1 trial on which the probe w 
an old ST-set word drawn from a mixed ST-set. 
For each ST-set size each block contained 3 trials 
on which the probe was a member of the LT-set 
but not a member of the ST-set: 1 trial on which 
the ST-set contained all new words, 1 trial on which 
the ST-set contained all old words, and 1 trial on 
which the ST-set was mixed. Negative test trials 
were constructed so that negative and positive 
responses were required equally often for each ST-set 
size and type (pure or mixed). Within each block 
the test trials were presented in a different random 
order, 

Within each block each of the middle 20 L'T-set 
words appeared either in an ST-set or as a test 
probe at least twice and not more than three times, 
Over all 10 blocks of trials each of the middle 20 
L'I-set words was tested twice as an old ST-set 

word and three times as a long-term memory item 
(i.e., on a trial when it was not a member of the 
t). No LT-set word was tested more than 
once in a single block. For mixed ST-sets the posi- 
tion of the old words was balanced so that they 
appeared equally often in each position for each 
trial type. Also, the serial position of test words 
drawn from ST-sets was counterbalanced across 
blocks for each trial type. 

Two additional blocks of six trials each were con- 
structed for use as warm-up trials. One block was 
given at the start of each session. These trials were 
similar to those used in the regular test blocks except 
that one of the first two and one of the last two 
LT-set words were tested in each warm-up block. 
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Apparatus. The ST-sets for each tri ere pre 
sented through the speaker system of a t ecorder, 
Each test stimulus was typed in ca | letters 
on a white 6 X 9 in. index card wit n IBM 
Executive Registry typewriter. The ili were 


presented in an lconix tachistoscope 
peared at the center of the visual fx 
tended a visual angle of about 2 degree 
target of four dots in the shape of a r: 
displayed at the center of the field for 
to the onset of the test stimulus. The 
dark between trials. 

On a table to the right of S three | 
were arranged along an arc, The S co 
forefinger on the center button between 
make short movements to the left or rig 
one of the two response buttons. Hal 


hey ap- 
id sub- 
fixation 
igle was 
c. prior 
lay was 


buttons 
est her 
ils and 
> strike 
the 58 


were assigned the right button as the j^ ive re- 
sponse and the left button as the negativ: ponse; 
the assignment was reversed for the oth: The 
S held in her left hand a button used nitiate 


display of the test stimulus on each trial 


Procedure, Each S was contacted by ne at 
least 18 hr. prior to the first test sessio The S 
was given the LT-set and was told to me: ze the 


list so that it could be recalled in correct s 
at the first test session. Test sessions w« 
uled for two successive days. 

All Ss satisfied a learning criterion for th 
by giving both written and oral recalls of th 
in correct serial order at the start of the «t test 
session. After completing both recalls correctly $ 
was seated at the tachistoscope and given instruc- 
tions about the task. The Æ explained that the 
experiment involved a series of test trials and that 
the following sequence would be followed on each 
trial: (a) the E would start each trial by saying 
"ready"; (b) the E would turn on the tape recorder 
and 5 would hear an ST-set of either two or four 
words; (c) the S would then repeat the ST-set 
aloud; (d) when S was ready she was to push the 
start button in her left hand; (e) this would cause 
à fixation box to appear in the tachistoscope for .5 
sec., and then a test word would appear; (/) the 
lest word would remain visible until § made a 
positive or a negative response by pressing the ap- 

priate button; (g) the S would then repeat the 
-set aloud once more (this was done to ensure 
that old ST-set words were maintained in STM as 
well as LTM during the recognition test); and (h) 
the E would record the RT and a new trial would 
begin. The E then explained fully the nature of the 
ST-sets; the S was told that some of the ST-sets 
would be all new words, that some would contain 
only words already in the LT-set, and that some 
would contain half new and half old words. The S 
was not told that only the middle 20 words of the 
LT-set would be tested during regular test blocks. 
The S was told that the positive set on each trial 
consisted of the words in the LT-set, words presente 
from the tape recorder on that trial, and words in 
both the ST-set and the LT-set. It was explained 
that new ST-set words and distractor words would 
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n only one trial in the entire experiment. 

instructions emphasized that S was to respond 

dly as possible while trying to avoid errors. 

i was sure that S understood the instructions, 

frst warm-up block was given. After the 

uin-up block S was asked if she had any questions. 

Vhen S’s questions, if any, were answered, the two 

sractice blocks and three test blocks were given. 

during testing Ss were informed whenever they had 

nade an error. The time between trials was about 
10 sec. 

At the start of the second day's session S was 
again required to give both a written and an oral 
recall of the LT-set, The instructions were then 
-eviewed and S was given another warm-up block 
and five regular test blocks. Each session lasted 
about 1 hr. 


RusULTS AND DISCUSSION 


The RT data from eight test blocks, the 
last three given in the first session and the 
five given in the second session, were in- 
cluded in the analysis. Data from trials 
on which an incorrect response was made 
were excluded from the analysis. 

A preliminary analysis of the RT data 
for positive responses to words in the ST- 
set suggested that ST-set type (pure vs. 
mixed) had no effect on RT. This variable 
was included in the experiment in order to 
determine whether the familiarity value of 
a test word from the ST-set was affected 
by the familiarity of other ST-set words. 
An analysis of variance using S means as 
scores was performed with ST-set type, 
ST-set size (two or four words), and probe 
type (new or old) as factors. The analysis 
showed that the main effect of ST-set type 
and all interactions involving ST-set type 
were not significant, F (1, 7) <1 in all 
cases. 

Since ST-set type clearly had no effect 
on RT, the data for pure and mixed ST- 
sets were pooled. Table 1 presents a sum- 
mary of the pooled data. Trials on which 
Ss gave a positive response to a new ST-set 
word are labeled Y(ST-new); trials on 
which Ss gave a positive response to am 
old ST-set word are labeled Y (ST-old) ; 
trials on which Ss gave à positive response 
to an LT-set word not in the ST-set are 
labeled Y (LT); and trials on which Ss 
gave a negative response to à distractor 
word are labeled N(D). In the second 
column of Table 1 are the two ST-set sizes 
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TABLE 1 


NUMBER OF OBSERVATIONS, REACTION TIME MEANS, 
STANDARD DEVIATIONS, AND ERROR RATES FOR 
ALL TRIAL TYPES 


gent ST-set ac a Sp. | Pee 
Condition size RAM (msec.) | (msec.) Er 
rese rcm AE 
vera | 2 | BE] BS | | 3 
van | 2 | 192 | M | as | 86 
w [21] [|S 


for each condition, and in the third column 
is the total number of observations made 
for all Ss over eight test blocks. Using the 
raw data collapsed over ST-set types, the 
mean RT for each of the eight trial types 
was computed for each S. Outlying scores 
for each S were eliminated from computa- 
tions by the following method: (a) for 
each trial type, the mean was calculated, 
(b) each score that was more than 1.5 
times the mean was deleted, and (c) the 
mean was recalculated from the remaining 
scores. This procedure eliminated about 
5% of the scores. The mean and standard 
deviation of S means for each of the trial 
types are presented in the fourth and fifth 
columns of Table 1. The las column pre- 
sents the error rate for each trial type. 

A graph of the means from Table 1 is 
presented in Figure 1. Lines for the Y (LT) 
and N(D) conditions were drawn at the 
mean RT for the two ST-set sizes while 
the lines for the Y (ST-new) and Y (ST-old) 
conditions connect the two points for each 
condition. An equation for each line is 
given in the figure. 

A comparison of the RT functions for 
the Y(ST-old) and Y (ST-new) conditions 
presented in Figure 1 indicates that the 
intercept for the Y (ST-old) condition is 
slightly smaller than the intercept for the 
Y(ST-new) condition but that the slopes 
for the two conditions are nearly identical. 
A two-factor analysis of variance using 
individual S means for these two condi- 
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nu stimuli even when response fre. cney 
[ VitT) «769 controlled. An encoding differ coul 
Snes Sage have resulted, then, because « ST-sel 
750 L SES words were each used as probes sev 
trials while new ST-set words e each 
presented only once in the entire « iment, 
g The fact that no evidence wa ind in- 
= 700 is dicating a greater mean familia: for old 
= ST-set words than for new SI words 
Z suggests that a limit on the rang: of mean 
5 650 [at vier Nei BR E nla yee Fis ee inclu: in 
a t arity model. hat is, the stribu 
3 tion of familiarity values for it is curs 

rently being maintained in ST^! has a 

YIST-OLD) = 572 + 16.0 M : 

600 |= mean that is as great, or nearly great, 
as the mean familiarity value for any set 
of items in LTM, STM, or both stores. 

Paired £ tests using individual 5 means 

o as scores indicated that ST-set size had no 

Se LSE M) effect on RT for either the Y(LT) condi- 
FIGURE 1. Mean reaction time vs. short-term tion, ¢ (7) < 1, or the N(D) condition, 2 
(ST) set for all conditions. (See text for (7) < 1, Previous experiments using & 


explanation of conditi uer à ES 
piena Non of conditions.) similar paradigm (Mohs et al., 1973; Wes- 


court & Atkinson, 1973) have also found 
tions as scores showed that the effect of that ST-set had an effect on RT to 
ST-set size on RT was significant, F (1, 7) probes from the ST-set but had no effect 
= 10.7, p < .025, that the effect of probe On RT to probes from the LT-sct or to 
type was marginally significant, F (1, 7) distractor test stimuli. A model that 
= 4.1, .05 < p < .10, and that the inter- accounts for this basic result by assuming’ 
action between these two variables was not that Ss search STM and LTM in parallel 
significant, F (1, 7) « 1. The lack of a rather than sequentially is presented in 
significant interaction between set size and both of the cited papers. 
probe type suggests that the difference in The present results are unique, however, 
RT between new and old ST-set words in that the N(D) condition had a smaller 
was not due to a difference in familiarity; RT than did the Y (LT) condition. In both 
a greater mean familiarity for old ST-set of the previous studies of this type (Mohs 
words would have produced both a smaller et al, 1973; Wescourt & Atkinson, 1973) 
slope and a smaller intercept for the RT to distractor words was over 40 msec. 
Y(ST-old) function. It is possible, how- greater than RT to LT-set words. The 
ever, that the intercept difference shown familiarity model can account for this re- 
in Figure 1 reflects the fact that old ST-set versal if it is assumed that Ss in the present 
words were encoded more quickly than experiment set upper and lower criteria in 
were new ST-set words. That recognition such a way that a high proportion of LT- 
involves a stimulus encoding Process which set words had familiarity values between 
is separate from the comparison process the criteria and so that a high proportion 
has been demonstrated by Sternberg (1967). of the distractor words had familiarity 
Furthermore, this interpretation of the in- values below the lower criterion. The 
tercept difference is consistent with the reason for Ss’ positioning the decision cri- 
results of previous experiments in which teria in such a manner in the present ex- 
familiarity could play no role (Kreuger, periment may be the following: Some Ss 
Ue Bee x DO Showing reported that they often decided whether 
j € frequent test a test word was a member of the ST-sety 
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L'f-set, or both memory sets before 
ing a positive response. By positioning 
ss decision criteria in the manner de- 
‚ed above, Ss could ensure that memory 
wches would be conducted on a high 
voportion of the positive trials and that 
high proportion of the distractor words 
ould be correctly rejected on the basis of 
amiliarity. 

The error data shown in Table 1 provide 
irther support for this explanation of the 
oversal. The familiarity model predicts 
hat errors occur when the familiarity value 

for a member of the positive set falls below 
the tower criterion or when the familiarity 
value for a distractor falls above the upper 
criterion. We are assuming here three 
;verlapping distributions of familiarity 
values, each normal and with common 
variance; we assume that the distribution 
for distractors had a low mean, that the 
distribution for ST-set words had a high 
mean, and that the mean for LT-set words 
was intermediate. If most of the famili- 

^ arity values for LT-set words were between 
the criteria, few distractors would have 
familiarity values above the upper cri- 
terion and few ST-set words would have 
familiarity values below the lower criterion. 
Given these criteria, then, the model pre- 
dicts lower error rates for ST-set words 
and distractors than for LT-set words. 
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PICTORIAL ELABORATION AND RECALL OF 
MULTILIST PAIRED ASSOCIATES ! 


EDWARD J. FORBES? anp HAYNE W. REESE 


West Virginia University 


In learning multiple paired-associate lists, all with the same stimuli, Ss may 
form a new image for each pair or may progressively elaborate the image for 
a List 1 pair by incorporating each new response item into the image. In the 
latter, imagery and response-repetition effects are confounded. Analogues of 
these imagery types were manipulated by imposed pictorial elaboration in 4 
paired-associate lists, with college Ss. A control for response repetition was 
included. The elaboration types were found to be equally superior to their 
controls, for both serial and free recall, and for both immediate and 1-w. 
delayed testing. Thus, elaboration effects are not lost in long-term retention, 
serial information is not lost in progressive elaboration, and response repetition 
and elaboration have separable effects. 


It is now well established that nonverbal 
mnemonics, or images, facilitate the learn- 
ing and short-term retention of a paired- 
associate list when the images represent 
interactions between the referents of the 
stimulus and response items (see review by 
Paivio, 1971). However, there is contra- 
dictory evidence about whether long-term 
retention is facilitated by these interactive 
images (e.g., Palermo, 1970). 

In multilist paired-associate tasks in 
which the same stimuli appear with new 
responses in the different lists, there is some 
evidence that interactive images facilitate 
performance, perhaps by reducing associa- 
tive interference between lists (e.g., Bower 
& Reitman, 1972; see brief review by 
Reese, 1972), One mnemonic strategy that 
Ss sometimes report (Bugelski, 1968) is to 
elaborate a progressively more complex 
image, or “grand imaginal scene" (Bower, 
1972), for a given stimulus by incorporating 
into the original interactive image each new 
response to be associated with that stimu- 
lus in the successive lists. Bower (1970) 
had suggested that information about serial 
order might be lost in these built-up im- 
ages; but Bower and Reitman (1972) ob- 
tained contradictory evidence. 

Bower and Reitman compared two 

1A report of the study was given orally at the 


meeting of the Eastern Psychological Association, 
Boston, April 1972. 


? Requests for reprints should be sent to Edward 
n Forbes, Department of Psychology, West Virginia 
University, Morgantown, West Virginia 26506. 
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groups, one given progressive ela! vation 
instructions and the other instri «d to 


form an interactive image for e pair 
and to replace it with a new image + each 
new list. Free and serial recall werc vetter 
in the progressive-elaboration group than 
in the single-response elaboration :roup« 
Furthermore, the effects were obtaiscd in 
both short-term and long-term (i wk) 


retention. 

With progressive elaboration there is 
repetition of old-list responses in each new- 
list elaboration of the image. This repeti- 
tion could be the source of the superiority 
of progressive elaboration, It would facili- 
tate free recall by increasing the number 
of learning “trials,” and it could facilitate 
serial recall by giving the response items 
different "frequencies" in Underwood's 
(1969) sense, as Reese (1970) suggested. 
However, if repetition is the source of facili- 
tation, it should be equally effective without 
the context of imagery. A problem, then, 
is to determine whether repetition per se is 
the effective variable in progressive elabora- 
tion or whether the interactive imagery 
involved provides further facilitation. In 
the Bower and Reitman (1972) study, there 
were no control conditions in which imagery 
instructions were omitted, and conse- 
quently there was no way to separate the 
repetition and imagery effects. 

To control for effects of repetition, Reese 
(1972) used imposed pictorial materials 
generating analogues of the imagery and 
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trol conditions. The analogue of giving 
ery instructions is to show, for each 
picture of the stimulus and response 
interacting; the analogue of omitting 
imagery instructions is to show the 
imulus and response items side by side 

d not interacting. In both kinds of con- 

| condition, S might spontaneously 

vm interactive images, but the proba- 

lity of interactive images is assumed to 

e reduced. It was found in a retention 
st that progressive elaboration was su- 

‘rior to single-response elaboration, as in 
ihe Bower and Reitman (1972) study; but 
with response repetition controlled, reten- 
tion was the same in the two kinds of 
elaboration condition, and in fact neither 
was superior to its comparable control con- 
dition, contrary to the implications of the 
Bower and Reitman study. However, 
Reese tested recognition memory instead of 
recall, and the recognition test was so con- 
structed that serial retention could not be 
tested. (In addition, his Ss were young 
children.) 

'The present study can be considered to 
be an extension of the Bower and Reitman 
study, with the response repetition. con- 
trols included (although this study was 
actually designed and run before the Bower 
and Reitman study was published). Be- 
cause of the nature of the control condi- 
tions, it was decided to use imposed pic- 
torial materials, as in the Reese (1972) 
study, but with recall instead of recogni- 
tion tests. Free and serial recall were 
tested immediately and again after a one- 
week delay. 


METHOD 


Design. The experimental manipulation involved 
the presence or absence of imposed pictorial elabora- 
tion. There were four conditions, each with four 
lis! (a) In a single-response elaboration condition, 
the stimulus and response items in a given pair in a 
given list were depicted as inte ng in some way 
(sce Reese, 1972, for details). The "scene" was 
shown only in the list in which it was appropriate. 
(b) In a “Standard” control condition, the stimulus 
and response items in a pair were depicted side by 
side and not interacting in any way. A given pair 
was shown only in the list in which it was ap- 
propriate. This condition is comparable to the 
single-response elaboration condition, in that both 
involved presentation of only the currently appro- 


837 


priate response items in each list. (c) In a progres- 
sive-elaboration condition, each new response item 
was added to the scene from the preceding list to 
form a progressively more complex scene that finally 
incorporated all four of the appropriate response 
items, The interaction between a response item 
and the stimulus item was exactly the same as in the 
single-response elaboration condition. (d) In a 
“row” control condition, the stimulus and response 
items in a pair in List 1 were presented side by side 
and not interacting, exactly as in List 1 for the 
"standard" control condition. In each successive 
list, however, each new response item was added to 
the row without deleting the response items already 
in the row. This condition is comparable to the 
progressive-elaboration condition, in that both in- 
volved presentation of the currently appropriate 
response items in each list together with all previ- 
ously appropriate response items, thus controlling 
the amount of sheer repetition of the response items. 

"The four conditions form a two-by-two factorial 
matrix with the dimensions: (a) Picture Treatment 
—imposed vs. no imposed elaboration—and (b) 
Response Treatment—response items repeated vs. 
not repeated in subsequent lists, In addition, the 
design included two other variables of experimental 
concern, both manipulated within Ss: (c) Test— 
free vs. serial recall—and (d) List. A fifth variable 
included in the analyses was (e) Order of Tests, re- 
flecting counterbalancing of the free and serial recall 
tests. The data from a sixth variable—immediate 
and delayed testing—were analyzed separately. 

Subjects. The Ss were 48 students in sections 
an introductory psychology course at West Virginia 
University, who volunteered to servi Ss for extra 
credit in the course. They were run in groups of 2 
to 11 persons. These groups were assigned to con- 
ditions in such a way that time of testing was varied, 
the sex distribution was roughly balanced, and inso- 
far as possible there was rotation through the con- 
ditions, roughly equating control or experimental 
conditions in number of Ss before repeating the 
rotation. 

Materials. The materials were identical to those 
used and described by Reese (1972). They were 
line drawings of animals, used as stimuli, and of 
common objects, used as responses. They were 
divided into four five-pair lists, all with the same 
stimulus items and all with different response items. 
The pairings were random, except that obviously 
high associates were not paired. The materials 
were photographed for slide projection as positives, 
Projection time was controlled by Hunter decade 
timers. 

Procedure. ‘The Ss were run in subgroups, in each 
of which all Ss served in the same condition, but 
roughly half were given one sequence of the free 
and serial recall tests and the others were given the 
reverse sequence. 

A study-test procedure was used, in which eight 
study trials and two test trials were given. Each 
list was presented twice for study, one pair at a 
time, before the next list was presented, The order 
of the pairs was changed in a predetermined, ran- 
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dom way for each trial, with the restriction that 
across the eight study trials (two per list, four lists) 
the mean within-list positions of the stimuli were 
about the same for all stimuli. The presentation 
rate was 4 sec. per pair. 

The test trials were given immediately after the 
eighth study trial. Free recall was tested in one 
of these trials and serial recall in the other. Half 
of the Ss had the free-recall test first and the others 
had the serial-recall test first. The answer sheet 
for the free-recall test included a list of word labels 
of the stimulus items in the same predetermined 
random sequence for all Ss, and beside each word 
was a line 5 in. long (12.7 cm). The instructions, 
printed at the top of the sheet, were to write down 
the names of the response items in any order, The 
answer sheet for the serial-recall test included a list 
of the words, in a different predetermined random 
sequence from that in the free-recall test to reduce 
transfer effects, and beside each word were four 
short lines. The instructions were to write down 
the names of the response items in the original order 
of appearance, The duration of each test trial was 
3 min. Guessing was explicitly encouraged in the 
instructions for each test. 

Seven days after the initial session, the Ss were 
given the two test trials again, in the same sequence 
as in their initial test and with no further study 
presentations, The Ss had not been told that these 
retests would be given. The Ss were tested in groups 
of 15 to 20 in their classroom (these groups included 
Ss participating in another study). 


RESULTS 


Before the detailed findings are pre- 
sented, it is useful to note that the Bower 
and Reitman (1972) findings were broadly 
replicated. In both free and serial recall, 
tested immediately and after the delay, the 
progressive-elaboration group was superior 
to the single-response elaboration group 
(all ps < .01) However, when these 
groups were compared with their respec- 
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tive control groups, the two type: | elabo- 
ration were found to be almos: exactly 
equal in overall effectiveness, as letailed 
below. 

Immediate tests. Figure 1 show e mean 
numbers of correct responses p. list in 
the elaboration and control group. ‘or free 
and serial recall. It can be seen t, that 
both kinds of elaboration were f tative, 
relative to the control conditions. in con- 
firmation, the main effect of pic treat- 
ment was significant, F (1, 40 15.42, 
p < .001; and although picture © tment 
interacted significantly with list, / 3, 120) 
= 3.08, p < .05, follow-up f tests © icated 
that the difference was significa: in all 
lists except the first one (List 1,/ — 1.19; 
all other /s > 3.00, p < .01; the s are 
Cochran-Cox approximations, wii error 
terms derived from the overall an. vsis of 


variance). Second, within each response- 
treatment condition it can be seen that free 
recall was better than serial rec but 
that the relative effect of elaboration. (ie, 
the elaboration vs. control difference) was 
about the same for serial recall as {or free 


recall. In confirmation, picture treatment 
and test did not interact with each other 
(F < 1.00) nor jointly with any other 
variable (largest F = 1.85, for Picture 
"Treatment X Test X Response Treatment 
X Order of Tests, df = 1, 40). Finally, it 
can be seen that the relative effects of 
elaboration on free and serial recall were 
about as strong in the repeated-response 
condition as in the single-response condi- 
tion. In confirmation, picture treatment, 
test, and response treatment did not inter- 
act (all Fs involving all three variables < 
1.00 except that F = 1.85 for the four-way 
interaction mentioned above). These re- 
sults show that elaboration facilitated per- 
formance, and affected serial and free recall 
in the same way. Furthermore, single- 
response elaboration was as facilitative as 
progressive elaboration when the effect of 
response repetition was controlled. 

The source of facilitation in the elabora- 
tion groups seems to have been a reduction 
in interlist interference. It has already 
been noted that the elaboration and con- 
trol groups did not differ significantly in 
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List 1, and examination of Figure 1 shows 


that the curves of the elaboration groups 
were ‘latter than those of the control 
groups. The Picture Treatment X List in- 
teraction was significant, as already men- 


tioned, and follow-up / tests confirmed the 
apparent trends. In the combined elabora- 
tion groups, List 1 performance was sig- 
nificantly superior to List 3 and 4 perform- 
ance, and no other differences between 
lists were significant. - Thus, performance 
dropped after List 1 but immediately 
leveledoff. In thecombined control groups, 
performance was significantly better in List 
1 than in all other lists, and significantly 
better in List 2 than in List 3. Thus, per- 
formance dropped continuously through 
the first three lists before leveling off. 

It can also be seen in Figure 1 that typi- 
cal serial position curves were obtained in 
the single-response groups and that there 
was a marked primacy effect and no recency 
effect in the repeated-response groups. 
These trends were confirmed by 4 tests, 
which were justified by a significant inter- 
action between response treatment and 
list, F (3, 120) = 10.89, p < .001. It is, 
of course, not diffi 


icult to explain the pri- 
macy effect in the repeated-response con- 
ditions. The effect, which has also been 
obtained in a recognition test (Reese, 1972), 
presumably resulted [rom the extra practice 
provided when the previously relevant 
responses were repeated. 

The only other significant effects are of 
no systematic interest : the main effects of 
response treatment (of no interest because 
of the interaction with list), test, order of 
tests, and list; and the interactions of test 
with order of tests and with list. 

Delayed tests. The data from the delayed 
tests are summarized in Figure 2. (Group 
means were substituted for the data of one 
S who missed the delayed tests, with appro- 
priate adjustment of df in the analyses of 
variance.) Overall recall was better in the 
elaboration conditions than in the control 
conditions, F (1, 39) = 10.98, p < .005, 
but the apparent effect of response treat- 
ment was not significant, F (1, 39) — 3.72, 
p = 0f. 

The only other significant effects were 
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Ficurr 2. Recall in delayed tests. 

the main effects of test and list (ps < .001) 
and their interaction, F (3, 117) = 3.71, p 
< 025. Follow-up / tests showed that the 
recency effect disappeared but the primacy 
effect remained, and was stronger for free 
recall than for serial recall. The disap- 
pearance of the recency effect wiped out 
the effect of response treatment on the 
shape of the retention curves. The effect 
of elaboration on the shape of the curves 
did not approach significance (F = 1912) 

The superiority of the elaboration condi- 
tions on the delayed tests is ambiguous 
because of the differences in immediate 
recall; more was learned, therefore more 
wasretained. To determine whether elabo- 
ration affected long-term retention inde- 
pendently of amount learned, the amount 
of loss was analyzed. (The conditional 
probability of a correct response in a 
delayed test, given a correct response in 
the analogous immediate test, could also 
be used, but because of reminiscence effects 
it should underestimate retention.) 

The only significant effect involving pic- 
ture treatment was the triple interaction 
with list and order of tests, F (3, 117) 
3.58, p < .025. Of the eight elaboration 
vs. control differences involved in this in- 
teraction, four favored the elaboration 
group and four the control group, but there 
was no obvious pattern, overall, and only 
one difference was significant, favoring the 
elaboration group in List 1 on the test 
given second. Thus, in general, loss was 
not reliably related to picture treatment, 
indicating that amount lost was generally 
unrelated to amount originally learned. 
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The response treatment by list interac- 
tion was significant, F (3, 117) — 6.95, 
p «.001. Follow-up £ tests showed that 
in the single-response conditions the most 
loss was in the recency effect, as theoretical 
explanations of this effect would predict; 
and in the repeated-response conditions the 
most loss was in the primacy effect. 

'The only other significant effects were 
order of tests, test, and test by list. 


Discussion 


Short-term retention. The results show 
that response repetition facilitates learning of 
earlier-list responses, presumably because of 
the extra practice available. Response repeti- 
tion in the context of elaboration provides more 
facilitation than response repetition without 
elaboration, but no more than is provided by 
single-response elaboration (presumably with- 
out response repetition). By analogy, it can be 
inferred that built-up images facilitate recall 
partly because of the effect of interactive 
imagery on retention and partly because of the 
response-repetition effect on learning. These 
effects are additive, and equally strong for 
Íree and serial recall. 

It was expected that free recall would be 
facilitated more by progressive elaboration 
than by single-response elaboration. However, 
they produced equal facilitation, when response 
repetition was controlled, by equally reducing 
interlist interference. By analogy, the im- 
agery component of built-up images functions 
like that of single-response interactive images, 
and the function is to reduce interference. 
Thus, interactive imagery per se may produce 
à basic amount of facilitation, regardless of the 
complexity of the image (see also Bower, 1972). 

An alternative interpretation is possible. 
Recoding of the stimulus items may have 
occurred in the single-response elaboration 
condition (see Keppel & Zavortink, 1969) but 
not in the progressive-elaboration condition. 
If so, then the cause of facilitation in the 
former condition would be reduced stimulus 
similarity (converting A-B A-C to A-B D-C), 
and the cause of facilitation in the progressive- 
elaboration condition might be chunking of the 
materials (see, e.g., Horowitz, Lampel, & 
Takanishi, 1969; Taylor, Josberger, & Prentice, 
1970). It is possible, in other words, that the 
finding of equal facilitation for the two kinds 
of elaboration was fortuitous and does not 
reflect an underlying single source of facilita- 
tion. However, these particular different 
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sources of facilitation do not accou for the 
finding that progressive elaboration ^ l single- 
response elaboration equally facilii. «d serial 
recall as well as free recall. It ther: ' seems 
more reasonable to posit a singl urce of 
facilitation in the two elaboration ditions, 
presumably an interactive imagery ponent, 

Why it works is not clear, but it ears to 

function similarly regardless of the plexity 

of elaboration. 

Long-term retention. For long-t reten- 
tion, neither kind of elaboration pro ed any 
facilitation. Elaboration effects ap; ‘d, but 
if elaboration had facilitated long-t: reten- 
tion, there would have been less /^ in the 
elaboration conditions than in the co: ol con- 
ditions. Thus, elaboration did no: prove 
long-term retention; but neither did it: erfere. 
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Two hypotheses regarding the recall of 
agery were investigated in a cued-recall 
ceptual-peg hypothesis for paired-assoc 
creteness of the cue word 
while a constructive approach to image 


context used to construct the image would be the essential element. 
Ss formed images for 20 noun triplets with corresponding 
later presented with one member of each triplet and 
Results indicated that the elicitation of the context was the 
concretenes 


two members. 
single factor determining recall, while 


Minnesota 


noun triplets through the use of im- 
study. An extension of Paivio's con- 
iate learning predicted that the con- 


would be the critical factor in determining recall, 


ry predicted that the retrieval of the 
Sixty-four 
contexts and were 
asked to recall the other 


had no consistent effect. 


Implications of the results for the processes involved in remembering through 


imaging are discussed. 


Imagery has been shown to be an ex- 
tremely potent variable in many types of 
learning and memory tasks (see Bower, 
1972, and Paivio, 1971, for reviews). 
While not a greal deal is known about the 
mechanism by which imagery works, 
several hypotheses have been offered to 
explain its effects. For example, to ac- 
count for the facilitative effect of imagery 
on paired-associate learning, Paivio (1963, 
1965) has offered a conceptual-peg hy- 
pothesis. This hypothesis states that when 
two words are presented together on a 
study trial, S forms a compound image of 
the referents of these words, such that 
when one of the words is subsequently pre- 
sented to S on a test trial, it redintegrates 
the compound image from which the re- 
sponse term is decoded. 

From this hypothesis it follows that recall 
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should depend upon the imagery value (J) 
of the members of the pair and, in particu- 
lar, upon the J of the stimlus term. The 
reasoning for this is that on recall trials, 
stimulus terms rated high in Z will be more 
likely to redintegrate the compound image 
than those rated low in Z. A substantial 
amount of research has been generated by 
this hypothesis and, almost without ex- 
ception, has supported the basic position 
(e.g., Paivio, 1965, 1969, 1971). 

Alternatively, if imagery is viewed as 
a constructive process (Bartlett, 1932; 
Neisser, 1967), the conceptual-peg hypoth- 
esis no longer appears relevant because 
images are not thought to be "stored" and 
"retrieved" assuch. Instead, an important 
component of the constructive approach to 
imagery is the setting or context used to 
construct the image, and the critical ele- 
ment at the time of recall is the elicitation 
of this context from which the image can 
be reconstructed. In other words, a con- 
crete word would be effective as a recall 
cue only insofar as it elicited the context 
in which the image was originally con- 
structed. ‘There is nothing inherent in the 
concreteness itself that makes it particu- 
larly effective, and, in fact, an abstract 
cue word could be more effective than a 
concrete word as a recall cue if it were 
more likely to elicit the context. 

The present experiment was designed to 
evaluate these two views of imaging. 
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TABLE 1 


SAMPLE Ser or Items as THEY APPEARED TO Ss 
IN THE ASSOCIATIVE RATING Task 


Cue-context. Responses (high-low) 


BnipE-WEDDING 
GLOVE-WEDDING 
BEAUTY-WEDDING 
CANDLE-WEDDING 
LovE-WEDDING 
Vows-WEDDING 


yop aee ie ue 
DIOS 
pre 


Word triplets were used as the materials 
to be remembered, and each triplet was 
provided with a context in which the words 
could be readily imaged. The Ss' task 
was to construct an image for the three 
words in the setting provided by the con- 
text word, such that, when later presented 
with one of the items in the triplet, they 
would be able to recall the other two. The 
conceptual-peg hypothesis predicts that 
the concreteness of the cue word is the 
critical element for determining recall; 
whereas the constructive view contends 
that the essential factor for recall is the 
capacity of the cue word to elicit the con- 
text in which the image was constructed. 
These two variables were factorially manip- 
ulated to determine which is prepotent. 


PRELIMINARY STUDY 


The primary variables in this study in- 
volved scaled attributes of the cue words. 
For example, suppose Ss were to form an 
image for the words, DREss, RING, and 
CANDLE, in the context of WEDDING. |f 
CANDLE were to be the cue word for recall, 
it would be essential to know the Z of 
CANDLE and the likelihood with which it 
would elicit the context, WEDDING, at the 
time of recall. Therefore, imagery and 
associative norms were compiled on the 
materials. The noun triplets were selected 
by E such that the three words were ob- 
viously related to a common setting or 
context. Two of the items were common to 
all experimental conditions and were con- 
crete nouns which could be readily imaged 
in the contexts. The third member of the 
triplet, the cue word, varied on the two 
scaled dimensions of imagery and associa- 
tion value. The latter dimension assessed 
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the capacity of the cue word licit its 


context. To scale the cue w« or each 
context, several candidates for | »ur-fold 
classification of concrete-abstra: d high- 


low association value of cue w to con- 


text were selected by Æ for th ; norm- 
ing exercises, 
Associative Rating Ta 

A particular type of associat orming 
procedure was developed to 'ss the 
degree to which the cue words v. ` id elicit 
the context words in this exper it. E 
was imperative that the measur: ct the 
likelihood with which a particular <e word 
(one of the members of the trip! would 
elicit the particular context word sed for 
the formation of the image of th: :riplet. 
In this sense, the present norming sk was 
developed to reflect the processes | -oposed 
to take place at the time of recall. A list 
of potential stimulus terms was sclected, 
and each term was paired with its context 


for presentation to a group of Ss. 


Method 


Subjects. The Ss were 36 male and 40 female 
volunteers from an introductory psychology course 
at the University of Minnesota. Each 5 received 
course credit for participating. 

Procedure. The associative rating task was con- 
ducted in one session. Table 1 shows the format for 
the exercise in which several pairs of words (cue 
words with contexts) were grouped under their 
appropriate contexts. The Ss were told to in- 
terpret the pairs from the frame of reference of the 
context word which preceded each group of pairs. 
The Ss’ task was to estimate for each pair what 
proportion of a large group of people would respond 
with the right-hand member of each pair (context) 
when presented the left member (cue word). The 
5s made their responses by circling a number from 
1 through 4, where 1 indicated that a low proportion 
and 4 indicated that a high proportion of the popu- 
lation would give the member on the right when 
presented with the member on the left. Each S was 
given a booklet with 90 pairs grouped under 22 
appropriate contexts. The Ss were given an explana- 
tion of free-association procedures as well as what 
was required of them in this task. They were tol 
to work rapidly but carefully and were allowed to - 
proceed at their own pace. 


Imagery Rating Task 
Method 


. Subjects. Forty males and 60 females from an 
introductory psychology class at the University 9! 


p~ 


— 


mu 
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volunteered to participate in this 
Each S received course credit for 
ng. 
c. The left members of the pairs used 
.sociative norming exercise were presented 
n imagery rating task similar to that of 
», Yuille, and Madigan (1968). Each word 
red beside a 7-point rating scale, with 7 
wring the high end of the scale and 1 anchoring 
»w end. The Ss were given imagery rating 
ictions similar to those of Paivio et al. and 
allowed to work at their own pace. 


ined. Resulls 


he two norming tasks yielded two 
values for each cue word, (a) a mean 
ion value between itself and its con- 
ind (b) a mean imagery rating. For 
^ context, the cue word candidates were 
oss-classified on the association value and 
magery dimensions to obtain an item in 
each cell of the four-fold classification. 
The members of the classification were de- 
termined by selecting words from the ex- 
tremes of the two dimensions. For the 
association value dimension, à word was 
classified as high if it had a mean of 
approximately three or better and low if 
its mean was two or less. On the imagery 
dimension, words with a mean of six or 
better constituted concrete words, while 
words with a mean of four or less con- 
stituted abstract words. Of the 22 con- 
texts appearing in the norming exercise, 
13 yielded candidates fulfilling these cross- 
classification requirements. Table 2 dis- 
plays the means in a four-fold array for the 
13 target triplets. 


MAIN EXPERIMENT 


Method 


Subjects. ‘Thirty-two males and 32 females from 
an introductory psychology course at the University 
of Minnesota served as Ss. Each S volunteered and 
received course credit for participating. 

Design, A2X2X2 randomized block design 
was used, with Ss being blocked on sex. The other 
two between-Ss variables were imagery (concrete 
and abstract) and association value (high and low). 
‘All three variables were crossed, and Ss were assigned 
to the various combinations of imagery and associa- 
tion value by their order of appearance at the 
laboratory. 

Materials. The list consisted of 20 triplets and 
contexts. The first 3 and last 4 groupings were 
filler items, while the middle 13 triplets were the 
target items resulting from the preliminary study. 
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TABLE 2 


CELL AND MARGINAL MEAN CLASSIFICATIONS OF 
TARGET IrEMS ON IMAGERY (J) AND Asso- 
CIATION VALUE (AV) DIMENSIONS 


Imagery 
Association value | ————— ——,— ——| Mean AV 
Abstract Concrete 
Low 
AV 1.68 1.53 
I 2.73 
High 
AV 3.29 3.43 
I 3.57 
Mean I 3.15 


Each target triplet consisted of a context and two 
items that remained constant under all experimental 
conditions. The third member of each triplet 
varied according to the particular combinations of 
imagery and association value. For example, for 
the context WEDDING, the two constant items were 
DRESS and RING and the four cue words were as 
follow concrete-high association value (AV), 
BRIDE; concrete-low AV, CANDLE abstract—-high 
AV, vows; and abstract-low AV, BEAUTY, 

Procedure. The Ss were run in groups of 4-10. 
They were read instructions regarding the nature of 
imagery and given an example of how they might 
form compound images for word triple In addi- 
tion, they were told to use the provided context as 
a setting in which to form the image and to try to 
remember the triplet, such that when they were 
given one of the members of the triplet later, they 
would be able to recall the other two words. The 
Ss were then given four examples for which they 
practiced forming image The triplets were read 
to it a rate of one triplet every 15 , followed 
by a 1-min, subtraction task, After this, Ss were 
read one word from each triplet and allowed 30 sec. 
to write the other two words. j 


Results 


The first 3 and last 4 triplets on the list 
were fillers; therefore, only the remaining 
13 triplets were included in the analysis. 
A 2X2 2 analysis of variance was per- 
formed on the recall scores, with the three 
between-S variables being imagery (ab- 
stract and concrete), association value (low 
and high), and sex (female and male). 

The recall means for the four groups are 
shown in Table 3. The analysis of variance 
on these scores revealed two significant 
main effects: association value, F (1, 56) 
=58.43, p < .0001; and sex, F (1, 56) 
=4.51, p < .04. None of the other main 
effects or interactions was significant. The 
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TABLE 3 
MEAN RECALL (SDs IN PARENTHESES) 


Imagery 
Association value 
Abstract Concrete 
Low 12.50 (5.06) 12.75 (5.50) 
High 21.81 (2.86) 20.75 (4.65) 


mean recall for the high-AV condition was 
21.28 as compared to 12.62 for the low-AV 
condition, while the means for the two 
imagery conditions were quite similar (ab- 
stract = 17.16, concrete 16.75) The 
mean recall for the females was 18.16 
and for the males, 15.75. Interactions in- 
volving sex were not significant; therefore, 
this variable was not analyzed any further. 


Discussion 


The results of this experiment were quite 
unambiguous: Differences in recall were due 
solely to the association value of the cue word 
to the context, and the imagery value of the 
cue words had no effect. ‘This finding would 
not be predicted from the conceptual-peg 
hypothesis, but it is entirely consistent with 
the constructive process view of imagery. 

The important point to be realized from these 
results is that the critical factor for recall was 
not the concreteness of the cue words per se; 
rather, it was the function that the cue words 
served, In other words, in most of the studies 
demonstrating the importance of the con- 
creteness of the stimulus term in cued recall 
(e.g., Paivio, 1965; Yarmey & Paivio, 1965), it 
is quite possible that the concreteness of the 
cue word was confounded with the term's 
capacity to elicit the context used to construct 
the image. In these studies, the concrete 
stimulus terms may also have been high in 
likelihood of eliciting the context used to con- 
struct the image, thereby yielding positive 
results. The present experiment did not con- 
found these two factors and consequently 
demonstrated that the critical factor was the 
stimulus term’s capacity to elicit the context. 

Context is quite vaguely defined here and 
is meant to refer to the general setting from 
which the image can be reconstructed. When 
the statement “eliciting the context” is used, 
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it is meant to refer to the act of re oring the 
setting used to construct the orig «| image J 
Nothing is offered here as to how process 
actually takes place; instead, results 
demonstrate the importance of th einstate- i 


ment of the context. 


The facilitatory effect of contex: vr themes 


on the recall of prose is well wn (ef, 
Bartlett, 1932; Bransford & Joh: n, 1972; 
Dooling & Lachman, 1971). Gi ily, the 
position taken is that the context theme is 
recalled along with a few details | the re- 
mainder is reconstructed. The re s of the? 
present experiment indicate that th: — "instate- 
ment of a context is also a critical | ment in M 
recall through imagery, and they are — nsistent 
with the argument offered for the r- of con- 
texts in prose recall. 
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Both fourth-grade children and adult Ss searched faster for target letters 


through common words than nonwords (scrambled collections of letters), 
through third-order pseudowords than nonwords. In fact, both fourth- 


graders and adults reduced their search time by nearly the same proportion: 
10% through common words, 3% through third-order pseudowords. Better 
readers tended to be faster searchers, but there was no consistent relationship 


between reading ability 
speed letter research. 


Adults can exploit their knowledge 
of word structure in a letter search 
task. Krueger's (1970b) undergraduate Ss 
searched faster for target letters through 
real words and third-order pseudowords 
than through nonwords (scrambled collec 
tions of letters). In the present study, 
Experiment ] examined whether fourth- 
grade children, whose knowledge and 
far ty with words is much more 

;, yet has had some opportunity to 
velop, also would search faster through 
words than nonwords. Experiment I also 
examined whether fourth-grade children 
would follow the pattern shown by adult 
Ss, by searching faster through repeated 
than nonrepeated letter strings. (Krueger, 
1970a) and searching faster through co- 
herent sentences than the same words in 
scrambled lists (Krueger, 1970b). 


EXPERIMENT I 
Method 
Subjects, "Twenty City College of New York 
undergraduates, all paid volunteers, served as the 


1 This study was supported by NIH Grant HD- 
04869-01-02-03 to the first author. The authors are 
grateful to Harry Rosenberg, Sheri Berenbaum, and 
Ora Ezrachi for assistance in tabulating the data, 
and to A. Rodriquez, principal, V. Lieber and H. 
Pierce, assistant principals, and C. Lazar and R. 
Stampp, teachers, at Public School 192, nhattan, 
for their assistance in obtaining the fourth-grade Ss. 

? Requests for reprints should be sent to Lester 
E. Krueger, Department of Psychology, City College 
of the City University of New York, New York, 
New York 10031. 


and the ability to use the redundancy in words to 


adult Ss, Ten pupils from Public School 192 in 
Manhattan served as the fourth-grade Ss (median 
age: 9 yr., 7 mo.). English was the native language 
of all Ss. All Ss had 20/40 vision (corrected) or 
better, 

Apparatus and materials, Stimuli were presented 
at 4 mL, intensity on a Gerbrands Model T-3B-1 
tachistosco The exposure field was illuminated 
when S pressed a foot pedal at the beginning of a 
trial, and was extinguished immediately upon S's 
response, 

In Sets 1 and 2, each card contained a single 
(target) letter, centered above a column of five six- 
letter words or nonwords, typed in all-capitals, one 
toa line ona 4 X 6 in. white index card, in 10 pitch, 
IBM Courier 72, All typed lines were double-spaced, 

In Set 1, the five six-letter items on each card were 
either all words or all nonwords. From the 140 
common words used by Krueger (1970b, Experiment 
I), 14 words were chosen in a quasi-random manner 
to match as closely as possible the letter fre- 
y distributions at each letter position in the 
140-item set. Fourteen nonwords likewise were 
drawn from the corresponding 140-item nonword 
set. The 14 items chosen in each case served as re- 
peated items; 120 of the 126 remaining items were 
selected for one-time use. A total of 192 cards 
were constructed: 72 with repeated words, 24 with 
nonrepeated words, 72 with repeated nonwords, 
and 24 with nonrepeated nonwords, The same item 
never appeared twice on a card, Each repeated item 
appeared on an average 25.7 cards, On 27 of the 
72 repeated cards, and 9 of the 24 nonrepeated 
cards, the target letter at the top was not present 
in any of the five items below. When the target 
letter present, it appeared equally often in 
either Line 1, 2, 3, 4, or 5, and always appeared 
in either Letter Position 2, 3, 4, or 5, but never 1 or 6. 
Each letter position was used equally often for 
words and nonwords and for repeated and non- 
repeated displays. The 192 cards were divided into 
three equivalent subsets of 64 each, which were 
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TABLE 1 


EXPERIMENT l: RELATIVE REDUCTION IN Sk. 
TIME FOR ApULTS AND FOURTH-GRADE Ss 


Third- 
Repeated | Common order Coherent 
vernon, | wordsvs. | pseudo. |y scram- 
repeated corre- | words vs. r 
Group | wordsand | sponding | ` corre- | bled pass- 
nonwords | nonwords* | sponding (Set 3) 
(Set 1) (Set 1) | nonwords : 
(Set 2) 
Adults 22% 84% | 3.1% | 1.0% 
Fourth E is a Be 
graders 8.5% 10.9% 2.5% 4.2% 


a Based only on repeated-item displays. 


randomized differently for each S; the order of the 
subsets also was rotated across the Ss. 

In Set 2, 16 cards contained five third-order 
pseudowords and 16 contained corresponding non- 
words, selected from a set used by Krueger (1970b). 
No items were repeated. In each subset of 16 cards, 
the target letter was not present in 6 cards, and in 
the remaining 10 cards appeared twice in each of the 
five rows. The 32 cards were presented in a differ- 
ent random order for each S. 

In Set 3, the card contained an extended passage, 
either in its original (coherent) version or in a 
randomly rearranged (scrambled) version (for fur- 
ther details, see Krueger, 1970b). The single letter 
was triple-spaced above the set of five lines, which 
were double-spaced, flush left. The 36 cards were 
presented in a different random order for each S; 
18 contained coherent passages and 18 contained 
scrambled passages. In 6 of the 18, the target 
letter did not appear at all in the passage; in two 
cases each, it appeared in Lines 1, 2, and 4, and in 
three cases each, it appeared in Lines 3 and 5. 

Procedure, The Ss were told to search down the 
card for the letter shown at the top. Half of the Ss 
were told to press the left hand button (“yes”) 
upon finding the target letter, and the right hand 
button (“no”) after searching the entire array with- 
out finding the target; for the other half of the Ss, 
the hand assignment was reversed. The Ss were 
told to search as quickly as possible, but to keep 
their errors to a minimum. A buzzer sounded when 
55 pressed the incorrect button; all errors were ex- 
cluded before search time was averaged, 

Eight practice cards containing rare words or 
corresponding nonwords were presented first, fol- 
lowed by Set 1, then Set. 2 The Ss next rated 56 
items (a randomized list of 14 repeated words, 14 
nonrepeated words, 14 repeated nonwords, and 14 
nonrepeated nonwords) as to whether they had been 
presented more than once in Set 1. The adult Ss 
next received the Nelson-Denny Reading Test, 
Form A, which measures vocabulary, reading com- 
prehension, and reading rate. Eighteen of the 20 
adult Ss subsequently received Set 3, on either the 
same or a later date. 

Each child received Set 3 at a later date, followed 
by the Gates-MacGinitie Reading Test, Survey D 
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Form 1M. Scores also were avai 
Metropolitan Achievement Test, 
Form H, administered at the child's 
four months previously. Data from : 
Set 3 were excluded because of failure | 
instructions properly. 


Results 


Search performance. The adult 
about twice as fast as the fourth 
to scan down an average 3.75 it« 
Set 1 nonrepeated-nonword dis; vs, fom 
instance, took an average 3.3* c. fom 
adults and 6.75 sec. for fourth 
In Set 1, mean search time w: ignifi- 
cantly lower for adults than fourt aders 
F (1, 28) = 49.62, p < .001, for v 
nonword displays, F (1, 28) 
p < .001, and for repeated-item t! 
repeated-item displays, F (1, 28 
p «.001. Significant interactic: 
found for Adults vs. Fourth-g: 
Words vs. Nonwords, F (1, 28 5.18 
p < .05, and for Adults vs. Fourth 
X Repeated vs. Nonrepeated, F (! 
11.04, p < .01, reflecting the faci 
search time decreased more for 
graders than for adults when word ori 
repeated-item displays were presented: 
When the decrease in search time is ¢X* 
pressed not in absolute, but in relative: 
terms, that is, as the percent reduction of 
baseline nonword or nonrepeated search 
time (see Table 1), fourth-graders still 
differed significantly from the adults om 
repeated vs. nonrepeated, ¢ (28) = 2.86 
p < .01, but not on words vs. nonwords. < 

In Set 1, the error rate was significantly: 
lower for adults than fourth graders, 7.1% 
vs. 12.6%, F (1, 28) = 12.14, p «Qt 
and for word than nonword displays, 5.17% 
vs. 11.2%, F (1, 28) = 14.06, p < -0014 
Although the repeated vs. nonrepeated 
main effect (8.5% vs. 9.3%) was not signifi - 
cant, the Word vs. Nonword X Repeated 
vs. Nonrepeated interaction was significant - 
F (1, 28) = 6.21, p « .025, reflecting the 
fact that the repeated-nonrepeated differ 
ence was larger among nonword tham; 
word displays. No other interaction Wé 
significant. 

The Ss rated 56 words and nonword 
Írom Set 1 as to whether they had occurred: 


LETTER SEARCH THROUGH WORDS AND NONWORDS 


; once on the search displays. 
liott's (1964) table, d^ values were 
d, based on the frequency with 
s answered “yes” to repeated items 
ite) and nonrepeated items (false 
rate). The mean d’ value was signifi- 
y greater than zero, both for adults, 
1.21, ¢ (19) = 9.64, p< .001, and 
rth-grade Ss, d' = 1.00, ! (9) = 4.15, 
< 005; the difference between adults and 
‘rth graders was not significant. Mean 
vas significantly greater for words than 
awords, F (1, 28) = 14.79, p < .001, and 
word-nonword difference was virtually 
i\e same, for adults, 1.51 vs. .91, and 
1 graders, 1.28 vs. .71. 
; Set 2, the decrease in search time for 
ürd-order pseudowords (vs. correspond- 
ing nonwords) was significant for adults, 
i (19) = 2.29, p < .05, but not for fourth 
graders. The relative reduction in search 
time, shown in Table 1, was significantly 
smaller for the third-order pseudowords in 
Set 2 than the common words in Set 1, 
F (1,28) = 20.10, p < .001. This factor 
did not interact significantly with adults 
vs. fourth graders. The mean error rate 
did differ significantly between third- 
oseudowords and nonwords, for 
cher adults (3.1% vs. 4.7%) or fourth- 
vade Ss (13.8% vs. 13.8%). 
in Set 3, search was faster, but not signi- 
ficantly so, through coherent passages than 
scrambled ones, among both adults and 
fourth graders (see Table 1). Similarly, 
error rate was slightly, but not significantly 
lower for coherent passages, for adults 
(18.3% vs. 18.9%) and fourth graders 


(20.6% vs. 21.1%). 
Correlations between search and reading 


measures. The various reading measures 
were highly and usually significantly 


(p « .05) intercorrelated. The correla- 
tions ranged from 7 = 4-.47 to 4.96 
among the children's measures (Metro- 
politan: word knowledge, reading; Gates- 
MacGinitie: speed, accuracy, vocabulary, 
comprehension), and r = + 35 to +.89 
among the adult measures (Nelson-Denny : 
reading rate, vocabulary, comprehension). 
The high correlations indicate that there 
was a good spread in reading ability, both 
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among the children and the adults. Thus, 
on the vocabulary test for the adults, the 
scores ranged from the 1st to the 93rd 
percentile, based on Grade 16 norms, with 
the median score at the 51st percentile. 
For the children, on the total read- 
ing ability measure of the Metropolitan 
Achievement Test, the scores ranged from 
the 18th to the 92nd percentile, based on 
Grade 4 norms, with the median score at 
the 48th percentile. 

Search time invariably correlated nega- 
tively with the various reading measures, 
indicating that poor readers generally took 
longer to locate the target letter than did 
good readers. (The correlations were com- 
puted separately, and yielded quite similar 
results, for search time through repeated 
common words and search time through 
repeated nonwords.) Among fourth graders 
these correlations were significant (p < .05) 
for all four Gates-MacGinitie measures, 
ranging from r = — .63 to —.73, but not 
for either Metropolitan measure, ranging 
from r = — .36 to —.56. Among adults 
the correlations, ranging from r = — 14 
to —.32, were not significant. 

No trend was apparent at all, however, 
in the correlations between reading mea- 
sures and the absolute or relative reduction 
in search time for various types of material 
(ie., words, repeated items, coherent 
passages). The correlations were generally 
quite small, and were negative about as 
often as positive. The only significant cor- 
elations were between reading rate in 
adults and absolute (r = — .55, P < .02) 
and relative (r = — .49, p < .05) reduc- 
tion in search time through coherent 
passages. 


Discussion 


As the significant d' scores indicate, both 
adult and fourth-grade Ss discriminated rather 
well between the repeated and nonrepeated 
items. Schulman (1971) obtained similar re- 
sults when he tested Ss on their recognition 
memory of words they had scanned through 
for the letter A. Neisser (1964), on the other 
hand, reported that Ss usually could not dis- 
tinguish words that had appeared on a list 
from words that had not, when tested im- 
mediately after a scan. Neisser proposed a 
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hierarchical model in which nontarget letters 
are analyzed only so deeply that they may be 
discarded from further consideration. Neisser's 
well-practiced Ss reported that they did not 
even "see" nontarget letters during search for 
a target letter. The present findings, based on 
the performance of relatively unpracticed Ss 
who were set for accuracy as well as speed, 
indicate that nontarget letters not only are 
seen, but may even be organized and remem- 
bered as entire words or nonwords. 

The adults searched about twice as fast as the 
fourth graders, which confirms previous work 
with third graders (Forsman, 1967) and fourth 
graders (Leslie & Calfee, 1971). The present 
finding that in the fourth grade the better 
readers search faster than do poor readers, 
though, contradicts two recent studies: Leslie 
and Calfee (1971), using second, fourth, and 
sixth graders, found no relationship between 
reading level and search rate for target words, 
and Katz and Wicklund (1972), using second 
and sixth graders, found no difference between 
good and poor readers in letter-search rate 
through nonsense items. On the other hand, 
Spring (1971), using third, fourth, and fifth 
graders, found longer latencies for poor than 
normal readers on the "same'-"different" 
judgment of two adjacent letters, 

Since older Ss and better readers exhibited 
superiority on the general skill of visual search, 
one would have expected them to exhibit an 
even greater superiority on the specific skill 
of exploiting their knowledge of English word 
Structure to speed search for letters. it 
turned out, however, reduction in search time 
through words was as great for children as for 
adults, as great for poor readers as for good 
readers, "This result will be discussed more 
fully after Experiment II has been presented. 
Experiment II, in which 40 adult Ss partici- 
pated, provided another test of whether the 
exploitation of word structure in letter search 
is related to reading ability, 
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EXPERIMENT II 
Method. 


Subjects. Forty City College of New York under- 
graduates served as paid Ss, 
Apparatus and materials, Search displays were 


presented in a light-tight wooden box cont 


« aining a 
semitransparent (one-way) mirror (12 X 15 in); 
5 could see into the box whenever four 40-w, lamps 


ere lit, On each trial, Æ first slid a plank 
ning the 8$ X 11 in. stimulus sheet into the 
, and then closed a switch which illuminated the 
lamps and started a Standard Electric, Model S-1 
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timer (calibrated to .01 sec.). The S's respons 
extinguished the lamps and stopped the timer, 
The sets of 140 common words and 140 corre 
sponding nonwords used were the same as in Bx 
ment I. Each stimulus sheet contained either 2§ 
six-letter words or 25 six-letter nonwords, typed 
one column, single-spaced, in 12 pitch, IBM Pi 
tige Elite 72. Sampling of items was random, excep 
that no word or nonword appeared twice on a gi 
list of 25. Each S saw 22 word and 22 nonword 
displays, which were randomly intermixed, TI 
target letter was actually present in 15 of the 
displays, appearing once each in Item Positions 2 
and 4, and in all odd-numbered positions from 1 to 
25 (for further detai ee Krueger, 1970b). 
Procedure. On each trial, after S saw the ta 
letter and indicated he was ready, E turned on th 
timer and lamps. The S was told to press th 
right-hand (“yes”) switch as soon as he found th 
target letter, or, if it was not present, to press the 
left-hand ("no") switch. The S received thret 
practice trials. The S was informed when he pres 
the incorrect switch, in which case his search time 
was discarded for both the word and nonword di 
plays ying a target at the particular locatiom 
After completing the letter se: 
the Nelson-Denny Reading Test, Form A, and the) 
Wellesley Spelling Scale, Form 1. 


Results 


Search performance. Mean search time, 
through an average 15.8 items, was 8.48 sec. 
for word displays and 7.71 sec. for nonword: 
displays. The 9.1% relative reduction in 
search time for words (vs. nonwords), was 
significant, £ (39) = 4.8, p < .001. The 
error rate was somewhat, but not sig- 
nificantly, higher for nonword (11.390) 
than word (9.2%) displays. 

Correlations between search and reading 
and spelling measures. As in Experiment l, 
the reading and spelling measures were 
highly and usually significantly (p < .05) 
intercorrelated (range:r = + .26 to +.73), 
indicating again a good spread in reading 
ability. Thus, on the vocabulary test, the 
scores ranged from the 2nd to the 99th 
percentile, based on Grade 16 norms, Wi 
the median score at the 72nd percentile. 
Also as in Experiment I, reading and spell- 
ing measures generally correlated negatively ! 
with search time for words and nonwords, 
r= + .03 to —.34. That is, better readers: 
tended to search faster than poor readers 
The only significant correlation, though, 
was that between reading rate and meam 
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st ‘ime through words, r = — 34, 


Experiment I, the correlations 

the reading measures and the 

ion in search time for words (vs. 
ords), were all positive, both for 
jute reduction (r = +.03 to +.32) and 

i- relative reduction (r = +.09 to +.39). 
iat is, better readers tended to take more 
dvantage of the familiar contexts of words 
n reducing search time. The only signifi- 
-ant correlations, though, were those be- 
veen reading comprehension and relative 
duction in search time for words, r — + 
39, p < .02, and reading comprehension 
id absolute reduction, r = + .32, P < .05. 


GENERAL DISCUSSION 


Perhaps the most striking finding of the 
present study is that fourth-grade children 
made as much use of the redundancy in English 
and near-English letter sequences as did adults. 
Both adults and fourth-grade children speeded 
their search for letters by about 10% when non- 
words were replaced by common words, by 
about 3% when nonwords were replaced by 
third-orler pseudowords (see Table 1). These 
rosu! ue in very nicely with the findings of 

nd Smith (1970), who found that the 

to use intraword redundancy in the 
identification of letters increased only up to 
the fourth grade, where adult-level skill was 
achieved. 

The mastery of letter and word recognition 
skills by the fourth grade is very important, 
because the child is thereby freed to develop 
more complex skills in reading. Singer (1970) 
reported that the ‘‘system for attaining Speed 
of Reading undergoes a developmental shift 
from a predominance of visual perceptual 
abilities at the third grade to a more equitable 
organization of visual perceptual and word 
meaning factors at the sixth grade level [pp. 
215-216]. 

Of course, the visual perceptual component 
in reading may continue to improve to some 
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extent well past the fourth grade, and better 
readers may ever surpass poor readers, as 
shown, for instance, by the fact that even 
among adults, the better readers tended to 
search faster for target letters. Among 
adults, however, reading ability accounted 
for no more than about 10% of the variation 
on search time. Among the fourth graders, 
by contrast, reading ability, as variously in- 
dexed, accounted for about 15-50% of the 
variation on search time. 
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Constant-order paired-associate lists were used in which the numbers 1 

were stimulus terms, and 24 nouns were response terms. The order of t 

nouns was varied across five lists to produce a different number of hierarchic 
conceptual levels in the lists. There were two degrees of original learning a: 
three types of retention tests after 24 hr. The study-test method was use 
Learning rate was related directly to the degree of conceptual structure, Ù 
retention was uninfluenced by structure. A further experiment showed th: 
the direct relation between recall and structure found in an earlier study is t 


TAN 


be attributed to the antic’ 


ation method in which information at recall is i: 


an amount that is directly related to the conceptual structure. 


In a previous study (Underwood & 
Zimmerman, 1973), 16 words were ordered 
serially so that a three-level conceptual 
hierarchical structure resulted. If S, in 
learning the list, followed the rules implied 
by this structure, placement of each word 
in its appropriate position within the list 
was possible, Other lists were constructed 
from the same words in a way such as to 
violate the appropriateness of successive 
conceptual levels. The purpose of this 
previous study was to determine the role 
of conceptual structure on the learning and 
retention of the lists. Two findings 
emerged. First, learning rate was related 
directly to degree of conceptual structure 
up toa point, and second, recall after 24 hr. 
was related directly to degree of structure. 
This latter finding conformed in general to 
the notion that associations learned in the 
laboratory which are compatible with al- 
ready established associations will show 
less rapid forgetting than will be the case 
for associations which are in conflict with 
established habits. In the previous study, 
this latter case was represented at the 
extreme by the 16 words presented in 
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that for 
onceptual 
propriate 

In fact, 
gave no fi 
the ran- 


random order. It was presun 
such a list the long-establishec 
habits would interfere with th« 
ordering of the words at rec^" 
however, the overt-error da 
evidence that the poorer recal! 
dom list than of the structur list was | 
due to such interference. Thus, the reasons 
for the differences in recall remained 


obscure. 1 


The intent of the present study wasii 
identify more precisely the characteristic | 
of the memory for structured and unstruc- | 
tured lists. By so doing it was believed 
that the characteristic or characterist 
responsible for the differences in retention | 
might be isolated. Two different levels ol 
learning were used as a means of varying } 
the degree to which the conceptual struc | 
ture had become a part of the memory for - 
the lists. Since the utilization of a con- j 
ceptual structure to mediate item place- | 
ment at the time of retention may 9€ 
critically time dependent, three different 
types of retention tests were used, namely) 
paced and unpaced recall, and unpaced - 
associative matching. 


MAIN EXPERIMENT 
Method 


Lists. Five different lists were constructed, all 
from the following 24 words: robin, owl, tubo 
trout, guppy, sturgeon, apple, lemon, fig, 108% nd 
marigold, beer, rum, sherry, milk, soda, cocot, denma 


A, which were essentially equivalent 
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tal, sapphire, iron, brass, tungsten. These 24 words, 
presented as in the order listed above, formed the 
most highly structured list, to be called List 5. It 
will be noted that at the lowest conceptual level in 
List 5 there are three instances of each of eight con- 
cepts, birds, fish, fruit, flowers, alcoholic beverages, 
nonalcoholic beverages, precious stones, and metals. 
^t the next conceptual level there are four concepts, 
animals, plants, beverages, and minerals. At the 
third level, living and nonliving things divide the 
list in half. 

In List 5 the three instances within each concept 
were ordered serially (as above) such that the first 
instances had high frequency in the Battig-Monta- 
gue (1969) norms, the second medium frequency, 
and the third, low frequency. Given this ordering 
the S could, in a manner of speaking, run off the 
three instances within a concept according to à 
frequency rule, In the previous study (Underwood 
& Zimmerman, 1973), this variable, with two in- 
stances of each concept, did not influence learning. 
However, there were reasons to believe that the 
earlier lists were not entirely satisfactory for a test 
f, the frequency rule and so this variable was in- 
cluded again in the present experiment. 

The nature of the other four lists may now be 
described. For List 4 the conceptual structure re- 
mained the same as in List 5, but the order of the 
three instances within each of the eight concepts 
wes randomized to neutralize any influence of a 
i-frequency rule. In List 3 the order of the 
.nimal names was arranged so that the concepts 
js and fishes were not appropriate for three 

ive items, but the concept animal was appro- 
Liate for a block of six words (robin, trout, sturgeon, 
The same was true for plants, 

"Therefore, the four intact 
d mediate placement only 
For List 2 only the 
intained so that 
the implementation of this distinction by the 5 
would restrict placement to halves of the list. 


wo 


as constant-order paired- 
associate lists with the numbers 1-24 in order as 
stimuli and with the words as response terms. In 
the previous experiment, this procedure gave results 
to those found 
‘when the words were presented as a true serial list. 

Conditions. One variable, of course, was defined 
by the five lists as explained above. A second vari- 
able was the degree of learning prior to the 24-hr. 
retention interval. Half of the 5s learned to a 
criterion of 12 correct responses on à single trial, 
half to a criterion of 20 correct responses on a single 
trial. A third variable was the nature of the reten- 


t tion test. Half of the Ss were given a paced-recall 
ES test, the rate being the same as used during learning. 
For these Ss, recall was followed by relearning to 


"ne perfect recitation but with a minimum of three 
"learning trials after the recall trial. Half of the 
Ss were given unpaced retention tests consisting of 
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two steps. Initially, a sheet was provided the S 
on which the numbers 1 through 24 were listed with a 
blank after each. The 5 was given unlimited time 
to write down all of the appropriate response ternis 
he could, guessing being encouraged. Following this 
step, a list of the 24 response words was provided the 
S on a second sheet and he was asked to match each 
word with a number, using each only once. He was 
required to fill each blank with a word even if it 
involved guessing. 

The five lists, two degrees of learning, and two 
types of retention tests resulted in 20 different 
conditions. Four further conditions were added to 
provide controls for possible differential performance 
on the postcriterial trials of learning. The study- 
test method was used during learning, and although 
the use of this method is normally expected to 
minimize differences on postcriterial trials as a func- 
tion of rate of learning, it was believed necessary to 
provide some minimal information about the mat- 
ter. Consequently, four groups were given im- 
mediate retention tests after achieving the cri- 
terion of 20 correct responses on a single trial. 
Two of these groups learned List 1, and two learned 
List 5. For each list, one group was given paced 
recall and relearning, one unpaced recall, followed 
by matching. 

There were 24 groups of college-student Ss, one 
corresponding to each of the 24 unique conditions, 
with 18 Ss in each group. Assignment toa particular 
group was made from a schedule containing 18 blocks 
of conditions, with each condition occurring once 
within each block. A different random order was 
used for the conditions within each block. 

Procedure. All lists were presented for alternate 
study and test trials at a 1.5-sec. rate for both. 
When the appropriate criterion was achieved, the 
S either was dismissed from the laboratory (to re- 
turn 24 hr. later) or was given an immediate reten- 
On the immediate paced tests the experi- 
menter stopped the memory drum, told the S$ that 
he would now have another test trial, with further 
study and test trials to follow, and that he should 
try to get as many correct as possible, For the 
unpaced tests (both immediate and delayed) the 
recall sheet was given the S and he was asked to 
follow the printed instructions on the sheet as the 
experimenter read these instructions aloud. He 
was given a second sheet (described earlier) for the 
matching test. All Ss having 24-hr. tests were re- 
minded that they had learned a list of words the 
previous day and that a test of their memories for 
this list was the purpose of the present. session. 

Finally, all 5s having the 24-hr, retention tes! 
were given an en-ended questionnaire concerning 
rchearsal activities they might have engaged in over 
the 24 hr. They were asked to describe any experi- 
ences they had with the words during the 24-hr. 
period, whether they had rehearsed or thought about 
the words, and so on. Honesty and accuracy were 
emphasized since (the S was told) such replies were 
valuable in helping to understand the nature of 


memory. 


tion test. 
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FIGURE 1, Mean number of trials to reach two 


different criteria of learning (12 correct on a single 
trial, and 20 correct), for lists varying in conceptual 
structure from low (List 1) to high (List 5). 


Resulls 


Original learning. For an examination 
of learning as a function of list structure, 
the Ss who would subsequently receive dif- 
ferent retention tests were combined to 
provide 36 Ss having had each of the five 
lists at each criterion of learning. The 
data are plotted in Figure 1 in terms of 
the mean number of trials required to 
reach the two criteria, For convenience, 
the five lists are equally spaced along the 
baseline, indicating increasing structure 
from List 1 to List 5. It can be seen that 
as list structure increased, trials to learn 
decreased. "This is true for both criteria, 
and summed across the two, the effect is 
significant statistically, F (4, 350) = 8.08, 
b «.01. Although the influence of list 
structure appeared to be somewhat greater 
for the higher criterion of learning than for 
the lower criterion, the interaction was not 
reliable, F (4, 350) — 2.14, p > .05. In 
the previous study (Underwood & Zimmer- 
man, 1973), learning rate increased from 
List 1 through List 3 with no further in- 
crease for Lists 4 and 5. It is not known 
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whether this difference is due 
of learning (anticipation versi 
or to list differences, Ther 
evidence in the earlier study 

the instances defining the low: 
cepts were not always underst 

It will be remembered that ! 
from List 4 only in terms oí 
of the three words within eac 
concepts. For List 5 the 
ordered from high to low in 
frequency with which the i 
produced to the concept nam: 
and Montague (1969) normati« 
List 4, the three words wer: 
As seen in Figure 1, List 5 was 
rapidly than was List 4, and 
for both criteria. An analysis 
with Lists 4 and 5 as one varial 
two criteria as the other, show« 
ence between» the lists to 
F (1,140) = 5.45, p < .05. 

The effect of word frequency i 
was also examined. Three scores 
tained for each S, these repre 
number of times the eight hig 
instances were given correctly, the numb 
of times the eight medium-frequency 
stances were given correctly, and the num 
ber of times the eight low-frequency mM- 
stances were given correctly. Across all 
lists combined for the lower criterion © 
learning the three means were 13.99, 11.02 
and 9.84, for the high-, medium-, and low 
frequency instances, respectively (F E 
87.59). These differences essentially dis 
appeared at the higher criterion of learn- 
ing for Lists 4 and 5, but were still quite 
evident for the other three lists. It ap: 
pears, therefore, that one of the character 
istics of initial learning of these lists by, 
the study-test method was that eithe 
because of ease of learning of words with 
high frequencies, or because of priority; 
effects, or both, the S acquired the highs 
frequency instances initially regardless O1 
the conceptual structure of the list. 

Still another finding could be interprete 
as a word-frequency effect. Some observa 
tions of the experimenters suggested tha 
the learning of the concepts differed Í ; 
men and Women. The learning of Listy 


rds were 
ns of the 
ices were 
the Battig 
tudy, I 

idomized. 
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4 5 was evaluated to see if these ob- 
;ns had validity. The 144 Ss (both 

. of learning) consisted of 77 men 

17 women. The number of correct re- 
uses within each of the eight concepts 

; determined for each group. These are 

'tted in Figure 2 for two reasons. First, 
hey show the nature of the serial-position 
surye which obtains even for the highly 
structured lists. Second, they indicate 
differential learning of certain concepts by 
men and women. Overall, the learning of 
the two groups did not differ (F < 1), but 
the interaction between concepts and sex 
was reliable, F (7, 994) = 3.13, p < .01. 
l'hree concepts produced substantial differ- 
ences in learning between men and women, 
namely, flowers, alcoholic beverages, and 
precious stones. It may be that cultural 
experiences have resulted in women having 
a greater familiarity for words representing 
flowers and precious stones than is true for 
men, and that the opposite is true for 
alcoholic beverages. In any event, these 
results are similar to those reported by 
Bousfield and Cohen (1956) when they 
used free-recall learning of groups of con- 

piually related words. 

Figure 2 suggests that the words which 
represented the first occurrence of a non- 
living concept (alcoholic beverages) caused 
a distinct break in the position-performance 
curve. However, this is primarily a func- 
tion of the particular words involved. For 
Lists 2 and 3, where the living-nonliving 
break also came between Positions 12 and 
13, there was no evidence of discontinuity 
between the two positions. k 

Overt errors. The first overt error analysis 
to be presented dealt only with the number 
of errors across lists. The 360 Ss from 
whom the acquisition data were derived (as 
plotted in Figure 1) were used. The error 
measure was the proportion of times an 
error was produced per opportunity (num- 
ber of overt errors/number of omissions 
plus overt errors). For the 10 groups this 
ratio varied between .10 and .19. The 
values were greater statistically for the 
high-criterion groups than for the low- 
criterion groups, F (1, 350) = 8.33, p < .01, 
the means being .16 and .12. The errors 
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as a function of the particular concepts (and their 
positions in the list) by men and women. 


increased with list structure (.11, .11, .15, 
.15, .16), F (4, 350) = 4.08, p < .01. The 
F for the interaction was less than one. 
Undoubtedly, some correct responses re- 
sulted from guesses within concepts, par- 
ticularly for the more structured lists, but 
in view of the relatively low proportion of 
overt errors to cases of not responding, it 
does not seem likely that mere guessing is 
heavily involved in the correct-response 
data. 

For Lists 4 and 5, overt errors can be 
identified as having one of four levels of 
appropriateness in terms of the conceptual 
structure for these lists. These will be 
described and illustrated. An error may 
occur within the appropriate block of three 
instances representing the same concept. 
Saying “guppy” to one of the two stimuli 
to which it was an incorrect response in 
the block of three fishes was such an error, 
and will be called here an error at Level 1. 
Six words were involved in the animal 
block; if “guppy” was given to one of the 
three stimulus terms appropriate for birds, 
it was defined as a Level-2 error. "Twelve 
words were included in the living block; 
if “guppy” was given to any of the six 
stimuli paired with plants, it was called a 
Level-3 error. Finally, if “guppy” was 
given as a response to any of the 12 stimuli 
paired with nonliving objects, it was called 
a Level-4 error. All errors were classified 
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PERCENT ERRORS AT EACH L 
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Ficure 3. Locus of overt errors as a function 
of list structure and degree of learning. (The num- 
bers 1, 2, 3, and 4 represent an increase in the dis- 
crepancy between the locus of an error and its correct 
position as defined by the conceptual structure of 
Lists 4 and 5.) 


into one of these mutually exclusive cate- 
gories for Lists 4 and 5. Conceptual level 
and position within Lists 4 and 5 are per- 
fectly confounded. Position is also tied to 
stimulus number, Therefore, any analysis 
of the appropriateness of overt errors to the 
conceptual structure must consider the 
number of such errors which were produced 
by position per se. To handle this prob- 
lem, the overt errors made in learning all 
five lists were categorized in exactly the 
same way as was done for Lists 4 and 5. 
For example, if, for List 1 (no structure), a 
misplaced response occurred among the 
first three pairs, it was classified as a Level-1 
error, just as was done for List 5. Or, if an 
error made to the stimulus-term 9 was an 
appropriate response for the stimulus-term 
3, it was classed as a Level-3 error, To 
summarize: for all lists the errors were 
classified as falling within one of eight 
blocks of 3 positions (Level 1), one of 
four blocks of 6 positions (Level 2), one of 
two blocks of 12 positions (Level 3), and, 
all others (Level 4). 

The errors made by each S were allocated 
to the appropriate level and the percentage 
of errors in each level was determined, The 
means of these values were used to con- 
struct Figure 3. The sums of the percen- 
tages for a given list do not always equal 
100 since a few Ss produced no overt errors, 
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and the means shown in Figure | were zm 
ways based on N = 18. It shou ' noted | 
that list structure is along th "scissa, 
with the error levels (1, 2, 3, as the 
plotting parameter. The left p repre- 
sents the data for the Ss who | d toa 
criterion of 12 correct, the right anel for 
those who learned to a criterion ^ 20 cor- 


rect. Several features may be no d. For 
Lists 4 and 5, the greatest pe: age of 
errors by far was that for errors f: ng into 
Level 1. Over half the errors ma: by the 
Ss learning these lists represen! errors 
that were approrpiate to the 1 owest 
conceptual categories. The Leve! ` values 
for List 1, values which should resent 
in relatively pure form the effects of posi- 
tion as such, are at 25% for lower 


criterion, 40% for the higher criter on. In 


List 3 the narrowest conceptual c tegory 
consisted of six words. If cor eptual 
structure were used by the ,S in th. place- 
ment of items in this list, he could :educe 
his possibilities to one of six positi: As 


may be seen in Figure 3, the maximum 
number of Level-2 errors occurred in learn- 
ing List 3, these errors reflecting place- 
ment within a block of six words. For 
List 1, the random list, the maximum fre- 
quency of errors for the lower criterion was 
in Level 4, which means giving a response 
in one half of the list which actually be- 
longed in the other half. There is only 
slight evidence that the living-nonliving 
distinction influenced the errors, this being 
shown in the fact that fewer errors at Level 
4 were made by the Ss learning List 2 than 
by those learning List 1. As would be 
expected, the major difference in the errors 
for the two criteria of learning was that 
there was an increase in Level-1 errors 
from the lower to the higher criterion, 
with the increase being the greatest for the 
lists with low structure. In summary, 
Figure 1 showed that learning was related 
directly to the number of appropriate con- 
ceptual levels involved in the list; the 
error data of Figure 3 indicate that these 
conceptual levels aided learning because 
they limited the number of possible nu- 
merical stimuli or positions for which a par- 
ticular word was appropriate. 


oy 


x 


There is the possibility that with zero 
conceptual structure (List 1), the presence 
of conceptually related words actually in- 
terfered with learning. If such interference 
was present, it was not manifest in the error 
data. The errors made in learning List 1 
were divided into two categories, those 
which were given to a stimulus for which 
another instance of the same concept was 
appropriate, and those which were not ap- 
propriate in the above sense. The concepts 
used were the eight with the three instances 
each. To illustrate: saying “guppy” to the 
stimulus appropriate for trout would be 
viewed as evidence for interference; giving 
“guppy” to the stimulus for milk, would 
not. Errors which would constitute evi- 
dence for interference constituted 7.6% of 
the total errors. Chance responding would 
be expected to yield 8.7% of such errors. 
This indicates that interference resulting 
from the conceptual relationships among 
the words was minimal in the unstructured 
list 

Retention. The retention data will be 
presented first for the Ss who learned to a 
criterion of 12 correct on a single trial. 
The results for all three retention tests are 
shown in Figure 4. To replicate the earlier 
study (Underwood & Zimmerman, 1973), 
performance should have increased as list 
structure increased. It is obvious from 
Figure 4 that list structure had little effect 
on retention. ‘The means for the paced 
24-hr. recall showed little variance (F < 1). 
For unpaced recall, performance was about 
the same for Lists 1 through 4, with an in- 
crease for List 5. However, the F for the 
recall for the five lists was only 1.37. It 
can be seen that performance on the last 
trial of learning was higher for List 5 than 
for the other four lists. It is likely, there- 
fore, that the retention of List 5 for both 
paced recall and matching reflected this 
higher degree of learning, although it is not 
clear why paced recall for this list should 
not have been more influenced by this 
factor than it was. The fact that matching 
exceeded unpaced recall indicates that the 
Ss were unable to recall some response 
words but could pair them appropriately 
when given the matching test. Again, 
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Ficure 4. Number of correct responses on the 
various retention tests after 24 hr. following original 
learning to 12 correct responses. (Number correct 
on the last learning trial is also shown.) 


however, the lists did not produce differ- 
ential performance on the matching task 
(F = 1,46). 

No immediate retention test was given 
to Ss learning to the lower criterion so it is 
difficult to calculate the amount of for- 
getting which took place over the 24 hr, 
However, the number correct on the last 
trial of original learning does give some 
basis for a rough estimate. An immediate 
test would probably have shown some loss 
due to the fall normally found after reach- 
ing a criterion. As will be shown later, it 
would not be unreasonable to expect this 
loss to be at least one to two items. If so, 
the amount of forgetting was between 30% 
to 40% for the paced recall for the five lists. 
The critical fact is that list structure was 
unrelated to the amount lost over 24 hr 

The next data to be examined are the 
retention scores for those Ss who learned to 
a criterion of 20 correct responses. ‘These 
are shown in Figure 5. For these compari- 
sons, immediate tests are available for Lists 
1 and 5 (dotted lines). It may be noted 
that the mean number correct on the last 
test trial of original learning varied between 
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Figure 5, Number correct on the various 24-hr. 
retention tests following original learning to 20 
correct responses. (Number correct on the im- 
mediate tests for Lists 1 and 5 is also shown.) 


21 and 22 correct responses. Paced recall 
means varied between 12.28 for List 5, 
and 16.11 for List 3. Yet, overall the F 
was only 1.74. Because of the apparent 
difference in paced recall as a function of 
list structure for the two levels of original 
learning (Figure 4 versus Figure 5), an 
analysis was performed on these 10 groups. 
List structure was not a reliable source of 
variance (F < 1), level of original learning 
was (F — 93.14), but the interaction was 
not (F = 1.66). Thus, in spite of apparent 
differences when viewed graphically, both 
levels of original learning resulted in the 
same conclusion, namely, that list structure 
and paced recall were not related. Figure 5 
suggests the possibility that there was an 
interaction between immediate paced recall 
and 24 hr. paced recall for Lists 1 and 5. 
A statistical analysis fails to confirm the 
reliability of this interaction, F (1, 68) = 
3.34, p > .05. Using the immediate paced 
recall as a base, forgetting for paced recall 
is estimated at 25% across the lists asa 
whole. 
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It is obvious that forgetting als 
over the 24 hr. for the Ss give: 
recall. Again, however, list st: 
not related to the amount of r 
the unpaced procedure (F < 1 
matching scores almost perfecti 
the unpaced recall except the | 
little higher. In summary: th« 
data for both levels of learning, 
unpaced, and matching, all lead t 
clusion that there was no syste: 
tionship between list structure an: 
using the study-test method. 

The number of misplaced resp ses at 
recall was examined for all pa recall 
groups. These numbers did not d «rasa 
function of the criterion of learn s, but 
for both levels of learning the nu: ber of 
misplaced responses at recall incr: sed as 
list structure increased, F (4, 170 5.81, 
p < .01. This same relationship w- found 
during learning and so is not a phenomenon 
peculiar to recall. For unpaced recall, 
however, list structure did not {uence 
the number of incorrectly-paired responses 
given. 

As noted earlier, during learning of the 
constant-order paired-associate lists, very 
clear serial-position effects were present. 
"These position curves were in evidence for 
all of the types of retention tests. Further- 
more, there was a strong relationship be- 
tween the number of times an item was 
given correctly in original learning and the | 
number of times given correctly at recall, 
which is to indicate that the position effects 
remained essentially constant from learn- 
ing to recall. For example, for the lower 
criterion, the correlations between number 
of times correct in original learning and 
number of times correct at recall varied be- 
tween .77 and .91 for the five lists. Degree 
of original learning was obviously a power- 
ful determiner of the items which were 
recalled. 

Kelearning. Differences in relearning as à 
function of list structure appeared on the 
first test trial after recall, and in terms of 
trials to reach a criterion of one perfect 
recitation, were reliable, F (4, 170) = 4.14, 
b < .01, with, of course, rate of relearning 
being directly related to structure. Thus, 
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li ture, which did not influence recall, 
q reinstated during relearning the 


ir cit had had during original learning. 
carsal and recall. The Ss were given 
a» open-ended questionnaire concerning 
t rehearsal activities over the 24 hr. 
d replies to these questionnaires were 
r ed on a 9-point scale for the amount of 
arsal implied in the protocols. These 
ngs were carried out independently by 
e different people.  Interrater reli- 
iities were determined for 18 subblocks 
20 Ss each. Of the 54 correlations 
ossible, 11 were between .71—.80, 29 be- 
cen .81-.90, and 14 were .91 or greater. 
There were 20 groups of Ss of 18 each 
having 24-hr. recall. All correlations be- 
tween rated rehearsal activity and reten- 
tion were positive, varying between .14 and 
.81. There was no relationship between 
rated rehearsal activity and list structure, 
and the magnitude of the correlations be- 
tween rehearsal and retention did not vary 
systematically as a function of list structure. 
The positive relationship between re- 
ported rehearsal and retention allows 
several alternative interpretations. For 
«ample, rehearsal may have increased re- 
tention, or, as another interpretation, Ss 
with good retention may have rehearsed. 
The concern of the present study was 
whether or not rehearsal differed as a func- 
tion of list structure. Since it didn't, it 
scems unlikely that the failure to find an 
influence of list structure on retention 
could be due to differential rehearsal. Also, 
if the present evidence on rehearsal can be 
generalized to the earlier study (Underwood 
& Zimmerman, 1973), it seems unlikely 
that the positive relationship between list 
structure and retention as reported for 
that study was due to differential rehearsal. 


DISCUSSION 


In the previous study, paced recall and list 
structure were directly related and the for- 
getting over 24 hr. for the list with the highest 
structure was only 5%. In the present study, 
list structure was unrelated to either paced or 
unpaced recall, and the forgetting for the 
highest criterion of learning used was estimated 
at 25%, The discrepancy between these two 
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studies is the 
scussion. 

The contradiction in the retention results 
for the two studies pointed immediately to a 
method difference as the likely source for the 
contradiction. In the previous study the 
anticipation method was used. During the 
recall trial, therefore, the 5 would be informed 
of the particular concept whose instances were 
appropriate at the moment for the structured 
lists. If, for example, he remembered there 
were six animal names in succession, the ap- 
pearance of the first instance would inform 
him that the-next several positions also con- 
tained animal names. With the study-test 
method, the S would not be given this infor- 
mation. If he did not give the first instance 
of a concept at the appropriate point (to the 
correct stimulus), his following responses 
would be incorrect unless he remembered a 
particular response term associated with a 
stimulus term within the series of concept in- 
stances. There were, in fact, some cases in 
which S did give three to six correct response 
words in the correct order but which were 
scored as wrong because the initial response in 
the series was not paired with the appropriate 
stimulus, This did not occur frequently (and 
the data show that scoring these as correct 
did not change the basic conclusion), but the 
uncertainty felt by the $ may have prevented 
him from responding overtly. List structure 
clearly influenced learning and relearning, but 
some portion of the learning which allowed the 
S to align response terms and stimulus terms 
correctly during learning must have been for- 
gotten over the 24 hr. It is possible that the 
numerical stimulus terms were not always 
used as the effective stimulus terms and that 
the lists were treated more as serial lists than 
paired-associate lists, although what this 
means theoretically is not known. It appears 
that if S had learned, for example (for Lists 
4 and 5), that number 1 was paired with the 
first bird instance, number 4 with the first of 
three fish instances, and so on, that this part of 
the memory was lost over 24 hr. Otherwise, 
there is no reason why providing this infor- 
mation at recall (as is done in part under the 
anticipation method) should result in better 
performance. 

It seemed necessary to make a test of the 
notion that the discrepancy in the recall re- 
sults for the two experiments was due to a 
difference in methods. In the auxiliary experi- 
ment, to be reported now, Lists 1 and 5 were 
presented for anticipation learning, with re- 


topic of this preliminary 
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MEAN NUMBER OF TRIALS 


1 5 
LIST STRUCTURE 


Ficure 6. Learning and relearning scores 
Lists 1 and 3 as a function of the study-test (S-T) 
and anticipation (Ant.) methods. 


call taken after 24 hr. The expectation was 
that recall of these two lists would differ, with 
List 5 giving better recall than List 1. 


AUXILIARY EXPERIMENT 
Method 


Lists 1 and 5 were presented at a 1.5:1.5-see. rate 
for anticipation learning. he criterion of original 
learning was 12 correct responses on a single trial. 
Paced recall occurred after 24 hr. with relearning 
carried to one perfect trial, but a minimum of three 
t beyond the recall trial. Each list learned 
by a separate group of 18 5 gned to one of the 
two lists by a block-randomized schedule, 


Results 


In presenting the results, comparisons 
will be made with the two groups of Ss 
from the major experiment having the 
study-test method and paced recall, and 
who had learned Lists 1 and 5 to a criterion 
of 12 correct responses before the retention. 
interval. There were 18 Ss in each of these 
two groups. Since the auxiliary experi- 
ment was conducted after the major ex- 
periment, it is not known if the groups 
(study-test vs. anticipation) represent the 
same or equivalent populations. Therefore, 
differences in the levels of performance will 
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not be readily interpretable, altho: ^: inter- # 
actions between the methods d lists 
should be meaningful. 

The mean numbers of trials to ^ rn to a 
criterion of 12 correct, and the n num- 
bers of trials to relearn to one [x trial, 
are shown for the four groups i: ure 6. 
For original learning, list structu ppears 
to have a greater influence for an ‘pation 
learning than it does for the s test 
method. List structure is signifi: (F= & 
25.33), as is the interaction, F | 68) = 
4.19, p < .05. The interaction tween 
list structure and method does s occur 
during relearning, indicating t thé 
method's influence on learning is  nfined 
to the early stages. 

It will be remembered that for th: :tudy- 
test method the frequency of concept in- 
stances was directly related to rning. 
This was quite evident at the lower « . terion 
of learning. For List 5, under the «:iticipa- 
tion method of the auxiliary exp-«iment, 
the reverse was found. More speciñcally, 
two findings held across all eight « incepts 
of three instances each. First, the initial 


word of the three (the high-frequency in- 
stance) was never given correctly more 
times than the second instance (the me- 
dium-frequency instance). Second, the third 
word in each of the eight concept triads 
(the low-frequency instance) was always 
given correctly more times than the first 
instance of the succeeding concept. For 
List 1 under anticipation learning, the 
effect of word frequency was in evidence 
just as was true for all of the lists learned 
by the study-test method. The above 
facts would indicate that item learning 
under the two methods would be more 
reliable for List 1 than for List 5. The 
product-moment correlations for item learn- 
ing were .83 for List 1, and .58 for List 5. 
The positive relationship for List 5 reflects 
the commonality in learning by the two 
methods produced by serial position of the 
words, differences which were apparent 
under both methods. 

Mean overt errors per opportunity were 
greater in learning under the anticipation 
method (.22) than under the study-test 
method (.13), and the difference was re- 
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lia i, 68) = 9.24, p < .01. This may 
re a greater tendency to guess under 
t! icipation method than under the 
s test method. However, of the overt 
ei made under the two methods, the 
pe antage of these errors within the ap- 
I iate concept position (Level-1 errors 
as escribed earlier) was about the same for 
I 5, being 53% for anticipation and 54% 


f. the study-test method. 
he mean numbers of correct responses 
ecall are shown in Figure 7, along with 
mean number correct on the last learn- 
trial. Under the anticipation method, 
call was directly related to list structure, 
(1, 34) = 4.97, p «.05. The inter- 
tion between lists and methods for recall 
was also reliable, F (1, 68) — 4.44, p « .05. 
Recall for List 5 under the anticipation 
method was higher than performance on 
the last learning trial, 24 hr. earlier. To 
some extent, these comparisons are all in 
error, a matter which needs discussion. 
Consider first the recall of List 1 under 
the two methods. A conclusion from an 
inspection of Figure 7 might be that recall 
: superior following anticipation learning 
; that following learning by the study- 
st method. However, two factors must 
considered. First, the criterion fall 
which may occur uader the study-test 
method, and second, the learning which 
occurred on the last trial under the antici- 
pation method. ‘This last issue can be 
handled directly. A multiple-entry prob- 
ability analysis. (Underwood, 1964) was 
carried out to project performance to the 
hypothetical next trial under the anticipa- 
tion method. The mean expected value 
was 14.89, Since recall was 11.67 items, 
forgetting over 24 hr. was 3.22 items, or 
22%. To incorporate the criterion fall into 
the calculations for List 1 for the study- 
test method requires a rough estimation of 
values. In the main experiment, groups 
were given a recall test immediately after 
reaching a criterion of 20 correct responses. 
On this immediate test under paced recall, 
performance fell from a mean correct of 
20.89 correct on the last test trial to 17.72 
correct on an immediately following test 
trial. This represents a loss of 3.17 items 
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Recall 


Last trial OL 


MEAN NUMBER CORRECT 


LIST STRUCTURE 


Ficure 7. Number correct on the last trial of 
original learning and number recalled after 24 hr. 
for the study-test and anticipation methods. 


which is referred to here as the criterion 
fall. None of the groups had an immediate 
test following learning to a criterion of 12 
correct responses. Assume, however, that 
the group having the study-test method 
and learning to a criterion of 12 correct 
responses actually forgot 22%, the same 
as forgotten by the group having the 
anticipation method. This would require 
a criterion fall of 2.75 items. In light of 
the criterion fall of 3.17 items shown by 
the group learning to 20 correct responses, 
a fall of 2.75 items for those learning to 12 
correct would not seem to be seriously in 
error. Given this assumption, the con- 
clusion is that for List 1, the amount of 
forgetting shown under the two methods is 
roughly the same. 

Turning next to List 5, it must first be 
noted that it was not possible to project 
“next trial" performance for the Ss learning 
under the anticipation method because 
several of them learned in one or two trials. 
As an estimate, however, the percentage 
projection obtained for List 1 was used. 
This gives a mean expected next-trial value 
of 17.03 items. Since recall was 15.69 
items on the average, the loss over 24 hr. 
was about 7%. Deriving an expected loss 
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TABLE 1 


COMPARISON OF STUDY-TEST AND ANTICIPATION 
METHODS WITH REGARD TO CONCEPT INSTANCE 


LEARNING AND RECALL FOR Lis 
Anticipation Study 
Correct E DIT SE Ty 
responses 
Last OL Recall | Last OL | Recall 
Three 17 | 162 | 210 | 19 
Two 
1&2 22 14 18 18 
1&3 28 20 18 10 
2&3 62 68 18 8 
One 
1 6 2 11 16 
2 13 13 2 1 
3 21 12 5 9 


Note, See text for complete explanation, 


for List 5 following study-test learning 
(using the value of 2.75 for the criterion 
fall) gives a loss of 3.75 items from 12.92, 
or 29% forgetting. ‘These values must be 
considered approximations, but they lead 
to the conclusion that the loss was about 
the same under the two methods for the 
unstructured lists, but that the anticipation 
method led to better recall than did the 
study-test method for the highly struc- 
tured list. Both the estimate of loss for 
List 1 for the anticipation method (22%) 
and the loss for List 5 under the same 
method (7%) correspond to the findings 
of the earlier published experiment using 
this method (Underwood & Zimmerman, 
1973). 

The sources of the differences in the re- 
tention for List 5 following the learning 
by the two methods needs more detailed 
examination. The data which seem to aid 
in reaching decisions about the sources of 
differences are shown in Table 1. The 
data sheets were examined for the last 
trial of original learning and for the recall 
trial. A tabulation was made of the num- 
ber of correct responses which resulted from 
producing all three instances of a concept, 
the number which resulted from producing 
only two instances of a concept correctly, 
and the number which resulted from giving 
only one correct response from among the 
three possible for each concept. When two 
correct responses were given, they were 
broken down by position within the three 
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possible positions (1 & 2, 1 & ) 8S 
When a single correct response given 
the number falling in each o > thre 
positions was noted. All of this matio 
is given in Table 1 for the rigina 
learning trial, and for the recall under 
both methods, For example, ie las 
learning trial under the pation 
method, there were 39 cases i ich a 
three instances were given cor y» 
sulting in 117 correct responses, latt 
value being shown in Table | r th 
same condition there were 13 ses di 
which a single correct response give 
for a concept and which consis: f the 
second of the three instances in series 
of three. 

Several facts are to be noted in ible t. 
First, under the anticipation met! ther 
was an actual increase over 24 ho n the 
number of cases in which all three inces 
were correctly given (117 to 162). | view 


of the learning which may have takı 
on the last anticipation trial of ining, 
this increase must be viewed cai)! iouslys 
Nevertheless, it is in marked contrast to” 
the results found with the stud y-test 
method, where there was a loss of 108 
(210 — 102) complete triads. The second 
fact to note is the difference between the 
two methods in both learning and recall 
when less than three instances were given 
correctly. Under the anticipation method, 
the most probable positions for correct re- 
sponses when two were given are Positions 1 
2 and 3, and Position 3 when a single cor- 
rect response was produced. Comparable 
relationships do not exist for the study-test 
method, where there was more or less 
constancy among the positions. The third 
fact is that if the number of cases in which 
the first instance (by position) of a concept 
was given correctly was calculated (regard- 
less of the outcome for the two following 
instances) for recall, the sums were found 
to be 198 cases for anticipation, and 146 
for study—test. This seems to indicate that, 
(a) associative learning between the first 
instance of the concept and its position or 
stimulus number was greater or better dur- 
ing anticipation learning than during 
study-test learning, or (b), showing the S 
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the ouse terms during recall allowed 
hi leduce" the subsequent occurrence 
of instance of a different concept. 
Be tors may be involved, although the 
la ems more reasonable. Finally, the 
di ; Table 1 are clear in demonstrating 
th: "allure of S to recall the first instance 
ol concept provides verv little penalty 
ur r the anticipation procedure, since he 
( proceed to give other instances of the 
& cept for the two following positions. 


ie evidence seems to point to the fact 
learning under the study-test method 
clatively fragile or weak with regard to 
itions or stimulus numbers which mark 
first instance of a new concept. The 

me is probably true for the anticipation 
arning. However, on the recall trial the 
5 having learned by the study-test method 
receives no feedback information and he 
has few means to apply corrective pro- 
cedures based on any knowledge he had 
about the conceptual structure. In effect, 
he was reduced to responding on an item 
by item basis, much as is the case for an 
nstructured list. On the other hand, the 
having anticipation learning could apply 
rective procedures based on his knowl- 
about the conceptual structure of the 


GENERAL DISCUSSION 


The higher the conceptual organization of 
the lists the more rapid was the learning. It 
is presumed that this relationship was pro- 
duced because the greater the number of the 
conceptual levels the more precise was the 
placement of an item if the rules indicated by 
the conceptual relationships were followed. 
Nevertheless, this should not be taken to mean 
that the hierarchical structure in its totality 
entered into the learning of the most com- 
pletely structured lists. The 5 could learn 
the sequence of eight concepts of three in- 
stances each without reference to the higher- 
order concepts present in the list (animals, 
plants, beverages, minerals). So also, List 3 
could be learned by reference only to the four 
successive concepts of six instances each and 
without the living-nonliving distinction per se 
entering into the learning. The error data 
were clear in showing that at least one level of 
conceptual responding was involved in the 
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learning of Lists 3, 4, and 5, but these error 
data do not speak to the question of whether 
two or more conceptual levels were involved 
in learning Lists 4 and 5. To determine if 
two or more levels were involved would re- 
quire the use of a list in which the blocks of 
three instances were ordered randomly with 
respect to the more inclusive concepts. A 
list of this type was not included in the present 
study. 

In the previous study (Underwood & 
Zimmerman, 1973), learning was facilitated up 
through the structure corresponding to List 3 
in the present study. No further enhancement 
occurred for Lists 4 and 5, This is believed 
due to the fact that the present lists were made 
up of more obvious concepts and more ob- 
vious concept instances than were the pre- 
vious lists, Unlike the present findings, fre- 
quency of the concept instance with a concept 
did not influence learning in the previous 
experiment. In view of the findings of the 
auxiliary experiment, it seems now that this 
contradiction. is another by-product of the 
differences in the learning by the two methods. 

The major purpose of the present study was 
to attempt to identify the component(s) in 
memory which had led to the increase in re- 
call over 24 hr. as list structure increased. In 
an inelegant way, this study was quite success- 
ful in achieving its purpose. The use of the 
study-test method of learning was followed by 
a complete lack of any differences in the recall 
tests, whether paced or unpaced. Only relearn- 
ing, reflecting the same relationship as was 
found in original learning, was related to list 
structure. With the study-test method the 
original learning was apparently based on very 
weak associations between stimulus number or 
position and words marking the conceptual 
changes which occurred throughout the list. 
The loss of these associations over 24 hr. essen- 
tially made the conceptual nature of the lists 
useless as a recall vehicle. The items which 
were recalled were apparently based upon 
factors specific to them. 

Considering now the broader context of 
these studies, the question may be raised 
about the systematic importance of the earlier 
study (Underwood & Zimmerman, 1973), and 
the auxiliary experiment of the present report, 
in both of which the anticipation method was 
used. Recall and list structure were directly 
related in these studies. However, the error 
data from this previous study gave no support 
to the idea that forgetting of unstructured lists 
was produced by interference among the con- 


ceptual associations among the words. In the 
present study there was no evidence that con- 
ceptual associations played any interfering role 
in learning the unstructured lists. It does not 
seem now, therefore, that these types of lists 
have any special or pointed use for the study 
of interference as a source of forgetting. 
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CHANGE IN SPEAKER'S VOICE AND RELEASE 
FROM PROACTIVE INHIBITION 


JOHN M. GARDINER? anp PAULINE C. CAMERON? 


The City University, London, England 


Four triads of unrelated words were presented auditorily in the Brown- 


Peterson distractor task. 
spoken in a female voice 


Over Trials 1 to 3, half of the Ss heard the words 
and half in a male voice. On Trial 4, Ss in a control 


group heard the words in the same voice as on preceding trials and Ss in an 


experimental group heard the words in the alternate voice. 
tion (PI) developed rapidly over Trials 1 to 3 in both groups. 


Proactive inhibi- 
On Trial 4, 


however, Ss in the experimental group recalled significantly more words than 


did Ss in the control. It wa: 


concluded that Ss can utilize change in speaker's 


voice to reduce the cumulative effects of PI in short-term memory. 


Considerable interest has been shown 
ecently in the investigation of those at- 
tributes or dimensions of verbal items which 
Ss can utilize or identify in a variety of 
memory tasks (Underwood, 1969; Wickens, 
1970). One strategy, which has yielded a 
considerable body of evidence on the en- 
coding of various attributes of verbal items, 
has been the use of the release from pro- 
ctive inhibition (PI) technique. This 

chnique capitalizes upon the rapid build- 
up of PI which occurs over the first few 
tials in the Brown-Peterson distractor task 
('eppel & Underwood, 1962) to determine 
tlie extent to which Ss can utilize a specific 
change in the nature of the to-be-remem- 
Dered material to reduce the cumulative 
effects of PI. Wickens (1970, 1972) has 
reviewed many studies which demonstrate 
that when the "category" of the material 
is changed after three or four trials, recall 
will be better than if the category remains 
unchanged. Wickens has argued that the 
amount of recovery in performance (or 
release from PI) on the critical trial, as a 
function of any particular “category shift,” 
provides an index of the relative effective- 
ness of that category as an attribute or 
encoding dimension of memory. 


! Requests for reprints should be sent to John M. 
Gardiner, Department of Social Science and Human- 
ities, The City University, St. John Street, London 
ECIV 4PB, England. 

* Now at the Department of Psychology, Brunel 
University, Uxbridge, Middlesex, England. 


A second strategy has been to examine 
more directly Ss’ retention of informa- 
tion about various attributes of verbal 
items, as, for example, in determining the 
extent to which Ss can identify input 
modality following recall of a list of 
words presented in mixed modes (Bray & 
Batchelder, 1972). In this situation, there 
is evidence not only that Ss are able to 
make accurate judgments concerning be- 
tween-modality attributes (whether an item 
was presented visually or auditorily), but 
also concerning within-modality attributes 
(whether an item was presented in upper- 
or lowercase letters, or in a male or female 
voice). Hintzman, Block, and Inskeep 
(1972), for example, gave Ss an unexpected 
final recognition test following several free- 
recall trials in which the word lists were 
presented in one of three mixed modes: 
visual or auditory; upper- or lowercase; 
male or female voice. In all cases, the 
results showed that, given recognition of 
the item in the final test, Ss could identify 
the input mode with considerable accuracy 
for at least several minutes after presenta- 
tion. In a rather different vein, Craik and 
Kirsner (in press) investigated the persis- 
tence of information about speaker's voice 
(maleor female) using the continuous recog- 
nition paradigm of Shepard and Teghtsoon- 
ian (1961). Craik and Kirsner found that 
when test words were represented in the 
same voice as in the initial presentation, 
recognition performance was better than 
when the second presentation was in a 
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different voice. This “same voice" facilita- 
tion did not decline over a 2-min. lag. 
There is thus good evidence that Ss 
retain various aspects of input mode in- 
formation for at least several minutes fol- 
lowing item presentation, and also that 
such information may in certain circum- 
stances facilitate memory performance. 
These results are important in that they 
imply that the representational charac- 
teristics of verbal material may persist for 
much longer than many theorists have 


maintained (e.g, Crowder & Morton, 
1969). 
Such evidence suggests also that Ss 


might be able to utilize input modality 
attributes to reduce the cumulative effects 
of PI in the Brown-Peterson distractor 
task. Indeed, several recent studies (Hop- 
kins, Edwards, & Cook, 1973; Kroll, Bee, 
& Gurski, 1973) have established that be- 
tween-modality shifts (visual to auditory 
or auditory to visual presentation) can lead 
to dramatic release from PI effects. The 
results of several within-modality shift 
studies have also been reported. Within 
the visual modality, Shearer and Wickens 
(unpublished study cited by Wickens, 1972) 
found that a shift from lower- to upper-, 
or upper- to lowercase letters did not pro- 
duce a significant release effect. Shifting 
slide background area (Turvey & Egan, 
1969), and the color of the slide background 
(Reutener, 1972), were, however, found to 
be effective. Somewhat surprisingly, the 
effect of shifts within the auditory modality 
have not hitherto been reported. Bearing 
in mind the results obtained by Craik and 
Kirsner (in press) and Hintzman et al. 
(1972), it seems quite possible, for example, 
that change in speaker's voice would lead 
to a release from PI effect. The present 
study was designed to investigate this 
possibility. 


METHOD 


Design. All Ss received four triads of unrelated 
words presented auditorily in the Brown-Peterson 
distractor task. There was an experimental and a 
control group of Ss. Over Trials 1 to 3, half of the 
Ss in each group heard the words spoken in a female 
voice and half in a male voice. On Trial 4, Ss in the 
control group heard the words spoken in the same 
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voice as on previous trials. The Ss in the e 
mental group heard the words spoken in the alter 
nate voice on Trial 4. 

Subjects. The Ss were 120 volunte« 
male undergraduate students, who were paid fi 
their services, Sixty Ss were assigned randomly ti 
each group and the Ss were tested individually, 

Materials, "Thirty unique 12-word sets were cı 
structed by sampling without replac 
pool of 600 common two-syllable nouns 
was divided arbitrarily into four word-triads. 
word sets were yoked across experimental conditioi 
such that each set was used four times once for 
each voice condition within each group. 

Procedure. The word lists were recorded on @ 
Brenell STB 2 tape recorder and subsequ 
presented to S via a loudspeaker. Each | 
tested in a small soundproof cubicle. 
procedure was as follows. Each trial began when 
the tape recorder w witched on and 5 heard & 
nted at the rate of one word 
per sec, s then switched off and the 
“click” made by the switch was the S's cue to begin 
the distractor task. This task was that of reporting 
aloud the sum of each of a series of pairs of random 
single digits from a sheet of these placed on a table? 
in front of the S, The addition t was paced at 
a Lsec. rate by the click of a metronome, After 
. addition, E signaled with a pencil tap that 

to attempt oral recall of the three to-bes- 
remembered words, The S was allowed 9 sec. for 
recall, after which a further pencil tap signaled that 
the next set of words was about to be presented, 
There was no intertrial interval, and each complete 
trial sequence lasted for 30 se 

All Ss were instructed that the experiment in- 
volved two “unrelated” tasks, an arithmetic task 
and a memory task, and that performance on both 
Ss were given an initial 
practice at the distractor task alone. They were 
then given seven lead-in trials presented in the 
same manner as described above. Lead-in trials 
comprised triads of one-syllable common words and 
all Ss received the same words. Prior to the lead-in 
trials the Ss were informed that the words would 
sometimes be spoken in a male voice and sometimes 
in a female voice, that changes in voice occurred at 
"random," but that all they had to do was to re 
member the words. The seven lead-in trials were 
in fact a random sequence with respect to voice. 
Between the lead-in trials and the four experimental 
trials there was an unfilled interval of about 2 min. 


most]: 


RESULTS 


Preliminary analyses indicated that there 
were no reliable differences as a function of 
voice (male or female) and the data were | 
pooled over this variable. Figure 1 shows 
the percentage correct recall for the experi- 
mental and control groups. One point was 
given for each word recalled correctly, re- 
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ga s of its output position. The figure 
sl iat recall declined rapidly in each 
ver the first three trials. On the 


trial, however, the experimental 

showed a substantial recovery in 
mance, whereas performance in the 
ol group remained essentially the same 
à was on Trial 3. 

parate analyses were carried out on 

d. a from the first three trials and on data 
(^on Trial 4. Considering Trials 1 to 3, 
' — results of an analysis of variance showed 
t the only significant effect was that due 
trials, F (2, 236) = 33.15, p < .001. 

he F ratios associated with groups, and 
oups X Trials, were less than unity. 
hese results indicate that the build-up of 

I over Trials 1 to 3 was reliable and did 
not differ with respect to groups. The 
result of an uncorrelated ¢ test carried out 
on data from Trial 4, however, confirmed 
that the superiority shown by the experi- 
mental group was highly significant, ¢ (118) 
= 3.95, p < .001. 

Following the procedure suggested by 
Wickens (1970) the amount of release from 
.l was assessed by dividing the difference 
between the experimental and the control 

oups on Trial 4 by the amount of loss 
shown by the control group from Trials 1 
to 4, Use of this procedure gave a per- 
centage recovery of 67%. Relative to 
previous findings (Wickens, 1972, Figure 
3), this represents a substantial amount of 
release. 

^fter recall of the final triad, each S was 
asked which voice or voices had been used 
over the last four trials. Somewhat sur- 
prisingly, only 33 Ss in the control group, 
and 29 Ss in the experimental group, re- 
ported accurately the voice in which the 
final triad had been presented. Since there 
were 60 Ss in each group, these figures are 
very close to a chance level of performance, 
and many Ss commented that they paid no 
attention to voice, or were guessing when 
they reported on which voices they had 
heard. In this context it is of interest to 
note how Ss' reported awareness of the 
change in voice relates to the release effect. 
Of the 60 Ss in the experimental group 22 
reported the change in voice from Trials 3 
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Ficure 1. Percentage correct responses for 
control and experimental groups, 


to 4 accurately. Taking an increase in the 
number of words recalled by the S from 
Trials 3 to 4 as an index of “individual 
release," 12 of the 22 Ss who noticed the 
change and 16 of the 38 Ss who did not 
notice the change showed individual release 
effects. These results imply that the re- 
covery in performance shown by Ss in the 
experimental group on trial four did not 
depend upon Ss’ awareness of the change 
in voice. 
DISCUSSION 


Clearly, the results of the present study show 
that change in speaker's voice lead to a dra- 
matic recovery from the effects of PI in the 
Brown-Peterson distractor task. Thus speak- 
er's voice may be added to the already large 
list of attributes which Ss apparently encode 
in short-term memory for verbal items. 

'The magnitude of the release effect obtained 
runs counter to the general finding that 
“physical,” as opposed to semantic shifts, do 
not typically produce large release effects 
(Wickens, 1972). Why should change in 
speaker's voice be an exception to the more 
general result? One possibility is that the Ss 
in the present study were "set" to encode 
items in terms of voice. That is, the Ss were 
informed beforehand that the triads might be 
spoken in either a male or a female voice and 
received several lead-in trials in which they 
heard each of the two voices. In most previ- 
ous release from PI studies, with the exception 
of some between-modality shift experiments 
(e.g, Hopkins et al., 1973), the Ss were not 
informed beforehand that any change in the 


866 


nature of the to-be-remembered material would 
occur. 

However, as Underwood (1972) has pointed 
out release effects in general may to some 
extent reflect "priming" the S, implicitly if not 
explicitly, to encode the material along the 
specific dimension being investigated. We 
may well be "overloading memory" if we as- 
sume that 5s habitually or automatically en- 
code verbal materials along all the many di- 
mensions so far shown to yield release effects. 
The more cautious conclusion, of course, is 
simply that Ss can utilize such dimensions to 
reduce the effects of PI when given the 
"opportunity" to do so, and, therefore, that 
the dimensions may be represented, in some 
sense, by different memory codes. In view of 
Underwood's point, it may be worth empha- 
sizing that this conclusion is independent of 
any conclusion with respect to the possible 
influence of priming the use of particular 
memory codes. Underwood's argument im- 
plies, rather, that the employment of such 
codes may, to a large extent, be "optional" 
rather than "automatic" (cf. Wickens, 1970). 
If this be so, then an urgent task for further 
research will be that of investigating the con- 
ditions that tend to favor the use of one en- 
coding strategy rather than another. Reutener 
(1972), for example, found that shifting the 
color of the slide background was effective in 
producing release from PI when the stimulus 
materials were arabic numbers or spelled out 
numbers, but not when the materials were 
words drawn from a taxonomic category. On 
the basis of this result, Reutener suggested 
that a "physical" shift may be effective only 
when the to-be-remembered material is not 
words, While such a conclusion is also in line 
with results reported by Turvey and Egan 
(1969) and Shearer and Wickens (unpublished 
study cited by Wickens, 1972), the results of 
the present study do not fit into this pattern. 
The results of the present study, rather, sug- 
gest that 5s may also place more reliance on 
"physical" codes when the to-be-remembered 
material consists of relatively “unrelated” 
words. Nevertheless, one plausible factor 
which may affect the extent to which Ss 
utilize "physical" forms of coding may well 
be the degree of meaningfulness which Ss can 
extract from the stimulus materials. 

That the Ss in the present study were not 
able to report accurately which voices they 
had heard in the final trials is somewhat sur- 
prising in view of the results obtained by 
Craik and Kirsner (in press). These authors 
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found that Ss could assign speaker's voice 
65-75% accuracy after a 2-min. inter 
This discrepancy may be related to the fae 
that whereas Ss in the Craik and Kirsn 
study identified presentation voice in the com 
text of the presented words, Ss in the presen 
study were required to recall presentatio 
voice in the absence of the words. We suggi 
that the inability of our Ss to report accuratel 
Which voices they had heard in the final tri 
is in good agreement with the notion that tht 
auditory information in speaker's voice was 
retained in some literal or representatio 
form. Our data may thus be viewed as adding 
to the already substantial body of evidence 
(Craik & Kirsner, in press; Hintzman et aly 
1972) which indicates that auditory informa: 
tion may persist for much longer than many 
theorists have maintained (e.g., Crowder 
Morton, 1969). Further, the present study) 
extends these findings by demonstrating tha 
such information can to a large extent elimi- 
nate the effects of PI in short-term memory. 
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THE COMBINED EFFECTS OF STIMULUS AND RESPON & 
CONDITIONS ON THE DELAY IN IDENTIFYING 
THE PRINT COLOR OF WORDS! 


RAYMOND L. MAJERES? 


Western Illinois University 


The interaction between stimulus and response conditions in the Stroop inter- 
ference effect was studied in two experiments. Response conditions in Experi- 
ment I required college students (N — 36) to identify colors by using verbal 
responses (color names, color associates, or single letters). In Experiment II, 
Ss (N = 36) identified the colors by writing the first letter of the responses 
used in Experiment I, Stimulus conditions in both experiments were color 
only, color names in nonmatching print colors, and color-associated words in 
unrelated print colors. In line with predictions from Morton's word recogni- 
tion model, the Stimulus X Response interaction w ignificant in both ex- 
periments (p < .01). Differences between the experiments indicate that the 
type of response (verbal or motoric) influences the delaying effect of the words. 


Naming the color of printed words (e.g., 


of both color and word, with inter crence 
such as friend, sky, or run printed in red) arising from difficulties in respons: selec- 
takes more time than naming the color of tion (see also Dyer, 1973). 
colored patches (Fox, Shor, & Steinman, Studies by Treisman and Fearnley (1969) 
1971; Klein, 1964). This effectis especially and Hock and Egeth (1970) have ques- 


strong when the words are color names 
(Stroop, 1935). It is as if the highly de- 
veloped systems involved in reading oper- 
ate autonomously and interfere with simple 
color naming. 

Klein (1964) used different kinds of 
verbal material in addition to color names 
as potential sources of distraction. It was 
found that a variety of verbal materials, 
including nonsense syllables, could delay 
color naming. The nature of the words 
in terms of their semantic relationship to 
the colors (other color names, color-related 
words) and their familiarity (word-count 
frequency) influenced the degree of delay. 
These particular results have been repli- 
cated, and a variety of word characteristics 
have been found to influence the delay 
(Dyer, 1973). Both Stroop (1935) and 
Klein emphasized the role of response 
competition, involving central as well as 
peripheral aspects. Implicit in their em- 
phasis was the initial parallel processing 

1 This research was carried out while the author 
was a postdoctoral fellow supported jointly by 


Educational Testing Service and National Institutes 
of Health Grant MH-08260. 

* Requests for reprints should be sent to Raymond 
L. Majeres, Department of Psychology, Western 
Illinois University, Macomb, Illinois 61455, 


tioned this assumption, pointing ou! that 
the task as used confounded stimulus and 
response effects. They attempted to elimi- 
nate this confounding by devising tasks 
that did not require the overt naming of 
colors, but rather involved judgments of 
perceptual match or a match with “target” 
stimuli held in memory. Treisman and 
Fearnley suggested that the critical feature 
of the Stroop effect is that a comparison 
must be made across hypothesized word 
and color “analyzers,” and such compari- 
sons are less efficient than those compari- 
sons made within an analyzer, Also, 
"across-analvzer" comparisons were be- 
lieved to be more susceptible to interference 
effects from values of the irrelevant at- 
tribute. However, Egeth, Blecker, and 
Kamlet (1969) found that an interference 
effect can be obtained in "within-analyzer" 
comparisons. In their task the responses 
were verbalized and the response words 
(same and different) were printed in the 
different ink colors. That is, the identity 
of the set of words used in the stimuli to 
the set of response terms did produce à 
delay, even though the task involved within- 
attribute matching. 

A few other studies have provided data 
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on \iual effects of both stimulus and 
res -> variables. Most of these studies 
we signed to investigate language orga- 
niz n in bilinguals (Dalrymple-Alford, 


19 Preston & Lambert, 1969). Preston 
an Lambert considered initial identifica- 


ti is one of the factors in the observed 
in ‘ference effect, but output processes 
W considered to be the major factor in 
p ducing the interference effect. Dal- 


;ple-Alford, in a study using Arabic- 
;lish bilinguals, introduced the possi- 
tv that the stimulus word primes all 
rds associatively related to it. That is, 

t only does the stimulus word prime the 

rong response, but it also primes associa- 
‘ively related responses. There is more 
recent evidence that this priming may 
extend beyond simple associational pro- 
cesses and probably involves the word's 
semantic category as well (Warren, 1972). 
This is in contrast to Hock and Egeth's 
(1970) interpretation, where the responses 
were considered to sensitize Ss to the 
stimulus word. It may be that the stimu- 
ius word and the response both alter the 
availability of some central linguistic unit. 

Morton (1969) has suggested that the 
ivailability of such mediational units, 
called logogens, is affected by stimulus in- 
formation (seeing a color, seeing a word, 
hearing a word, etc.) and by the experi- 
mental context. In the Stroop (1935) 
paradigm, the experimental context refers 
primarily to the instructions to identify 
colors, which presumably increases the 
availability of (primes) color names, color- 
related words, and perhaps entire color 
categories (Warren, 1972). A third factor 
affecting the availability of the logogen to 
a serial output buffer is the response. Re- 
sponse repetition (overt or covert) is be- 
lieved to lower the thresholds of the re- 
spective logogens. Hence, in the Stroop 
task, both stimulus and response conditions 
should influence logogen availability. 

For example, the use of color names both 
as stimulus words (printed in incongruent 
colors) and as response terms (as names of 
the print colors) should provide the great- 
est competition in terms of the greatest 
number of incompatible logogens reaching 
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threshold: color identification is required, 
which influences availability of color names 
in general; color names are irrelevant 
sources of stimulus information, which in- 
creases the availability of incompatible 
color names; and color-name responses are 
required, which lowers the thresholds for 
color names. Likewise, using color associ- 
ates as response terms, such as saying fire 
for red, should have a direct priming effect 
on color associates and not on the color 
names. If color associates are in the stimu- 
lus card as irrelevant, incompatible infor- 
mation and are also used as responses, the 
logogens for color associates should be 
highly available (via stimulus information), 
and their thresholds should be lowered 
(via response requirements), Such condi- 
tions should produce a dramatic delaying 
effect analogous to the original Stroop 
(1935) task. However, since the context 
is still color identification, which has its 
primary effect on the availability of color 
names rather than of color associates, the 
effect may be somewhat reduced. 

Using single letters as vocal responses 
should not alter the threshold for color- 
associate or color-name logogens. How- 
ever, stimulus words (color names and 
color associates) and context conditions 
should produce delaying effects similar to 
those found in the original Stroop (1935) 
task, with color names producing more 
delay than color associates. But the mag- 
nitude of the delay may be reduced, since 
single-letter responses should not alter 
logogen thresholds, except possibly by 
verbal mediation. 


EXPERIMENT I 


An initial experiment was designed to 
demonstrate the joint influence of both 
stimulus and response conditions on the 
delay in color naming, using procedures 
directly comparable to those used by 
Stroop (1935). Both the kind of words in 
the stimulus (color names, color-related 
words, and color only with no verbal dis- 
traction) and the kind of color-identifying 
responses (conventional color names, color- 
associated words, or arbitrary letters) were 
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TABLE 1 
COLOR IDENTIFICATION TIMES (IN SEC.) FOR T) PONSE GROUPS ON THREE 
STIMULUS CONDITION ^ E 
= Trials | Bs: 
Type of interference M en 
1&2 3&4 5&6 7&8 | 
SUR Seen 19.6 19.9 T uo Us | 1 
Color associates 27.3 26.1 25. 25.2 26. | 3 
Color names 374 35.6 32.8 32.6 34.6 5 
proce da 342 33.5 33.7 314 332 1 
Color associates 47.7 46.9 43.3 41.9 45.0 5 
Color names 46.2 45.0 43.0 40.2 43.6 4 
hs SL 
MET Ane st 41.1 38.3 ds yaa 32 d 
l ciat 48.5 43.4 Š à 2. 
Color Sas ET 54.4 49.7 43.4 41.0 47.1 ] 
Note, Abbreviations: N = color names, A = color-color associates, SL = single letters. 
systematically varied to investigate their groups. In the first group, Group N, Ss id. tified 
joint influence the colors on the three stimulus cards using «vene 


Method 


Subjects. Thirty-six students (18 males and 18 
females) from a private and a state college in New 
Jersey volunteered to take part and were paid $2 
for a 1-hr. session. 

Materials. Three sets of stimulus cards were 
prepared, as described below. 

Card 1. The words red, green, blue, and orange 
were printed in lowercase letters (pica) by a multi- 
lith-offset process on 83^ X 11" sheets with eight 
rows of five words each. The words were printed in 
incongruent ink colors (words never congruent with 
ink color) with each color appearing 10 times in a 
quasirandom order (doublets were avoided within 
rows). Each word appeared about the same number 
of times (4 times) in each of the three possible in- 
congruent colors. This was the color-names stimu- 
lus condition. 

Card2. Using the same format as in Card 1, the 
words fire, sky, grass, and juice were printed in 
colors incongruent with the colors suggested by 
these words: red, blue, green, and orange, respec- 
tively. This card was the color-associates stimulus 
condition. The color-associated words and the 
color-names were quite comparable in terms of 
Thorndike—Lorge (1944) word-count frequency, with 
all rated AA except orange (A) and juice (37). 

Card 3. Sets of five asterisks (approximating the 
average length of the words) were printed with each 
set in one of the four colors in the same format as 
was used on Cards 1 and 2. This was a noninter- 
ference or color-only stimulus condition, in that no 
words were printed on this card. There were two 
different forms of each of these three stimulus cards, 

Procedure, Equal numbers of male and female 
Ss were assigned to three different experimental 


tional color names (red, blue, green, and orang. . In 
the second group, Group A, Ss identified the colors 
by using color-associated words, saying fire red, 
sky for blue, grass for green, and Juice for orange. 
In the third group, Group SL, the identifying re- 
sponses were single letters: v for red, x for cicen, ¥ 
for blue, and z for orange. All Ss were given pr 
tice trials in serially naming the 40 color-only items 
using their particular set of identifying responses to 
a performance criterion of 60 sec. This practice 
was designed to ensure that the responses were 
sufficiently familiar to permit smooth and accurate 
responding. No S in the N or A groups required 
more than two practice trials to reach criterion. 
Of course, Ss in the single-letter (SL) group typi- 
cally required more extensive practice to reach the 
criterion since the labels they used were novel and 
arbitrary. 

After the initial training trials, Ss in all three 
response groups identified colors on the three dif- 
ferent stimulus cards. There were eight trials on 
each of the three stimulus conditions. Presentation 
of each of the three stimulus cards constituted à 
trial. The color-only stimulus condition always 
preceded the other two stimulus conditions on each 
trial. The order of the color-associates and color- 
names cards was alternated between trials. Half of 
the Ss began with a color associates-color names 
order, and half began with the reverse order. 

A stopwatch was used to obtain performance 
times, and a 30-sec. interval between cards provided 
Ss with a brief rest and enabled the recording of 5s 
performance times. The Ss were instructed to 
correct all errors so that any errors would be re- 
flected in the time score. The number of uncor- 
rected errors was too limited to permit a meaningful 
analysis. 


IMULUS AND RESPONSE CONDITIONS AND THE STROOP EFFECT 


Res 

i mance times on each stimulus card 
we lyzed using a 3 X 3 X 2 X 8analy- 
sis ariance design with response condi- 
tior (color names, color associates, and 
sir letters), stimulus conditions (color 
na es, color associates, and color only), 
se and trials (eight) as factors. Response 
co ditions and sex were between-Ss varia- 
L and stimulus conditions and trials 


within-Ss variables. 

‘lean times to identify the colors under 

: different experimental conditions are 

ven in Table 1. Response conditions, F 

^, 30) = 17.01, p < .001; stimulus condi- 
ions, F (2, 60) = 190.55, p < .001; and 
heir interaction, F (4, 60) = 13.48, p < 
.001, were significant. There were signifi- 
cant trial effects, F (7, 210) = 28.11, p < 
.001; as well as significant Trial X Re- 
sponse, F (14, 210) = 4.48, p < .001, and 
Trial X Stimulus, F (14, 420) = 4.56, p < 
.001, interactions. Neither main effect for 
sex nor any of the other interactions were 
significant. Since the Stimulus X Re- 
ponse interaction was significant, the sig- 

ificant main effects are interpreted within 
at context. 

Stimulus X Response interaction. The 
prediction of a Stimulus X Response inter- 
action was confirmed. The degree of delay 
in color naming did depend on the particu- 
lar combination of stimulus and response 
terms (Figure 1). Tests of simple effects 
indicated that the two interference condi- 
tions (color names and color associates) 
did produce significant delays in color iden- 
tification, and this was the case for each of 
the three response groups (Newman-Keuls, 
ps < 01). Differences between the color- 
names and color-associates stimulus condi- 
tions were significant (Newman-Keuls, ps 
< .01) for both the color-names response 
group (34.6 vs. 26.0) and for the single- 
letters response group (47.1 vs. 42.9), but 
this difference was not significant for the 
color-associates response group (43.6 vs. 
45.0). 

Practice effects. With respect to the dif- 
lerent response conditions, the practice 
effects were more marked for the newly 
learned conditions (identifying colors with 
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Figure 1, Mean vocal identification times for 
three stimulus conditions by three different response 
groups. 


color associates or with arbitrary single 
letters) than for the already well-learned 
color-names response condition (Table 1). 

The effects of practice were much greater 
for the two interference conditions than 
for the color-only condition, An analysis 
of variance with response conditions and 
sex collapsed indicated that this effect was 
reliable, for the Stimulus X Trials inter- 
action, F (7, 245) = 7.92, p < .001. Prac- 
tice not only improved the speed of color 
identification per se, but also improved the 
efficiency with which the delay produced 
by the words was overcome. However, 
the nature of the Stimulus X Response 
interaction was not significantly affected 
by practice. 


EXPERIMENT II 


Pritchatt (1968) argued that the Stroop 
(1935) effect is primarily a verbal phe- 
nomenon. Using a simple motor response, 
tapping colored keys, it was found that the 
magnitude of the Stroop effect could be 
reduced but not eliminated. Pritchatt rea- 
soned that sufficient practice may eliminate 
the Stroop effect by reducing the role of 
verbal mediation. 

Also, according to Morton's (1969) model, 
vocal responses both increase the word's 
availability because of auditory stimulus 
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TABLE 2 


COLOR IDENTIFICATION TIMES (IN 


C) FoR THR 


E Grours UNpER Tux 


YrIMULUS CONDITIONS IL 
Trials ; 
Type of verbal interference — =- - M eel 
1&2 3&4 5&6 7&8 
Response Group N 
None 28.4 25.8 25.0 26.6 
Color associates 36.7 31.0 29.6 32.5 1 
Color names 41.9 34.6 33.8 37.3 3 
Response Group A | 
None 37.1 36.9 35.5 32.3 | 
Color associates 54.9 44.9 43.8 41.3 | 12 
Color names 53.5 46.5 42.6 40.9 | ^4 
Response Group SL | 
None 41.2 38.8 35.4 35.5 37.7 | 
Color associates 46.6 42.7 40,3 36.5 41.5 | 1 
Color names 57.4 48.2 44.5 39.5 47.4 | 7 
Ice bitevtelidnis Nie UNE UU MUN CET ingle letters. XXIX 
information and lower the word's thresh- names) motor response may result . : a re- 
old because of response repetition. A duction or elimination of the media‘. role 
written and abbreviated response, such as of the color name with practice, "Th result 
the first letter of each of the responses of of such conditions should be a reduction in 
the first experiment, should eliminate the the interference effect. However. the con- 
auditory stimulation and also eliminate text, the requirement for color identifica- 


other feedback from overt speech mecha- 
nisms, replacing them with a modest 
amount of visual and kinesthetic informa- 
tion. 

Furthermore, as suggested by Pritchatt 
(1968), using a simple, mediated (by color 
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FIGURE 2. Mean written identification times for 
three stimulus conditions by three different response 
groups. 


tion, would be unchanged, and covert re- 
sponding also should lower word thresholds 
(Morton, 1969). The net result should bea 
general pattern of results similar to those 
obtained in Experiment I (a significant 
Stimulus X Response interaction), how- 
ever, practice may eliminate the Stroop 
effect by the elimination of verbal media- 
tion. In this experiment, simple verbally 
mediated “motor” responses replaced the 
vocal responses of Experiment I. The 
design was analogous to the first experi- 
ment except for the response requirement. 


Method 


Subjects. Thirty-six volunteers (18 males and 
18 females) from a New Jersey state college served 
as 5s and were paid $2 for a 1-hr. session. 

Procedure, "The major modification of the pro- 
cedure was to require a written response rather than 
a vocal one. This was done by having Ss in three 
response groups write the first letter of the identify- 
ing response that was used in the first experiment 
below each item. 

All Ss were given training trials where they 
vocalized the correct responses just as in the first 
experiment. The Ss then were given one written 
practice trial of 40 responses to colors only, and then 
the entire experimental series was given. Each 
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was ie sheet at a time and was timed from 


the f the last response. Errors were to be 
corr »y crossing out the incorrect letter. Un- 
corre errors, as in the first experiment, were 
rare «| insufficient for a meaningful analysis, All 
Ss w ested individually. 

Re 


in the first experiment, a 3 X 3 X 2 
inalysis of variance design was used. 

n times for the different experimental 
(ditions are presented in Table 2. The 
lysis of performance times yielded sig- 
icant main effects for the stimulus con- 
tions, F (2, 60) = 101.05, p < .001; re- 

onse groups, F (2, 30) = 7.02, p « .01; 
nd trials, F (7, 210) = 53.78, p « .001. 
\ significant Stimulus X Response interac- 
tion was found, F (4, 60) = 5.00, p < .01; 
as well as significant Stimulus X Trials, F 
(14, 420) = 7.51, p < .001, and Response 
X Trials, F (14, 210) = 1.89, p < .05, 
interactions. 

Unlike the first experiment, there were 
significant Stimulus X Response X Trials, 
F (28,420) = 1.94, p < .01, and Stimulus 
X Response X Sex, F (4, 60) = 3.73, p < 

^, interactions. This latter interaction 
vas due to females in the color-names re- 
ponse group doing significantly better than 
females in the single-letters response group 
on the color-names stimulus condition, F 
(1,30) = 11.45, p < .01; this was the only 
such difference to reach statistical signifi- 
cance. The sex main effect and all other 
interactions were not significant. 

Stimulus X Response X Trials interac- 
tion. The significant Stimulus X Response 
X Trials interaction was the result of a gen- 
eral decrease in the magnitude of the dif- 
ferences among the groups over trials 
(Table 2). The effects of practice did not 
eliminate the delay produced by the words 
by the final trial (Trial 8) for the color- 
names (first letter of color name written) 
group (Newman-Keuls, p < .01). How- 
ever, the delay produced by the color 
associates was not significant on the last 
trial. Similar results were found for the 
single-letters group. There was still a sig- 
nificant delay in color identification for the 
color-names stimulus condition (Newman- 
Keuls, p < .05), but the delay was non- 
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significant for the color-associates stimulus 
condition. For the color-associates re- 
sponse group, there were significant inter- 
ference effects for both stimulus conditions 
(Newman-Keuls, p < .01). 

Stimulus X Response interaction, Again, 
as in the first experiment, the predicted 
interaction of stimulus and response condi- 
tions was found (Figure 2). Tests of simple 
effects indicated that the two interference 
conditions (color names and color associ- 
ates) again produced significant delays in 
color identification for each of the three 
response groups (Newman-Keuls, ps < .01). 

Differences between the color-names and 
color-associates stimulus conditions were 
significant (Newman-Keuls, ps < .01) for 
both the color-names and for the single- 
letters response groups, but this difference 
was not significant for the color-associates 
response group. 


DISCUSSION 


Interference effects. For each of the three 
vocal response groups of Experiment I, times 
to identify colors on the color-word cards were 
significantly greater than on their respective 
color-only cards. Hence, the words signifi- 
cantly increased the time required to identify 
the colors under all three response conditions. 
An interference effect was obtained indepen- 
dent of the type of identifying response, How- 
ever, the amount of interference appeared to 
be related to the type of response and to the 
type of verbal material on the stimulus card. 

The results from the second experiment were 
remarkably consistent with those of the first 
experiment. Again, the stimulus words (color 
names and color associates) produced signifi- 
cant delays in color identification for all three 
response groups. The delaying effects of color 
associates, first reported by Klein (1964), and 
color names were replicated and extended to 
several other response conditions. 

Maximum delay in color naming (78% in- 
crease over control conditions) was found when 
the distracting stimulus words were color 
names and the response terms were also color 
names. Similarly, in Experiment II, the 
analogous conditions (color-names stimulus 
conditions and color-names mediated response 


‘conditions) produced the greatest amount of 


delay, but the magnitude of the effect was 
reduced (only 40% increase over control 
condition). 


x] 
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The next most interfering condition was 
expected to be the color-associates response 
group under the color-associates stimulus con- 
ditions (Experiment I). This set of conditions 
was found to produce about the same delay 
as the color-names stimulus condition when 
using color associates as responses (color as- 
sociates, 35.5% vs. color names, 31.4%). In 
Experiment II, the color-associates stimulus 
condition using the color-associates mediated 
responses produced similar results (30.2% vs. 
29.4%). Although the effect of having color 
associates in both stimulus and response con- 
ditions did not result in the pronounced effect 
characteristic of the Stroop conditions, the 
effect was large enough to eliminate the 
usually observed difference between color- 
names and color-associates stimulus conditions. 
The results indicate that the traditional Stroop 
(1935) phenomenon is a product of both the 
particular stimulus words and the response 
terms. 

Practice. Practice in both experiments re- 
duced the delay in color identification produced 
by the words. There was improvement in 
color identification per se, but, in addition, 
there was improvement in overcoming the 
interference. In the color-names response con- 
dition of Experiment II, only the color-names 
stimulus condition still produced significant 
interference by the final trial. The delay for 
the color-associates stimulus condition was 

not significant. A similar result was found 
for the single-letters response group. This re- 
sult is interpreted as evidence that context, 
vocal color identification, maintained the 
availability of color names in some sustained 
fashion, which is a feature of Morton's (1969) 
“context system." 

The spoken responses in Experiment I, in 
contrast to Experiment II, did not result in 
comparable practice effects. Presumably, the 
activation or involvement of the speech system 
resulted in an attentional bias for verbal stimuli 
which was difficult to overcome and which pro- 
duced significant interference for color-names 
and color-associates stimulus conditions for all 
three response groups. While in Experiment 
II, with the motor response of writing, only the 
delay produced by the color names resulted in 
sustained delaying effects, probably due to the 
color identification context. Since the color- 
associates stimulus condition only produced a 
sustained delay in the color-associates medi-' 
ated response condition, this suggests that the 
thresholds of color associates were lowered by 
this response requirement, thereby preventing 
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the elimination of color-associate mediatigi 
with practice. This interpretation is supported 
by the finding that color-associates produce 
no significant delay (Trial 8) in the other twé 
response conditions, color-names and single 
letters conditions. 

The results support the interpretation of the 
Stroop (1935) phenomena in terms of verbal 
mediation processes (Pritchatt, 1968) which 
are affected by experimental context, stimuli 
priming (Dalrymple-Alford, 1968; Morton, 
1969), and response priming (Hock & Egetlij 
1970; Morton, 1969). 
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JATCHES IN THE SIMULTANEOUS MATCHING TASK! 
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State University of New York at Buffalo 


The study compared the effects of string lengths and familiarity on reaction 
time (RT) to physical and analog matches in an attempt to determine the 
nature of a size variation process assumed to occur during analog matches. The 
results replicated prior findings that "same" RTs to familiar (word) strings 
were faster than to nonfamiliar strings, and that this advantage increased over 


string length, 


For nonfamiliar strings analog matches were slower but this 


difference was not influenced by string length. Thus, adjusting for size does 
not seem to occur in the matching or decision stages which appeared to be 
sequential. However, same RTs to analog and physical matches did not differ 
when familiar stimuli were used. This suggests that, if size variation is to be 
postulated as an early processing stage, familiarity effects must also occur 


quite early. 


As has previous research (Eichelman, 
1970), the present study used Sternberg’s 
(1969) methodology to investigate the in- 
formation processing involved in the simul- 
taneous matching task. Posner (1969) 
concluded that stimuli can be matched on 
at least two levels, one related to physical 
ittributes, the other dependent on past 
xperience (e.g., the name of the stimulus). 

'osner and Mitchell (1967) had found .Ss 

uld respond "same" to physically identi- 
ıl stimuli (AA) much faster than to stimuli 
dentical in name only (Aa). In addition, 
they found matches of stimuli identical in 
form but not size (Cc), analog matches, 
to be slightly slower than physical matches, 
but faster than "name" matches. Posner 
(1969) indicated that analog matches are 
based on physical attributes and, in addi- 
tion, involve a size variation operation. 

'The purpose of this research was to in- 
vestigate the effects of familiarity and 
string length on analog matches in order 
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to see if the size variation process is in- 
fluenced by past experience (familiarity), 
and to explore the relationship of size varia- 
tion and the matching processes. For ex- 
ample, if the size variation process is a 
global normalizing operation (Neisser, 1966) 
which occurs prior to any stimulus segmen- 
tation or analysis, it should be uneffected 
by string length or familiarity. 

In addition, the study provided addi- 
tional evidence relative to the effects of 
familiarity on physical matches. Although 
Posner and Mitchell (1967) found no famil- 
iarity effect on physical matches, more 
recent work (Eichelman, 1970) has shown 
that past experience does influence physical 
matches when letter strings (more than one 
letter) are used as stimuli. Eichelman 
(1970) found reaction time (RT) to phys- 
ical matches was much faster when the 
letter strings formed words. Because phys- 
ical matches were still faster than name 
matches he concluded that the increase in 
RT of physical matches with words did not 
reflect reading or naming the words. Eichel- 
man (1970) also found the familiarity 
effect increased over string length. While 
maintaining the physical match was based 
on visual processing, he concluded that 
familiarity can effect processing stages re- 
lated to the physical characteristics of the 
stimuli, In the present study it was hoped 
that the relationship of familiarity to the 
size normalizing operation could provide 
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FIGURE 1. Examples of stimuli corresponding to 
print type and conditions as listed in Table 1. 


evidence regarding how familiarity influ- 
ences physical matches. For example, if 
the evidence from unfamiliar strings indi- 
cates that size normalizing is done sepa- 
rately for each unit (letter) to be matched, 
the effect of size variation on word matches 
could indicate if the units to be matched 
are larger than single letters. 

The present study compared physical and 
analog matches for word and nonword 
letter strings of 3 and 5 letters. In addition 
single letter (nonword) matches were used 
to provide a measure of the effect of the 
size variation operation with single letter 
stimuli. In addition, name matches were 
included to try to replicate Eichelman's 
(1970) results and thus test his conclusion 
that physical matches of words do not 
include reading or naming the words. 


METHOD 


Design. The study used an incomplete 3 X 3 x2 
X2 X 2 design. There were three types of matches; 
physical, analog, and name matches. String lengths 
of 1, 3, or 5 letters were used. The strings were the 
same or differed in one letter pair, and the letter 
strings were either words or nonwords. The design 
was incomplete because the single letter stimuli 
were never words, In addition separate scores were 
obtained for each condition for the first and second 
half of the test trials, 
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Subjects, The Ss were 12 male ai 
psychology student For half the .5 
sponse key was assigned to "same" ri 
for the remaining half it was assigned t 
responses, Tray order was also co: 
across .Ss. 

Apparatus. The stimuli were proj 
Kodak 900 Carousel slide projector 
mechanism connected to the project« 
cuits controlled the stimulus durati. 
stimulus interval. The opening of 
triggered a Hickock timer and the tim« 
by the depression of one of two re 
operated by the index and middle | 
preferred hand. A Hickock printer r 
response and the reaction time, The 
projected onto a screen 2 m. in front of 

Materials. The stimuli were lettered 
white paper using a Leroy lettering set 
of the stimuli were used in slides suc 
stimuli appeared as white letters on à 
background. 

There were 10 examples of stimuli for « 
conditions, Eight stimuli were used fo: 
trials; two were used for the practice tria 
ences in case were used to obtain nam: 
With the lettering set used, upper and | 
forms for the letters C, O, S, V, W, X, and : 
only in size. These letters occasionally o 
name match stimuli. Differences in ti ize ol 
printing were used to create analog match stimulis 
All conditions had stimuli printed in each of the 
possible cases and sizes. The four variations im 
printing for each of the type of matches are ci scribed 
in Table 1. Examples of the stimuli corresponding to 
print variations are shown in Figure 1. Tho visual 
angles for small lower and upper case letters were 
about 24’ and 36’, respectivel The visual angles 
for the large lower and upper case letters were about 
36’ and 48’, As can be seen from Figure 1, the visual 
angles for letter strings depended in part on the 
size and case of the letters used. The range of 


TABLE 1 
Tyee or PRINTING UsEp ror THE Four STIMULUS 
EXAMPLES PER CONDITION 


Type of match 

Position of |” Physical Analog Name 

Size | Case | Size | Case | Size | Cas 
Example 1 Tppet 
Top Large | Upper | Large | Upper | Large | Upper 
Bottom | Large | Upper | Small | Upper | Large | Low 
Example 2 er 
Top Large | Lower | Large | Lower | Large | Lowel 
Bottom | Large | Lower | Small | Lower | Large | Upp 
Example 3 T 
Top Small | Upper | Small | Upper | Small | Upper 
Bottom | Small | Upper | Large | Upper | Small | Low 
Example 4 et 
Top Small | Lower | Small | Lower | Small | Lower 
Bottom | Small | Lower | Large | Lower | Small | Upt 
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TABLE 2 
PERCENT ERRORS PER CONDITION 


Type of match 


mulus type Physical Analog Name 
Same Different Same Different Same Different 
igle letter 78 3.91 1.56 547 4.69 3.91 
letter word 78 3.91 1.56 6.25 6.25 7.03 
letter nonword 3.91 7.03 547 10.16 28.12 11.72 
letter word 4.69 17.97 3.12 10.94 8.59 11.72 
letter nonword 5.47 21,88 4.69 26.56 19.38 19.53 


il angles for the three and five letter strings 
: 1°12’ to 2°24’, and 2° to 4°, respectively. The 
ht of the pairs ranged from 1°12’ to 2°24’ in 
ial angle. There were two stimuli for each 
'riation in printing; one of these appeared in the 
vst half of the test trials, the other appeared in the 
second half. Print type for the practice stimuli 
were chosen in a random fashion. 

For the single letter conditions, all letters of the 
alphabet, except A and I were used. For the dif- 
ferent stimuli conditions the letters were randomly 
paired. For both the word and nonword conditions, 
common three and five letter words were selected 
(A or AA words taken from Thorndike & Lorge, 
1944). For the different word stimuli condition a 

ngle letter was changed. The letter change was 

ne such that both strings would be words of ap- 
yroximately equal frequency. The nonword strings 
were made by randomly arranging the letters of the 
words to form meaningless letter strings. The dif- 
ferent nonword stimuli were obtained by changing 
a single letter. The position of the different letter 
was varied over the eight stimulus examples for 
cach condition, All stimuli were randomly assigned 
to one of the three match condition: 

Plhere were a total of 240 test slides which were 
divided into 4 trays of 60 slides each. Each tray 
had two examples of each condition, and the order 
of tray presentation was completely counterbalanced 
across Ss, 

Procedure. The S was seated in front of a table 
on which were two response k The S was in- 
structed to depress one “key” if the stimulus pairing 
was "same," and the other if the pairing was 
“different.” A "same" response was defined as 
indicating the paired letters had the same name. 
That is, S responded “same” when the letter strings 
spelled the same thing. One half the Ss used their 
index finger to signal "same" and their middle 
finger to signal “different,” while the remainder re- 
sponded in the reverse manner. Each S was in- 
structed as to the nature of the task and was given 
at least one session on the 60 practice slides. Both 
speed and accuracy were stressed. Additional 
practice was available to S, and a few so requested. 
The slides were shown in a dark room, with the 
slide screen about 2 meters in front of S. Each slide 


remained on the screen until the S responded, 
There was a 1 sec. intertrial interval and several 
seconds elapsed between each block of 60 trials 
while Æ changed the slide trays. 


RESULTS AND DISCUSSION 


Each .S received four examples of each 
stimulus condition in each half of the 
trials. Median RTs were obtained for each 
S based on the correct responses to the 
four examples, and the analyses of RT 
were based on this data. The percent 
errors for each condition are shown in 
Table 2. It should be noted that in some 
conditions the median RTs were frequently 
based on fewer than four correct responses, 
but the error rates for the same responses 
to the physical and analog matches were 
low, and performance in these conditions 
was the basis for the major conclusions 
from this study. 

Same reaction times. The main com- 
parisons involved the physical and analog 
matches. One main analysis assessed the 
effects of size variation, test half, string 
length, and familiarity. Since there were 
no words of single letter length, only string 
lengths three and five were included in this 
analysis. The summary for this analysis 
is provided in Table 3. 

Practice had no effect on RT within this 
study. Although RT decreased slightly 
over halves, the effect was not significant. 
None of the interactions with test half 
even approached significance and test half 
will not be considered further. 

As is clear from Figure 2 and Table 4, 
the RTs increased with string length. In 
the main analysis (excluding single letter 
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TABLE 3 


SUMMARY OF ANALYSIS OF VARIANCE OF “SAME” REACTION TIMES AS A FUNCTION OF Siz! 
Test HALF, STRING LENGTH, AND FAMILIARITY 
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AIATION, 


Source of variance Been ieee or Error term F > 

Size variation 34677.42 5253.19 6.60 05 
Test half 51855.83 17705.98 2.93 25 
Size X Half 4662.33 10390.86 <i 

String length 1054408.09 48108.10 21,92 001 
Length X Size 13887.15 4825.61 2.88 25 
Length X Half 20331.48 «1 

Length X Size X Half 2865.80 «1 

Familiarity 22680.37 46.17 001 
Familiarity X Size 65040.94 13392.32 4.86 05 
Familiarity X Half 9487.98 8759.17 1.08 25 
Familiarity X Size X Half 209.34 10981.77 «1 

Familiarity X Length 166795.67 7043.09 23.68 001 
Familiarity X Length X Size 2.34 4008.10 «1 | 
Familiarity X Length X Half 5752.27 1667.05 «1 | 
Familiarity X Length X Size X Half 157.03 10129,49 «1 


^ There is 1 df for all sources of variance. 
» There are 15 df for all error terms, 


stimuli) RTs increased with string length 
and decreased with familiarity. The inter- 
action of Length X Familiarity reflects the 
fact that the increase in RT over length 
was less for words than nonwords. How- 
ever, a separate analysis indicated the in- 
crease in RT over length for words alone 
was significant, F (1, 15) = 12.15; p < 
.005. The analyses of nonword stimuli 
(including single letters) indicated a highly 
significant and linear increase in RT over 
length, F (2, 30) = 41.68; p < .001. The 
linear component accounted for 98% of the 
variance, and the increase was about 76 
msec, per letter. The linear trend is conso- 
nant with the hypothesis that a sequential 
letter-by-letter matching process occurs for 


TABLE 4 


MEAN Reaction TIME ACCORDING 
TO CONDITION 


Type of match 
Pairs Physical Analog Name 

Non- Non- Non- 
Word | word | Word | word | Word Soni 

Different 
Single letter 762 779 763 
Three letter | 819 | 834| 867 877 | 901 | 940 
Five letter 952 | 056 | 897 | 1,117] 969 | 1,042 

Same 

Single letter 652 701 780 
Threeletter | 722 | 766 | 728 | 837| 709 | 1,130 
Five letter 814 | 961| 790 | 1000 | 918 |1,194 


the nonword stimuli. With only t» > string 
lengths, the linear trend for word -timuli 
could not be assessed. These res: s sup- 
port the findings of Eichelman (10. ') that 
familiarity reduces RT and this advantage 
increases over string length. 

Size variation effects were examined to 


determine the nature of any size normaliz- 
ing process. According to the main analy- 
sis the size differences increased R but 
the size difference did not interact with 
string length (analysis excluded single letter 
stimuli). For just nonword stimuli (all 
string lengths) the disadvantage of analog 
compared with physical matches did not 
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FIGURE 2. “Same” reaction time as a func aan 
of number of letters per string, type of match, an 
familiarity. 


increas r string length, F <1. For 


word uli alone, analog matches were 
not slov. -r and the interaction, F (1, 15) = 
2:01; 7 < .25. 

These results provide no support for the 


hypothesis that size normalizing operations 
are done sequentially on each letter prior 
to or during the matching process, which 
seems to be sequential. However, the idea 
that the whole stimulus is "normalized" 
prior to segmentation or analysis is con- 
tradicted by the significant interaction ob- 
tained between Familiarity and Size Varia- 
tion. The interaction reflects the fact that 
size variation only affected performance on 
nonword stimuli. The analysis for non- 
word stimuli (including single letter stimuli) 
showed analog matches to be slower than 
physical matches, F (1, 15) = 13.11; 5 « 
.005. There was no difference between 
analog and physical matches for the word 
stimuli, F < 1. 

Different reaction times. 'The means are 
presented in Table 3. For all conditions 
and string lengths the different slides dif- 
fered in only a single letter. 

s Posner and Mitchell (1967) found in 
their study, RTs to a different single letter 
pair for all matches were the same and 
were about equal to the "same" name 
matches. This is in accord with the con- 
clusion of Posner and Mitchell (1967) that 
Ss must check all the way to the name 
level when a physical difference exists. In 
addition, the difference in RT between 
physical and analog matches was less with 
"different" than with "same" single letter 
stimuli, This supports the previous con- 
clusion that size normalizing does not occur 
completely prior to any stimulus analysis. 

Because only one type of different con- 
dition was used in the present study, the 
"different" RT data are of limited use. 
For example comparisons of "same" and 
"different" RTs with physical or analog 
matches are difficult to interpret since the 
different response may include name match- 
ing when a physical difference occurs. It 
was noted earlier that different single letter 
stimuli are probably processed to the name 
level in all conditions. However, it is 
difficult to say if the different pair is always 
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processed to the name level in the letter 
strings. The position of the different pair 
may determine the level of processing. 
For example, if processing is sequential 
and the previous several letters have been 
physically identical, a physical difference 
may be sufficient to initiate a "different" 
response prior to any name analysi 

Determination of where in processing to 
place a size normalizing operation was 
more difficult than expected. The fact 
that size variation influenced RT to single 
letters and nonwords but not words pre- 
cluded placing the operation prior to any 
stimulus analysis. The fact that size dif- 
ferences did not increase with string length 
indicates that size normalizing did not 
occur during the sequential matching of 
letters. Placing the operation later in a 
response selection stage is intuitively un- 
appealing and would still leave unexplained 
the lack of effect of size variation on word 
stimuli. 

Although the physical and name match 
data in this study replicate Eichelman's 
(1970) results, the analog match data sug- 
gest a complex explanation will be required 
to explain RTT differences in matching word 
and nonword stimuli. The main question 
is whether words in a physical match con- 
dition are matched on the basis of physical 
characteristics and/or on some other basis, 
such as name information, —Eichelman 
(1970) concluded that physical character- 
istics of words are involved in the match- 
ing decision. The faster RT with words 
could be explained by an increase in the 
size of the unit to be matched, However, 
if familiarity only increases the size of the 
unit to be matched (for example, making a 
three letter word string equivalent to a 
single letter) the size variation should have 
the same effect on words as on nonwords 
and single letters. 

On the other hand, Eichelman (1970) 
argued quite effectively against the claim 
that words are being read and matched at 
the name level. He found name matches 
to be slower than physical matches; in the 
present study name matches were also 
slower than ph | matches, for word 
stimuli, F (1, 30) = 14.86; p < .005. 
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The following discussion will suggest two 
possible approaches to integrate the effects 
of physical characteristics and more ab- 
stract characteristics related to words into 
a model of the matching process. 

In order to retain a sequential stage 
model, it could be postulated that a word- 
nonword decision is made at an early stage. 
Subsequent processing of words might be 
done in a different manner (or place) from 
that of nonwords or letters. (See Bieder- 
man, 1972, for a discussion of contingent 
processing.) Kimura (1967) presented evi- 
dence that speech stimuli are processed in 
the left hemisphere, auditory patterns in 
the right. More recently, Cohen (1973) 
found processing differences between hemi- 
spheres with visually presented linguistic 
material. Johnstonand McClelland (1973), 
comparing perception of words and letters, 
suggested that different systems may exist 
for processing letters and words. In the 
contingency model suggested here, physical 
matching of the whole word would be done 
with size variation filtered or ignored. If 
a physical match were not obtained, further 
processing would be done at the name level. 
Processing of nonword stimuli would be 
done leiter-by-letter, with failure to ob- 
tain a physical match leading to matching 
at the name level. Filtering of size varia- 
tion would not apply to nonword stimuli 
(including single letters). 

On the other hand, parallel processing of 
physical and abstract characteristics of 
stimuli might be suggested. A redundancy 
gain between processing physical and ab- 
stract characteristics might account for 
RT differences between words and non- 
words (Biederman & Checkosky, 1970). 
An alternative is an interactive parallel 
model In this type of model a same- 
different decision would be based on a 
composite of information from physical and 
abstract comparisons. If processing of ab- 
stract characteristics is faster, or takes 
priority, the "same" decision for physical 
and analog matches would be made more 

quickly for words than nonwords, which 


* This model, based on Thomas (1969), was sug- 
gested by Webb Stacy. 
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do not have abstract "word" 'racter- 
istics. "Same" decisions might slower 
for name matches because of i ference 
from physical comparisons indi: ig that 
differences exist. One remainin, »>roblem 
is to specify why information a t phys- 
ical sise differences would not in re with 
a "same" response for words. 

Clearly the development of equate 
model will require additional ex. menta- 
tion. The results of the pr: study 
suggest an adequate model wi ve to 
take into account the fact tha; — hysical 
characteristics relating to lette ipe in- 
fluenced word and nonword mat: while 
the physical characteristic of size i ienced 
only the matching of nonwords single 
letters. 
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Varieties of the Müller-Lyer illusion were scaled for perceived similarity to the 


standard Müller-Lyer configuration, 


Five variants differing in scaled simi- 


larity were pretested for illusion magnitude. All Ss then received practice with 


the s 


andard configuration during which ill 
This was followed by a posttest on the ini 
decrement was monotonically related to perc 


sion decrement was obtained. 
lly tested variant. Transfer of 
ved similarity. These data indi- 


a 


cate central processing components in the Miiller-Lyer illusion. 


Since the first systematic studies of visual 
'ometric illusions were undertaken by 
)ppel in 1854-1855, numerous theoretical 

explanations for these perceptual distor- 
tions have been offered. These theoretical 
treatments may be divided into two gen- 
eral classes: (a) theories based directly on 
the structural properties of the optical and 
nervous system; and (b) theories based on 
the way visual information is processed in 
he higher centers. 

Structural theories contend that the il- 
losory distortions arise from physiological 
sources associated with the normal optical 
and neural processing of visual inputs. 
Thus Chiang (1968) suggests that simple 
degradation of the optical array due to 
diffraction of the light as it passes through 
the crystalline lens may produce some in- 
tersecting line illusions such as the Müller- 
Lyer, Zoellner, or Poggendorff. Coren 
(1969) has demonstrated that this source 
of distortion can account for some 15% of 
the Poggendorff illusion with the other 85% 
of the illusion presumably caused by other 
factors. Another structural source of il- 
lusory distortion has been suggested by 
von Békésy (1967) and Ganz (1966) who 
contend that lateral inhibitory interactions 
Operating on converging or intersecting line 
> of 


! We gratefully acknowledge the a: 
Richard Fraenkel in the collection and anal 
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elements cause some of the perceived con- 
tour displacements and distortions. Coren 
(1970) and Girgus, Coren, and Horowitz 
(1973) have shown that removal of such 
converging line elements results in a reduc- 
tion in the magnitude of several classical 
illusions, although significant distortions 
still remain in all cases. 

Process theories attribute the existence of 
illusions to the cognitive processing of the 
perceptual inputs. Thus, Gregory (1963) 
and Gillam (1971) have shown that per- 
spective cues which inappropriately evoke 
constancy scaling me isms contribute 
to the formation of some classical illusions, 
Coren (1971), Girgus, Coren, and Agdern 
(1972), Massaro and Anderson (1971), 
Pressey (1967), and Restle and Merryman 
(1968) have all found evidence that com- 
parative judgmental components, such as 
cognitive contrast play a part in illusion 
formation, especially in configurations with 
unequal sized elements such as the Ebbing- 
haus figure. On the other hand, Carr 
(1935), Coren and Girgus (1972a), and 
Erlebacher and Sekuler (1969) have pro- 
vided evidence that confusion of test with 
inducing elements seems to contribute to 
illusion magnitude in some configurations. 
A recently revived theoretical position 
maintains that some illusory distortions 
are based on information from scanning eye 
movements over the figures or information 
extracted to guide such movements (Binet, 
1895; Coren & Hoenig, 1972a; Van Bier- 
vliet, 1896). 
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As early as 1902, Judd observed that, 
with free inspection over a period of time, 
the magnitude of the perceptual distortions 
in the Miiller-Lyer illusion diminishes. 
This has since been repeatedly confirmed 
both for the Miiller-Lyer illusion and for a 
number of other configurations. Although 
the precise mechanisms for this decrease in 
illusion magnitude are not fully understood, 
it seems unlikely that the peripheral struc- 
tural components, such as optical aberra- 
tions from the crystalline lens, could be 
responsible for such an illusion decrement. 
These factors would not be expected to 
show changes which would lead to reduc- 
tion in illusion magnitude simply as a 
function of inspection of the stimulus for 
a few minutes. There are, however, some 
more central structural components that 
have been proposed specifically to account 
for illusion decrement. These include cor- 
tical satiation effects (Kohler & Fishback, 
1950) and adaptation of specific feature 
analyzers (Coltheart, 1971). 

There is sonie evidence that these central 
structural components are probably not 
responsible for decrement. Köhler and 
Fishback (1950) imply maximal decrement 
in illusion configurations where there are 
converging line segments; yet, Girgus et al. 
(1973) have shown that the rate of illusion 
decrement is independent of the number 
of converging line elements. Both Köhler 
and Fishback (1950) and Coltheart (1971) 
predict the maximum reduction in illusion 
strength under fixation conditions; yet 
several investigators have shown that free 
inspection leads to a larger reduction in 
illusion magnitude than fixation (Coren & 
Hoenig, 1972b; Festinger, White, & Allyn, 
1968). 

There is some further indirect evidence 
that decrement of illusion magnitude in- 
volves process rather than structural com- 
ponents: (a) the rate of decrement responds 
to traditional learning variables such as 
the spacing of trials (Dewar, 1968; Mount- 
joy, 1958) ; (b) the amount of illusion decre- 
ment is cumulative over days and weeks 
(Judd, 1902); (c) direct manipulation of 
factors which should effect structural mech- 
anisms of decrement such as the number of 
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converging and intersecting lin ments, 
do not appear to alter the rat Ilusion 
diminution with inspection (Gi et al, 
1973). 

Although these results sugg« iat the 
decrease in illusion magnitud h free 
inspection is due to process c nents, 
they are not conclusive. Ther: tech- 
nique, however, emanating f tradi- 
tional learning studies, which m rovide 
a tool for answering this questioi more 
convincing manner. Consider ) who 
has been inspecting an illusion co ration 
such as the Müller-Lyer, and ha: ven al- 
lowed to observe it long eno: for a 
reasonable amount of decremen take 
place. If we now present hi: ith a 
variant of the inspected illusio: which 
differs in some respects from the © sinally 
viewed form, it is not unreasonal; — to ex- 
pect some transfer of decremen! o this 
new configuration. Some evidence or such 
transfer of illusion decrement m: be in- 
ferred from Parker and Newbigginy |1963). 
This transfer, or generalization, would be 


expected on the basis of both st:uctural 
and process explanations. Howe» the 
parameters which would affect the amount 
of transfer are different for the two classes 
of illusion theories. Structural explana- 
tions would predict that the amount of 
transfer of illusion decrement would depend 
on the number of common elements in the 
two stimuli. Thus the greater the number 
of formal configurational similarities, the 
greater the expected amount of transfer. 
A process explanation would contend that 
the amount of decrement transferred to 
the new variant would depend upon the 
observers cognitive assessment of the simi- 
larity between the configurations. The 
more similar the figures appear, the greater 
the amount of transfer of the reduction in 
illusion magnitude, regardless of the num- 
ber of elements in common. Thus the 
amount of transfer or generalization of illu- 
sion decrement seems like a useful means 
of clarifying the theoretical nature of illu- 
sion decrement. 


METHOD 


„Scaling of stimuli. Since this experiment is de- 
signed to explore whether illusion decrement tran 
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fers more to perceptually similar configurations 
than to dissimilar ones, it is first required to estab- 
lish a scale of similarity among illusion configura- 
tions. Ten variants of the Miiller-Lyer figure were 
used to obtain such a metric, They consisted of 
the standard configuration (Figure 1A), a variant 
in which the wings are separated from the shaft by 
a small gap (1B), a standard configuration with the 
horizontal shafts removed (1C), a dot form of the 
- illusion (1D) which was introduced by Coren (1970), 
an all horizontal version of the illusion (1E) intro- 
duced by Piaget (1969), a figure with squared off 
wings (iF), one with half circles as wings (1G), 
_ another with appended full circles (1H), after Parker 
E and Newbigging (1963), and two configurations 
^. from Sanford (1898) that use only half of the 
4 arrowheads, either symmetrically (1I) or asym- 
— metrically (1J) oriented. Each configuration was 
drawn on a 20 X 25 cm. piece of stiff paper with 
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Variants of the Müller-Lyer figure scaled for similarity to the 

indard figure. (Forms A through E are used in the main experiment : [A ] 
standard form, [B] Exploded form, [C] Shafts removed, [D] Dot form after 
Coren, [E] Line form after Piaget, [F 
CH] Full circles after Parker and Newbigging, [1] Symmetrical, and [J] 
Asymmetrical incomplete forms after Sanford.) 


J Squared wings, [G] Curved wings, 


1-mm. wide black lines. The horizontal shafts (or 
distance between the shafts when the shafts were 
not present) were always 8 cm, in length. Each of 
15 Ss was asked to judge the similarity of each con- 
figuration, presented one at a time, to the standard 
configuration (1A) which was continuously dle. 
Judgments were made using a 7-point rating scale 
labeled (1) extremely similar, (2) very similar, (3) 
fairly similar, (4) medium, (5) fairly different, (6) 
very different, (7) extremely different. The mean 
of the ratings per stimulus configuration can be 
used to indicate the perceived similarity between 
the configuration and the standard version of the 
illusion. The obtained ratings were used to select 
four variants of the illusion reliably rated as dif- 
fering from each other in their similarity to the 
standard configuration. Those selected were the 
exploded form (1B) with a mean rating of 1.47, the 
configuration without shafts (1C) with a mean 
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rating of 3.47, the dot form (1D) rated 4,60, and 
the Piaget form (1E) whose rated similarity was a 
distant 6.53. T tests indicated that the mean 
rating of each of these configurations was signifi- 
cantly different from that of each of the other con- 
figurations (p < .05). It should be clear that this 
particular scaling procedure gives only a one-dimen- 
sional representation of the psychological space. 
However, since the theoretical issue involves only 
the question of the psychological distance between 
the standard form and the respective variants, this 
should be sufficient. 

Subjects. Fifty undergraduate volunteers served 
as Ss. Each was randomly assigned to one of the 
five variants of the Müller-Lyer illusion used in this 
study (the standard configuration plus the four 
variants described above). 

Stimuli and apparatus. The five (Müller-Lyer) 
configurations—1A, 1B, 1C, 1D, erved as 
stimuli. All of the figures were constructed with 
1-mm. wide black lines. The reflectance of the 
lines was 5.1% and of the background was 74.3%. 
"The test shafts (or the distance between the wings) 
were 8 cm, long, which corresponded to a visual 
angle of 12.0. When angles were present, they were 
45? from the horizontal; when wings were present, 
they were 2 cm. long. For each configuration, the 
apparently longer half of the illusion was made 
adjustable through the use of a tongue and groove 
arrangement. Readings were taken in mm. from a 
scale affixed to the back of the adjustable half of 
the illusion. 

Procedure, Each S first made two settings to 
apparent equality for one of the five variants of 
the illusion, depending upon his assigned condition. 
Settings were balanced as to starting position, half 
starting at clearly longer and half at clearly shorter, 

Following this, all Ss then inspected the standard 
form of the Müller-Lyer figure (1A) for 5 min. with 
free eye movements, During this period Ss made 
settings of apparent equality at 1-min. intervals. 
Previous investigations have shown that this pro- 
cedure leads to a sizable reduction in illusion mag- 
nitude (Coren & Girgus, 1972b; Girgus et al., 1973). 
Following the inspection period, all Ss made a 
two settings of apparent equality on the in ially 
tested illusion. variant. Any diminution in the 
strength of the nonpracticed illusion configuration 
between the pretest and the posttest may be taken 
as an indication of the amount of illusion decrement 
that has transferred from the standard configuration 
to that particular variant, 


REsULTS AND Discussion 


Since we are interested in the amount of 
transfer of illusion decrement from the 
standard Miiller-Lyer configuration to other 
variants of the illusion, we must first verify 
that illusion decrement has in fact taken 
place. An analysis of variance performed 
on the judgments made during the 5-min. 
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inspection period with the stan rd form 
of the illusion reveals that th: unt of 
illusion decrement obtained aci the five 
groups is statistically significant — (5, 225) 
= 9.96, p «.001. However, : is no 
significant difference among th: oups in 
amount of decrement and no nificant 
interaction. The mean amou: decre- 
ment was 7.0 mm. This m: tude is 
quite comparable to values rted in 
other studies of illusion decre: t using 
thestandard configuration (Core; — Girgus, 
1972a, 1972b; Girgus et al., 1 Re- 
gression lines were fitted throu, he plot 
of illusion magnitude against tin or each 


experimental group. The mea: 
these lines was 1.4 mm/min; 
among the five slopes was only .4 | 
and tests on all possible paired c 
yielded no significant group d 
These results support the conten on that 
the magnitude of illusion decre: cnt ob- 
tained on the standard figure is equi across 
conditions. 

The mean amount of illusion per test 
configuration for the pretest and the post- 
test is shown in Table 1. In order to assess 
the amount of transfer from the standard 
form of the illusion to the five test con- 
figurations, an analysis of variance was 
performed on these data. The results of 
this analysis indicate that there is a sig- 
nificant pretest-posttest difference, thus 
demonstrating that transfer of illusion dec- 
rement has occurred F (1, 45) = 73.71, P 
X .0l. In addition a significant interaction 
between pretest-posttest differences and 
illusion form indicates a difference among 
configurations in the amount of transfer, F 


ope of 

range 
m/min 
erences 
‘rences: 


TABLE 1 


ILLUSION MAGNITUDE OF Five VARIANTS OF THE 
MÜLLER-LYER PRE- AND POSTINSPECTION OF 


THE STANDARD CONFIGURATION A 
Musion Preinspection | Postinspection Transfer 
Es ee Rae 
A 21.9 12.4 9.5 
B 15.9 8.9 7.0 
B 19.2 15.4 3.8 
D 6.6 4.6 2.0 
E 8.8 8.5 3 


* Forms are labeled as in Figure 1. 


n 
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(4, 4 0.00, p < .01. A significant 
main t for illusion variants also exists, 
F (4 ‘) = 12.60, p < .01. This main 
effe lects differences in the initial mag- 
nitu f illusion obtained for the various 
fori ind it is correlated with the number 
of « erging and intersecting line elements 
in stimulus as has been reported else- 
wi (Girgus et aL, 1973). A quick 
gl. ce at the table also verifies that the 
m nitude of decrement transferred from 
tH standard to the test variants is inde- 
p «dent of the initial magnitude of illusion. 


‘he most theoretically relevant com- 
is the amount of transfer as a 
nction of the perceived similarity among 
ae variants. A process theory would 
redict that as rated similarity between 
inspection and test figure decreases the 
amount of decrement transferred should 
also decrease. The amount of transfer 
from inspection of the standard is com- 
puted from the pretest-posttest differences 
ind plotted against rated similarity of the 
est configuration to the standard in Figure 
\, It is clear that less transfer is found 

r more perceptually dissimilar stimuli. A 

‘ear trend through these points confirms 
this, F (1, 45) = 13.04, p < .01. 

It is important to assess whether these 
results based upon perceived similarity be- 
tween the test and the standard configura- 
tions predict the amount of illusion decre- 
ment transferred with any more accuracy 
than would a formal analysis of the con- 
tour and pattern properties of the configu- 
rations. Two methods of assessing the 
formal similarity of the illusion variants 
were used to answer this question. In the 
first, the standard form of the illusion was 
superimposed on a 40 X 200 matrix of 
squares, each of which subtended 1 square 
mm. The standard figure was found to 
overlap 320 units on this matrix. Each 
successive variant was then superimposed 
on the matrix. The number of units held 
in common between the standard and the 
variant then serves as an index of the 
similarity between the patterns. The sec- 
ond scoring procedure simply provides a 
count of the theoretically relevant pattern 
elements present in the illusion figures. 
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Ficure 2. Transfer of illusion decrement 
sured as illusion reduction in the test figur 
practice with the standard form : 
(A) perceived dissimilarity of test and standard 
form, (B) number of missing contour unit a d © 
number of missing pattern elements, ee 
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"These are converging lines, horizontal shaft, 
and closed angles. All of these properties 
are contained by the standard form of the 
illusion, and the number contained by each 
variant may serve as an index of formal 
similarity. Figure 2B plots the degree of 
transfer as a function of the number of 
missing contour units, and Figure 2C plots 
the transfer as a function of the number of 
missing pattern elements. Visual inspec- 
tion of these figures seems to indicate that 
neither of the two scoring schemes produces 
as good a fit to the data as does rated dis- 
similarity. This is confirmed if we compute 
the product-moment correlation coefficient 
for each method of presentation. For rated 
dissimilarity the correlation with transfer 
of decrement is —.751, for missing contour 
elements it is —.448, and for missing 
pattern elements it is —.518. Although 
each of these is significantly greater than 
zero with p < .01, the coefficient for the 
rating procedure is significantly greater 
than either of the other two procedures 
with ¢ (47) = 3.47, p < .01, and t (47) = 
3.01, p < .01, respectively. This result 
seems to warrant the interpretation that 
perceived similarity among configurations 
is a better index of the amount of illusion 
decrement transferred from one configura- 
tion to another than indices based upon 
formal properties of the figures involved. 
The data presented above seem to be at 
variance with predictions from structural 
interpretations of the diminution of illu- 
sions with inspection, such as satiation 
(Köhler & Fishback, 1950) or adaptation 
of specific feature analyzers (Coltheart, 
1971). These theories would predict that 
transfer would be a function of the formal 
similarity of the stimuli, such as the num- 
ber of common elements in inspection and 
test forms. In the Miiller-Lyer figure, the 
most theoretically important elements are 
the lines which define the angle between 
the wings. For satiation theory, this is the 
region in which the maximum effect is con- 
centrated, and for specific feature adapta- 
tion models, these are the stimuli for the 
feature detectors which are held to fatigue 
with inspection. These critical angles are 
clearly present in Figures 1A, 1B, and 1C 
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thus leading to the predicti hat the 


amount of transfer should be « | among 
these three targets. Contrary his, we 
find a decrease in the amount transfer 
as the rated similarity decr These 
differences are substantiated he sta- 
tistical significance between ljacent 
means (¢ = 2.98 and 3.82, resi vely, $ 
< .01). Since the structural « should 
confine themselves to effects | ning to 
angles or slanted lines, we wou! pect no 
transfer to Forms 1D and 1} ce they 
lack any converging line eleme: In ad- 
dition, they are virtually dev: of any 
elements in common with th andard 
configuration. Although the Pi et from 
(1E), shows no significant amour + of trans- 


spected 
n ele- 
uration 
to the 
figures 


fer, (252) = .36, this would bx 
both on the basis of missing co: 
ments and the fact that this con 
is the lowest in perceived similari 
standard configuration of all of : 


tested. We do, however, find si nificant 
transfer of decrement for Coren’s dot form 
of the illusion (1D), which also contains 


no line elements in common with the 
standard form, ¢ (252) = 2.39, < 01. 
Taken together, these data make it un- 
likely that decrement of the Miiller-Lyer 
illusion under free viewing conditions is due 
to satiation of cortical fields or adaptation 
of specific feature analyzers. 
Alternatively, it might be suggested that 
decrement is due to a shift in the way in 
which the figure is judged. According to 
this point of view, during the decrement 
procedure, the O begins to pay more careful 
attention to the test extent and to igrore 
the inducing components. Such atten- 
tional restriction has been shown to reduce 
illusion magnitude in the Müller-Lyer 
(Benussi, 1912; Coren & Girgus, 1972). 
If such a restriction of attention were re- 
sponsible for decrement, however, On 
might expect that such a set would carry 
over equally to all alternative forms of the 
illusion, This would then predict an equal 
amount of transfer of illusion decrement 
for all of the configurations, rather tha" 
the generalization curve obtained. 1f, 0? 
the other hand, such a judgmental set was 
only applied as a function of the perceive 


similarity -i the test to the inspection figure, 
such a mechanism could explain these 
results. 


The one thing that does seem to clearly 
emerge from these data is the fact that 
illusion decrement almost certainly reflects 
a process mechanism, responding to central 
cognitive factors, rather than a structural 
mechanism, responding only to the formal 
configurational aspects of the stimulus 
array. This is confirmed by the fact that 
perceived similarity between practice and 
test stimuli seems to act as a very accurate 
predictor of the amount of transfer or 
generalization of illusion decrement from 
a practice to a test form. Although this 
result does not serve to pin down the 
actual mechanism involved in illusion dec- 
rement, it certainly serves to limit the 
class of explanations that are most likely 
apt to be valid. 
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RECOGNITION LATENCY FOR A SUBJECTIVELY ORGANIZED LIS! 


DOUGLAS J. HERRMA 


University of Delaware 


A previous study showed that after 5s memorized a list organized into category 
groupings, recognition latency (RL) to double-word dis 
faster to DW Ds representing the same category, HORSE-B! 
LION-RUSSIA. 
the same when the words in a DWD were both old, one old and one new, or 
In contrast, the present experiment showed that after learning a 
list of unrelated words organized into subjective groupings, the representation 
of groupings in DWDs affected RL only when both words in a DWD were old 
Effects of subjective organization on recognition were hypothesized to dilfer 
from those of category organization because the semantic bases of subjective 
organization are too diverse to influence the recognition retrieval process. 


representing different categories 


both new. 


Recent research has shown that organization of 
items during learning affects the latency of sub- 
sequent recognition. For instance, McLaughlin and 
Herrmann (1972) and Herrmann and McLaughlin 
(1973) have shown that after having memorized 
words in category groupings, e.g., animals, countries, 
Ss recognized a double-word display (DWD) faster 
when the words in a DWD represented the same 
category, e.g., HORSE-BEAR, than when they were 
from diflerent categories, e.g., LION-RUSSIA. This 
result was essentially the same for old responses 
(both words in a DWD from the list), new responses 
(both words not from the list), and mixed responses 
(one word in a DWD from the list). 

It is often assumed that organizational processes 
involving relatively unstructured materials such as 
a list of unrelated words (Tulving, 1968) are the 
same as for highly structured materials, such as a 
categorized list. For example, a subjective group- 
ing may be similar to a category grouping in that 
both list organizations reflect a semantic organiza- 
tion (cf. Schwartz & Humphreys, 1973). Conse- 
quently, one might expect subjective groupings to 
affect recognition latency in the same manner as 
that observed for category groupings. Alterna- 
tively, if subjective groupings are not equivalent to 
category groupings in organizational characteristics, 
a different pattern of results might be expected. 
The purpose of the present research was to investi- 
gate these alternatives. 

Method. "Twelve students participated as .Ss in 
partial fulfillment of a course requirement. 

A three-channel tachistoscope (Scientific Proto- 
type, Model GB) presented a white pre-post expo- 
sure field in the first channel, a fixation field in the 
second channel, and the DWD in the third channel. 
The two words in the DWD were typed in upper- 
case and centered one above the other. The re- 
sponse apparatus consisted of three levers on which 

1 E , 
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lays (DWDs) wa 
R, than to DWD- 
This result was essentially 


S rested his index, middle, and ring fins 
cies were measured to an accuracy of = 
Each S was run individually in two 
the first session .S memorized a list of ` 
words, each of which was printed on 
card. Each of three lists, A, B, and C, 
ized by four Ss. Each word on a list we 
of a different taxonomic category in th« 
Montague (1969) category! norms, and 
categories were used across the thre 
studying the list, S sorted the words int 
which were conceptually similar. At le: 


no more than seven groupings were allow ed j 
Mandler, 1967). The S sorted and then s: idied th 
list (for 3 min.) until he recalled all 18 words (within 


1 min.) three times in succession and until there Wi 
perfect agreement in composition of sorting for thr 
sorts in succession, : 

The second session, scheduled at .5's convenientt 
began approximately 24 hr, after the first sessi@l 
for 9 Ss and 48 hr. later for 3 Ss. Due to tht 
variable delay between sessions, the 5 began thig 
second session by restudying the list. There 
no differences in performance as a function of | n 
between sessions. After restudying and resort 
the words, S was seated before the tachistoscope @ 
given recognition-test directions as described 


the list which S memorized, e.g., Lis 
DWDs one word came from 5's | 
in mixed DWDs and both word: 
came from one of the other lists 
List B, and from the remaining list in Block 2, € 
List C. Thus, all old words were presented twi 
across the blocks, but nonmemorized words we 
presented only once across blocks. A diferen 
random presentation of conditions was used for 


For half of the old DWDs, the two words if 
DWD had been studied together in a groupl 
during acquisition, i.e., the words in a DWD onig 
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nated e same study grouping (SG-DWDs). 
For : uning half of the old DWDs, the two 
DWD originated from different study 
(DG-DWDs). Similarly, half of the 
d mew DWDs were SG-DWDs and half 
;DWDs, wherein grouping origin was 


desi; :d by the category to which nonmemorized 
wor clonged and by which categories were repre- 
sen! in S's groupings in acquisition, For example, 
if memorization § had grouped the words 


W! .sKEY and SUGAR, then he was considered 
to ve grouped the categories alcoholic beverage and 
a vance for flavoring food. Since it was possible 
ti ! might organize nonmemorized items differ- 
v from memorized items, despite both kinds of 
ts being from the same categories, a comparison 
made of the groupings of List A, by the 4 Ss 

o learned that list, to the groupings of List B 

| List C, by the 4 Ss who learned each of those 

ts, It was found that there was a significant 

ndency for the same categories to be represented 

| a grouping on all three lists, p < .001 (see 
Herrmann, 1972). For example, regardless of which 
list he learned, 5 tended to place an alcoholic beverage 
and a substance for flavoring foods in the same 
grouping. 

Results and discussion, The latency data for 
correct recognition responses (96.3% of all re- 
sponses) are shown in Table 1. The table presents 
the mean of median correct reaction times (RT's) 
for the three response types as a function of the 

rouping origin of DWDs, i.e., SG and DG. The 
ins were based on the three RTs per condition 
ithin a presentation block. The latencies in the 

thle have been averaged over blocks, since the 
inalysis of variance indicated that blocks did not 
significantly affect latency or interact with the other 
experimental variables, ps > .05. Thus, the repe- 
tition of old words and the lack of repetition of new 
words across blocks did not differentially alter the 
cllect of subjective organization on old, mixed, and 


new latencies. 

Inspection of Table 1 suggests that grouping origin 
of words in a DWD affected old latencies but did 
not affect mixed and new latencies. The analysis of 
variance and simple effects tests confirmed that old 
RT to SG-DWDs was less than RTs for all other 
kinds of DWDs (p < .01), but that all other RTs 
did not differ significantly from each other, Thus, 
these data show again that the relationships used 
during learning affect recognition (cf. Herrmann & 
McLaughlin, 1973). 

The results for subjective organization differ from 
the effects of category organization on recognition 
latency. Herrmann and McLaughlin (1973) found 
latency was faster to same-category DW Ds than 
to different-category DWDs for new and mixed 
DWDs as well as for old DWDs. In contrast, in the 
present study, latency was faster to same-grouping 
DWDs than to different-grouping DWDs for old 
DWDs but not for new and mixed DWDs. This 
contrast in results for category groupings and sub- 
jective groupings is somewhat surprising since both 
types of organization have a semantic basis. Ap- 
parently, the semantic organization in subjective 
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TABLE 1 


MEAN OF MEDIAN CORRECT REACTION TIMES (IN MSEC.) FOR 
, AND new RESPONSES AS A FUNCTION 
OF GROUPING ORIGIN 


Grouping origin 
Response type e 


Same grouping | Different grouping 


Old | 1,584 1,797 
Mixed 1,945 1,906 
New 1,835 1,861 


groupings, in the absence of learning not suffi- 
cient to affect recognition latency. 
groupings to affect latency, it seems necesse 
the grouping be memoriz .e,, contiguity during 
learning is crucial for the subjective-organization 
effect on recognition latency. 

Regardless of contiguity's role in the effect on old 
latencies, it is puzzling that the semantic informa- 
tion alone in subjective groupings did not have some 
effect on mixed and new latencies. Models of se- 
mantic memory, c.g. and Quillian (1969), 
predict that succ 1 of related items 
will take less time than retrieval of unrelated items. 
The lack of an effect from the semantic basis of 
subjective groupings suggests that subjective group- 
ings differ qualitatively from common taxonomic 
categories. Subjective groupings in the present 
study were apparently based on several semantic 
factors, in contrast to the single categorical basis of 
category groupings. For example, many Ss placed 
alcoholic beverages, substances for flavoring foods, and 
kitchen utensils in a “foods” grouping. Note that 
the group included “things to eat" as well as “things 
to eat with," Thus, a grouping was based on both 
taxonomic relationships and use relationships. Be- 
cause of the diverse semantic bases of subjective 
groupin emantics. may have no influence on 
recognition retrieval processes when a subjectively 
organized list is involved. This conclusion does not 
deny that semantics may influence the formation 
and initial storage of subjective groupings. 
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THE EFFECT OF CUE-CRITERION FUNCTION FORM 
ON SINGLE-CUE LEARNING! 


L. F, W, pe KLERK? AND A. G. VROON 
U niversity of Leiden, Leiden, The Netherlands 


This investigation is concerned with the effect of the form of the functio: 
relating cues to a task criterion upon single-cue learning. 
forms were used (one in each condition): a J-shaped, a reversed J-shaped, 
The results indicated that (a) performance 
gradually improves with training; (b) a U-shaped function is more difficult to 
learn than a reversed J-shaped function, and the latter is more difficult thar 
a J-shaped function. This result is in agreement with the progression hypothe 
sis; and (c) the relative variances decrease a 
result does not support the view that single-cue learning is to be conceived of 
as a two-process theory, which would predict that the relative variar 


and a U-shaped function form. 


decreases and then increases. 


A single-cue learning experiment requires the 
S to infer a value of a criterion variable, Y; 
(1... x») from that of a cue variable X; 
(i =1... m). In fact, S has to learn to utilize 
the (real valued) function which determines the 
assignment of Y; to X; The dependent variable 
for learning is S's prediction of Y; say f; when 
confronted with X;. 

It has been commonly assumed that S actually 
processes information in reaching a solution. The 
underlying assumption is that § is sampling and 
testing hypotheses in acquiring the function relating 
criterion to cue values, There is abundant evidence 
that the universe of hypotheses from which S is 
sampling is hierarchically structured. Studies in 
single-cue learning have shown that nonlinear func- 
tions are more difficult to learn than linear functions 
(eg. Brehmer, 1971; de Klerk & Oppe, 1972). 
Positive linear functions are easier to learn than 
negative linear functions (e.g, Björkman, 1965; 
Brehmer, 1971; Eisler & Spolander, 1970). 

According to the theory of single-cue learning, 
proposed by de Klerk and Oppe (1972), S starts 
constructing "simple hypotheses" and progresses— 
if necessary—towards more “complex hypotheses.” 
Their study provided preliminary evidence for this 
so-called progression hypothesis. The results indi- 
ed that when the task required the S to learn a 
quadratic cue-criterion relation, linear processing 
preceded quadratic processing. This effect appeared 
to be highly influenced by the criterion variance. 

"The present investigation examines the effects of 
three different quadratic cue-criterion function 
forms upon single-cue learning. The cue which Ss 
had to learn to depend upon was related to criterion 
in either (a) a J-shaped manner; (b) a reversed J- 
shaped manner; or (c) a U-shaped manner. If the 
above mentioned results are valid, they will lead to 
the following predictions. When the cue-criterion 
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Three function 


a function of training, This 


ce first 


function form is a J arc, then it is ! 
assume that S starts constructing à [^ 
function, When a reversed J-shaped Í 
is to be learned, S is assumed to sta 
negative linear function, Since it is © 
a positive linear function than a nt 
function, it is to be expected that a J- 
more accessible and easier to learn tha 
J-shaped arc. When the cuegpriteri: 
form is U shaped, Sis supposed to start 
two linear functions. Therefore, this 
function is expected to be more difficult 
than the other two forms. 

Method. The paradigm used in the present stu 
comprised three events that occurred in the learning 


value of Y to the subject, The cue values were 
presented on a scope display. On this display, twa 
marks were present which indicated the endpoin! s 
of an imaginary horizontal line. On every trial 4 
dot was presented on this scale. This dot d ated: 
the value of X. The values of X could vary front 
0 (i.e., the origin of the scale) to 100 (i.c., the right, 
end point of the scale), 1 

Following each presentation of the cue, Ss had to 
respond by turning a special knob to the estin! 
(Y) of the function value, the scale being grad 
from 0 to 100, 4 

The Ss received feedback after each setting of the 
knob by presentation of the correct value of the 
criterion, Y, as a number on a special numb ý 
display. The value of Y was computed accordin d 
to Y = f(X) + e, in which e is a random compe 
nent, the value of e being obtained by rando n 
sampling from a normal distribution with a meat 
of 0 and a variance of 16. This variance is referred 
to as the criterion variance, s¿?. 

Three different experimental conditions W@ 


created by using three different functions. Thes 
functions were: 
Y = 015X — .8X + 21, u 
Y —.015X* — 2,2X +91, E 


Y = 032X? — 3.2X + 91. [3] 


riterion function form represented by 
1 is a J arc, the form that is represented 
ition 2 is a reversed J arc, and the form of 


Fur n3isa U arc. 
I) sing learning, S was sitting at a table in front 
of scope display. On this table were mounted 


th sponse knob (provided with a scale), the 
nu- ber display, and a microswitch. Each learning 
t started with the presentation of the cue. Then 
ıs requested to turn the response knob into the 

red scale position. After this response S could 

t the presentation of a number on the number 
play (ie., feedback) by depressing the micro- 

itch, Twenty such learning trials constituted a 

irning run. he test, which followed upon each 

arning run, consisted of seven different cue values 
which were equally spaced along the .Y continuum, 
during every test, the same seven cue values were 
presented 10 times without feedback. In every test 
the cues occurred in different random orders. 

Presentation and recording was done automati- 
cally by means of punched tape. The output tape 
contained the specific values of X that were pre- 
sented during testing and the corresponding values 
of Y. 

The 5s were told that Y was a function of X plus 
or minus a random deviation, and that it was their 
task to discover this function and to use it as ac- 
curately as possible during testing. Each S worked 
under one condition. The Ss made 10 learning runs 

ilternating with 10 test rur The whole expe 

ment took about 2 hr. per subject. The Ss 
paid. 

Twenty-four Ss participated in this study. They 
were randomly divided into three groups with eight 
5s in each. The Ss were unfamiliar with this type 
of experiment, 

Results, For each S and for each run the value of 
D was computed according to D = = Yon, 
where n is the number of trials, 

^n analysis of variance was performed on these 
D scores, The results showed significant main 
effects of function form, F (2, 23) = 10.86, p < .001, 
and run, F (9, 23) = 25.31, p < .001. The inter- 
action between function and run was not significant, 

"The results of this analysis of variance indicated. 
that Ss’ performance improved as a function of 
learning. In Figure 1, mean D scores are plotted 
against the number of test runs for each of the three 
conditions. Performance gradually increased as 
training progressed and the overall performance was 
largely dependent upon the function form. 

For each S the variance in the responses for each 
of the seven cues was computed per test. These 
response variances, sj’, were then divided by the 
criterion variance, se. 

The overall analysis of variance that has been 
carried out on these relative variances indicated 
significant main effects of function, F (2, 108) = 
132.50, p < .001; run, F (9, 108) = 24.20, p < .001; 
and cue, F (6, 108) = 9.09, p < .001. Of the inter- 


e 


NX, 


RUNS 


Plot of D as a function of test runs for 
each experimental condition, 


FIGURE 1. 


actions only those of Function X Run, Æ (18, 108) 
= 6.53, p < .001; and Run X Cue, F (54, 108) = 
2.05, p < .01, were significant. 

"The mean relative variances are presented in 
Table 1, Within every condition the mean variance 
response system exceeded the criterion 
variance. 

Discussion and conclusions. The present results 
provide several conclusions. First, the data clearly 
show that performance gradually improved over 
test runs. The rate of learning was approximately 
the same for all three functions, The D measures, 
which reflect the accuracy with which the functions 
are used, demonstrate that the extent to which Ss 
learn and utilize relations depends largely upon the 
task function form, A U-shaped function is more 
difficult to learn than a reversed J-shaped function; 
a reversed J-shaped function is more difficult to 
learn than a J-shaped function. These findings are 
in agreement with the conjectures that were derived 
from the progression hypothe: 

A second conclusion is that the above mentioned 
differences between functions remain approximately 
constant over the range of test runs, Thus, the 
differences between functions do not decrease as 
training progresses. This is in agreement with the 
results of Brehmer (1971), who found that 5's skill 
in using a function depends largely upon the par- 
ticular function form S has to use. Especially when 
the task requires S to learn a U-shaped function 
form, performance appeared to be rather poor. 

A third conclusion is that the relative variance: 
expressed by the s,7/s;* ratios, decrease with train- 
ing. The results indicate that the extent to which 
these ratios decrease is significantly influenced by 
the form of the function that is to be learned. 
Within every condition, the variance in S's response 
system exceeds the variance in the task system. 
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TABLE 1 P 
MEAN RELATIVE VARIANCES PER RUN FOR EACH FUNCTION 
Run 

action EE — 

1 2 3 4 5 6 7 8 | 10 

J shaped 10.9 6.2 24 19 2.5 2.2 2.0 1.8 | mi 

Reversed J shaped 20.0 11.5 nS 3.6 $5.4 45 2.5 37 TA 

U shaped 25.6 18.4 16.1 13.1 10.3 10.3 13.5 13.3 9,3 
This finding is inconsistent with the results of given. The minimum value of s;? pr: depends 
Björkman (1968), who found s,!/s? ratios which upon the function form that is to | rned. If 
were below unity, especially at the end of training. this asymptotic value of s,? exceeds 1! ue of sdy 

Björkman (1968) suggested that a single-cue then the relative variance will only : ase as a 
probability learning task required the S to learn function of learning. Since in the pr: tudy the 
(a) the function relating criterion to cue values value of sê is relatively small (i.c siderably 
(i.e., functional learning) and (b) the distribution smaller than in Bjórkman's, 1968, stu ^is might 
around this function (i.e., cue probability learning). be the case here. 

This two-process theory predicts that the relative 
variance should first decrease (as S learns the func- REFERE 
tion) and then increase (as S learns the distribution — Bjyüakwaw, M. Learning of linear function: parison bes 
of feedback values). Since the present results show en S Univ papaline diops: NA Re 33 oe 
s > : y Stockholm c 
only that the relative variances decreases, they do s, 1 = 
not offer support for this theory. This might be bed effect of training and mur stimuli o 
z er Sá d sponsi ; ne 
due to two different factors. First, it is possible chological R ep. 2) Umea, Suedets “Universit ced, 1968, 
that 10 successive runs of 20 learning trials only — Beim, B elie ability to use functio». ‘ules. Psy- 
-S ; M i chonomic Science, 9-260, 
cover the first stage of the learning proc ess and not pe KLERK, L. EW Wek Orre, S. Rüunctlonal cur ning under 
the second stage (i.e., cue probability learning). different, probabilitic, feedback conditions. — J:ychonomtes 
n ann m z cience, —380. 

Second, due to S's inability to use a quadratic Ester, H., & SroLANDER, K. On the sign o iope in the 
function correctly, there always will remain a certain keamning of linear functions. Scandinavian J nal of Ps 
amount of (error) variance in S's response system, AOTER E618 
even when a sufficient number of learning trials is (Received May 21, 1973) 
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Apparent rate grows much more rapi 
as words per minute increase (expo 
comprises three variables, howe 
and the durations of the pauses. Wh 
to him) he halves the number of p 


slightly, Conversely, the listener is much more impressed by changes 


lation rate than in pausing (exponen 
processes of speaker and listener are 
in the two variables have more com 
pressed in units of apparent magnit 
mapping autophonic into extraphonic 
rate is one of compre: 


: the number of paus 


ques, Université de Paris III, Paris, France 


idly for the speaker than for the listener 
nents 2.6 vs. 1.6). The rate of speaking 
articulation 
en a speaker doubles his rate (as it seems 
auses, altering the other variables only 
narticu- 
ts 1.5 and —.23). Thus, the perceptual 
somewhat complementary, and changes 
parable effects on the listener when ex- 
ude for the speaker. The net effect of 
magnitudes in the transmission of speech 


ion, both for overall rate (exponent .6) and for the com- 


ponent articulation rates (.44) and pause frequencies (.21). 


A speaker may alter his rate by changing 
the number of pauses, the articulation rate, 
or the durations of pauses, in various 
degrees. The autophonic scale that de- 
cribes the speaker's perception of his own 
ate has a slope of 2.6 when expressed as a 

anction of words per minute (in log-log 
oordinates) and slopes of —1.1, 3.4, and 

4.3 when plotted as a function of the 
respective components of these overall 
rates (Lane & Grosjean, 1973). It follows 
that when a speaker varies his rate of 
reading a known passage, he primarily 
adds or subtracts pauses (at strategic 
syntactic locations); he alters articulation 
rate and pause duration much less. The 
same is true for spontaneous speech: 
"Increase of speed in talking is due largely 
to the closing of gaps and to the heightened 
continuity with which movements per- 
formed at relatively constant rate succeed 
each other [Goldman-Eisler, 1968, p. 26 ]." 
An equivalent way of describing the rela- 
tive importance of these variables for 
autophonic rate is to point out that as 
reading rate increased by a factor of 2.7 
(close to the total operational range for 


1 Requests for reprints should be sent to Harlan 
Lane, Department of Linguistics, University of 
California, San Diego, La Jolla, California 92037. 


reading sentences), the three component 
variables changed by factors of 10.3, 2.1, 
and 1.6, respectively. 

How the three components of rate control 
the listener's perception is unknown. We 
do know from the autophonic study (Lane 
& Grosjean, 1973), in which a representa- 
tive speaker's productions were also played 
back to listeners for numerical estimates, 
that apparent rate grows much more slowly 
for the listener than the speaker as words 
per minute (wpm) increase (slopes of 1.5 
vs. 2.6). This difference in slopes indicates 
that the speaker's perception of his own 
rate is not based solely on auditory cues. 
Therefore, pause frequency, articulation 
rate, and pause duration may not have the 
same relative importance for the listener's 
perception of rate as they do for the 
speaker ; it may matter a great deal whether 
it is your own tongue that is wagging. In 
the present experiment, then, the two vari- 
ables that proved most salient in the per- 
ception of autophonic rate were decoupled, 
and each was varied systematically in a 
5 X 5 array of rates presented to listeners. 
We examined the relative contributions of 
pause frequency and articulation rate to the 
listener's extraphonic perception of rate, 
the invariance of the cue hierarchy across 
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5 
T 
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WORDS PER MINUTE (LOG SCALE) 


Ficure 1. The extraphonic scale of reading rate. 
(Each point is the geometric mean of 68 rate esti- 
mates, 4 by each of 17 listeners. The 25 values of 
words per minute reflect 5 levels of articulation rate 
at each of 5 levels of pause frequency.) 


modalities, and the reproducibility of the 
extraphonic scale with somewhat different 
procedures and stimulus ensembles. 


METHOD 


Seventeen graduate students, native speakers of 
English with no reported speech or hearing defects, 
served in groups of three or four, while seated in an 
audiometric room for 1} hr. They wore Opelem 
binaural headsets supplied in parallel by an Opelem 
2M-70 studio tape recorder. 

The method of magnitude estimation (Stevens, 
1956) was used to measure the perception of speech 
rate. The Æ played a recording of the experimental 
passage, read with a midvalue of number of pauses 
(7 pauses or 9.4 syllables/run) and of articulation 
rate (4.5 syllables/sec). The Æ assigned the 
numerical value 10 to this standard rate (146 wpm). 
Next, he played 100 recordings of the passage, and 
the listener assigned to each a number proportional 
to its apparent rate. In all, 25 rates of. reading by 
the same speaker, ranging from 85 to 225 wpm, were 
presented four times, in irregular order. Each rate 
corresponded to a combination of a level of articula- 
tion rate and a level of number of pauses, shown on 
the coordinates of Figure 2. (The length of the 
pauses was held constant, with tape splicing, at a 
mean value of .56 + .01 sec. Each articulation rate 
was constant within 1% over the five levels of pause 
frequency.) The same passage employed in the 
extraphonic study and comparable ranges of the 
variables measured there were adopted in the 
present study. However, the rates presented here 
are somewhat lower (average wpm 146 vs. 165), 
thus higher pause frequencies (9.6 vs. 6.3) and lower 
articulation rates (4.5 vs. 4.8 syllables/sec) are 
naturally encountered. 


SHORT REPORTS 


The experimental passage contained the folli 
51 words of text (including contractions), compris 
75 syllables (the number at each pause emplacem 
shows the lowest pause frequency that entai 
pausing there; all higher pause frequencies also 
used that emplacement): 


as far /24/ as I know /7/ I'm a fairly /24/ noi 
/13/ fifteen year old /3/ neither a complete /1 
psychological /24/ case /7/ nor a cut /13/ abo 
/24/ the others /1/ I listen /24 
Luxembourg /7/ my hair /24/ fails forward /18; 
in the fashionable /24/ style /3 
/13/ polo neck /24/ sweaters /7 
/24/ consider myself /13/ a great /24/ pop fan. i 


The hierarchy of pause emplacements is that ob- 
served in the autophonic experiment of Lane andi 
Grosjean (1973). 


RESULTS AND DISCUSSION 


The extraphonic scale of reading rate is 
shown in Figure 1. The magnitude estima- 
tions are well fit by a straight line with 
slope of 1.7, in logarithmic coordinates. 
(The mean and SD of the 17 individual: 
slopes are 1.7 and .56., respectively) 
The scale is substantially the same as that 
obtained with five rates of reading in whic 
pausing and articulation rate were normally” 
coupled, not varied independently (slop 
= 1.5, SD = .37; Lane & Grosjean, 1973). 
A slope of 1.6 is found in the latter study” 
over the range of words per minute em- 
ployed here; moreover, moderate levels of 
the two variables in the present experiment 
(3 X 3 subensemble) yield a slope of 1.62. 
We conclude that the extraphonic scale 0 
reading rate is reasonably invariant over 
these differences in stimulus array and 
procedure, with a slope (exponent in linear 
coordinates) of 1.6. 

The contributions of pause frequency an 
articulation rate to perceived reading rat 
may be examined in three different ways: 
of which give the same general picturi 
The perceived-rate solid in Figure 2 sug- 
gests that the two components of overall 
rate contribute about equally over the 
ranges employed (approximately those €n 
countered by Lane & Grosjean, 1973, ! 
actual productions). An analysis of và 
ance of rate estimates confirms that pau! 
frequency and articulation rate are equal 
important: the two mean squares are 3,2 


SHORT REPORTS 
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oc O «d rm c c 
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FIGURE 2, 


Numerical estimates of apparent rate as a function of two compo- 
nent variables, pause frequency and articulation rate. 


(The 25 line inter- 


sections plot the geometric means of 68 rate estimates, 4 by each of 17 listeners.) 


and 2,919, respectively; F (4, 64) — 98 and 
109, p «.01. The between-Ss mean 
quare is 136, ns. The interaction of pause 
requency and articulation rate, evident 
n the graph, is statistically significant, 
nean square = 136; F (16, 256) = 23, 
p < .01. Eighty-four percent of the inter- 
action sum of squares is associated with 
the one df corresponding to the bilinear 
component) This suggests that the two 
cues combine multiplicatively to yield the 
net impression of overall rate. Thus, an in- 
crease in articulation rate may amplify 
the effects of pause frequency on apparent 
rate. The same result would be obtained, 
however, if the listener combines the two 
cues linearly under conditions in which the 
estimates are not linearly related to appar- 
ent rate, or if the weights assigned to the 
cues for integration vary as a function of 
stimulus magnitude. 

A third way to evaluate the relative 
contributions of pause frequency and artic- 
ulation rate, and to compare their im- 
portance for the speaker and the listener, 


? The authors are grateful to Norman H. Anderson 
for calling this aspect of the results to our attention 
and for aiding us in applying his theory of func- 
tional measurement (Anderson, 1973). 


is to examine the rates of growth of auto- 
phonic and extraphonic perceived rate as a 
function of the two components and overall 
words per minute. A speaker instructed to 
double his apparent rate will change pause 
frequency by a factor of 2, articulation 
rate by 1.2, and overall rate by 1.3 (since 
the autophonic slopes are — 1.1, 3.4, and 
2.6). A listener would estimate, however, 
that the changes were, in fact, 1.2, 1.3, and 
1.5 to 1, respectively (since the extraphonic 
slopes found in this study are —.23, 1.5, 
1.6). Consequently, equal changes in 
pause frequency and articulation rate have 
more comparable effects on the listener 
when those changes are measured in psy- 
chological units along the autophonic scale. 
Extraphonic rate grows approximately six 
times faster as a function of articulation 
rate than pause frequency, expressed in 
physical units. However, the rates of 
growth as a function of the corresponding 
apparent (autophonic) magnitudes are 
more similar: .44 and .21, respectively 
(absolute values). Thus, the perceptual 
processes of speaker and listener are 
somewhat complementary. The net effect 
is one of compression: extraphonic rate 
grows as the 1.6/2.6 — .6 power of auto- 
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phonic rate. It follows that speakers and 
listeners will not agree on what constitutes 
a doubling of rate (or halving, tripling, 
etc.), and that speakers will always fall 
short when trying to imitate the rate 
changes of a model. 
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AROUSAL AND RECALL: 
EFFECTS OF NOISE ON TWO RETRIEVAL STRATEGIES! 


STEVEN SCHWARTZ? 


Northern Illinois University 


In a test of the notion that arousal affects the retrieval strategy of pigeon- 
holing by decreasing the separation between risky and cautious criteria, 75 
Ss heard short meaningful stories accompanied by either No, Medium, or 
High noise and were asked to recall the characters’ names. Common names 
were better recalled than rare names in each noise condition. Using a dei 
theory analysis it was found that, in the No Noise 


risky criterion for common nam 
noise, however, Ss employed a 


interpreted as supporting the hypothe: 
of information for retrieval. 


Drawing on his work in perception, Broadbent 
(1971) has identified two memory retrieval strate- 
gies—filtering and pigeonholing, According to 
Broadbent, filtering occurs when one adopts a 
“stimulus set," choosing items to be recalled on the 
basis of some common feature (e.g., acoustic simi- 
larity) and ignoring those items in memory without 
this feature. Filtering, therefore, involves the 
grouping of input on the basis of physical char- 
acteristics. Pigeonholing, on the other hand, occurs 
when one adopts a “response set” selecting from a 
large number of items (e.g., a list of words), those 
constituting a subvocabulary (e.g, the names of 
colors). Thus, filtering leads to stimulus lectivity, 
whereas pigeonholing results in response selection, 
When an operating characteristic is derived relating 
the probability of a correct response to the proba- 


! The author is indebted to D, E. Broadbent for his con- 
structive criticism and to a number of students and colleagues 
whose comments greatly improved this article. Any errors in 
the final product are, of course, the responsibility of the author, 

? Requests for reprints should be sent to Steven Schwartz, 
Department of Psychology, Northern Illinois University, 
De Kalb, Illinois 60115. 
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nd a cautious criterion for rare names. 


milar criterion for recall of both common and 
rare names, but sensitivity increased for common 


A possible mechanism for arousal's action, as 
well as the theoretical implications of these findings, was discussed. 
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cision 
condition, Ss employed a 
In 


ames, The results we 


1 affects the accessibility 


is that arous 


bility of an incorrect response (false alarm), pigeon 
holing produces an increase in the number of correg 
responses as the false-alarm rate increases, where ag 
filtering produces a change in the number of correct 
responses with a constant false-alarm rate. : 
decision-theory terms, filtering affects d', the dil- 
ference between the correct response rate and t 
false-alarm rate, whereas pigeonholing affects f, th 
critical value of likelihood ratio above which 
responds positively. 
Despite recent interest in applying decis 

measures to memory, the effects of arousal on d^ am 
B as estimated from recall data have not beet 
explored. On the basis of vigilance research, ho! 
ever, there is reason to believe that arousal affec 
pigeonholing but not filtering. Broadbent anc 
Gregory (1965) found that, in a vigilance situatio 

arousal results in a change in the number of 
sponses assigned to intermediate confidence level 
That is, aroused Ss were more certain that a sig! 

was presented or more certain that no signal Wå 
presented than unaroused Ss. Such a mechani 
may produce contradictory effects on pigeonholing: 


SHORT REPORTS 


Specifi hen conditions favor a cautious cri- 


terion ( gh) a change in the number of not sure 
respon. iay improve performance, whereas, when 
conditio favor risky responding, arousal may im- 


rmance. 

sal's effect on recall is substantially similar 
ct on vigilance, contradictory results would 
¿ted depending on the value of 8 at the out- 
order to assess this hypothesis, it is nece 
know in advance when Ss will adopt a risky 
on and when they will adopt a conservative 
oi so that the discrepant effects of arousal may be 
out. Fortunately, such a situation is 
c established. The present study employed a 
D ication of a technique devised by Ingleby 
9), In that study, Ss were asked to recall the 
names of characters in short stories, Probe 
mes were suggested, and Ss were asked to indicate 
he name was correct and to give a rating of con- 
lence in their judgment, Although d" was actually 
eater for rare names, recall for common names 
vas superior because Ss employed a riskier criterion 
(8 was lower) for common names, ven a similar 
experimental task, the major hypoth of the 
present study was that arousal would affect pigeon- 
holing by producing a decrease in the separation of 
criteria (8) for rare and common names while d’ 
remained unchanged, It was hypothesized that 
8 for rare names would decrease, thus reducing the 

idvantage of common names. 
Method. The Ss were 75 introductory psychology 
udents who received course credit for participa- 
ion. AM Ss reported having normal hearing; Ss 
irticipated in groups of five, E nental ses 
os were conducted in the university language 
|.horatory. The room was furnished with desks 
separated. by side panels, ch desk contained a 
set of monophonic earphone: 
l'hree short meaningful stories were used in the 
present study, The stories, which ranged in length 
from 375-525 words, were chosen from past issues of 
the New Yorker magazine, Each story had four 
characters, two of whose names were judged common 
and two of whose names were judged rare based on 
the frequency of their occurrence in the Chicago, 
Illinois, telephone directory. The mean frequency 
of common names was 920 (of about 900,000 tele 
phone subscribers); the mean frequency for rare 
names was 43. The stories were recorded and 

presented on tape. 

Upon entering the language laboratory, each S 
vas seated at a booth, asked to put on his earphones, 
and given an instruction and answer booklet, The 
instructions informed S that the experiment 
concerned with memory under various condi 
All Ss were told that they would hear brief stories 
over their earphones and that a transcript of each 
story appeared in the answer booklet. Blanks in 
in the transcripts corresponded to character names. 
Corresponding to each blank was a suggested name 
and a rating scale. During the initial story presen- 
tation, the room was kept completely dark in order 
to discourage Ss from making entries in their answer 
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point is the mean 8 for the five groups in each condition.) 


booklets. At the conclusion of each story, the 
lights were turned back on, and Ss were required 
to read the transcripts and decide whether the name 
suggested was correct or incorrect and to give a 
confidence rating in their judgment on a 5-point 
le ranging from 1 (certain this is the correct 
name) to 5 (certain this is not the correct name). 
For each story, two of the suggested names (one 
common and one rare name) were correct, while the 
remaining common and rare suggested names were 
incorrec Each .S, then, had three occasions to 
judge each of four types of suggested names: a 
rare name that was correct, a rare name that was 
incorrec common name that was correct, and a 
common name that was incorrect. 

The Ss were randomly assigned to one of three 
noise conditions (No, Medium, High). The No 
Noise group heard the stories on tape, recorded by 
a male graduate student and broadcast at about 
90 db. (A). (The abbreviation db. (A) refers to 
readings made from the A scale of a sound-level 
meter, where high frequencies are weighted more 
than low frequencies.) The Medium Noise group 
heard the stories accompanied by a eontinuous white 
noise, broadcast at about 65 db. The High 
Noise group heard the stories and 85«db. (A) con- 
tinuous white noise. For all Ss, the earphones were 
silent during the period from the end of one story 
to the beginning of the next. 

Results. Due to the small number of judgments 
each S was asked to make (six for name type), 
there was little evidence from false positives for 
some Ss in the "certain" category. For this reason, 
each session's data were pooled across Ss. Since Ss 
participated in small groups of five per session, ed ch 
noise condition required five sessions, For statis 
cal analyses, then, the unit was ions and each 
noise condition had a sample of five. 

Following the procedures described by Broadbent 
(1971, pp. 70-76), 8 for rare and common names 
was calculated for all rating categories combined. 
As Figure 1 illustrates, with increasing noise, fj 
decreased for rare names, but increased for common 


3 Differences in the resistance of the earphones made the 
sound level estimate only approximate. The actual level for 
an individual S may have varied by about 3%. 
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groups in that. condition.) 


names, An analysis of variance performed on these 
data confirmed the significant Noise X Name inter- 
action, F (2, 12) = 4.50, p < .05, An a posteriori 
test on the mean 8 scores (Winer, 1962, pp. 309-313) 
indicated that although 8 was lower for common 


- names in the No Noise condition (p < .05), à did 


not differ for rare and common names in the two 
noise conditions, 

Despite the decrease in the separation of s with 
noise, performance measured by hit rates did not 
deteriorate for common names relative to rare 
names. An analysis of variance of hit rates in each 
confidence level indicated that the hit rate for 
common names was higher than for rare names at 
each noise level, F (1, 12) = 7.71, p <.05, 

Since g for rare names did not differ signif- 
cantly from d for common names in noise, it seems 
plausible that changes in d' may have been respon- 
sille for the continued superiority of common names, 
Changes in d' are depicted in Figure 2. A» indi- 
cated, the mean values for d' were 1.03 and 1.24 for 
rare and common names, respectively, in the No 
Nowe condition, but 1.08 and 1.69 in the High 
Noise condition, Only the latter difference. was 
significant, 4 (4) = 3.0, p < 05, 

Discwision.— The results of the present study were 
in accord with those of Ingleby (1969), Ia the 
No Noe condition, the condition most directly 
comparable, common names were recalled with 
greater accuracy than rare names, despite the fact 
that d" did not differ for the two types of names, 
The advantage of common names was due to the 
lower value of Ø for these names. The introduction 
A white-noise-imduced arousal led to an increase in 
A (and pigronholing) for common names and a 
decrease in A for rare names, This result supports 
the hypothesis that arousal improves pigeonhoting 
when conditions favor cautious responding, but 
ee Végronboling when risky responding is 
favored. 


dicted changes in overall responding. In 
arousal condition, common names were reg 
with greater accuracy than rare names. 
periority of common names in noise was duei 
increase in d', which tended to counteract 
verse effect of noise on 8. Thus, it would 
that, for common names at least, arousal 
filtering as well as pigeonhol 
‘The apparent increase in filtering with incre 
arousal noted for common names is in line 
the results of a study by Hörmann and Ost 
(1966). They reported a decrease in "elus 
in free recall with noise, although similar m 
of words were recalled in the quiet and noise 
tions, To the extent that clustering depe 
pigeonholing, noise-induced arousal operated 
manner similar to the findings of the present 
That is, arousal seemed to make filtering 
important than pigeonholing as a retrieval 
Arousal, therefore, will only lead to improved: 
if the material may also be recalled by fill 
The actual effect of arousal on recall would be 
result of the interaction of a number of facto 
cluding the nature of the material to be recalle 
criterion adopted, and the efficacy of filtering 
It should be noted that the failure to fi 
increase in filtering for rare names in the p 
study may have been due to inherent differ 
discriminability. Thus, although the signalto " 
ratio was quite good, it is possible given the 
dynamic range of speech that some loss of acon 
information occurred due to masking. Such 
ing may have affected common and rare 
differentially because com names po 
phonemic characteristics resistant to maski 
because stich names contained common lette 
features which were likely to be selectively 4 
amined by an aroused S (cf. E brook, 19 
In any event, an explanation based solely on 
ing would be hard pressed to account for the el 
in pigeonholing, since 4 decreased for n 
names, whereas 8 increased for rare namos, 
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VERBAL DISCRIMINATION LEARNING FOR BILINGUAL LISTS! 


axo DONALD H. KAUSLER * 


University of Missouri 


The standard decrement in verbal discrimination learning for an interpair 
relationship between right (R) and wrong (W) items was found when the ro- 


lated words were bilingual translations of one another, 


ment occurred only when the R item 


lish) and the W items from the secondary language (German). 


However, the decre- 
^ were from the primary language (Eng- 
The results 


were interpreted in terms of the igo tagging of different set» of features 


for primary language words as 
R items, 


veral studies. (eg, Eberlein & Raskin, 1968; 
kerson & Kausler, 1969; Zimmerman, Shaughn- 
;, & Underwood, 1972) have demonstrated a 
rement in verbal discrimination (VD) learning 

obclency for lists containing interpair associative 

iatedness between the right (E) and wrong (W) 
ems, The featuretagging hypot of VD learn 
ing (Kausler, 1975, in press) attributes the decre 
me to bp of tagged 
requency of response 
the related R and W items, False recognitions of 
W items as being correct occur. when 
the semantic features of t items were t 
vhewhere in the list via the rehearsal and 

í their related R lenm, By contrast, the same 
dies failed to find a comparable decrement for 
ts containing intrapalr asociative relatedness 
tween the R and W items, Ir pees err demi 
pensat: process operates the intrapair. 
: diim whereby S loce Wis promulag aud 
(cing on those features of R items that are 
ace MEE ae bs. — 
graphio phonemic features; , in press. 

The cited studies employed standard 
word associates as the means of manipulat 
mantie relatedness, The present study the 
grovrality of the disparity between interpair and 
intrapair conditions by employing a different form 
E — A vin, 

terpair and intrapair experimental 
flatesd R and W items were English-German trame 
lations of one another. For the control comfition 
the bilingual Heme were semantically 
‘The Se in each condition were students pearing the 
completion of second year college German. 

A question of further interest concerned the dil- 
ferential effecta on VD performance of 
secondary language words a» R items. 
tance of the language variable was suggested by the 
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s of an eather Kamie & ry 
) with Spanbh. bilingual Mas 
Spanish heir primary language, English thei 
semdary i Te experimental conditions were 
IU Ken Sr ee he Em th p 


items than for secondary language words as 


formulated within the context of a Wi — Ry, Wy 
= Ry transfor paradigm (Le, the R items of List 
2 were related to those of List 1, whereas the W 
items of the two lists were unrelated). Significant 
mediated facilitation was found, relative to a Wy = 
Ky, We ~ Ry control condition, when the Ry items 
words and the Ky’ items 

secondary language translations, but not 
the interiit relationship was reversed. A 
effect in the present study would in« 


were 
were t 
when 


‘The Se were 00 students in fourth semester German 
classes at the University of Missouri. wre 
qaid for thelr participation dn the stinty, amd all of 
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with English words as R items, and 6 pairs with 
German words as R items) across all conditions 
prevented the application of the simple rule of 
always selecting words of one language as R items. 
For the intrapair condition the R-W pairings were, 
of course, cross-language translations of one an- 
other. For the interpair condition the R-W pair- 
ings were formed by re-pairing randomly the 
English-German translations within the separate 
sets of the intrapair condition. For the control 
condition either the English words or the German 
words of the experimental conditions were replaced 
by other members of the total word pool. Finally, 
interitem relatedness, both semantic and ortho- 
graphic, other than that of direct experimental 
interest, was minimized as much as possible in 
terms of E's judgment. 

Each S$ received 10 trials by the anticipation 
method, with the first being a guessing trial. The 
rate of exposure was 3:3 sec., and the intertrial 
interval was 6 sec. Standard VD control pro- 
cedures were employed. That is, the pairs were 
arranged in different serial orders across trials, the 
R items appeared equally often on the spatial right 
and left, etc. The experimental list was preceded 
by 3 trials on a short practice list conforming to S's 
specific list condition. 

Results and. discussion. Means and standard de- 
viations, respectively, for total errors summated 
across trials and language sets were 18.60 and 9.73, 
15.65 and 6.58, and 13.10 and 5.13 for the interpair, 
intrapair, and control conditions, respectively. Er- 
rors were analyzed separately for the two within- 
list sets. Considering the English set first, the 
means and standard deviations, respectively, were 
11.45 and 8.15, 8.10 and 4.70, and 6.40 and 2.93 for 
the interlist, intralist, and control conditions, re- 
spectively. Comparable values for the German set 
were 7.15 and 3.03, 7.55 and 3.28, and 6.70 and 3.25. 

A 3 X2 mixed analysis of variance revealed a 
nonsignificant main effect for list condition, F (2,57) 

= 2,77, p < .10, a significant main effect for lan- 
guage set, F (1, 57) = 8.94, p < .01, and a sign 
cant List Condition X Language Set interaction 
effect, F (2, 57) = 7.76, p < .005. In view of the 
significant interaction effect, the effect of list con- 
dition was determined separately for the two lan- 
guage sets. For the English set the between- 
condition effect was significant, F (2, 57) — 4.13, p 
< .02. A Duncan test yielded a significant com- 
parison between the interpair and control condition 
means (b <.01), Neither the interpair-intrapair 
comparison nor the intrapair-control comparison 
reached statistical significance (ps < .10 and > .10, 
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respectively). Given the pronounced 
of variance apparent between list « 
between-condition effect was also analy i 
of the median test. The between-ce 


rogeneity 
ions, the 
y means 
ion effect 


was again significant, x? (2) = 7.93, .02, but 

only the contrast between the m of the 

interpair and control conditions reach nificance, 

x (1) = 4.95, p < .05. Finally, for German 

set the between-condition effect fel short of 
istical significance, F (2, 57) < 1. 

Thus, in agreement with previou lies, the 
present results indicated a decrement D learn- 
ing for a list containing an interpai: :tionship. 
but not an intrapair relationship. ver, the 
generalization to bilingual relatedness estricted 
by the fact that the decrement occurr: hen the 
R and W items were primary and s jary lan- 


One 
mmetry 
recipi- 


guage words, respectively, but not vic: as 
possible explanation for the bilingua! 
is that different sets of features are the 


ents of frequency of response cues wi primary 
and secondary language words functio t items, 
Semantic features of primary word item «re likely 
to be processed and tagged via th: chearsal 
during feedback interva Since these |--:ures are 
shared by their secondary language co: -terparts, 
false recognitions of the latter are possible when 
they are encountered (as W items) during later 


anticipation intervals. On the other hand, phonemic 
and orthographic features of the less familiar sec- 
ondary language words are likely to be processed 
and tagged when they serve as R items. Since 
these features are not shared by their primary. 
language counterparts, false recognitions of the 
latter are unlikely to take place when they are en- 
countered during later anticipation intervals. 
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-TITION EFFECTS IN ICONIC AND VERBAL SHORT-TERM MEMORY ! 


DEREK BESNER,? J. K. KEATING, LESLIE J. CAKE, anp RICHARD MADDIGAN 


Memorial University of Newfoundland, St. John's, Newfoundland, Canada 


The Sperling 1960 procedure of full and partial report was used with tachis- 
toscopically presented arrays of 100-msec. duration in two separate experi- 
ments. A digit slide was repeated 48 times per ion, with 48 slides of non- 
repeated series (digits in Experiment I; letters in Experiment l) interpolated 
between repetitions. The results of both experiments demonstrated better 
recall of the repeated array. It was concluded that Turvey's 1967 observa- 
s dillerentiate verbal and iconic memory may be 


tion that repetition effi 
incorrect. 


perling (1960) viewed the difference between 
and full report of tachistoscopic arrays as 
iecting qualitative as well as quantitative memory 
fects, That is, recall under full report was inter- 
preted as being from a verbal short-term store, 
hile recall under partial report was assumed to 
flect a sensory (iconic) trace. Given the existence 
of a separate iconic memory store, an important 
question concerns the kind of operations that might 
serve to illustrate different properties of these two 
memory systenis. 
In support of this notion Turvey (1967) reported 
two experiments which indicated that partial report 


all of a repeated 3 X 5 digit matrix was no 
rent from that of nonrepeated matrices inter- 
olated between the successive repetitions, These 


‘sults were considered to be consistent with the 
‘ea that a sensory trace should show no effect of 

petition, sir it deals with the retrieval of a 
rclatively unencoded stimulus, while other memory 

tems yield results like the “Hebb effect” (Hebb, 
1961). 

It has recently been suggested however (Dick, 
1971) that partial-report techniques are more a 
source of methodological artif: than evidence 
supportive of a sensory trace. Dick's findings taken 
in conjunction with Turvey's (1967) suggest that a 
more reasonable converging operation would include 
a fullreport condition. If Turvey’s results are 
correct, one would expect a report by repetition 
interaction, in which a repetition effect occurs in 
whole report, but not in partial. 

Two experiments are reported here in which both 
full- and partial-report techniques have been used. 
The second experiment was a replication of the 
first, with the exception that the nonrepeated slides 
were composed of letters rather than digits, as in 
Turvey's (1967) Experiment IL. 

Method. The Ss in Experiment I were five male 
students and one female student solicited from the 
Department of Psychology at Memorial University. 
The Ss in Experiment II were three males and three 
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females from the same population. All Ss had no 
previous tachistoscopic experience. 

Materials in both experiments were presented by 
means of a three-field tachistoscope, Scientific Pro- 
totype Model GB. The stimulus materials con- 
sisted of 96 slides containing three rows of five 
digits each. These were always dark letters on a 
white background. No items were repeated more 
than once within a single array, and never within a 
single row. In Experiment I the numerals 1-9 were 
used: in Experiment II the interpolated slides con- 
tained letters instead of digits—vowels and the 
letters Y and Q never appeared. 

The exposures were of 100-msec, duration, fol- 
lowed by a 100-m: delay, at which point a 50- 
msec.-duration partial-report cue appeared, Preand 
post adapting fields were white light, as was the 
interval between array offset and cue onset, Mea- 
surements with a Macbeth illuminometer indicated 
that the stimulus field had a background value of 
5.8 ftL., while the letters were .2 ftL. Pre and post 
fields were 5,0 ftL. each, while the partial-report cue 
field had a background value of 5.0 ftL. and the 
bar marker, .2 ftL. The cue for partial report was 
a }-in. bar marker which appeared ¢ in. to the right 
of the row to be reported. The total array sub- 
tended a visual angle of 1.25? horizontally and .75* 
vertically, while the average letter was .16° h 
and .14? wide. The intertrial interval was approxi- 
mz 


S received. 96 preexperimental training 
trials on Day 1. This involved introducing Ss to 
the general procedure, establishing facility with the 
poststimulus cuing procedure, and setting the pace 
which would be used throughout the experiment. 
Repeated slides were not used during the training 
phase. The Ss were given two kinds of report, full 
and partial. Under full-report conditions, 5s were 
told to write down as many of the items as they 
could remember, and to place them in the pos 
that they occupied in the display. In the partial- 
report condition, Ss were told to write down only 
those items that appeared in the cued row, again in 
their correct spatial location. The partial-report 
condition was a Sperling (1960) arrangement in 
which cues were quasi-randomly presented, indi- 
cating top, bottom, or middle rows of the display. 
The only constraint was that each row be cued an 
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TABLE 1 
MEAN PERCENTAGE RECALL 


Whole report Partial report 


Experiment 


Non- > Non- 
Repeated repeated Repeated. repeated 
I 27.80 21.45 32.4 25.4 
iL 37.4 20.81 42.2 26.75 


Note, Partial report scores are calculated on the basis of 
number correct out of 5; whole report scores are calculated on 
the basis of number correct out of 15. 


equal number of times within a session. Type of 
report was counterbalanced across Ss and testing 
days. Thus half of the Ss received partial report 
first on Day 2, and second on Day 3. The remain- 
ing half received partial report second on Day 2, 
and first on Day 3. 

On the second and third days, Ss received 96 
trials/day. In Experiment I the repeated slide 
was presented every second trial; every other trial 
consisted of a random digit slide. In Experiment 
I the interpolated slide consisted of letters. With- 
in a daily session, Ss received 48 presentations of the 
repeated slide and 48 presentations of the random 
slides. Under each type of slide, report was half 
partial, and half full. 

Recalled items were scored as correct only when 
the items were in their exact spatial location. Re- 
call was by means of a written response on a gridded 
sheet which corresponded to the 3 X5 display 
matrix. Response sheets were turned over before 
the start of a new trial. Reporting five items 
correctly under partial report gave a score of 100%; 
the same number of items under whole report gave 
a score of 33.33%, 

Results. One S failed to complete the second 
experiment— consequently, her data were not in- 
cluded in the analysis, The results were analyzed 
bya2X2X6 within-Ss analysis of variance in 
the first experiment and a 2 X2X5 within-Ss 
analysis in the second. The respective factors were 
report (whole or partial), treatment (repetition or 
nonrepetition), and Ss, Means for both experi- 
ments can be seen in Table 1, 

In Experiment I the analysis revealed a main 
effect of repetition, F (1, 5) = 13.52, p< .025, no 
effect of report, F (1, 5) — 1.75, 2 > .10, and no 
interaction (F < 1). The results of Experiment II 
mirrored those of Experiment I. Repetition was a 
main effect, F (1, 4) = 20.45, p < .025, there was no 
effect of type of report F (1, 4) = 1.59, b >.10, 
and there was no interaction (F « 1). 

A check on the possibility that the repeated 
matrix was easier to recall in and by itself yielded 
negative results. This was done by taking percen- 
tage recalled on the first presentation of the repeated 
slide and comparing it with percentage recalled on 


SHORT REPORTS 


the first random slide. In Exper I the nom 
repeated slide was recalled sl , although 
this difference was not significant ller's A @ 
= .76, b > 10, two-tailed. In I ent IL the 
repeated slide was recalled slight! r than th 
nonrepeated one, again not signin , Sandler's 
A (3) = .31, p > .10, two-tailed, « though the 
guessing probabilities favor digits tters by & 


factor of better than 2:1, 
Questioning after Experiment | 
Ss with the exception of one fail 
presence of a repeated slide. A ri 
data excluding this one S left the p 
unchanged. In Experiment II all Ss 
ing that a repeated slide was p! 


Discussion. "These results fail to ate those 
of Turvey (1967). However, there : lear differ- 
ences in methodology; in particula contrast 
ratio between pre and post fields on one hand 
and the stimulus field on the other nuch less 
than in Turvey's. In addition, th« tial-report 


» stimulus: 
ort condi- 


cue was visual rather than auditory 
array was much smaller, and the ful! : 


tion was an added factor, Finally one delay 
of 100 msec. was used in contrast to Tu: ey's three. 
Some of these differences can perhaps be expected 
to reduce the amount of iconic memory available. 

Knowing whether or not the slide was repeated 


did not appear to systematically influence thé 
manner in which the input was processed, although 3 
absolute recall was slightly higher in Experiment IL 1 
That is, Ss in Experiment I who did not know that 
a slide was repeated appeared to behave in the 


same fashion as Ss in Experiment 1l who did. 
Furthermore, all Ss showed the effect in Experiment 
I, with the exception of one tied score in full report: 2 
in Experiment II all Ss demonstrated the effect 
under both report conditions. E. 

Since type of report was not a main effect, it is 
unclear how these results bear on the question of 
whether iconic memory is susceptible to repetition 
effects. In general, the existence of excess sensory 
information is inferred from the superiority of 
partial over full report, thus it seems likely that 
these experiments show only that repetition affects 
à postattentional verbal component rather than an 
iconic one. 1 

Finally, it may well be that the Sperling (1960) 
technique is an inappropriate one for this particular 
Question. Reporting items from a visual display 
entails knowing the names of the items, Retrieval 
of such information is presumably from a long-term 
store—thus, in such a paradigm, it would always be 
Possible to argue that repetition effects are depen- 
dent on previous excitation of name units in verbal 
memory rather than upon a precategorical sensory 
trace. 

It is thus suggested that an uncritical acceptance 
of Turvey's (1967) results, and their implications, 
would be premature, 
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RELATIONSHIP OF INTERVAL FREQUENCY COUNT 
TO RATINGS OF MELODIC INTERVALS 


THOMAS B, JEFFRIES? 


Los Angeles City Schools, Los Augeles, California 


This study tested Ss’ validity in rating the 24 ascending and descending 
melodic intervals of the octave, using 116 Ss’ ratings based on their judgments 
as to how often they had heard or experienced each of these interval types in 
music. (The ratings had been obtained in an earlier study.) An interval count 
was taken of the total number of times each of the 24 interval types occurred 


in the printed scores of 53 current and standard popular vocal piec 


The 


interval ranks determined from the total count for each interval were corre- 
lated with the interval ranks determined from Ss’ mean interval ratings. The 
results indicated a high degree of validity for Ss’ ratings. Other aspects of 


the interval count were discussed. 


Several studies have provided evidence that Ss 
can quite accurately judge the frequency with which 
words and letters appear in the English language 
; Attneave, 1953; Howes, 1954). The purpose 
ol ihe present study was to determine if Ss could 
rate the frequency of occurrence of the different 
melodic interval types heard in music in much the 
same way as Ss rated the frequency of occurrence 
of words and letters in English as described in the 
studies cited above. The problem for the study was 
to determine Ss’ validity in rating the 24 ascending 
and descending melodic intervals within the octave. 
The Ss’ ratings were based on their judgments as 
to how often they had heard or experienced each 
interval type in music. These rating results were 
reported in an earlier study (Jeffries, 1972). The 
criterion measure of validity used in the present 
study was an interval count taken by the author of 
the frequency of occurrence of the 24 interval types 
found in several popular vocal music scores, The 
melodic intervals ranked according to frequency of 
occurrence determined from the count were cor- 
related with the intervals ranked on the basis of Ss’ 
mean interval ratings as to how often they had heard 
9r experienced each of the intervals in music in 
order to discover with what degree of validity Ss 
had rated the intervals, 

__ Another objective of the study was to determine 
if the ascending interval types occurred in signifi- 


í os Requests for reprints should be sent to Thomas B. Jeffries, 
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cantly greater numbers in the interval count than 
the descending interval types. 

Jeffries (1972) asked 214 college students com- 
prising two groups to rate the 24 ascending and 
descending melodie intervals of the octave individ- 
ually on a 10-point scale, The 98 Ss in Group 1 
rated the intervals presented on tape on the basi: 
of how familiar each interval sounded, while the 
116 Ss in Group 2 rated the same intervals on the 
basis of how frequently they thought they had heard 
or experienced each interval type in music. Table 
1 presents the familiarity and frequency mean 
ratings for each interval determined from the rating 
results. 

Results of the study showed that Ss agreed 
closely in their familiarity and frequency mean 
ratings; however, an analysis of variance disclosed 
a highly significant difference between the ratings, 
indicating that Ss had rated the intervals on two 
distinctly different bases, Results also revealed 
that Ss rated the ascending intervals as being 
significantly more familiar and frequently heard as 
a group than the descending intervals as a group. 

In the present study the ranks for the 24 intervals 
determined from results of the author's interval 
count were correlated with the ranks for the same 
intervals determined from the familiarity and fre- 
quency mean ratings given in Table 1. 

Method. "The interval count consisted of a tally 
of the number of melodic intervals falling under 
each interval type found in the printed scores of 53 
current and standard popular vocal pieces con- 
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TABLE 1 


INTERVAL FREQUENCY Count, F; 
MEAN RATINGS, AND R) 


MILIARITY -FREQUENCY 
OGNITION ERRORS 


Ratings 
Frequency Recognition 
Interval] (Trae errors 
Familiarity | Frequency 
D M2 1,255 6.05 (13). | 5.35 (11) -— 
A M2 847 7.10 (5) 6.78 (1) 60 (11) 
A m3 409 7.08 (1) | 6.56 (4) 136 (6) 
D m3 392 7.39 (4) 6.33 (5) > 
D m2 375 7.07 (6) 6.03 (7.5) -— 
A m2 368 6.26 (12) 5.44 (10) 64 (10) 
A M3 198 6.82 (8) | 6.03 (7.5) 16 (7) 
D M3 160 6.38 (11) | 5.34 (12) — 
A P4 152 7.59 (2) 6.76 (2) 112 (8) 
D P4 104 5.60 (16) | 4.72 (14) — 
A P5 88 6.90 (7 6.23 (6) 100 (9) 
D P5 73 6.59 (9) 5.97 o — 
A M6 45 751 (3) | 6.69 (3 191 (4) 
A mo 33 5.29 (19) 441 (16 279 (1) 
A P8 26 6.58 (10) 5.15 (13. 26 (12) 
D P8 26 5.45 (17) | 4.57 MEA -— 
D mó 26 523 Q0) | 421 Hn zd 
D M6 21 4.23 (24) 3.82 (22 — 
A m7 17 5.33 (18) | 4.07 (3) 213 (3) 
Aad 13 5.17 (21) 4.17 (18) 265 (2) 
Dm? 11 4,93 (22) 4.00 (21) — 
Dad 6 4.91 (23) 3.69 (23 — 
A M7 4 5.65 (15) 3.68 (24; 154 (5) 
D M7 1 5.99 (14) | 4.05 (20; — 
Note. Abbreviations: A = ascending, D = descending, 


M = major, m = minor, a = augmented, 


x - and P = perfect, 
Ranks are given in parentheses, 


tained in two song collections (Today's Hits, un- 
dated; Today's Most Popular Hits, undated). In 
selecting popular songs as the source material for 
the interval count, it was assumed that Ss who 
took part in the interval-rating experiment perhaps 
were more exposed to this type of music generally 
than to other music styles, 

Ascending and descending intervals were counted 
from the vocal melodies, not from the piano ac- 
companiment parts. For each song analyzed, the 
count began with the identification of the first 
interval of the melody; then successive intervals of 
the melody were identified and counted one by one. 
The songs were written in a variety of keys. Melody 
sections marked with repeat signs were analyzed 
only once. Intervals considered as enharmonics, 
for example, the diminished fifth and the double 
diminished fourth, were tabulated in the count as 
augmented fourths and minor thirds, respectively, 
for purposes of limiting the classifications to the 24 
interval types. Compound. intervals, those greater 
than an octave, were not included in the count. 
Repeated tones of the same pitch were tabulated 
as prime intervals. In all, 6,341 intervals were 
counted from the scores, 

Each of the two song collections contained nearly 
the same number of songs. To determine internal 
consistency reliability for the count, the 25 interval 
types, including the prime interval ranked on the 
basis of the total frequency count obtained for each 
interval from one song collection, were correlated 


with the 25 ranked intervals from the other song 
collection (p = .96). 


Results and discussion, 


d Table 1 lists the 24 
ascending and descending i 


ntervals ranked on the 
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basis of the total count obtained 
The prime interval count is no 
this interval was not rated in tht 

The frequency interval order o 
related with the ranked interval 
frequency count (p —.83, p < 
was also correlated with the ! 
order of Table 1 (p = .70, p < 
ence in correlation was not signi! 

In the experiment, Ss rated ti 
vals significantly higher in famili 
than the descending intervals. 
was to determine if the ascending 
more frequently than the desco: 
the music analyzed. The data of ! 
from Table 1 show the compari 
tween the like ascending and des 
A binomial test was employed to 
in count yielding z = 3.65, p < 
significantly more descending tha: 
vals counted in the popular songs. 

Ortmann (1937) reported on : 
count taken from 126 songs of the 
posers Schumann, Brahms, and | 
In all, 15,459 intervals were counte: 
songs, 44% of the intervals counted 
while 56% were descending. In 
Schumann and Strauss, 48% of the 
ascending, 52% descending. For th: 
three composers taken together, [ 
intervals significantly outnumbered 
intervals. Here, a binomial test 
p < .001, 

The interval counts discussed above do not agree 
generally with Ss’ ratings in which the ascending 
intervals were rated significantly higher than the 
descending intervals. This difference could be at 
tributed to many factors. In considering the int 
val count findings, perhaps taking more ined 
counts from a variety of composers’ works coul 
providea more definitive picture regarding frequency 
of interval occurrence than was provided from these 
two counts alone, From another standpoint, it 1$ 
also possible that the author's interval rating 


ach interval, 
ided because 
iment. 
le 1 was cor 
based on the 
The latter 
rity interval 
This differ- 
p > .05). 
nding inter- 
nd frequency 
juestion now 
als occurred 
intervals in 
? rearranged 
n count be- 
uy intervals, 
or differences: 
indicating 
nding inter- 


adic interval 
mantic com- 
urd Strauss, 
In Brahms's 
c ascending, 
he songs O 
tervals were 
ounts of the 

descending 
he ascending 
ve 8 = 1.09, 


TABLE 2 

FREQUENCY COUNT ror LIKE ENDING AND 

Interval x 

Ascending 

M2 847 

m3 409 

m2 368 

M3 198 

P4 152 

PS 88 

M6 45 

mó 33 

Ps 26 

m7 17 

a4 13 

M7 4 


t — perfects 
Note, Abbreviations: M = major, m = minor, P = P® 
anda = augmented, 


NAT 


icked some sensitivity in measuring 


procedu 

the deg Ss’ familiarity and frequency experi- 
ence wit : intervals. Introducing refinements 
into the esting procedures might produce rating 
results e in agreement with the interval count 
findin s given than was found in comparing the 
inter ount results with the mean interval ratings 
given » Table 1. Both the questions of interval 
fre cy count and the interval-rating procedures 
new urther study. 


data provide some evidence that familiarity of 
m > intervals may have some bearing on interval 


le vz. Table 1 shows the total number of errors 
in iorval recognition for each of the 12 ascending 
n lic intervals made by 24 Ss, these data taken 
f n earlier study (Jeffries, 1967, p. 189). Table 


ts the same ascending intervals ordered accord- 
i o results of the interval count. Correlating 
two interval orders produced p = — .53, p < 
using a one-tailed test of significance, with these 
indicating a significant inverse relationship 
tween interval count and rate of error. The 
gher was the count for particular intervals, gen- 
erally, the lower the error rate for Ss learning to 
identify these intervals. Although the correlation 
figure above is not particularly high, it does provide 
some basis for considering further experiments to 
se effects that familiarity may 
have on interval learning. 
Jn summary, the results showed a close relation- 
pD between the interval count findings and .Ss' 
i -rval frequency ratings. This may point to the 
p sibility that people can judge quite accurately 
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the frequency with which they hear the various 
melodic intervals in music. 

Significantly more descending than ascending 
intervals were counted in the popular songs as well 
asin the art songs, These findings are not in agree- 
ment with Ss’ ratings that the ascending intervals 
were more familiar and frequently heard than the 
descending intervals. 

Of course, a host of questions was not touched 
upon in the study, such as those relating to the 
resolution tendencies of tones, melodic intervals 
heard in cadences, and the like. No doubt many 
such factors in one way or another influence the 
recognition of tones in sequence. While such phe- 
nomena may well be of interest to this study, none 

yas considered here because of the limited nature 
of the problem. Certainly many of these questions 
call for investigation. 
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THE TIME COURSE OF LATERAL ASYMMETRIES IN 
VISUAL PERCEPTION OF LETTERS! 


ARNOLD WILKINS ? anp ANNE STEWART 


University of Sussex, Brighton, Sussex, England 


A single letter presented at fixation was followed after a short or long interval 


by a second letter presented in the right or left visual field. The 
by a manual response whether or not the letter names were the s 
letter pairs with the same name and case, there 


indicated 
me. For 
were fewer errors at short 


intervals when the second letter in each pair was presented to the left visual 
field, although at long intervals right visual field presentations were more 
accurate. For letters with the same name and different case, there were no 
field differences at short intervals, but at long intervals there was a right visual 
field superiority. It is proposed that in the process of letter perception a pre- 
liminary nonverbal code is available in the right hemisphere until superceded 


by a verbal code in the left hemisphere. 


Cohen (1972) and Geffen, Bradshaw, and Nettle- 
ton (1972) found that the matching of letter pairs 
al Ate!) found tl 
Se This work was conducted while the first author held a 


$ tet Research Council studentship. 
*quests for reprints should be sent to A. J. Wilkins, 


with the same name and shape is faster when the 
letters are presented to the left visual field than 
Department of Neurology and Neurosurgery, Montreal Neu- 


tological Institute, 3801 University Street, 
Ce y et, Montreal 112, 


TABLE 1 
ERRORS AND LATENCIES FOR SAME axo DIF 
3 


Stat 
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ARITY, INTERLETTER INTERVAL (ILI), V! 
SECOND LETTER (VF), AND RESPONDING HAND 


ERENT RESPONSE 


AS A FUNCTION 
UAL FIELD OF 


Stimulus condition 
Same case Differ 
Measure Response condition Tyr ILIO 
Short ILI Long ILI Short ILI 
Left | Right | Left | Right | Left | Right 
VF VF VE VF VF 
Error Same 
(%) Right hand (Group 1) 0.0 9.4 10.9 0.0 14.1 
Left hand (Group 2) 0.0 78 7.8 0.0 7.8 
Different 
Right hand (Group 2) 1.6 34 34 1.6 94 
Left hand (Group 1) 0.0 0.0 31 16 1.6 | 
Latenc; Same | 
Yee Right hand (Group 1) | 732 735 667 707 783 716 
correct. Left hand (Group 2) 754 737 686 743 697 700 | 
responses | Different | 
(in msec.) Right hand (Group 2) | 649 685 681 657 661 706 60 
Left hand (Group 1) 799 749 730 763 757 168. | 


when they are presented to the right. They also 
showed that matching of pairs with the same name 
and different shape is more rapid for presentations 
to the right visual field than to the left. Since each 
visual half-field projects to the contralateral cerebral 
hemisphere, the above findings suggest that letters 
with the same name and shape are matched in the 
right hemisphere in terms of a spatial code, whereas 
pairs with the same name but different shape are 
matched in the left hemisphere on the basis of a 
verbal representation. 

Using a memory drum, Posner and Keele (1967) 
presented pairs of letters successively and varied 
the interletter interval. At short intervals pairs 
with the same name and shape were matched faster 
than those with the same name and different shape, 
but at intervals of 1.5 sec. the two types of letter 
pairs were classified at the same speed. Posner and 
Keele interpreted their results as indicating that at 
long intervals all matching was performed in terms 
of a verbal code, the spatial code having been 
allowed to decay. If their interpretation is correct, 
it is conceivable that a visually presented letter is 
available initially as a spatial code in the right 
hemisphere until such time as this code is translated 
into a verbal code in the left hemisphere. The 
following experiment provides evidence to support 
this view. 

Method. Sixteen right-handed female students at 
the University of Sussex served as unpaid Ss. They 
were unaware of the purpose of the experiment. 
The Edinburgh Handedness Inventory (Oldfield, 
1971) was administered after the experiment and, 
on the basis of this questionnaire, all Ss were 
strongly right handed. 

The stimuli were presented in a three-field. 
tachistoscope with fields of luminance 34 cd/m?, 
10.2 cm. high and 15.2 cm. wide, viewed binocularly 
ata distance of 54cm. The fixation field, which was 
illuminated continuously except when one of the 


stimulus fields was on, contained a 
agonal cross, 2 mm. high, at the cent 
The Ss were instructed to fixate the cro 
each trial, and the importance of thes 
was stressed. The stimulus fields each 
single letter approximately 4 mm. high, printed 
black Letraset 14 pt. Helvetica Light. 1n the fi 
stimulus field the letter was positioned in the cen 
of the field, and in the second field the letter aps 
peared 28 mm. (approximately 3° of visual angle 
to the right or left of center, 1 
A trial began when E prepared S by counting 
“Three, two, one," and, on the count of “One, 
triggered the onset of the first stimulus field. l€ 
tachistoscope was wired so that both stimulus field: 
lasted 100 msec., separated by an interval of eith 
50 or 990 msec. The .S was required to respond 
one key if the letters had the same name and on 
second key if the names were different. 
latencies were measured from the onset 
second stimulus field, and all errors were recorde 
Half the Ss (Group 1) used the index fingerof t 
right hand for a same response and the index fin 
of the left hand for a different response. The fes 
maining Ss (Group 2) used the reverse confi uratione 
Sixteen conditions resulted from the combinatig E 
of (a) the two interletter intervals, (b) the similari d 
or dissimilarity of the letter names, (c) the similari 
or dissimilarity of the letter cases, and (d) the tw 
visual fields in which the second letter of each pal 
could appear. The experimental conditions| are 
shown in Table 1, - The letters B, D, G, H, N, Q, 
R, and T, or their lowercase equivalents, were used 
once in each condition as the first letter of a Par 
so that the case of the initial letter was balanced 
across conditions, Each cell in Table 1 is thercfo 
based on eight observations from each of eight A 
The second letter in each pair was chosen as appre 
priate to the condition. For the conditions in which 
the letter names were different, the second lettet 


:roughoul 
struction: 


outained 
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was chos andom from the same population as 
the first the restriction that pairs such as 
"Dt" w honetic similarity, and pairs such as 
"rn," wi milar shape, were avoided. The 128 
letter p were presented after 10 practice trials, 
and th der of presentation was random and 
consta r all For half the Ss from each group 
the ini. ‘etter interval was switched so that pairs 
of let which were separated by a long interval 
for | he Ss were separated by a short interval 
for t! - remaining Ss. As a result, effects of inter- 
lett: terval cannot be attributed to possible con- 
Co! variation in stimulus discriminability. 

ts and discussion. The percentage error in 
ea« xperimental condition is shown in Table 1. 
A ilysis of variance of the error scores for same 
n ises over interletter interval, visual field, case 


«arity, and responding hand yielded a significant 

i ction between visual field and interletter in- 
4, F (1, 14) = 34.3, p < .001, MS, = .311; a 
ificant effect of case similarity, F (1, 14) = 8.50, 

< 05, MS, = .405; and a significant three-way 
teraction between case similarity, interletter in- 

:erval, and responding hand, F (1, 14) = 6.59, p < 

.05, MS. = .267. The analysis of variance is gen- 

erally considered a robust test, but in view of the 
highly skewed distribution of error scores, all the 
major effects obtained in the analysis were corrobo- 

rated by individual sign tests with Ss as N. 

For letter pairs with the same name and 
e.g., BB") there were fewer errors at short inter- 

iter intervals when the second letter in each pair 

1s presented to the left visual field (p = 004, sign 

|). At long intervals there were fewer errors 
when the second letter was presented in the right 
visual field (p = .004, sign test). Since the first 
kiter appeared at fixation and was therefore pro- 
jected to both hemispheres, these results suggest 
that at short intervals letter pairs which had the 
same name and case were processed in the right 
hemisphere on the basis of a spatial representation, 
whereas at long intervals a verbal match was per- 
formed in the left hemisphere. 

For letter pairs with the same name and different 
case (e.g, “bB"), there was no field difference at 
short interletter intervals (p = .754, sign test), 
although at long intervals fewer errors were made 
when the second letter was presented in the right 
visual field (p = .004, sign test), The absence of a 
right visual field advantage at short intervals is of 
interest because letter pairs with the same name 
and different case ostensibly required verbal pro- 
cessing at both short and long intervals. A right 
visual field advantage at both intervals might there- 
fore have been anticipated, Furthermore, inspec- 
tion of the error scores for these stimuli suggests 
that the significant three-way interaction obtained 
in the analysis of variance was partly due to the 
greater accuracy of the left hand at short intervals 
and the right hand at long intervals. Both this 
interaction and the absence of a right visual field 
advantage when the intervals were short point to 
the involvement of the right hemisphere, 1f the 
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retrieval of the name of the first letter was not 
always complete by the time the second letter was 
presented and the retrieval of two letter names was 
not able to proceed simultaneously, a nonverbal 
code of one letter would have to have been held 
(possibly in the right hemisphere) while the name 
of the other letter was retrieved. 

An analysis of variance of the error scores for 
different responses over interletter interval, visual 
field, case similarity, and responding hand yielded 
one significant effect, that of case similarity, F 
(1, 14) = 7.47, p <.05, MS, —.268. Evidently 
more errors were made when the pairs differed in 
case than when their case was the same. It is 
therefore possible that same and different judgments 
involved separate mechanisms (Egeth & Blecker, 
1971), and Ss occasionally responded different with- 
out retrieving the letter name if the letters were the 
same size and did not produce a spatial match. 

For each S the median latency of correct responses 
in each combination of the two levels of interletter 
interval, visual field, and case similarity was calcu- 
lated. The eight medians in each condition were 
then averaged, and the resulting means appear in 
‘Table 1. Analyses of variance based on the median 
latencies yielded three small effects, two of which 
tended to contradict the clear pattern which emerged 
in the analyses of errors. An analysis of the laten- 
cies for same judgments yielded a significant inter- 
action between case similarity and responding hand, 
F (1, 14) = 6.05, p < .05, MS, = 6,206; a similar 
anal for different judgments revealed an effect 
of case similarity, F (1, 14) = 7.78, p < .05, MS, = 
2,188, and a three-way interaction between inter- 
letter interval, visual field, and responding hand, /^ 
(1, 14) = 4.99, p < .0. 3,310. 

The effect of case similarity for different responses 
indicates that latencies were shorter for pairs of 
letters with the same case than for letters with 
different case, and this result is consistent with the 
view expressed earlier that letter size may have 
been occasionally used as a shortcut to the produc- 
tion of a different response. The interaction be- 
tween case similarity and responding hand may be 
attributed to the faster responses with the right 
hand for letter pairs with the same name and case 
and the faster responses with the left hand for 
letter pairs with the same name and different case. 
This effect and the three-way interaction between 
interval, visual field, and responding hand are open 
to many highly speculative interpretations. Instead 
of discussing these here, it will simply be noted that 
there was no evidence for a time-error tradeoff 
since the correlations between residual errors and 
latencies were positive (.22 for same responses and 
10 for different responses) Error scores were 
probably a more reliable index of performance than 
latencies, owing to the use of unpracticed Ss and 
the fact that the foreperiod included the variable 
interletter interval. Cohen (1972) and Geffen et al. 
(1972), who obtained field differences for latencies 
but not for errors, used a constant foreperiod, 
a relatively large number of trials, and, in the 
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case of Cohen's study, near-threshold stimulus 
durations. 

Although the apparatus did not admit observa- 
tion of eye movement, presentation of the first 
letter at the center of the visual field was probably 
sufficient to secure central fixation at the beginning 
of each trial. Drift in fixation may have occurred 
during the long interletter interval and if such drift 
occurred predominantly to the right, it would pro- 
vide an alternative account of the general superiority 
of the right visual field at long intervals. However, 
the differences in the field advantages which oc- 
curred at short intervals cannot be explained in 
terms of fixation drift. Furthermore, Geffen et al. 
(1972), who observed eye movements in a situation 
very similar to that of the present experiment, 
found that such movements were rare and that they 
were evenly distributed between the two visual 
fields. The occurrence of eye movement artifact in 
the present experiment was therefore unlikely. The 
development of consistent scanning strategies was 
also unlikely since Ss could never predict which 
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EFFECTS OF EVALUATION APPREHENSION ON MEMORY 
OVER INTERVALS OF VARYING LENGTH! 


RUSSELL G. GEEN è 


University of Missouri 


In a 2 X 2 X 3 factorial design, 240 female Ss 
Tor a single trial after being told that it was e 
evaluation of the list. The S was either obse 
the study trial. Recall was tested after 2, 15, or 45 min. 
after 2 min. for Ss who had been told th 
than for Ss told that the list was being 


finding indicates that evaluation app: 
that facilitates long- 
intervals. 


In a recently reported experiment, Geen (1973) 
found that female Ss who were observed during a 
single study trial in a paired-associate (PA) learning 
task subsequently showed poorer recall after 2 min. 
than nonobserved Ss, but superior recall after 45 
min. Since observed Ss also reported themselves as 
more "nervous" than nonobserved 5s, the results 
were interpreted as being consistent with the findings 
of others that arousal during trace formation facili- 
tates long-term retention, while hindering retention 
over relatively short periods (e.g., Kleinsmith & 
Kaplan, 1964). The results also conform to those of 
other investigators who report that the presence 

!This study was supported by Research Grant GS 2748 


from the National Science Foundation. 
*Requests for reprints should be sent to Russell Geen, 
Hall, University of 


Department of Psychology, McAlester 
Missouri, Columbia, Missouri 65201, 
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at their intelligence was being evaluated 
tested, but superior after 45 min, This 
rehension creates a condition of arousal 
term memory but hinders memory over relatively short 


visual field the second letter wo 
interval would occur, and whether 
be the same or different shape. 

In conclusion, the substantia 


cupy, which 
etters would 


ences in the 


lateral asymmetries at short ig intervals 
suggest that in the course of le »erception à 
nonverbal code is available in th hemisphere 
until superceded by a verbal in the left 


hemisphere. 
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studied a paired-associate list 
ither a test of intelligence or an 
rved or not observed by E during 
Recall was poorer 


of observers produces increased arousal level (e: 
Cottrell, Wack, Sekerak, & Rittle, 1968). Geen 
implied that being observed causes increased arousal 
by eliciting an emotional response associated with 
apprehension over being evaluated by the observe! 
The present experiment was designed to vary obser- 
vation and evaluation independently of each other 
in an orthogonal design. Half of all Ss were informe! 
that they were being evaluated and half were non 
In addition, half of all Ss were observed by E during 
the PA study trial and half were not. All Ss were 
then tested for recall after either 2, 15, or 45 min. 
Method. Two hundred and forty female under- 
graduates served as Ss in order to earn points 
applicable toa final grade in a beginning psychology 
course. The Ss were selected at random from a 
lists and scheduled by telephone. Scheduling ne 
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arranged t one ,5 arrived at the waiting room 
eyery 15 n Selore meeting each S, E assigned her 
to 1 of t! ' conditions by referring to a table of 
random bers. The E was a female approxi- 
mately t! ime age as S. 

Afte climinary introductions, Æ led S to a 
table i: experimental room on which had been 
placed alayette memory drum. After explaining 
that tl ‘periment involved "'single-trial learning," 
Einf d S that a PA list consisting of six syllables 


and iated digits would be shown once and that 
reci uld be tested later. To half of the Ss 
(ev: n condition), E then said that good recall 
in | irning has been shown in several studies to 
be ity correlated with intelligence and that the 
e nent was in fact one of a number of pilot 
st es being carried out to assess intelligence by 
tl means, To the remaining Ss (no-evaluation 
co dition), E said that the experiment was designed 


a: a test of how certain stimuli are recalled better 
ton others and that the session in which S was 
‘ng run provided for evaluation of one such list. 

The E then turned on the memory drum. A 
period of 10 sec. elapsed before the word READY 
appeared in the window. Two seconds later the 
first stimulus term appeared. The stimuli were six 
syllables of 0% association value (rov, CEF, QAP, 
JEX, LAJ, and DAX) first used by Kleinsmith and 
Kaplan (1964) paired with the digits 2-7. Each 
syllable appeared alone for 2 sec., followed by the 
yllable-digit pair for 2 sec., followed by a 2-sec. 

lank interval before the appearance of the next 

vilable. Two seconds after the final syllable-digit 
ir, the word stop appeared. The S had been in- 
sivneted to turn off the drum at this point. During 
tue study trial for one half of the Ss (observation 
condition), E stood just to the left of S and watched 
as 5 was presented the syllable-digit pairs. With the 
other half of the Ss (no-observation condition), Æ 
stood beside S at about the same distance as she 
stood in the observation condition, but kept her back 
urned to S as she busied herself with some papers 
on a desk.* 

When S had turned off the memory drum, Æ led 
her to the waiting area outside the experimental 
room and informed her that she would be called 
back when it was time for the recall trial. The E 
began timing the recall interval from the moment 
5 was seated in the waiting room, Each S was then 
given a short adjective checklist on which she was 
asked to mark each word that described her state 
of feeling at that time, on the ostensible grounds that 
emotions have effects on learning. In addition to 
several distractor items, the checklist contained the 
words RELAXED, FEARFUL, JUMPY, and CAREFREE. 
This was included in an attempt to obtain an inde- 
pendent measure of evaluation apprehension. The 
E collected the checklist when S had completed it 
and then informed § how much longer she would 


have to wait before the recall trial. In order that 
AONE RR ry 

. Geen (1973) found that the mere presence or absence of E 
either during as study tríal or the test trial had no effects 
Ou S's recall. Accordingly, E was present with S during both 


trials in the present study. 
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rehearsal of the items from the PA ta: k might be 
avoided during the recall interval, each S was then 
given a questionnaire on vocational interests, pre- 
sumably as part of a pilot study for another experi- 
ment. The Strong Vocational Interest Blank for 
Women was used. Depending on the length of her 
wait, S was given either the first part, the first two 
parts, or the entire blank.* When the recall interval 
was over, E interrupted S, took the questionnaire, 
and led S back to the experimental room. The Æ 
next turned on the memory drum, instructed S to 
get ready, then watched § and recorded her re- 
sponses. After a blank interval of 5 sec., the first 
syllable appeared. Each syllable appeared for 2 s c 
followed by a 4-sec. blank interval during which 
time .5 responded. The syllables appeared in the 
same order as in the study trial. After the recall 
trial had ended, E interviewed each .S to discover 
whether the experimental deceptions had been suc- 
cessful. The S was asked to state whether she 
attached any significance to whether or not E had 
watched her during the study trial, whether she 
believed that intelligence was being tested, and 
whether the amount of time spent waiting had any 
effect on recall. Evidence from S's answers that 
she had formed any hypotheses consistent with 
those of the experiment was considered grounds for 
classifying S as “aware” of the deceptions. The 
E then fully explained the true nature of the study 

and included a statement of the major hypotheses 
being tested. ‘ y p 

Results and discussion. 
aware of the true nature of the study. 

Significant differences across conditions were 
found in the number of Ss reporting themselves to 
be relaxed. No such differences were found in the 
case of the other adjectives. With an n of 60 in each 
group, the numbers of Ss in each checking the word 
relaxed on the checklist were: no observation -no 
evaluation, 42; observation-no evaluation, 28; no 
observation-evaluation, 10; observation-evaluation 
20. This pattern of responses would not have been 
expected on the basis of chance, oe AON 4.96. 
b < .05, Yates's correction applied. We thus Pd 
some evidence that being evaluated made Ss Je: > 
relaxed than they otherwise would have been 

Figure 1 shows the number of digits recalled by S 
in the 12 conditions of the experiment, An aek 
of variance revealed that two first-order TUR KE É 
were significant, the Evaluation X Time, F Q 228) 
= 13.39, p < .01, and Observation x ine 


No S reported being 


F (2, 228) - 3.24, PES 087 interactions, Th 
second-order interaction was not significant, F 2. 
228) = 2.34, p < 10. No other inter. ; Q, 


action or maj 
effect approac! nificance Dur nain 
e pr hed nificance, Post hoc ¢ tests 


further showed significant differences betwee; 
evaluated—2-min. and not-evaluated—2-min pat 
+ &roups, 


4 Most of the time, more than on 
E at [ox time, 5s er that the 
be relatively standardized during i 
structed each S not to communicate oine el] int 
the waiting period but to concentrat. 
tionnaire. The Ss were also seated i 
a way that they did not have face. 
other. 


€ on filing out i 
in the waiting room 
-to-face contact 
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t (38) = 2.88, p < .01, and the evaluated—45-min. 
and not-evaluated—45-min. groups, £ (38) = 2.26, 
P < .05. The difference between the observed— 
2-min. and not-observed—2-min. groups failed to 
attain statistical significance, £ (38) = 1.92, p < .10. 
The overall pattern of comparisons among means 
indicates that Ss who believed that they were being 
evaluated manifested superior recall to that shown 
by nonevaluated Ss after a relatively long waiting 
period, but poorer recall following a short period.* 
The mere fact of being observed during the study 
trial, independent of being evaluated, produced only 
à weak crossover effect of recall over waiting periods, 
with practically no difference as a function of being 
Observed or not observed after 45 min. The findings 
thus corroborate the author's previous report that 
socially produced arousal promotes memory over a 
relatively long period but hinders it over a relatively 
short period, and also indicate that the important 
variable in the process is apprehension over being 


evaluated. The delineation of evaluation appre- 
SES Eis 

*Of the 120 Ss 
a lack of belief that they we 


during 
is of the 
d from 


Among un- 
the mean numbers of digits recalled at 2, 15, and 


us revealing 


primarily 
that they 
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Some investigators h: 
reaction time task 
others have not. 
proposed. According to one, 
response retrieval which mani 
imbalance which follows 
to the other, the effect resul 
strength across response c. 
the availability of a mediat 
prior experiences to all stimuli 
the present study 


containing 


Hawkins, MacKay, Holley, 


Friedin, and Cohen 
(1973) have shown that choice 


reaction time (RT) 
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rst of these two interpretations. 


is inversely related both to stimulus (S) and responsi 

relative Írequency when these variables 
independently manipulated within a 2:1 
mapping task. Both effects were found to increà 
with declining S-R compatibility. These rest 
were interpreted in terms of an informal statistica 
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decision which locates both stimulus and 
response ive frequency effects within the re- 
sponse r« val, or selection, process. More specifi- 


cally, st lus relative frequency effects were 


attribute: to frequency-related differences in 
strengt! association (and thence, retrieval time) 
across ci R pairings, and response relative 
freque effects were attributed to frequency- 
relate ias in response retrieval. However, an 
inspi n of the particular S-R codes used in the 
Haw's et al study raises a serious question 
reg 1g the appropriateness of this analysis.? 
Ur il compatibility conditions, the two stimuli 
cov'only assigned each vocal-naming response 
were the upper- and lowercase forms of a single 
le One of the key assumptions underlying the 
model proposed by Hawkins et al. was that the 
rejuired response was directly associated with each 
of rhe nominal stimuli to which it was igned. 


It is equally plausible, however, that this association 

mediated by the familiar name common to the 
o stimuli forming each response category. Were 
is the case, the response relative frequency effect 

bserved by these investigators may not index 
retrieval bias but rather frequency-related differ- 
ences, across response categories, in the strength 
of association between mediating and required re- 
sponses. Accordingly, the response relative fre- 
quency effect might be expected to attenuate or 
disappear under conditions where stimuli within 

itegories do not share a common familiar name (cf. 

ertelson & Tisseyre, 1966; Hawkins & Friedin, 
1 Hawkins, Thomas, & Drury, 1970). The 
present experiment constituted a test of this 
implication, 

Method. The Ss were 14 men and 10 women 
enrolled in introductory psychology at the Uni- 
versity of South Florida. All Ss had normal or 
normal corrected vision, and all were naive with 
respect to RT research, Extra course credit was 
given for participation. 

Stimuli consisted of the letters D, d, Q, q, H, h, 
k, b, and p (Tactype No, 5418). The manner of 
stimulus presentation was as reported in Hawkins 
et al. (1973). 

All Ss were pretested on a 40-trial practice task 
in which a colored nonsense syllable, either green 
(p 50) or orange, was presented on each trial, 
The vocal responses during practice were the 
common names of the two colors. Based on per- 
formance on this task, Ss were blocked into slow 
(N = 12) and fast responders. One-third of the 
Ss in each block were assigned on a random basis 
to one of three levels of S-R compatibility. The 
task, under all levels of compatibility, was to vocalize 
one of three responses to each of six perceptually 
distinct stimuli, Stimuli and responses were paired 
in 2:1 fashion. The two stimuli within one response 
Category were presented with relative frequencies 
-50 and .10, respectively; those within another 
response category both were assigned a relative 
frequency of .10; and the remaining two stimuli were 
presented with a relative frequency of .10. These 
latter two stimuli, together with their respective 
eR eU. 


? The authors wish to thank Steven W. Keele and Lester E. 
Krueger who independently alerted us to this possibility, 
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response, were distractors, included to increase 
overall task difficulty and to minimize the likelihood 
Ss would employ a “positive set" strategy of the 
type observed by Hawkins and Hosking (1969) in 
two-response choice RT tasks. Of primary interest 
were the high- (HH) and low-frequency (LH) 
stimuli requiring a common high-frequency (.60) 
response and the two low-frequency (LL) stimuli 
requiring a common low-frequency (.20) response. 

All Ss were tested on a total of 240 trials during a 
single experimental session lasting about 1 hr. and 
occurring within 3 wk. of the practice session. 
During the experimental session, the identity of, 
the correct relative frequency of, and the response 
appropriate for each stimulus were made constantly 
available to S on an index card. The intertrial 
interval was 3.5 sec., with a warning tone of 1-sec, 
duration occurring immediately prior to stimulus 
onset. After each correct response, .S was informed 
of his RT in milliseconds, Following each incorrect 
response, S was told that an error had been com- 
mitted. Stimuli to which S incorrectly responded 
were not subsequently reported. Short breaks 
were given following each 40-trial block. Under 
all experimental conditions, Ss were instructed to 
respond as rapidly as possible while at the same time, 
inimizing errors, 

Eight Ss were tested under the high-compatibility 
condition where they were required to vocalize the 
namecommonly associated with each of the stimulus 
letters. Each of the four stimuli, D, d, Q, and q, 
served as the high-frequency alternative for two 
of the eight Ss. The two stimuli H and h served 
as distractors. Eight Ss were tested under a low- 
compatibility condition where they were required 
to respond Z to stimuli Dand d, V to Q and q, and J 
to distractors H and b. D, d, Q, and q each served 
as the high-frequency alternative for two of the 
eight Ss. The remaining eight Ss were tested under 
a low-compatibility condition where the response Z 
was required to either D or k, the response V was 
required to either Q or b, and the distractor re- 
sponse J was required to either H or p. As before, 
D, k, Q, and b each served as the high-frequency 
alternative for two of the eight Ss. 

Results and discussion, Table 1 gives the mean 
correct RT and percent errors (in parentheses) 
under each of the conditions of the experiment. 
These data were subjected to a mixed model analysis 
of variance with compatibility (high vs. low; vs. 
lows) and blocks (fast vs. slow Ss) treated 
between-5s variables and S-R pairing (HH vs. LH 
vs. LL) and level of practice (first through fourth 
block of 60 trials) treated as within-Ss variables. 
The overall analysis revealed that S-R compati- 
bility, F (2, 18) = 101.99, S-R pairing, F (2, 36) 
= 116.51, the Compatibility X S-R pairings in- 
teraction, F (4, 36) = 34.18, practice, F (3, 54) 
= 29.20, the Practice X Compatibility interaction, 
F (6, 54) — 8.06, the S-R Pairing X Practice inter- 
action, F (6, 108) — 7.78, and the second-order S-R 
Pairing X Compatibility X Practice interaction, 
F (12, 108) — 5.01, were all significant sources of. 
variance (p < .001). 

It will be recalled that the primary objective of 
the present study was to determine whether the 
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TABLE 


MEAN Cnoick REACTION TIME AND PERCENT ERRORS (IN PARENTHESES) Aci 
CONDITIONS AND LEVELS OF PRACTICE 


Stimulus-response compatibility 
Trial block High (Gain ELA | ' M 
Sa E 
HH LH LL HH PHOT IL | mu i LL 
601 648 652 754 866 1013 819 1670 
: (5) (1.8) (2.0) (1.1) (8.3) (20.7) (1.9) (30.1) 
< 573 604 602 678 775 916 786 1276 
* (4) (1.5) (1.8) (1.0) (6.8) (16.9) (2.1) (25.0) 
555 593 605 655 734 870 719 1164 
D» [25] (1.4) (2.0) (7) (5.4) (15.0) (1.6) (23.3) 
563 581 598 668 706 821 696 978 
I5 (.6) (1.5) (1.8) (.9) (6.0) (16.3) a | (19.8) 
573 607 614 689 771 905 755 1272 
SNR C5) (1.6) (1.9) (.9) (6.6) (17.2) | (a) ( (24.6) 
see the two ow erue e emule (20) repose io high-frequency 1 0) responie LN 
response relative frequency effect attenuates or dis-  .005. This interaction reflects the fac! that the RT 


appears when stimuli within response categories — differer 
do not share a common name in S's prior experience, 
Two analyses were performed which are relevant 


nce between LH and LL pairings declined 
with practice under the low-compatibilit y condition 


a where stimuli within response categories did not 
to this question, First, Fisher’s least significant share a common name, but not under the remaining 
difference (LSD) test, with the criterion of signifi- two compatibility conditions. Theanti ipatory bias 


cance set at .05, was used to assess the occurrence of — interpretation implies, to the contrary, that since 
response relative frequency effects under each of Ss were given equal practice on LH and LL pairings 
the three compatibility levels, This analysis re- — under all compatibility conditions, the practice- 
vealed that choice RT 


RT increased significantly be- — related declines in RT for these two S-R pairing 
tween LH and LL S-R pairings under both low- types should have remained equal across trial blocks, 
compatibility, but not under the high-compatibility, 


at yielding no decline in the response relative frequency. 
conditions (LSD = 34 msec.). Second, a comparison effect under any of the compatibility conditions. 
of the magnitudes of the response relative frequency The present results make it clear that the response 
effect across the two low S-R. compatibility condi- relative frequency effect is a complicated phe 


tions revealed that the effect w: nomenon which may prove to be difficult or im- 


eal possible to adequately characterize in terms of a 
prising cach Tesponse category did not share a single factor model of the type proposed by Hawkins 
common familiar name, F (1, 36) = 7.85, < 01. et al. (1973). 

These results would seem to be consistent with the 

Hawkins et al. (1973) view that the response relative REFERENCES 
frequency effect indexes anticipatory bias in re. BERTELSON, P, 
sponse retrieval and that such anticip. 
ments represent a feature of the 


s uS SSEYRE, F. Choice reaction time as 9 
A function of stimulus versus response relative frequency 

atory adjust- occurrence. Nature, 1966, 212, 1069-1070. 1 ency. 
information- Hawkins, H. L., & Frienin, B, D. The relative frequ 


i ; ffect ibility. ic Science, 19721 
processing sequence which extends across a variety 28. Sees Compatibility, Psychonomic Scie 


of speeded recognition tasks. However, a further PARES HU 3i HO SEINE. Kc Sumus probable 
E ME an a ime. 

feature of the present data indicates that such an Experimental Psychology, 1969, 82, 435-140, IN, 

account cannot, in and of itself, fully characterize EOM Coney MacKay, S. L. HoLLey, ERE 
> i i - D., & COHEN, S. L, tive fre y 

the findings qs here. Specifically, a subse- in choice reaction iA WI i Psychology 

quent partition of the variance in the original 1973, 101, 90-99. ual 

i B Hawkins, H, L., A JRY, K. B. Percepi 

analysis revealed that the magnitude of the Re- versus response be Gs B» & DR reaction time, Jour 

sponse Relative Frequency x Compatibility interac- of Experimental Psychology, 1970, 84, 514-517. 

tion declined with practice, F (6, 108) = 4.03, $ < (Received July 12, 1973) 
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OVERT REHEARSAL AND LONG-TERM RETENTION 


GARY F. N 


Ball Stat 


Recall scores from intentional and in 


JNIER,! JANE KESTNER,? JO ANN MEUNIER, anp DOUGLAS RITZ 5 


le University 


cidental learning groups were compared 


when rehearsal was made overt by having S write each stimulus as it appeared. 


Duration of writing time was varied 
of a stimulus was an index of rehea 
superior to incidental learners even th 


and the number of written replications 
al. Intentional learning Ss were clearly 
ough the groups rehearsed equally often. 


In addition, moderate increases in rehearsal time led to improved retention, 


but increases in rehearsal beyond this 


sentral aspect of most multistore models of 
żory is the notion that stimulus material is both 
intained in a limited capacity, temporary storage 

d transferred to a larger, more durable storage by 

rocess of reh l. And, generally, the greater 

amount of rehearsal given an item the greater is 

e probability that the item will be encoded into 
he more durable store (e.g., Atkinson & Shiffrin, 
1968). 

A testable implication of this conceptualization is 
that, all other factors constant, long-term retention 
of a given item should be a function of the amount 
of rehearsal it receives. Unfortunately, attempts 
to evaluate this statement have not provided con- 
sistent results: Using a variety of paradigms, previ- 
ous research has yielded evidence both supportive 
(e.g., Rundus & Atkinson, 1970) and contradictory 
(Stanners & Meunier, 1969) to the implication. 

‘he present experiment, thus, is an attempt to 

urther clarify the relationship between rehearsal 

id retention. 

\ “rehearsal” in this experiment consists of a 
written replication of the stimulus item. Although 
there is, of course, some controversy over just what 
a rehearsal is and how it is to be measured, this 
definition does not seem inconsistent with other 
frequently used conceptions and does have the ad- 
vautage of making the rehearsal process overt. 

Method. The Ss were 130 students from intro- 
ductory psychology classes at Ball State University, 
who received extra course credit for participation 
in the experiment. 

Stimulus items were 24 29% association value 
trigrams selected from the Witmer (1935) scaling. 
Two additional trigrams selected from the same 
group served as practice items. The stimuli were 
projected individually by a Kodak Carousel pro- 
jector onto a screen located approximately 2 m. in 
front of the seated S. Height of the individual 
letters on the screen was 7.62 cm., and each was 
presented for 1.5 sec. The Ss were run in groups 
of from three to five members each. Each S silently 
viewed the trigram on the screen while it was 
presented. Termination of this visual stimulus was 
the signal for S to start writing the trigram on a 
ESE ST 


? Requests for reprints should be sent to Gary F. Meunier, 
Department of General and Experimental Psychology, Bali 
State University, Muncie, Indiana 47306. 

? Now at the University of Notre Dame. 
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level did not affect recall. 


sheet of paper from a booklet provided by Æ. 
Writing continued until a verbal command of “Stop” 
was given by Æ. This command was also the signal 
for S to turn to the next sheet of paper and get 
ready for the next trigram which was presented in 
approximately 2 sec. Following presentation of all 
24 stimulus items, S was asked to recall as many of 
the trigrams as he could. This recall was written 
and terminated after 5 min. 

The Ss were divided into two groups dependent 
upon instructions given before the experiment, All 
were told that the intent of the experiment was to 
measure speed and accuracy of writing and how 
these changed with fatigue. In addition, Ss in the 
aware (A) condition were told that they would be 
asked to recall the trigrams after the experiment 
was over. No attempt was made to emphasize the 
importance of this recall over the writing task, 
"Those in the unaware (UA) condition were not told 
of the final recall until they had been presented each 
of the trigrams in the writing part of the experiment, 
Also, the duration of the writing task following each 
trigram was either 3, 9, or 15 sec. and was varied 
randomly with the constraint that each interval 
was presented equally often (eight times) within a 
on. All members of a given session were given 
either A or UA instruction, and a total of 65 Ss 
served in each of the two groups. 

Results. The results may best be considered in 
two parts, an analysis of the number of rehearsals 
of a trigram, and an analysis of the trigram recalls. 

One rehearsal resulted when all three letters of a 
igram were written in the correct serial order. 
results are presented in Table 1 where both 
means and standard errors are given for each of the 


TABLE 1 


ARD ERRORS (IN PARI 
RSALS PER ITEM AS 


AND STA: 
MBER OF REH] 
or RE 


Group 
3 9 15 
Aware Bt 2.42 GE 5.66 842 
(.05) (.10) (.16) 
Unaware 2,48 5.76 89 
(607) (.12) (.16) 
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i 


914 
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REHEARBAL INTERVAL 


Ficire 1. Proportion of items recalled as a function of 
rehearsal interval and S's awareness of final recall, (Abbrevia- 
tions: A = aware and UA = unaware.) 


six treatment combinations, An analysis of variance 
of these data confirmed the obvious conclusions from 
the table. The only significant effect was attribut- 
able to length of the writing interval with an in- 
creasing number of rehearsals given each item as 
the length of the interval increased, F (2, 236) = 
115.54, p < .001. The Ss in the A and UA groups 
did not differ in the number of rehearsals given nor 
in the number of rehearsals as a function of interval 
length. The similarity of the two groups is rather 
striking and implies that Ss in the A group did not 
deemphasize the importance of the writing task, 
In order to maintain consistency with the scoring 
of rehearsals, a correct recall was scored when all 
three letters of the trigram were recalled in the 
correct serial positions. The Ss were, however, 
instructed to recall the whole trigrams in any order 
they chose, and a graphical display of these results 
is presented as Figure 1. An analysis of variance 
confirmed the suggestions from this figure: Recalls 
were significantly higher for A than for UA Ss, F 
(1, 128) = 33.212, b. < .001, and recall probability 
changed with increasing rehearsal time, F (2, 256) 
= 8.530, 5. < .001, Thus, although recall was 
clearly superior for A Ss at each rehearsal interval, 
the same pattern of effects of the rehearsal intervals 
was found in both A and UA recalls, Increasing 
rehearsal time from 3 to 9 sec, increased recall sub- 
stantially, but additional increases in this time 
beyond 9 sec. did not materially effect recall. 
Discussion. Three aspects of these results seem 
particularly noteworthy. First, Ss rehearsed equally 
often regardless of instructions concerning the recall 
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task, Although it is possible t! 
rehearsing covertly (and thercl 
actual rehearsal time) this possib 
likely. The very short interv: 
mand to stop writing and th: 
next trigram would not allow m: 
rehearsal, and the equality of « 
A and UA Ss implies that A 
something in addition to writin 
of studies indicates that perfor 
when two activities are attemp 
Monty, Taub, & Laughery, 1965 
The second salient feature of 
cerns the relationship between an 
and retention. While the num! f 
increased by significant amounts v 
in rehearsal time, a corresponding ise in recall 
was found only when writing tim reased from 
3to9sec. The extra rehearsals all by lengthen- 
ing this time to 15 sec. did not prod increases in 
recall. While this breakdown in : ositive rela- 
tionship between memory and rehe ippears to. 
contradict a great deal of experin | literature 
(e.g,, Rundus & Atkinson, 1970), i not totally 
unprecedented. For example, Stanner> ind Meunier 
(1969) have also reported a limit tc ^e beneficial 
effects of rehearsal on memory, Mo l the studies 
that have reported a strong positi relationship 
between memory and rehearsal of individual items 
have not studied rehea intervals greater than 
10 sec, or retention intervals longer than 30 sec 
And, third, some process other than simple 
rehearsal must be invoked to account for recall 
differences between A and UA Ss. Although these 
two groups were nearly identical in terms of Tor 
hearsals given Ss yielded consistently superior 
recall scores. Thus, while the present experiment 
does not imply what this extra process may be, it 
does seem to clearly indicate its existence. 


reasing their 
loes not seem. 
cen E's com 
rance of the 
1e for covert 
hearsals for 
not doing 
irge number 
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IN CHILDREN'S PAIRED-ASSOCIATE LEARNING! * 


NIGHT * anp LEWIS P. LIPSITT * 


Brown University 


In a study of paired-associate learning in 80 third graders, both the mediated 
facilitation and interference effects previously demonstrated on List III by 
Wismer and Lipsitt were replicated under conditions in which the stimulus 
words of List III were replaced either with pictures of the stimulus objects or 
pictures of objects in the same class as the stimulus objects. The results indi- 
cated that third-grade children apparently have the ability to make implicit 


associations in a mediational chain. 


mer and Lipsitt (1964) demonstrated that 
fi -grade children exhibit both facilitation and 
i ference in a mediational paired-associate study. 
1 Ss were presented printed word pairs on three 
essive lists to arrange facilitation and inter- 
ference conditions on a within-Ss basis: (a) A-B, 
B-C, A-C for facilitation, and (b) A-B, B-C, A-C 
re-paired) for interference. One control (Ci) in- 
volved a new word in List II, so that the sequence 
was A-B, X-C, A-C, and another control (C2) in- 
stituted new word pairs on List III, Besides demon- 
strating the predicted facilitation and interference 
effects, error analyses revealed that the mechanism 
underlying both types of transfer was mediational 
in nature. 

Russell and Storms (1955) showed that adults can 
provide their own mediational links when these are 
not directly administered. The present study at- 
empted to determine whether young children can 
make implicit chains involving images and words. 
two levels of extended mediation were studied. In 
the first level, pictorial presentations (designated A’) 
cre introduced in List II in place of the printed 
words, e.g., instead of the word telephone appear- 
ing, a drawing of a telephone would appear. For 
Ss to respond to these pictures as if they were the 
printed words of List I presupposes an implicit 
nicdiational link between the image and the word. 

in the second condition, the List II picture (now 
called A”) was of a different object from A, but was 
in the same class as A. For example, if the stimulus 
word A was telephone, a picture of a microphone 
(A) was used in List IHI. This condition extends 
the mediational chain still further. 

Method, The stimulus words were typed on 4 X 6 
in. white cards, and the stimulus pictures were black- 
and-white line drawings on similar cards. Stimulus 
cards were presented manually, at a rate of 5 sec. per 
card, followed by cards containing both the stimulus 
and response words, On List III pairs, line draw- 
ings replaced the printed stimulus words. 


1 This study conducted at Brown University as a senior 
honor's thesis (Sims, 1965) by the first author under the direc- 
tion of the second. The authors wish to express their gratitude 
to Florence McGwin of the Summit Avenue School, Providence, 
Rhode Island, and Edward Staltare of the Pleasant Street 
School, Seekonk, Massachusetts, for their kind cooperation. 
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Each S was presented eight word-pair sequences, 
two sequences for each experimental condition— 
facilitation, interference, Ci, and C» The eight 
word-pair sequences were chosen for their high A-B 
and B-C associative strengths and low A-C associa- 
tive strength. Word-pair sequences were completely 
counterbalanced across Ss. For example, a bull-cow 
(A-B), cow-milk (B-C) chain appears as a List IIT 
facilitation pair in Group 1, as interference in Group 
2 where bull is paired with an extramediational 
word, and as a different type of control in Groups 
3 and 4, respectively (see Wismer & Lipsitt, 1964). 
Ss were 80 third-grade children, 40 in each 
t chain group (A^ or A"). Each counter- 
balancing group had 10 Ss, who were tested in- 
dividually in one session. Standard paired-associate 
learning instructions preceded 3 trials of List I, 3 
of List II, followed by repetitions of List III until 
Ss reached a criterion of 3 errorless trials (or 12 
trials were completed). Presentation order was 
randomized separately for each trial. 

Results. Table 1 shows that mean errors on List 
II varied with pair type. This effect was significant 
in a three-way analysis of variance with pair type 
asa within-Ss variable, F (3, 216) = 34,96, p < .01. 
As predicted, the mediated facilitative and inter- 
fering effects were greater in the A’ condition than 
in the A" condition, yielding a significant inter- 
action, F (3, 216) = 5.44, p < .01. Simple tests 
revealed that in Condition A" the facilitation pairs 
produced significantly fewer errors than Ci, £ (39) 
= 2.82, p < .01, but not fewer than Cs, t (39) 
1.70. Performance under the A’ condition on inter- 
ference pairs was significantly poorer than perform- 
ance on both controls, / (39) — 3.42 and 5.60, re- 


TABLE 1 


Mean ERRORS ON List III ror FACILITATION, 
INTERFERENCE, AND CONTROL CONDITIONS 
UNDER Two Li s OF EXTENDED 
IATION 


Extended mediation levels 


Condition 


Facilitation 
Control 2 
Control 1 
Interference 
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TABLE 2 


Percent Fixst-Triat Responses ON List II 
TO A’ AND. A” STIMULI 
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Condition A^ 


 reepalked | eal Other response 
B ie aA” (Correct | (labeled | $ TATUM 

response) | picture) | (including none) 
41.2 9,9 0 o 10.0 38.9 


Condition A” 


C re-paired| A Other Tesan 
B C | A’ | (Correct | (labeled | Other'respon: 

response) picture) (including none) 
64) 18 | 35 0 20.2 68.1 


spectively, p < .01. In Condition A”, facilitation 
pairs performance was again significantly better 
than performance on Ci, ¢ (39) = 2.22, p « .05, 
but not better than performance on Cs, £ (39) = AL. 
Interference-pairs performance was significantly 
poorer than Ca, £ (39) = 2.66, p < .01, but not 
poorer than Cy, £ (39) = 1.30. 

A significant Pair Type X Counterbalancing 
Group interaction, F (9, 216) = 2.36, p < -05, was 
due apparently to the relatively poorer performance 
on Counterbalancing Group 4, particularly on Ci 
and interference pairs, 

Analysis of first-trial errors (Table 2) indicated 
that in the A' condition the most frequent response 
to A’ was B, while in the A" condition the most 
frequent first-trial responses were either to name the 
stimulus or refrain from responding. Moreover, the 
A' condition elicited more than five times as many 
C (mediated) responses than A", an effect which 
undoubtedly contributed to the greater facilitative 
and interference effects under Condition A’, 


Discussion. Third-grade child: 
the ability to make implicit asso: 
tional chain. All results from th 
cate that they can make associat ic 
and its pictorial representation a 
tion to complete a mediational c! 
most frequent response of B to th 
of A’ suggests that they had asse 
which had previously been paired 

The results of the A" conditio: 
as revealed by the Chain Condi 
interaction, indicated that as the 
is lengthened, both facilitative 
transfer are attenuated. Further: 
sponse to A" was most often a la! 
rather than B or C. 

The difficulty of obtaining a p! 
evident. Wismer and Lipsitt's (19 
suggested that C. more clearly d 
tween facilitation and interference 
study, however, neither condition > 
cant difference between facilitation 
ing that even in the absence of exp 
training Ss may have been able to 
plicit B link, thus facilitating A~ i 
C: was significantly different from boi 
and interference in Condition A’, it 
sidered, at least in this study, the b 
Considering only this control, facilit; 
interference was obtained under Condit 


ndition ind 
tween a wo 


Second, , 

sentation 
i A’ with A, 
B. 
xpected and 


ide the im- 
ing. Since 
acilitation 
ay be con- 
ter control. 
on but not 
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At the empirical level, the present studies 
were concerned with the role that an associ- 
ation between two words plays in the recog- 

‘ition performance for the pair. At the 
héoretical level, the studies were prompted 
a theory which states that recognition 
cisions are mediated by frequency infor- 
mation (Underwood, 1971). This theory 
has evolved from work in which the unit of 
analysis was the single word ; the associa- 
tion between words has received little 
attention in the development of the theory. 
For the single word, the theory, as applied 
to recognition decisions between old and 
new words, simply assumes that these 
decisions are made by discriminations of 
phenomenal frequency differences between 
the old and new words. Associations enter 
this formulation in only a vague way. It 
is known that the background frequency of 
words (as indexed by word counts) has 
little if any effect on judgments of situa- 
tional frequency, i.e., frequency induced in 
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The purpose of the eight experiments was to ass 
between two words played in recognition decisions. 
associations established in the laboratory indicated that the a 
playing a small role, but that recognition performance on pairs of words was 
highly predictable from frequency information. 
iated words indicated that the strength of the a: 
al variable in recognition performance. 
Some unexpected findings included criterion 
; judgments as compared with recognition 
decisions, and criterion ‘differences in recognition tests on homonym pairs as 
compared with other classes of word pairs. 
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Northwestern University 


ess the role which associations 
The evidence on weak 
ciation was 


However, the use of strongly 
ociation per se was not 
A post hoc expansion of fre- 


the laboratory (Underwood, Zimmerman, 
& Freund, 1971). This must mean that 
the frequency information induced in the 
laboratory is kept distinct from background 
frequency and, therefore, in the vague 
sense noted above, situational frequency is 
somehow associated with the laboratory. 
context. 

The broad perspective of memory into 
which frequency theory fits assumes that 
various types of information may consti- 
tute a memory for an event. (Underwood, 
1969). Frequency information is simply 
one of these types, but a type that assumes 
(by theory) the major function in recogni- 
tion decisions involving single verbal units. 
The theory has never presumed that other 
types of information will always be irrelev- 
ant for recognition. Indeed, it has been 
concluded that when frequency information 
becomes invalid for recognition decisions 
the subject will turn to other types of 
information in memory to mediate his 
judgments (Underwood & Freund, 1970). 
A recent study by Broder (1973) shows a 
serious breakdown of the theory when 
frequency information and associative in- 
formation are in conflict. So, then, fre- 
quency theory can only be viewed within a 
larger perspective; the theory deals with 
one type of information in memory and 
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its role in recognition performance. It is 
a part of the theoretical-empirical task to 
establish the limits of the theory, limits at 
which the theory is no longer useful in 
predicting the outcome of recognition 
studies. The studies to be reported were 
undertaken to determine if associative 
information between words leads to a 
breakdown of the predictions based only on 
frequency information. 
There is a rapidly growing literature 
dealing with word recognition and the 
associati ve relationships among the words. 
"These studies are often spoken of as studies 
dealing with associative context. The 
outcomes of these experiments are pre- 
sumed to be relevant to the question of 
whether or not recognition involves re- 
trieval. The idea is that if an association 
influences recognition, retrieval processes 
are involved. The central method of study, 
with several variations, is that of removing, 
on the test, the items which occurred with 
the target word during study. If per- 
formance falls (as compared with the 
condition in which the context remains on 
the test) it is said to implicate the loss of 
retrieval cues. Context may be added also, 
but if this influences recognition, the 
theoretical interpretation is not clear with 
regard to the role of retrieval in recognition. 
An examination of the literature shows no 
consistency in the effect of associative 
context on recognition. This confusion 
may be sampled. Tulving and Thomson 
(1971) reported that both the addition of 
context and the removal of context de- 
graded recognition, Subsequently, Thom- 
son (1972) found little effect of context 
addition. "The data of Cofer, Segal, Stein, 
and Walker (1969) showed that removing 
context caused a fall in recognition scores. 
Light and Carter-Sobel] (1970) found that 
context change did influence recognition 
(performance fell) but in a further study 
Light (1972) reported that recognition for 
words embedded in sentences for study but 
tested alone gave recognition scores that 
were quite comparable to those obtained 
when the word was studied alone and tested 
alone. Ellis and Shumate (1973) showed no 
loss in recognition of stimulus terms from 
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a paired-associate list over 28 5 although 
the ability to recall the respo? ~ terms fell 
over this period. Wood (19 also con- 
cluded that the association eloped in 
paired-associate learning did influence 


^71) inter- 
that the 
pair was 


recognition, although Wolfor« 
preted his results to indk 
association between words i 
involved in recognition. 


It is not the purpose here io -itempt to 


evaluate the possible reasons the con- 
tradictory findings. The det of some 
of these studies will be consid «1 in con- 


ments to 
here that 
:e role of 
ions are 


junction with the specific ex; 
be reported. It is merely not 
the empirical facts concernin: 
associations in recognition d 
far from clear. 

One further 


background iment is 


required. It was noted earli that the 
studies to be reported were lertaken 
with frequency theory as the |--ckground 
orientation. This theory fits nic: iy into the 


conceptual notions of signal-det«- ‘ion anal- 
ysis. If, for example, an old and new word 
are given on a forced-choice tes: of recog- 
nition, the apparent frequency of the old 
item constitutes the signal, the apparent 
frequency of the new word, the noise 
Words which have the same input fre- 
quency will constitute a distribution of 
phenomenal frequency for old words, and 
the phenomenal frequency of new words 
will constitute a distribution of noise. Two 
presumably independent measures of be- 
havior may be obtained from signal 
detection analysis. One of these is dis- 
criminability or sensitivity between the 
signal and noise distribution (between the 
phenomenal-frequency distributions for old 
and new words in terms of frequency 
theory). The measure d’ is used to express 
Sensitivity differences. The other measure 
has to do with the criterion set py the 
subject for accepting old words and reject- 
ing new ones, Presumably the criterion 
represents a form of response bias which 
may differ among subjects and among con- 
ditions of an experiment, and it is measure’ 
by bela in signal-detection analysis. ^S 
will be seen, some unexpected difference 
the criterion settings or response biases 
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were foi However, while accepting 
the con: il notions of signal detection, 
it has ied unnecessary to use the 
measur ! signal detection (d^ and beta) 
to refi the behavioral manifestations of 
these cepts. Rather, other measures 
have ^ «n developed. „Nevertheless, the 
relai: ships between the measures used 
and ind beta will sometimes be reported. 

! "ght experiments to be described 
di themselves into two subsets. The 
fir uve were concerned with transient or 
W associations developed in the labora- 
1 between the two words in a pair. The 


raining three used pairs of words already 
sı ongly associated by cultural usage. 


EXPERIMENT I 


The first experiment was planned as an 
analogue of a previous study using single 
words (Underwood, 1972, Experiment I). 
In that study, comparative recognition 
decisions were evaluated in conjunction 
vith comparative judgments of frequency 

nder exactly the same conditions. The 

utcomes were statistically equivalent. 

irthermore, when errors were made they 
occurred for the same items for the group 
making recognition decisions and for the 
group making frequency judgments, The 
correlation between the error distributions 
was .74. This finding was deemed consis- 
tent with the theory that recognition 
decisions are based on frequeney informa- 
tion. In Experiment 1, reported below, 
pairs of unrelated words were used as the 
recognition unit, with frequency judgments 
and recognition decisions being made by 
different groups. The empirical question 
is whether or not the recognition decisions 
are predictable from judgments of fre- 
quency as was true when single words were 
the recognition unit. 


Method 


The general method may be described fir 
Ss were presented a series of pairs for study, 
pair being presented on a slide for 5 sec. Before 
presenting the pairs the Ss were given a booklet, and 
the cover sheet of the booklet contained the test 
instructions, These instructions were studied by 
the § before the list was presented so that he knew 
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xactly how he would be tested. One set of instruc- 
tions informed the S that he would be asked to make 
recognition decisions, both absolute recognition 
(yes-no) decisions for individual pairs, and com- 
parative or forced-choice decisions between two 
pairs, one old and one new. The two types of tests 
were illustrated on the instruction sheet, Another 
set of instructions, given to other Ss, informed them 
that their memories would be tested for frequency 
information, These instructions further indicated 
that there would be two types of test items, In one 
case, single pairs were shown and the S was required 
to fill in a blank with a number to indicate the 
number of times he thought the pair had been pre- 
sented, In the other case, two: pairs were shown and 
the S was required to encircle the pair which had 
been presented most frequently. Thus, all Ss 
received the same study list, and all Ss made both 
absolute and comparative judgments, but for one 
group the instructions emphasized correct. recogni- 
tion and for the other group, correct frequency 
information, 

Materials, The pool of words consisted of 240 
five-letter words with Thorndike-Lorge frequencies 
falling between 11-30. This pool consisted of 
approximately 90% of all five-letter words falling 
in this frequency range From this pool, 72 pairs 
were formed randomly subject only to the restric- 
tion that the two words in a pair did not have the 
same initial letter, The 72 pairs were used in the 
study list, with 48 pairs presented once, 12 twice, 
and 12 three times, The pairs were assigned ran- 
domly to these frequency categories, Pairs pre- 
sented twice were first assigned positions randomly, 
once in each half of the study list. Pairs presented 
three times were then assigned, h pair occurring 
once in cach third of the study list. Finally, the 
48 pairs occurring once were assigned randomly to 
the remaining positions, Two bufer pairs were 
assigned at the beginning of the list, and two at the 
end so there was a total of 112 positions in the study 
list, 

Test booklets, The test booklet consisted of three 
pages plus the cover sheet of instructions: There 
were 84 test items, 28 on each page. Each page 
included 16 items containing pairs that had been 
presented once for study, four that had been pre- 
sented twice, and four that had been presented 
three times. In addition, there were four pairs on 
each page which had not been on the study list 
(new pairs). Half of the items in each class (except- 
ing the new pairs) were tested by a forced-choice 
procedure, half by an absolute test. Thus, across 
the three pages, 24 pairs that had been presented 
once for study were paired with new pairs to pro- 
duce 24 forced-choice items, and 24 pairs were 
presented alone for yes-no decisions, Pairs pre- 
sented twice and three times for study were also 
equally divided between forced choice and absolute 


judgments. 


* This pool of words was formed by Carl P. 
Duncan, Its use is greatly appreciated. 
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For absolute recognition, the words yes and no 
followed the pair with the S instructed to encircle 
the appropriate response. For the judgments of 
absolute frequency, a blank, rather than yes and no, 
followed the pair with the S requested to write in 
a number to indicate the frequency with which he 
thought the pair had been presented during the study 
list. In the forced-choice items the S encircled the 
correct pair (recognition) or the pair which had 
occurred most frequently during study (frequency 
judgments). 

As noted earlier, new pairs were required for the 
forced-choice items. The 36 new pairs required 
(24 for the pairs presented once for study, six each 
for those presented twice and three times) had been 
formed randomly from the pool and were paired 
randomly with old pairs. On a single form only 
half the study pairs could be tested by a given 
method (forced choice or absolute judgment). 
Therefore, a second form was constructed in which 
the method of testing was reversed for the pairs. 
As a consequence, when considered across all Ss, 
each study pair was tested by a forced-choice pro- 
cedure (for recognition and for frequency informa- 
tion) and also by an absolute test (yes or no recogni- 
tion, and judgments of absolute frequency). 

On each page of the booklet the type of test item 
was randomized. "Thus, 5 might have three succes- 
s items requiring a forced-choice decision, then 
an item requiring an absolute judgment, then an 
item for a forced-choice decision, then three items 
for an absolute judgment, and soon. On the forced- 
choice items the old and new pairs were randomly 
assigned to the left or right position. The instruc- 
tions required that the 5 respond to all items on each 
page, guessing if necessary. 

Procedure and Subjects, The data were collected 
by a group procedure, As the initial step, the 
booklets were distributed to the Ss with instructions 
not to open the booklet. The experimenter informed 
the Ss that they would be presented a long list of 
pairs of words and that they should try to learn the 
pairs as pairs. They were further told that some 
pairs would be presented more than once. Next, 
they were asked to study the instructions in order to 
understand how they would be tested. Any ques- 
tions concerning the instructions were handled 
individually. The list was then presented for study. 
After the last slide was presented the Ss were asked 
to review the instructions (if necessary) and then 
proceed to the first test page. The testing was 
unpaced. 

"The Ss were tested in groups of varying size, but 
within each group both frequency-judging instruc- 
tions and recognition instructions were represented 
as well as were both forms. The booklets were 
randomized within blocks of four (two forms and 
two types of instructions) before they were assigned 
to the Ss. A total of 152 Ss was completed, 76 
having had the recognition instructions and 76 the 
instructions for frequency judgments. Within each 
group there were 38 for each form. Since the forms 
did not differ in any of the analyses, no further 
mention will be made of this balancing variable. 
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Results 

Pairs presented two and th: imes were 
included merely to make tl:  requency- 
judging task a reasonable on: rticularly 
for the absolute judgments he errors 
made on these pairs were fc number, 
The basic interest was for airs pre- 
sented once for study. The r« ‘is for the 
forced-choice tests will be coi red first, 

Forced-choice tests. There \ 24 items 
each of which consisted of a | resented 
once and a pair which was ı The Ss 
were required in one case to ircle the 
pair which had been presente: or study, 
and in the other case, to encir. the most 
frequently presented pair. Fo ‘he 76 Ss 


given recognition instruction e mean 
number of errors was 2.59 (10.8. `, and for 
the 76 subjects given instructi« o choose 
the most frequently presente: pair the 
mean number of errors was 3.^. (13.8%). 
The difference between the tw« ans Was 
not reliable, / (150) = 1.73, p > .05. The 
discriminability or sensitivity, ‘herefore, 
may be judged equivalent fo: the two 


groups. This result is the same as found 
previously with single words (U: derwood, 
1972). In the previous study it was found 
that the errors fell on the same items under 
both sets of instructions. In the present 
study the number of errors made on each of 
the 48 pairs presented: once for study was 
determined for each type of instruction. 
The product-moment correlation between 
the two arrays was .43, p < .01. 

Absolute tests. For these tests, each $ 
was given 24 pairs presented once and 12 
new pairs. In one case yes—no decisions 
were requested, in the other, absolute judg- 
ments of frequency. The data will be 
viewed in a number of ways. 

The frequency estimates may be handled 
in exactly the same way as are the recogni- 
tion decisions. If S indicated that a pall 
not presented for study had a frequency 9 
one or greater, it would be equivalent toa 
false alarm for a yes-no recognition dc 
cision. Similarly, if S assigned a zero to a 
pair that had been presented once, it wou : 
be equivalent to a miss in recognition 
"Throughout the studies to be reported, ! 
has been found that the sum of the miss? 
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and false alar 
measure o! 
This mea 
the pres: 


is a simple and meaningful 
riminability or sensitivity. 
ind d’ correlate highly. In 
‘periment the correlation for 


the 76 iven the recognition test was 
—.86, ior those making absolute fre- 
quency igments, —.91. These are some- 
what values than those to be re- 
porti later studies. In view of these 
relati ps, the sum of the misses and 
fals vms (M + FA) has been used as 
thc c measure of discriminability. 
mean M + FA for absolute recogni- 
tic 5 7.66, that for frequency judgments, 
6 Ihese means do not differ statistic- 
all (£2 1.20). However, these values 


al ae do not adequately reflect the outcome 
for the absolute tests, The full picture 
can be seen in Figure 1. This figure impli- 
cates criterion differences between those Ss 
who made recognition decisions and those 
who made frequency decisions, The Ss in 
making recognition decisions infrequently 
nade false alarms but correspondingly had 
'any miss The Ss in making frequency 

dements behaved in the opposite manner; 
tey made many false alarms (assigning a 
new pair a value of one or more) but had 
few misses. 

he difference in the criterion set by the 
two groups is reflected directly by the 
interaction (F = 69,66) as seen in Figure 1. 
An analysis of variance of data such as 
Figure 1 is based upon allows conclusions 
about both differences in sensitivity or dis- 
criminability and differences in the criter- 
ion or response bias. If there is a significant 
main effect between recognition and fre- 
quency judgments, differences in sensitivity 
would be indicated. A main effect of input 
frequency would simply indicate the overall 
balance between the tendency to produce 
false alarms and the tendency to produce 
misses. A significant statistical interaction 
resulting from a convergence of the lines 
between input frequencies zero and one 
(whether a crossover or not) indicates a 
criterion difference. 

The above technique for determining 
sensitivity and criterion differences seems 
Straightforward. However, some comments 
should be made about other measures used 
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FiGURE 1. Errors on p: of words as related to 


input frequency. The test included both frequency 
judgments (FJ) and recognition (Recog). 


to reflect criterion differences. Bela, the 
measure of the criterion setting used in 
signal-detection analysis, is the ratio of 
the height of the ordinate of the normal 
curve for the proportion of hits and the 
ordinate height for the proportion of false 
alarms. The beta distributions were severely 
skewed for the present data. Ignoring this 
skewness, the mean beża for recognition was 
4.95, and for frequency judgments, 2.44 
(t = 3.59). 

Another measure which could be used to 
reflect differences in the criterion is 
M — FA/M+ FA. Thus, if one S had 
two misses and eight false alarms, and 
another S had eight misses and two false 
alarms, sensitivity is equivalent but the 
criterion differs for the two (—.60 for the 
first, and +.60 for the second). Scores 
may vary from —1 to a +1, For the 
present data the mean values were .72 for 
recognition and .08 for frequency judgments 
(t = 7.44). This measure is curvilinearly 
related to beta. If zero false alarms and 
zero misses are assumed to be one miss or 
one false alarm, the rank-order correlation 
between the two measures (bela and 
M — FA/M + FA) is essentially perfect. 
The logic of changing zeros to a positive 
value (the value could be less than one, 
e.g., .1) is to avoid too many extreme scores 
(—1 and +1) which result when zero 
scores are entered. This simulates the 
situation for bela where no ordinate value 
is zero. 


The values for M + FA are essentially 
equivalent for the two groups. "Therefore, 
the criterion difference resulting from the 
formula M — FA/M 4- FA must come 
from the M — FA values. In effect, this 
is simply another way of calculating an 
interaction. For the present work, there- 
fore, any criterion difference between con- 
ditions seems given most directly by the 
interaction of the analysis of variance be- 
tween input frequencies and type of test. 
However one analyzes the data, it is clear 
that there were differences in the criterion 
setting for Ss given the recognition task 
and those given the frequency-judging task. 
As was the case with forced-choice tests, 
sensitivity did not differ for the two groups 
on the absolute tests. 

Item correlations. The number of misses 
for each of the 48 pairs presented once was 
determined for recognition and for fre- 
quency judgments on the absolute tests, 
The correlation between the two distribu- 
tions was ,63 (p < .01). This indicates a 
substantial relationship between phenome- 
nal frequency and recognition decisions. 
This relationship emerges in another anal- 
ysis. The mean frequency judgments for 
the 48 pairs presented once were correlated 
with the number of misses for recognition. 
The value was —.50 (p < .01). 

Correlations by Subjects. A given S made 
both absolute and comparative frequency 
judgments, or both absolute and compara- 
tive recognition decisions. The relation- 
ship between the two types of judgments 
may be examined, For frequency judg- 
ments the correlation for the 76 Ss between 
M + FA for 24 items and number of errors 
on the 24 forced-choice items was .65. For 
recognition, the corresponding value was 
59. Both values are reliable statistically 
(p < 01). 

Each S under frequency-judging instruc- 
tions made absolute frequency estimates 
for 24 pairs which had been presented once 
for study. For each S the mean and 
standard deviation of these judgments was 
determined and correlated with the number 
of errors made on the forced-choice fre- 
quency judgments. The correlation be- 
tween the mean frequency judgments and 
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number of forced-choice err 
but that between the stand: 
and number of forced-choic 
44 (p«.01). Thus, the | 
ability of phenomenal frequ: 
with a constant input freque: 
the  discriminability on 
decisions. 


Discussion 


The discussion of the findin: 
at this point. Overall, the resu! 
frequency judgments and recog» 
for pairs of words have much in 
these results could be interprete: 
with the idea that phenomenal | 
ferences play a major role in ro 
formance. Alternative interp: tions are 
undoubtedly possible, of course, it such in 
terpretations must include an «  unting of 
the commonality in errors produce. >y recogni- 
tion and by frequency-judging in- uctions. 

The finding that there was a dit! || once in the 
criterion set (on the average) for "ecognition 
judgments and for frequency jud ments Was 
quite unexpected. It was there! c. decided 
to undertake a further study, wit!) somewhat 
changed procedures, to see if the (ding could 
be replicated. 


rowed that 
ı decisions 


imon, and 
consistent 
juency dif 
nition per 


EXPERIMENT II 
Method 


In this experiment all Ss were given exactly the 
same study list as was given in Experiment L 
two changes, which occurred on the test, 
described. 

In Experiment I, cach S made both absolute and 
comparative judgments, In Experiment M, «a 
S made a single type of judgment on all items 
Therefore, there were four groups of Ss, One gro 
made only absolute frequency judgments, 
made only forced-choice frequency judgment: 
A third group made yes-no recognition d E 
and a fourth made forced-choice recogaition d 
sions, Of primary interest were the judgments m 
on the 48 pairs presented once for stud 

It will be remembered that in Experiment 1 only 
12 new pairs were used for yes—no recognition a 
for absolute frequency judgments. In Exper 
M, 48 new pairs were included. Since there 
also 48 pairs presented once, the number of 
rights and wrongs was more ly equali 
Experiment |l than was true in Experimer, 
These 48 new pairs also became the new pairs u 
forced-choice tests. cog 
The booklet for the forced-choice tests (rè the 
tion and frequency judgments) consisted os! 


nd two test sheets with 30 items 

1l of 60 test items, These 60 items 
48 pairs presented once (each 

ı new pair) and six items in which 
had been presented twice and six in 
been presented three times, Thus, 


 dnstruction «i 
on each for 
"consisted 
Wecompan 

- the stud; 
which i 


only ha tems presented twice and half those 

F present times were tested, The purpose in 
this w ep the total number of pairs to which 
the s vas exposed on the test equivalent for 
the í hoe and for the absolute tests, 

T! »klet for the absolute judgments was 
max | the instruction sheet and three test 
pa tems toa page, for a total of 120, These 
to of 48 new pairs, 48 pairs presented once, 
D ted twice, and 12 presented three times, 


‘her details of the procedure were exactly the 
$ for Experiment I. The Ss studied the in- 

on sheet which described how they would be 
te d, after which the list was presented for study 
a . Sesee. rate, Again, the test was unpaced. The 
{ocr instructional conditions were randomized in 
socks of four so that for each session cach was 
(bout equally represented. Sessions were continued 
until 36 Ss had completed the test for each instruc- 
tional condition. 


Results 


Forced choice. The mean number of 
rrors made by the 36 Ss on the 48 forced- 

ioice items under recognition instructions 

is 5.42 (11,305), and under instructions 
to choose the most frequent pair, 4.92 
(10.1%). The £ was .60, The correlation 
of errors by items was .54 (p < 01). 
Those results confirm those of Experiment I. 

Absolute tests. "The mean frequency judg- 
ments for the pairs presented 0, 1, 2, and 
3 times may first be noted, The means in 
corresponding order were .18, ,91, 1,74, and 
2.49, The relationship is linear. Again, 
however, in the analyses to follow, no 
attention will be given to the pairs pre- 
sented more than once since the errors on 
these pairs were few in number. As was 
done for Experiment 1, false alarms and 
misses were determined for the groups 
making absolute judgments. ‘The results 
are plotted in Figure 2. The interaction, 
although less severe than in Experiment 1, 
was still present, F (1, 70) = 5.99, p < .05, 
There was no appreciable effect of type of 
instruction (F = 2.18), indicating that dis- 
criminability was equivalent for both fre- 
quency decisions and recognition decisions. 
There was a main effect of input frequency 
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Ficure 2, Further evidence on the errors made — — 
on pairs of words as related to input frequency for — — 
frequency judgments and recognition. 7 


(F = 17.54) indicating more misses than 
false alarms when combined across both — 
groups. . 

The sum of the false alarms and misses 
correlated —.,93 with d’ for the frequency 
judgments, and —.94 for the recognition 
data. The difference in the criterion set by 
the two groups, as given in the intezaction 
in Figure 2, was not reflected in bea. dif- 
ferences, the F being less than one, 

Item correlations. “The number of misses 
was determined for each of the 48 pairs 
presented once for study. This was done 
separately for the 36 Sa making absolute 
frequency judgments (a miss being an 
assignment of zero) and for the 36 Sw 
making yes=no recognition decisions, The 
correlation between the two arrays was 
55 (p< 01). The correlation for the 
number of false alarms on the 48 new items 
for the two groups was 42 (p < 01). 

If frequency information is involved in 
recognition decisions, it should be possible 
to predict forced-choice recognition and 
frequency judgments from the absolute fre- 
quency judgments. In the forced-choice 
tests, two pairs were involved for cach item, 
one old (presented once) and one new, For 
each of these pairs a mean frequency judg- 
ment was available from the group making 
the absolute frequency estimates. It 
would seem that the greater the phe- 
nomenal frequency difference between the 
two pairs in the forced-choice item, the 
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fewer the number of errors which should 
be made on that item. However, the 
variability of the frequency judgments 
should also be considered. For whatever 
reason, items having the same input 
frequency differ in the variability of the 
frequency estimates when viewed across 
the 36 estimates from the 36 Ss. It was 
shown in Experiment I that Ss having the 
lower variability of frequency estimates 
with constant input frequency did better 
on the forced-choice items than did the Ss 
with the higher variability. The same 
outcome may apply to item variability 
when calculated across Ss. T herefore, it 
would seem that the clearest prediction 
would be that the greater the mean dif- 
ference in phenomenal frequency between 
an old and new pair in a forced choice 
item, adjusted for differences in the vari- 
ability of frequency estimates for those 
pairs, the better should be the performance 
on the forced-choice test, 

The mean frequency difference for each 
old and new pair appearing in the forced- 
choice items was divided by the square 
root of the sums of the variances of the 
frequency estimates for the two pairs. A 
high value should indicate good discrimin- 
ability between the two pairs. The correla- 
tion between these values and the number 
of errors on the forced-choice recognition 
test was —.61. For the forced-choice fre- 
quency judgments, the correlation was 
—.79. Clearly, the phenomenal frequency 
of items presented once, and the variability 
of the estimates reflecting phenomenal fre- 
quency, predict performance on the forced- 
choice tests with some accuracy. 

As one further bit of evidence on the 
relationship between frequency information 
and recognition performance, the absolute 
judgments made on the 48 pairs occurring 
once were combined across Experiments | 
and Il. Therefore, there were 74 Ss who 
had made absolute frequency. judgments 
and 74 Ss who had made yes-no recognition 
decisions. The apparent frequency of a 
pair was given by the mean of the frequency 
judgments, and for each pair a standard 
deviation was available, It would be ex- 


pected that the higher the mean apparent 
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frequency and the lower 
the better should be the p 
recognition. "The coefficici 
(M /e) was used to derive a 
for each pair. A high co 
indicate good discriminabi| 
values were correlated with 
errors on yes—no recogniti: 
pairs. The product-mom« 
was —.70. 


variability. 
ormance on 
f variation 
‘le measure 
ent should 
These 48 
number of 
^or the 4$ 
correlation 


Discussion 


that dis. 
s of words 
d to make 


Both experiments have «i 
criminability or sensitivity for 
was equivalent when Ss were 


recognition decisions and jude; -nts of free 
quency. This outcome held fo: oth forced: 
choice items and for absolu: judgments: 
Furthermore, the particular it on whieh 
errors were made in recognition ^ nded to be 
the same items on which the er: vere made 


when judging frequency and fo juency dif- 
ferences, Mean frequency judgm: ots for pairs 
combined with the variability oí ihose judge 


ments, predicted errors in rec 


ition pere 
formance on both yes-mo and 


iorced-choice 


tes Viewed as a whole, the results were 
entirely consistent with the th ory which 
Specifies that frequency information is the 


major attribute of memory inyolyed in pair 
recognition. In short, these two «tudies indi- 
cate that when pairs of words constitute the 
unit of analysis of recognition the outcome I 
much the same as when the individual word is 
the unit. 

An unexpected finding in both experiments 
Was that Ss making absolute frequency judg 
ments set a more lenient criterion than did 
those Ss making recognition decisions. The 
reason for this difference is not known. M 
examination of the changes in performance 
across pages of the tests showed only very 
slight decrements under any of the conditions 
of testing, and the differences in the criterion 
were present initially and remained relatively 
constant across pages, Perhaps backgroum 
frequency influenced the frequency judgment 
and not the recognition decisions although, 4$ 
noted in the introduction, other data woul 
argue against this possibility. By way 
looking ahead it may be noted that a later 
experiment will show that criterion difference 
were found to be present for different ce 
of pairs of words for the same S. For the b. 
being, therefore, the problem concerning t 


- 


» 5 


" 


criterion differences will be set 


‘reason | 


aside. 

The : udies reported have established 
nothin: erning the role of the association 
in rec m. They have merely shown that 
frequ heory does not break down when 
pair rds are used, pairs which presumably 
hay © associative relationship as a conse- 
que ! the study trial. To assess in a more 
an way the role of associations in 
re: n requires different types of studies, 


7 vediately following experiments repre- 
ne of these different types. 


EXPERIMENT II 


sume the S is presented a series of 
| 5s of words under instructions to asso- 
« ate the words in each pair. Let three of 
he pairs be identified as A-B, C-D, and 
E-F. On the test of recognition he is 
presented A-B (intact pair) and C-F 
(broken pair). His instructions are to 
respond yes to all pairs providing both 
words in the pair had been presented for 
study whether paired during study or not. 
If the number of misses is greater for the 
woken pairs than for the intact pairs, the 
ssociation would seem to be clearly impli- 
ited in recognition performance, If the 
isses are equivalent, it would be con- 
ided that the association is irrelevant for 
recognition performance. The purpose of 
Experiment ILL was to make this test. 

In addition to the two types of test items 
noted. above, two others were included in 
order to have pairs for which the correct 
response was no, One of these types con- 
sisted of one old and one new word, and the 
other type consisted of new pairs, 


Method 


Materials, The pairs of words were from the same 
pool as used in the first two studies, The S was 
Presented 54 pairs for study. Of these, 18 were 
tested as intact pairs, that is, both words were old 
and paired as they were for study (O-O). A further 
18 were re-paired words or broken pairs (0-0b). 
One word from each of the remaining 18 pairs was 
used in pairs in which a new word constituted the 
other member of the pair (O-N). Finally, the test 
included 18 new pairs (N-N pairs). ^ 

For purposes of balancing conditions against 
Possible pair differences, cach pair was used in all 
. conditions equally often. Originally, 72 pairs were 
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j A 
assigned randomly to four subgroups of 18 
each. Across four forms each subgroup was u 
once for each of the four test types as desci 
above. Thus, there were four different lists 
sented for study and four different test forms, 
only additional words required were the 18 u: n 
new words for the O-N pairs, These 18 words 
served this same function across all forms M 
of course, the old words differed for each form. TI e 
O-Ob test pairs never consisted of an intercha 
of the members of two pairs. In the presentation. 
list, each type of test item (except N-N of course): 
was represented in each successive block of three — 
pairs, On the test, item types were randomized 
within blocks and particular items within blocks: 
were random with respect to presentation position - 
during study, ‘ 
One further variable was introduced in the test, 
Half of the O-O pairs were tested in the order pre 
sented for study (A-B), and half were tested in 
reverse order (B-A), This was also true for the 
O-Ob pairs and for the O-N pairs, The latter: 
became N-O pairs when reversed, ‘The particular 
nine pairs to be reversed were determined randomly, 
Procedure ond Subjects, The pairs were presented 
for study at a 5-sec, rate by a slide projector, The 
instructions emphasized the learning of pairs an 
pairs and, although the Se knew their memories 
would be tested, the nature of the test was not 
specified before presenting the list, After the study 
of the pairs the Ss were given a two-page booklet, M 
pair» to a page, with the words yes and wo alter 
each pair, They were instructed carefully that yer 
was the appropriate decision if both words had been P 
presented for study whether paired together garing 
study or not. The instructions further empha y 
that if one or both of the words in a test pair had not. 
been presented for study, the appropriate response: 
was o, According to these rales there were 30. 
pairs for which the correct response was yer, and 46 
for which the correct response was wo, although the 
Sy were not told this, 3 
Subjects were tested in small groupe until 18 were 
completed for each of the four forms, Forme did 
not prove to differ statistically on the teat, and they 
did not interact with item types, The results will 
be considered for the 72 Ss combined, 


Results 


The overall results are shown in Figure 3, * 
where percent errors is used as the recogni- 
tion measure, An error would be a false 
alarm for N-N and O-N, and a miss for 
O-O and O-Ob, It is to be noted tha 
there were about 7% more misses on O-Ob. 
pairs than for O-O pairs. The overall- 
analysis showed a significant effect of item 
type (F = 37.58). A standard error of the 
mean difference derived from the error term 
indicated that any difference as large as 
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PERCENT ERROR 
» e 
o o 


N-N o-0 O-Ob 


TEST-PAIR TYPE 


O-N 


FrGuRE 3. Misses and false alarms (FA) for 
four types of test pairs. (Abbreviations: N refers to 
a new word, O to an old word, and b to a broken 
pair.) 


4.3% was reliable at p= .01. All dif- 
ferences between item types were larger 
than 4.3%, The fact, therefore, that type 
O-O results in fewer errors than type O-Ob 
may be taken as evidence that the associa- 
tion between the two words in a pair played 
a role in recognition, 

The misses on the O-O pairs were more 
frequent than were the false alarms on 
the N-N pairs. The M + FA measure of 
sensitivity for these pairs correlated with 
d' across the four forms at the following 
levels: —.93, —.94, —.91, —.95. 

The O-N pairs produced nearly 35% 
false alarms. A satisfactory accounting of 
this result was not found. Several sub- 
sidiary analyses were done to search for 
leads. The 72 Ss were divided into two 
subgroups based upon the total errors made 
on all item types. The relative relation- 
‘ship among the four item types were 
exactly the same for these two subgroups 
of good and poor Ss. There were two pages 
to the test form, and the Ss completed the 
first page before moving to the second, It 
seemed possible that testing effects might 
have differed for item types. However, an 
analysis showed no difference in perform- 
ance on the two pages and no interaction 
between pages and item types, both Fs 
being less than 1. 

It will be remembered that the position 
of the words in a pair during study was 
reversed for half the pairs on the test. This 
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change had essentially no eff. the 
pairs, the nonreversed paire produci 
20.7% errors, the reversed pas, 19.6% 
For the O-Ob pairs the co: sponding 
values were 25.8% and 27.9: But, ft 
the O-N pairs the false alarms e 38.0% 
and when reversed, (N-O) 31 "This i 
to say that when the old word the fir 
member of the pair, errors w nore fr 
quent than when the new w was thi 
first member. However, « for 
latter pairs, performance was poorer tha 
was found on any of the thre er Í 
types. 

It is difficult to see how c: on dif 
ferences could exist for the v is item 
types, since if the .S could classi ~ an item 
type he presumably could provido the cor 


rect response. Frequency th might 
predict good performance on the O-N pairi 
on the grounds that the two woi:!s would 
represent a contrast in apparent !cequency. 
and this contrast would lead to a correct; 
rejection. [n one sense the results for the 
O-N pairs are in contradiction to the notion 
that an association facilitates recognition. 
The two words in the O-N pairs were not 
associated during study just as the O-Ob 
pairs were not associated, If the lack of am 
association leads to a no decision for the 
O-Ob pairs it should do the same for the 
O-N pairs, but in the latter case the deci- 
sion would be correct. It is possible, of 
course, that this effect was present and kept 
the number of errors from being more fre- 
quent than was actually found. For the 
time being, the mechanisms responsi le. 
for the heavy error rate on the O-N pairs 
cannot be specified. 

One other fact will be reported. When 
words are paired randomly the resulting 
pairs should show a distribution defined in 
terms of the ease with which the words 
within the pairs could be associated. There 
is a possibility that a pair which is easily 
ociated when presented for study will 
also produce errors (false alarms) when It 
is used as a new pair on a test, This does — 
not seem to be the case. In the present 
experiment, the 72 pairs were used both Bie 
old pairs (0-O) and as new pairs (N-N)- 
The number of false alarms made on the 
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72 pairs wi 
did not « 
which o 


they were used as new pairs 
ite with the number of misses 
ed when they were used as old 


pairs (; 03). 
EXPERIMENT IV 

7 sults of Experiment HI were inter- 
pr demonstrating that an associa- 
tio veen two words, developed on a 
si! udy trial, facilitated recognition of 
th Or, as is sometimes said, associa- 
ti^ ntext aided recognition performance. 


I iment IV was another test of the role 
sociative context using a different 
ay. vach, although one that is similar to 
th se used by other investigators, particu- 
ly ly DaPolito, Barker, and Wiant (1972). 
“hese investigators presented a triad of 
words for study. On the test, the Ss made 
recognition decisions on single words, but 
other items were sometimes present in the 
display when the decisions were being made. 
Essentially any change at test produced an 
acrease in misses. The Ss were not in- 

ructed to associate the words in the triads 
caring study, and since the triads were pre- 
scuited for only 1 sec., associative formation 
bo ween the items would be minimal, Still, 
the results showed that when the order of 
the three items as presented for study was 
chonged on the test, a decrement was ob- 
served, suggesting either that item-order 
information was important or that associa- 
tions between the item and its position 
within the triad was a part of the memory. 
The method used by DaPolito et al. makes 
context (when defined as the presence of an 
item or items on which no recognition 
decisions are to be made) seemingly inci- 
dental. To some degree it would seem that 
S could ignore the context words on the 
test. The fact that context effects were 
found in spite of this seems to speak 
Strongly that context, whether known to 
be associative or not, does influence 
recognition. 

In the present experiment, Ss were pre- 
sented pairs of words for study under in- 
Structions to associate the words in each 
Pair, On the recognition test, either single 
Words or pairs of words were presented. 


In the latter case the recognition decisions 
were made only on the second word in the 
pair. Thus, the first word in the pair need 
not have been involved at all in the recogni- 
tion decision although the Ss were told that 
it might help them in reaching a decision 
on the second. Of course, the major 
variable was the nature of the first word in 
the pairs. 


Method 


A description of the seven classes or types of test 
items will be given first, expanding the symbol 
system used in describing the pairs in Experiment 
II. An old item is designated O, a new item, N, 
and (a) the test word was always the second word 
in a pair on the test, and (^) a word on the test 
always occupied the same position in a pair (first 
or second member) as it held during the study trial, 
"The seven types of items were as follows. 


Type O: old second word tested alone 

Type N: new word tested alone 

Type O-O: intact pair 

Type N ew pair 

Type N-O: old second word with new first word 


Type O-N : new second word with old first word 
Type O-Ob: two old words from different study 
pairs 


The words in the pairs were all four- 
nosyllabie words. They were drawn from 
a pool of 312 such words which constituted a random 
sample from the Thorndike-Lorge tables The 
change in the class of words (from the class used in 
the first three experiments) was necessitated by the 
fact that other experiments being conducted in the 
department were using the words from the earlier 
experiments. 

Study and test lists, Initially, 96 words were 
drawn randomly from the pool and formed into 48 
pairs randomly, These 48 pairs constituted the 
study list, hence the old pairs, for all Ss. A second 
sample of 84 words was drawn randomly and these 
84 words were always used as new words on the 
test form. 

For balancing purposes, the 48 study pairs were 
arbitrarily divided into four blocks of 12 pairs each, 
Across four test forms, each block of 12 pairs was 
used once for each of the four item types on the test 
involving old words in the second position (0, O-O, 
N-O, O-Ob).. This means, therefore, that on a test 
form there were 12 items representing each of the 
four types. The O-Ob pairs were never constructed 
by interchanging the members of two pairs, 

The assignment of new items to particular fune- 
tions on the test forms was done randomly, witha — 
different random assignment for each form, There 
were 12 Type O-N test pairs, the O words being — 
the first words in the study pairs which on the test 
became Type O, There were also 12 Type N pairs 
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Ficure 4, Misses and false alarms for various 
types of test items. (For pairs, the recognition 
decision was made only on the second word in the 
pair.) 


on the test, but 24 pairs were used to represent 
Type N-N. This made 48 true yes items on the 
test and 48 true no items, On the test, the seven 
item types were block randomized, and test posi- 
tion, relative to input position, was orthogonal by 
halves. There were 48 items on each page of the 
two-page booklet. 

The single items on the test always occupied the 
nominal second position of a pair. For example, 
the first four items on one of the test forms were as 
follows. 


YES 
YES 
YES 
YES 


NO 
NO 
NO 
NO 


1. spry-wile 
2. whey-ramp 
3. jest 
4. came-dell 


Procedure and Subjects, The 48 study pairs, 
preceded and followed by two filler pairs, were 
presented at a 5-sec. rate, Prior to presenting the 
pairs, the Ss were told only that pairs would be 
presented and that they should try to associate the 
words in each pair in preparation for a memory test. 
After the last pair had been presented, the booklets 
were distributed. The Ss were run in groups of 
varying size but in any group the four test forms 
were assigned randomly to the Ss. The instructions 
for the test involved five key points: (a) for a single 
word on the test form, the yes-no decision 
whether or not the word had been shown as a 
member of a pair; (b) for a pair of words, the deci- 
sion was to be made only for the second word, i.e., 
had it or had it not been presented as a second word 
in a pair on the study list; (c) the first word in a pair 
might aid in making the decision on the second; 
(d) an old word always occupied the same position 
on the test form as it had in the pair in which it 
appeared on the study list; and (e) all items were 
to be completed, guessing if necessary. The test 

_ was unpaced. 

Groups of Ss were tested until 68 Ss had been 

completed, 17 for each of the four forms. 
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Results 

The results, shown in Figu are qui 
unambiguous. There were o two levi 
of errors, differing about 8 ind the 
two levels were associated ^ hange 
context and with context co: icy. r 
the misses, omission of the fi ord (0), 


including a new first word )), 
pairing old words (O-Ob) 
about the same increase in t! 
errors when no change (0-O 
reference. False alarms we: 
spondingly influenced when an 
used as a first word (O-N). 
different ways of viewing th 
statistically all produced th 
clusions. Perhaps the most di^ t 
to ask about the differences amo 
types for the misses, F (3, 2 = 
b < .01, and among the thre: 
the false alarms, F (2, 134) = 8 
In both analyses, the deviati of one: 
condition from the others is large! v respon 
sible for the conclusion that reliable dif- 
ferences were present. J 
It will be remembered that i» Experiz 
ment III, Types N-O and O-N produced 
more errors than did Type O-Ob. In the 
present data, where S was asked to make 
decisions on a single word rather than on a 
pair as was true in Experiment 111, the 
numbers of errors for these types were: 
essentially equivalent. Also, in Experi- 
ment III the misses on Type O-O exceeded 
the false alarms on Type N-N, which was 
not true in the present experiment. This: 
is to say that the two techniques (judging 
single words in pairs versus judging pairs) 
produced differences in detail, but both lead 
to the same conclusion. The presence on 
the test of the full associative context 
present during study facilitated recognition 
Any changes made in the context cause 
an increase in misses, 


EXPERIMENT V 


Experiments III and IV gave evidence 
that an intact association between tw 
words in a pair is a positive factor in the 
recognition of that pair. The present 
experiment represents a further test of the” 
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role of a ciation in recognition, a test 
which es use of  paired-associate 
learnin 

It w ntioned earlier that when words 
are p randomly one would anticipate 
that of these pairings would consist 
of w vhich would be easy to associate 
anc rs which would be difficult to 
ass , with the bulk of the pairs lying 
in n the extremes. If an association 
he n two words influences the recogni- 
ti ! the pair, it might be reasoned that 
o irs on which few errors were made 
( oped stronger associations than did 
Į on which many errors were made. To 


this another way, during a constant 
i. riod of study, the strength of the acquired 
ssociation would differ for different pairs. 
If the strength of the association is involved 
in recognition, it must be predicted that 
pairs on which few recognition errors were 
made would be learned more rapidly as 
paired associates than would pairs on which 
many errors were made. Experiment V was 
a test of this proposition. 


Method 


Materials, From among the 48 pairs presented 
ce for study in Experiment 1, and tested by a 


TABLE 1 
Easy AND Harp Lr J 


socrATE (PA) Exe! eR 
VANT INFORMATION ABOUT. THE PAIRS 


Recognition Frequency 
errors judgments | cor- 
List — ——|—— | rect in 
Exper ES 
Der 
limentrr| M | M7e 
Kasy 
nymph-prune 1 4 195 67 
bacon-shrub à 6 111 58 
mound-alert 3 15 1:24 65 
onion-ivory 1 4 193 75 
daddy-waver 5 3 E 77 
hroom-folly 5 5 1.26 61 
laden-cargo 5 8 1,09 55 
drank-voter 6 6 99 76 
exalL-baron 6 3 1.05 62 
wedge-canon 7 6 97 53 
M 45 6.0 | 1.05 64.9 
Hard 
hatch-focus 25 21 80 | 84 | 45 
marsh-brute 20 13 176 196 | 23 
award-birch 7 6 $94 | 1.07 | 54 
dwelt-forge 17 4 9 1.27 19 
harsh-inner 16 17 igi | LO» | 44 
ce-chime 16 21 EE ‘97 | 44 
n-stra 16 18 186 | 197 | 43 
punch-ex 16 8 108 | 133 | 32 
Died" 15 10 EG 196 | 66 
ush-abide 15 11 i89 | 116 | 22 
M 17.3 12.9 .86 | 1.05 | 392 
| | 


Easy 


Hard 


vw 


= 


MEAN NUMBER CORRECT 
P 


3 
TRIAL 


FIGURE 5, Acquisition of paired-associate lists 
of 10 pairs each, identified as easy and hard by the 
number of recognition errors made on the pairs. 


yes-no procedure, 10 pairs on which few recognition 
errors were made were selected along with 10 pairs 
on which the most rec ognition errors were made, 
As paired-associ e lists, these will be called the 
Easy List and the Hard List. Both lists are shown 
in Table 1. Although these lists were selected by 
using the data of Experiment I, the same pairs 
occurred under much the same conditions in Experi- 
ment II, Therefore, the errors made in Experi- 
ment l| are also shown. The reliability of pair 
difficulty is by no means perfect, particularly for 
the pairs in the Hard List, although some regression 
would be anticipated. Nevertheless, the mean errors 
were substantially different for the two lists even 
when gauged by the errors made in Experiment II. 
Procedure and Subjects. The lists were presented 
for one study trial and five anticipation trials at a 
2:2-sec. rate on a memory drum. There were 
20 Ss, Half learned the E: asy List followed by the 
Hard List, and half learned in the reverse order, 


Resulls 


As may be seen in Figure 5, the list made 
up of pairs defined as easy by recognition 
errors was much easier to learn than the 
list made up of pairs on which many 
recognition errors were made. This was 
true regardless of the order in which the 
lists were learned and the overall difference 
across the five trials was highly reliable 
(F = 28.09). 

The mean number correct for each pair, 
summed across the five trials, is shown in 
Table 1. Only two pairs from the so-called 
Hard List fall within the range of number 
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of correct responses as shown for the pairs 
in the Easy List. Other information is 
included in Table 1. In Experiment I, 38 
Ss made absolute frequency judgments of 
these pairs, and in Experiment II, 36 Ss 
did the same. The mean apparent fre- 
quency as derived from these 74 Ss for 
each pair is shown in Table 1. The dif- 
ference in the mean frequency judgments 
for the two lists is significant, / (18) = 3.69, 
p «.01. Thus, the pairs in the Easy List 
had higher apparent frequency than did 
those in the Hard List. 

As shown earlier, prediction of recogni- 
tion errors from absolute judgments of fre- 
quency was quite good when means and 
variability were considered simultaneous 
by using the coefficient of variation (Mo). 
"These values also appear in Table 1 and it 
is to be noted that the two lists differ 
markedly on this measure. Rank-order 
correlations were calculated across the 20 
pairs. The M/o values correlated .86 with 
recognition errors and .56 with the number 
of correct anticipations in paired-associate 
learning. The correlation between recog- 
nition errors and paired-associate learning 
was .72. 


Discussion 


The present results indicate that the ease of 
associating two words in a pair is positively 
related to correct recognition. These results 
Support those of Experiments III and IV in 
showing that an association between the two 
words in a pair seems to be involved in recogni- 
tion performance. Insofar as apparent fre- 
quency has been shown to predict recognition 
errors 1t is not surprising that apparent fre- 
quency also has some predictive value for 
paired-associate learning, 

The evidence that an association between 
two words influences recognition must be 
kept in theoretical perspective, The major 
information used in recognition decisions may 
well be frequency information—apparent fre- 
quency of the pair as a unit and of the indi- 
vidual words, each as a unit. The difference 
between intact and broken pairs in Experi- 
ments III and IV was only 7-10%; perform- 

ance did not fall to chance when associative 
Context was removed, Thus, although the 
results of the experiments thus far indicate 
that an association plays a role in recognition, 
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the magnitude of the role is no!  npressive 
when overall recognition perí ince js 
considered. 
EXPERIMENT VI 

Two of the basic findings froo the pre- 
ceding experiments led to Expe ent VI. 
Experiments III and IV indica hat an 
association resulting from one | ntation 
of a pair of words adds inform | which 
aids recognition | performance Experi- 
ment V showed that pairs on h few 
recognition errors were made more 
rapidly acquired in a paired-as te list 
than were pairs on which many ro- gnition 
errors were committed. These s sug- 
gest that the associability of tw ords in 
a pair facilitated recognition pe! nance, 
or, that facilitation of recognition f a pair 
by the association existing bet n the 
two words was directly relat: o the 
strength of that association. A: vious 


projection from these facts is tha: if very 
strongly associated pairs of wor were 
tested for recognition, few if an errors 
would be observed. The present experi- 


ment deals with this possibility. 

Many of the pairs used in E periment VI 
were strongly associated as a consequence 
of cultural usage. Other investigators have 
approached the problem of associative 
context by the use of similar materials. 
Tulving and Thomson (1971) presented S 
strongly associated pairs of words as well 
as weakly associated pairs as indexed by 
published word-association norms. On the 
recognition test the S was asked to make à 
decision on each word of a pair independ- 
ently. Based on the results of this pro- 
cedure it would be concluded that the 
strength of an association had little in- 
fluence on recognition. There were 12% 
misses on the strongly associated pairs, 
and 21% false alarms on new associated 
pairs. For weakly associated pairs the 
values were 15% and 16%. However, 
these data came from only a small part. P 
an extensive procedure of testing which 
involved changing words in pairs, testing 
single words, adding new words to words 
which during study had been members © 
Pairs, and so on. It is possible that the 


| 
| 
j 


] 
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TABLE 2 
Critical Parks or Wongps: ExPERIMENTS VI-VIIT 


Ant | Conceptual associates Parallel associates Synonyms Homonyms 
day canary-bird cup-saucer complete-entire see-sea 
bad-~; horse-animal bed-sleep spoken-verbal pail-pale 
lost~{ minnow-fish bread—butter hidden-concealed plane-plain 
hard maple-tree income-money empty-vacant minor—miner 
dirty—-cican | python-snake lamp-light bungalow-house peek-peak 
boy- waltz-dance hammer-nail capsule-pill cent-sent 
buy pansy—flower needle-thread chill-cold weak—week 

red-color _ salt-pepper correct-right fare-fair 

€ murder-crime eight-nine wicked-evil sleigh-slay 

water-liquid scissors-cut signature-name tea-tee 
king-queen | apple-fruit spider-web starved-hungry stayed-staid 
long-short cancer-disease table-chair silent-quiet sale-sail 
slow-fast bracelet-jewelery ale-beer grief-sorrow. wholly-holy 
bottom-top silk-cloth army-navy careful-cautious mane-main 
far-near pliers—tool candy-sweet meat-meet 
hate-love emerald-gem dock—boat y need-kneed 
open-close gnat-insect hand-foot central-middle lode-load 
rich-poor east-direction nurse-doctor ancient-old course-coarse 
smooth-rough uncle-relative mail-letter rural-country. prey-pray 
under-over oil-fuel lock-key boulder-rock seen-scene 


varied test context, and testing the words 
in a pair independently, may not have 
allowed differences as a function of strong 
and weak associative context to manifest 
themselves. Thomson (1972) used strongly 
associated pairs in one experiment (Experi- 
ment I) and pairs with no preexperimental 
associative relationship in another (Experi- 
ment IV). The Ss made decisions on each 
word in a pair. The misses were 5% less 
for the strongly associated pairs than for 
those which were weakly associated, with 
the false alarms about equivalent for new 
pairs. These data, like the Tulving- 
Thomson data, suggest only a small facili- 
tating effect of the culturally established 
associative relationship. Such findings 
seem odd if one is to assume that associa- 
tive context is a powerful factor in recog- 
nition memory. Further investigation 
seemed necessary. 

In the present experiment the S studied 
pairs of highly associated words and pairs 
of initially nonassociated words. Recog- 
nition decisions were always made for pairs 
as pairs. 


Method 


Materials. From a variety of sources, 100 pairs 
of words were brought together. As may be seen 


in Table 2, there were 20 pairs in each of five classes, 
Four of these classes were assumed to represent pairs 
with strong, culturally: established, associative 
relationshi These four classes are the parallel 


synonyms, antonyms, and concept ual 
xiates. The fifth class, homonyms, represent 
There is no reason to believe that 
the two words in a homonym pair are associatively 
related in the sense that the words in pairs in the 
other classes are associated. The distinguishing 
property of the homonym pairs for the present study 
was that the acoustic-articulatory response to the 
two words in a pair would be equivalent. The conse- 
quence is that the frequency of this response would 
be double that for the words in the other types of 
pairs. It was believed that this property would 
yield information not only relevant for frequency 
theory, but also relevant to an understanding of the 
mechanisms of recognition in the associated pairs. 
No argument is to be made for the purity of the 
pairs in the classes, nor that the pairs are necessarily 
representative of all possible pairs which might be 
placed in these classes. But, taken as a whole, ea 
class is assumed to be distinctly different from each 
other class. 
In addition to the 100 associated pairs, 100 
nonassociated pairs were formed. In constructing 
these 100 nonassociated pairs, 100 additional words 
were chosen, these words varying in length, fre- 
quency, and form class. Some examples are: 
assail, degree, lid, oath, worthless, These words were 
paired randomly with one word from each of the 
100 pairs listed in Table 2. Half the time (within 
each class), the first word in the pair as listed in 
Table 2 was retained, and it occupied the first posi- 
tion in the pair, the second position being held by a 
neutral word. For the other half of the pairs, the 
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TABLE 3 


sAN Errors (M +FA) ror E Pairs AND G 
P Pas IN EACH or THE Five CLASSES 


Concep- 
Pair | Parallel | Syno- | Anto- | Homo- | thal as- 
types | “Stes nyms DH nyms | sociates 
E 1.14 


s 1.70 1.42 .95 
32 


1 
c 1,32 1:3 1.10 .83 


Note, The standard deviations for these 10 distributions 
varied between .94 and 1.39. Abbreviations: = misses; 


FA = false alarms; E = highly associated; C = nonassociated. 


second word in the pair was retained, the first posi- 
tion being occupied by the neutral word. For ex- 
ample, using the pair cup-saucer in the list of 
parallel associates, the nonassociated pair became 
cup-artist, and using bed-sleep, the nonassociated 
pair became ulter-sleep. Thus, the highly associated 
pairs (to be called E Pairs) and the nonassociated 
pairs (to be called C Pairs) always had one word in 
common, 

Study lists and test forms. Each S was presented 
50 pairs on the study list (plus two filler pairs at the 
beginning and two at the end). These 50 pairs were 
made up of five E. Pairs and five C Pairs from each 
of the five classes, "There were 100 pairs on the test 
forms consisting of the five E and C Pairs from each 
class presented for study plus five E and C Pairs 
from each class not presented for study. Of course, 
no word appeared in both an E Pair and a C Pair 
on the study list or test. For any given S, only half 
of the E Pairs and half of the C Pairs were used, 
Across four study lists and four test forms, each E 
Pair served as an old pair and as a new pair, and this 
was also true for each C Pair. This was accom- 
plished by dividing the pairs in each class into four 
subgroups of C and E Pairs of five pairs each and 
rotating functions (old or new) across forms, 

For the study list the 10 item types (five E Pairs 
and five C Pairs from each class) were block random- 
ized across the 50 positions, The test forms con- 
sisted of 100 pairs, 50 on each of two Pages, with the 
words yes and no appearing after each pair. The 
order of the pairs on the test forms w: 
but with a different random order bei: 
cach form. 

Procedure and Subjects, Prior to 
list for study, the Ss were fully informed concernin 
the nature of the test. They were told to try i 
associate the words in each pair on the study trial 

The list was presented at a 5-sec, rate by a ift: 
projector. After the list was presented the test 
booklets were distributed and the instructions for 
the test repeated, „The Ss were informed that they 
must make a decision for each pair, guessing if 
necessary. 
The Ss were run in small groups until 21 had 
completed each of the four forms. Since forms did 
hot interact with the variables of experimental 


interest, the data have been summed aer 
‘oss forms 
for the 84 Ss. at 


‘as random, 
ng used for 


presenting the 
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Results 

Data were available on 20 d ent types 
of pairs, each type represented | five pairs. 
These types were: old E Pairs i: ach of the 
five classes; new E Pairs fro: ich class; 
old C Pairs from each class d new C 
Pairs from each of the five cla For the 


lisses and 


initial analysis, the sums of t! 
nined for 


false alarms (M + FA) was c: 


the C Pairs and for the E ! for each 
clas. (It may be noted that orrelated 
.98 with M + FA when sumni :cross the 
five classes for the old and no E Pairs.) 
The means of the M + FA sc re shown 
in Table 3. It will be noted for only 
two of the five classes (para! issociates 
and synonyms) are there few rrors for 
the E Pairs than for the C P: For the 
other three, the direction reversed, 
although in none of the five : es is the 
difference between E and Pairs of 
appreciable magnitude. Nev eless, an 


analysis of variance showed t! lass was 


a significant source of vari: «, F (4 
332) = 7.87, p < .01, as was a). the inter- 
action between class and C Pairs, 
F (4, 332) = 418, p < .01, the dif- 
ference between E and C Pa» was not, 
E (1,83) = 2.25, p > .05. The findings 
indicate that, overall, associate! pairs are 
not better recognized than n^ iassociated 


pairs. : 

The measure of sensitivity, or discrimin- 
ability, M + FA, has indicated that across 
the five classes the discriminability of pairs 
of highly associated words was not better 
than the discriminability for pairs with the 
minimal associative relationship that might 
have been established on a single study 
trial. The interaction between type of 
pair and class, however, indicated that 
discriminability differed as a function © 
class of pairs. An examination of the 
results for each class in more detail 5 
indicated, 

In presenting the results for the first a 
experiments, it was pointed out that 
criterion differences could be detected by ? 
es ain type of interaction when two Co!" 
ditions, both represented by old and new 
Items, were plotted on the same grapi 

hese requirements were met in the present 
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Conceptual 
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(3 


o 12:16, 1 


FrGURE 6. Recognition errors on experimental (E) and control (C) pairs of 
words as a function of input frequency and class or type of pair. 


| for each class since there were old and 
n « E Pairs and old and new C Pairs. 
‘l ne data are plotted in Figure 6 for each 
cass, using input frequency (0 and 1) 
rather than old and new along the baseline. 

The critical fact exhibited in Figure 6 is 
that S may maintain different criteria for 
different classes of pairs even when those 
pairs are intermingled on the same test 
form. The results for the homonyms show 
that S has a very lenient criterion for 
accepting pairs which were not in fact on 
the study list. The marked interaction 
»etween input frequency and E-C Pairs is 
vident. The results for each of the five 
lasses of pairs have been analyzed separ- 
itely, Each F is based on 1 and 82 df, 
and an F of 3.96 is required for the .05 
level, 6.96 for the .01 level. Input fre- 
quency was significant for all classes, and 
except for the homonyms (which shows the 
smallest F, 9.51), more errors were made on 
old than on new pairs. The E-C difference 
was significant for only the antonyms 
(F= 9.5) and homonyms (F = 4.02). 
The interaction between the two variables 
was reliable for parallel associates 
(F = 6.65), synonyms (11.15), and, of 
course, for the homonyms (F = 59.67). 
'The interaction for the parallel associates 
and for the synonyms cannot be interpreted 
as representing a criterion difference for 
the E and C Pairs, since the differences in 
errors are small in magnitude for new (zero 
frequency) items. 

An examination of performance on the 
two test pages separately showed no 
systematic changes for any class of item. 


The differences in the criterion imposed 
for the different classes, therefore, was not 
a consequence of a gradual change or 
shift as the S$ made successive judgments. 

It will be remembered that one word in 
each C Pair was tied to a word from a pair 
in one of the five class There is no reason 
to expect that these pairs should differ in 
number of errors across classes unless the 
words in the different classes differed in 
recognition difficulty in some way. Table 
3 did suggest, however, that small differ- 
ences were present among the C Pairs 
representing the different cl s. Statis- 
tically, the M + FA values for the C Pairs 
for the five classes were reliable, F (4, 
332) = 4.00, p < .01. The greatest num- 
ber of errors occurred for the C Pairs 
derived from words making up parallel 
associates and synonyms, and the fewest 
from the conceptual associates. 


Discussion 

Four classes of natural associates have given 
no consistent evidence that the associative 
relationship facilitated recognition perform- 
ance. Overall, control pairs, where the asso- 
ciative relationship would be minimal after a 
single study trial, produced no more errors 
than did the associated pairs. Even if it is 
presumed that the Ss rehearsed the C Pairs 
more than the E Pairs on the study trial, it 
does not seem possible that the associative 
strength of the C Pairs would even approach 
that of the E Pairs. It therefore becomes 
apparent that no simple conclusion about the 
role of associations in recognition is possible. 
This is not to say that an interpretation of the 
findings of Experiment VI as seen in Figure 6 
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is not possible. A tentative interpretation will 
be given later. Before doing this, however, 
it will be useful to look at the results of some 
additional experiments. 


EXPERIMENT VII 


In this experiment Ss were again pre- 
sented pairs of items from the five classes 
or types of pairs used in Experiment VI. 
On the test, only one word from each pair 
was tested, along with single words (control 
or new words) which had not been pre- 
sented. The purpose of this procedure was 
not to test the effects of removal of associa- 
tive context. Rather, the interest was 
centered in the relative recognition per- 
formance under these “impoverished” con- 
ditions for the five different classes. The 
interest was in particular directed toward 
the homonyms. It had seemed clear from 
Figure 6 that a lenient criterion was in- 
volved in the judgments on the homonym 
pairs. This alone could result in few misses 
on the E Pairs. But still another factor 
would also keep the number of misses to 
a minimum. As discussed earlier, a homo- 
nym pair should double the frequency of 
the acoustic-articulatory response as com- 
pared with that for any single word in the 
pairs of the other classes. Insofar as fre- 
quency information enters into the recog- 
nition decisions for pairs, the misses should 
be less for the homonym pairs than for the 
other types. Now, whena single word from 
a homonym pair (as presented for study) is 
presented for a yes—no test, the acoustic- 
articulatory frequency should be the major 
basis for the decision. In fact, it should be 
of little consequence whether the pair or 
the single word is presented if frequency is 
the dominant attribute. Number of misses 
should be small in both cases. Consider 
next the case of a single new word being 
tested, a new word which is a nominal 
member of a homonym pair but a pair 
which was not presented for study. There 
seems to be no reason to expect that the § 
would perceive this word as having a homo- 
nym. Therefore, there should be no 
criterion problem that is specific to homo- 
nyms. Or, in terms of formal predictions 

for the experiment, new items (from 
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nominal homonym pairs) sh: not pro- 
duce more false alarms tha: zle new 
words representing the other c. s. And, 
as discussed above, misses sho: je fewer 
on the old words from hon: is pairs 
than on the old words from th« er pairs. 


Method 
Study lists and test booklets. The ^ esented 
50 pairs for study, 10 p: from ea the five 


classes. "There were also two filler at the 
beginning and end, On the test, one v rom each 
of the 50 pairs was printed in a bool ng with 
50 new words, representing one word each of 
the 50 pairs remaining in the five cla: For the 
50 pairs presented for study, half the (within 
each class) the first word in the pair w ted and 
half the time the second word in the pai tested. 
By using two study lists, and a single r« m, each 
word served once n old word and « sa new 
word. The study list was block rando: so that 
each type of pair occurred once in each k of five 


10 item 
s) were 
pect to 


trials. On the test form (two pages) 
types (five old and five new from eac! 
block randomized and were random wit! 
input order. 

Procedure and Subjects, Prior to pr: 
list of pairs for study, the Ss were inform 
study list consisted of a series of pairs, a! 
that on the test there would be single v. 


ing the 
hat the 
further, 


some 


from the pairs presented and some from wairs not 
presented. They would be required to make à 
yes-no decision on each word. The list was pre- 
sented by a slide projector at a 5-sec, rate. Imme- 


diately following the presentation of the last slide, 
the booklets were distributed, and the instructions 
for the test repeated. As was true in all of the pre 
vious studies, the test was unpaced and the S was 
required to complete all items on the first page 
before going to the second. Thirty-four Ss com- 
pleted each form for a total of 68. 


Results 


Figure 7 shows the basic results, plotted 
in the same manner as Figure 6. Two facts 
are to be noted. First, errors on the 
antonyms (both misses and false alarms) 
were considerably higher than for the other 
types. Second, the fewest misses were 
made on the homonym pairs. The statis- 
tical analysis shows that class or type of 
item, based on M + FA is reliable, F (4 
268) = 18.72, p < .01. More errors were 
made on old words (f = 1) than on new 
words (f = 0), F (1, 67) = 6.18, p < .05; 
and the interaction between frequency an 
type was also reliable, F (4, 268) = 6.52, 


ROLE OF ASSOCIATION IN RECOGNITION MEMORY 


p «.01l es of these effects are quite 
apparent gure 7. 

The r r of misses on the homonyms 
was sl! under 14%. The class of 
items the next fewest misses was 
conce; associates. 19%. A direct- 


showed this difference to be 
2.84, p < .01. This may 
be | eted to mean that input fre- 


67) 2 


qu s greater on the homonym pairs 
thi | the other types. The fact that 
thi re only slightly more false alarms 
(y » new items) for the homonyms than 
for ie corresponding items in three of the 
ot classes is interpreted to mean that a 


cr. rion difference for the classes was of 
li^ ie consequence in this study. 
Recognition of antonyms was clearly 
inferior to the recognition of the words in 
the other classes. There was a suggestion 
of this possibility in the previous experi- 
ment for the E Pairs (see Figure 6). A 
part of the difference seen in Figure 7 can 
be attributed to heavy testing effects for the 
antonyms. This is to say, performance 
leereased in accuracy from page 1 of the 
st booklet through page 2. This was 
nantified for each class of items by using 
-FFA, with 10 items on each page. 
‘| e errors increased for the parallel associ- 
ates (15.3% to 19.3%), for the synonyms 
(16.9% to 22.2%), and for the antonyms 
(20.796 to 31.5%). All three of these 
changes were reliable beyond the .05 con- 
fidence level. The homonyms showed an 
unreliable increase (14.9% to 16.5%) and 
the conceptual associates showed a slight 
decrease (17.6% to 16.2%). The reasons 
for these differential testing effects in this 
experiment are not clear. It should be 
noted, however, that as pointed out earlier, 
there were no systematic changes in test 
performance over pages for Experiment VI 
where pairs were tested. 


EXPERIMENT VIII 


This experiment will be briefly reported. 
lt had as its major purpose the examination 
of the possibility that some peculiarity of 
the words used in the homonym pairs led 
to the small number of misses in Experi- 
ments VI and VII. Itis, perhaps, becom- 
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FIGURE 7. Recognition errors on single words 


following pair presentation for antonyms (A), 
synonyms (S), parallel associates (PA), conceptual 
associates (CA), and homonyms (H). 


ing apparent that some theoretical im- 
portance is to be attached to the findings 
reported thus far for the homonyms. The 
test made in the present experiment was 
simply to present single words from the 
pairs in each class for study and then test 
these single words for recognition along 
with new words from the classes. The 
critical question is whether or not the 
outcome will be the same for single words 
from the homonym pairs as for the single 
words from the other classes, particularly 
the classes other than antonyms. The 
antonyms appear to represent a special 
problem. 


Method 


The same test forms were used as in Experi- 
ment VIL The study list consisted of 50 single 
words (plus two filler words at the beginning and 
end of the series), 10 in each class, representing 10 
of the pairs. These were in fact the same words as 
tested in Experiment VII. By using two different 
presentation lists, each of 20 words in each class 
served both as new words and as old words. 

The Ss were fully instructed concerning the study 
list and the method of testing. There were 24 com- 
pleted records for each of the two forms for a total 
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Ficure 8. Recognition errors on single words 
following presentation of single words from five 
classes of pairs. 


of 48, All procedures were exactly the same as in 
the previous studies, 


Results 


The essential results are shown in Figure 

8, which is plotted in the same manner as 
the other recent graphs. It is to be noted 
that the results for the single words from 
the homonym pairs fall well within the 
limits set by the parallel associates, con- 
ceptual associates, and synonyms. This is 
to conclude, therefore, that there seems to 
be nothing peculiar about the words in the 
homonym pairs when recognition perform- 
ance is observed for them as single words, 
Figure 8 shows that the single words from 
antonym pairs produced many false alarms, 
as well as more misses than did the words 
from the other classes. Performance on the 
antonyms was largely responsible for the 
reliable class effect, F (4, 188) — 8.46, 
P «.01. There is a Suggestion that a 
criterion difference might exist for the 
antonyms versus the other classes, but the 
interaction between old-new and type was 
less than one. It should be clear that the 
new items in the test were not, for any of 
the classes, the associated word as shown 
in Table2. The false alarms cannot, there- 
fore, be attributed to implicit associative 
responses elicited during the study trial 
which were represented in the new words on 
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the test. If the false alarm: are toj 
attributed to such implicit res: onses they 
arise from sources which cann: be easih 


identified. That some of the ! ‘se alarms 
could be attributed to this m: hanism is 


not to be denied, but it is no: apparent 
why these should fall more he: ‘ly on the 
antonyms than on the other c!. ses, 


antonym problem will 
further in this paper. 


not 


GENERAL Discussio 


This research was initiated to de‘: mine the 
role which an association betwee: a pair of. 
words played in recognition. iven the 


knowledge of this role (if such sisted) it 
would be possible to s the deg: to which 
frequency theory fails to accomods« it. 
major purpose of this d ssion is tc make this 
assessment. The basic evidence from the 
experiments will be summarized in! ially. 


1. Experiments I and II showed that parallel ) 
procedures in which recognition decisions were 
required in one case and frequency judgments | 
in the other produced the same outcomes. 
quantitatively. This was true for both abso- 
lute and relative judgments. Errors tended to 
fall on the same pairs for both types of judg- 
ments, and recognition errors were pr lictable 
from frequency judgments by combining the 
means and variance of the frequency 
judgments, 1 

2. The paired-associate study showed that 
pairs on which few recognition errors were 
made were more easily associated than were 
pairs on which many errors were made. This 
would seem to implicate associability as & 
factor in recognition performance over and 
above frequency. Experiments III and IV 
Supported this conclusion in showing that a 
pair of words, both shown on the study list 
but in different pairs, were less likely to be 
recognized than were intact pairs. " 

3. Experiment VI, however, in which pairs 
of culturally associated words were used, pro- 
vided no consistent evidence that recognition 
performance was better for associated pairs 
than for nonassociated pairs. If one assumes 
that the association in culturally associated 
words is not qualitatively different from the 1 
associations established in the laboratory - 
between initially neutral words, the findings - 
With the culturally associated words are In 
direct conflict with those summarized in point 
2 above. : 


ROLE OF 


4. Quite ectedly, criterion differences 
were found een frequency judgments and 
recognitio: sions, the criterion being more 


former. Recognition perform- 
nym pairs also reflected a much 
than was found for 


lenient f 
ance on | 


more erion 
contro! and culturally associated pairs. 
The ts 6f Experiments I and II make it 
diffic: rn abruptly away from frequency 
as t! discriminative attribute for pair 
recc ^ Therefore, the attempt has been 
mari: develop a coherent account of the 
bas: lings, using an expanded theory of 
fre cy discrimination. This expansion is 
pos c, and consequently must be viewed 
wit pticism, although the formulation as a. 
wh does rest on frequency theory as de- 
ve ped in recent years. Before describing 


th s account, it seems necessary to show how 
a^sociative information or associative strength 
letween words in a pair cannot be taken 
seriously as a direct explanatory mechanism 
for pair recognition, 

Experiment VI showed that associated pairs 
and nonassociated pairs produced equivalent 
recognition performance. It would be quite 

"sible to obtain such a result even if the 

cognition performance in the two cases was 

diated by quite different types of informa- 

a in memory. To be more specific, for the 
a sociated pairs the decisions may have been 
based exclusively on the associative relation- 
while the decisions for the nonassociated 
pairs may have been based on frequency 
information, Two considerations argue against 
the idea that the associative information is 
critical in the recognition of the culturally 
associated pairs of words. First, the new pairs 
for e associated pairs were also highly 
associated pairs. "Therefore, some information 
other than associative information would be 
required to distinguish between the old and new 
pairs. Otherwise, the Ss should have accepted 
àny associated pair as having been included in 
the study list. Still, it might be argued (and 
this leads to the second point) that the single 
presentation of the associated pairs so increased 
the momentary situational strength of the 
association for the associated pairs that they 
were distinguishable on this basis from the 
control pairs even though the latter were 
strongly associated by cultural usage. This 
does not seem to be a likely possibility. In 
periment II, 12 pairs made up of unrelated 
Words were presented twice during the study 
trial, and 12 pairs were presented three times. 
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On the yes-no recognition test there were 5.3% 
misses for the former pairs, 3.5% for the latter. 
Although comparing error rates from experi- 
ment to experiment is risky because of many 
possible differences in subject samples, number 
of words presented and tested, and so on, these 
values must be considered appreciably less 
than the misses for the culturally associated 
pairs presented once. It does not seem 
appropriate to conclude that a neutral pair 
of words, presented twice, has a stronger 
associative link between them than does a 
pair of culturally associated words, such as 
table-chair, presented once. Based on associa- 
tive strength alone, there is no reason to 
believe that the Ss should have made any 
errors on the culturally associated pairs of 
words. Associative strength, at least as the 
term is understood as a consequence of multiple 
trials in the laboratory, does not seem to be 
directly involved at all in the recognition 

decisions. "This conclusion was important in 

directing the explanatory efforts back toward 

frequency as the fundamental discriminative 

attribute for pair recognition. 

In a recent study (Underwood & Zimmer- 
man, in press), the evidence pointed toward 
the fact that each syllable of a two-syllable 
word gained a small amount of subjective 
frequency which was independent of the fre- 
quency information for the word per se. This 
is to say that the subunits of larger units may 
have frequency representation in memory. 
This abstractive nature of the assimilation of 
event frequency has long been known to be a 
characteristic of memory in the developmental 
history of the individual (Underwood, 1971), 
It now seems evident that this same feature 
obtains to some degree in the laboratory. The 
implication of this fact for frequency theory 
as applied to recognition of pairs is that the 
frequency information in memory will consist 
of information about each word, and also 
about the two words as a unit. 

Given that frequency assimilation for a pair 
may involve three event frequencies, it is 
reasonable to ask what the optimal frequency 
should be for each event if recognition errors 
are to be minimized following a single presenta- 
tion of a pair. Obviously, this arrangement 
would be one in which each unit has a subjec- 
tive frequency of one, and the pair as a unit 
has a subjective frequency of one, All three 
sources of information would lead the S to 
respond with yes. A reduction in the subjec- 
tive frequency for any one of the three events 
should reduce the likelihood of a correct 


938 


decision, Assumptions will now be made to 
reflect changes in the subjective frequency of 
each of the three events, hence changes in the 
likelihood of a correct decision. 

It will be assumed that the stronger the 
association between two words prior to the 
study trial, the greater the likelihood that the 
subjective frequency of the pair as a unit will 
approach one, and the less the likelihood that 
the subject frequency of each word will ap- 
proach one. The converse may be stated. 
If there is no associative relationship existing 
between two words prior to the study trial, 
the greater is the likelihood that the subjective 
frequency for the individual words will 
approach one, and the less the likelihood that 
the subjective frequency of the pair as a unit 
will approach one. In the extreme case, if no 
associative link is developed between the words 
in a pair during the study trial, the subjective 
frequency of the pair as a unit would be zero. 
The application of these notions to the data 
may now be examined. 


1, In Experiment III, re-pairing old words 
on the test resulted in an increase in the errors. 
This would be accounted for on the grounds 
that a re-paired item has no subjective fre- 
quency for the pair as a pair. The decision 
must be made on the basis of the subjective 
frequency for the individual words. In Experi- 
ment IV, errors on the recognition of a single 
word from a pair presented for study increased 
(as compared with the presence of the intact 
pair), and the amount of increase was inde- 
pendent of the newness or oldness of the 
context word. The lack of pair-frequency 
information is believed responsible. In a 
further experiment, not reported in this series, 
individual words were presented for study. 
On the test, pairs of individual words were 
presented just as in Experiment IV. and with S 
always making a recognition decision on the 
second word. Just as found by: Thomson 
(1972), the characteristics (old or new) of the 
first word in the pair had no effect on the 
recognition of the second. Since there was no 
pair-frequency information induced during 
study, no loss of accuracy should have been 
found by prefixing either old or new words to 
the test word, 

2. Pairs on 


which few recognition errors 
were made were more easily associated (in a 
paired-associate test) than were pairs on which 
many errors were made. The easy pairs were 
also judged to have higher frequency than were 
the difficult pairs. The easy pairs represent 
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pairs which approximate as ci 
probably possible the maxin 
case. The pair was not assoc 
the study trial. But, for whate 
association could be establishe 
this process, each word will r: 
maximum frequency (one), an 
the pair will approach this max 
of the rapid formation of the as 

3. Why were not cultura! 
words better recognized than 
words? It is now presumed th 
is a coincidence of the distributi 
tive frequency information. For 
pairs, the decisions are primarily 
frequency, for the nonassociat 
element or single-word frequency 
be suspected that the small d: 
errors among the four classes 
resulted from differences in the m: 
the subjective frequency for the + 


It should be repeated that the f: 
post hoc. However, it does have 
testable implications. Although 
not be examined here, it can be se 
basic idea would be to evolve 
which would change the subjective 
of one or more of the sources assum 
theory to be involved in the 
decisions, 

The theory does not seem capabie of ham 
dling cleanly the false-alarm data for N-O and 
O-N pairs in Experiments III and IV. Al 
that can be said is that since subjective freq 
quency should be present for one of the thr 
events held important by the theory, the 
is led toa Yes response. This is not considered 
a satisfactory account for such false alarms 

Finally, the result for the homonym pali 
provided a special case for frequency theory; 
a case that touches on several issues. Fre 
quency theory makes no assumption concerns 
ing the awareness of the S that he is usin 
frequency information in recognition decisions 
In response to questioning, Ss will sometimes 
respond that the correct item looked familiar 
but more frequently given is the responses 
“Tt looked right." It will be remembered that 
the only distinctive difference found between 
i ns and frequency judgments: 
and II was that the Ss set 


ognition 


pairs he may, with some level of intentio, 
make his decisions on the basis of the frequency 
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of the acousti ‘iculatory responses he pro- 
duced to a ; iuring the study trial. If this 
is the casc whatever leads the S to set a 
lenient c: for frequency judgments 
may likes ead him to do the same in 
recognitio pairs of homonyms. That fre- 
quency icoustic-articulatory response 
is impli eems clear by the results of 
Experi: II. This study showed that 
relatis misses were made on a single 
word 1 homonym pair, a result that was 
not 2 lenient criterion. The S, in 
effect a subjective frequency of two for 
the acoustic-articulatory response; for 
the classes of items, the frequency 
wou > appreciably less. 


series of experiments was initiated 
ihe expectation that associative informa- 


wit 
tion in memory would cause a serious break- 
dewn in the usefulness of frequency informa- 
ton as an explanatory mechanism. The 


cvidence as viewed has led to the conclusion 
that it would be premature to abandon fre- 
quency theory in attempts to account for 
recognition performance for associatively re- 
lated verbal units. 
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SIMULTANEOUS DISCRIMINATIONS: 
TWO SEPARATE TYPES OF CONTROL ' 


MICHAEL D. ZEILER ? 


Emory University 


Children and retarded adults were given four different learning conditions and 
a series of transfer tests to determine which stimulus aspect controlled perform- 
ance in a simultaneous discrimination. An adequate description of the dis- 
imination performance entailed the recognition that different individuals 
isplayed equal accuracy but were not controlled by the same stimulus aspect. 
yht of the Ss responded on the basis of stimulus relationships, but the other 
Js were not sensitive to those aspects of the stimuli. One S was controlled 


14 
by irrelevant stimulus properties; the others demonstrated that choices de- 
pended on the size differences among the stimuli of each three-member set and 
the differences between the learning and the transfer sets. A model proposed 
to explain these performances could describe the probability of responding in 


each individual. Thus, individuals showed two types of control, one charac- 
terized by sensitivity to stimulus relations and the other described by the 
model, 


‘Two problems confronted by theoretical 
counts of discrimination learning are 
pecifying the controlling aspect of stimuli 
aud describing the way the discriminations 
learned. Stimuli can differ along a 
number of dimensions, e.g., two visual stim- 
uli can have different colors, forms, sizes, 
spatial locations, etc. An individual whose 
responding varied depending on which of 
the two stimuli was presented might be 
under the control of any or all of these 
aspects, but which one was relevant could 
not be determined from the sheer occur- 
rence of differential responding. The 
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achievement of differential responding also 
does not indicate the way the problem was 
learned (e.g, incrementally, all-or-none 
learning, etc.). In fact, it has been demon- 
strated that similar criterion performances 
can be attained in a variety of ways in the 
same task (Gholson, Levine, & Phillips, 
1972), thereby indicating that no single- 
process model can provide a comprehensive 
account. The same conclusion can be 
reached about models attempting to explain 
which aspect of the stimuli controls per- 
formance in even a simple situation 
volving stimuli differing on only 
dimension. ; 

One reason that research on controlling 
stimulus aspects has been insensitive to the 
possibility of multiple sources of control 
has been that data were averaged over 
groups of Ss (cf. Reese, 1968). Theories 
were devised to explain these averaged 


in- 
a single 


CO RT 3 
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data on the implicit assumption that the 
means estimated the performance of indivi- 
dual Ss (e.g., Zeiler, 1963). However, the 
detailed evaluation of individuals revealed 
that the means were not an adequate ap- 
proximation and consequently the theories 
were not explaining individual performance 
corrected for sources of uncontrolled error 
by averaging across Ss. For example, 
some Ss were always controlled by the 
relational characteristics of two simultane- 
ously presented stimuli, whereas others 
were sensitive to absolute stimulus sizes 
rather than to relationships (Kuenne, 1946; 
Porter, 1969). With situations involving 
three stimuli with the middle-size member 
of an original training condition as the cor- 
rect set member, some Ss were always con- 
trolld by the middle-size relation, but 
others responded to the middle-size stimu- 
lus in transfer tests only if the new stimuli 
were very similar to the originals (Zeiler & 
Friedrichs, 1969; Zeiler & Salten, 1967). 
In fact, the same individuals have even 
revealed both types of control simultane- 
ously (Zeiler, 1964). 

The existence of two controlling stimulus 
aspects means that the averaging of the 
performances of different individuals must 
yield misleading data. A more profitable 
approach might be to recognize the hetero- 
geneity and attempt to formulate a theo- 
retical description of each type. Subsequent 
Steps would be to try to understand what 
variables are responsible for determining 
which aspect will predominate, as well as 
lo analyze how both are combined in the 
behavior of adult humans. The present 
experiment was intended to provide the 
first step, i.e., the description of the two 
types of control emerging with three stimuli 
varying in size. 

One type of control seems straightfor- 
ward. Some Ss are controlled by the rel 
tionships of one stimulus to h 
ever, the second form of control cannot be 
analyzed so simply. In this case, individ- 
uals sometimes choose the stimulus having 
a certain relationship, but at other times 
do not. This sort of profile is predicted 
by the ratio theory (Zeiler, 1963). Accord- 

ing to the ratio theory model, each stimulus 


a- 


another. How- 


A 
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is defined by the ratio oi 
vailing stimulus contex 
level). The training a in level is 
determined by the train nuli alone 
(the geometric mean of reas), and 


) the pre- 
daptation 


the test adaptation lev: e weighted 
sum of the geometric : £ the test 
stimuli and the geomet: an of the 


training stimuli. In the : 
the set member having th 


will prefer 
closest to. 


that of the positive sti (S+) in 
training. 

With three stimuli difi size by 
equal units and the middi stimulus 
as S+ during the trainin e, which 
test stimulus will be prefe: pends on 
the similarity of the stin hin each 
set and the similarity of th« stimuli 
to the test stimuli. As th nce be- 
tween the training and the t« nditions 
increases from zero, the t ptation 


level changes in such a way t! model 
predicts that the preferred sti; : should 
shift from the middle-sized tes ember 
to a different stimulus. The st us dif- 
ferences within the test sets arc ritical. 


If a particular stimulus is tl le sized 
member of a test set and is pi ed, in- 
creases in the differences between it and the 
accompanying larger and smaller set mem- 
bers will not affect the preferences. How- 


ever, if either the large or the small st imulus 
is preferred, increasing within-set differ- 
ences will eventuate in a shift to preference 
of the middle-sized member. Thus, the 
model predicts that between- and within- 
Set differences operate in conjunction to 
determine choices in the tests. 

None of the research up to this point. that 
has studied the performances of individual 
Ss can evaluate the accuracy of this predic- 
tion, because no experiment has manipu- 
lated within- and between-set stimulus 
differences within the same individuals. 
The purpose of the present experiment was 
to do this. Because normal adults have 
been shown to switch the basis of the 
choices as a function of between-set simi- 
larity (Zeiler, 1964), young normal children 
and retarded adults were used as Ss an 
order to assure the observation of inde- 
pendent patterns of transfer performance. 


expected to reveal one or the 
two components that interact 


determi +e adult behavior. 


METHOD 


Subje: 
(8 boys «i 
- for nor: 
adult v 


(he Ss were 16 5- and 6-yr. old children 
8 girls) attending a summer day camp 
ildren and 6 institutionalized retarded 
en ranging in age from 16-30 yr. There 
were no current intelligence test scores available for 
any of Ss, but a variety of tests conducted on 
admission reported that the IQs for the retarded 
women ranged 40-68. 

Apparatus, The stimuli were two identical sets 
of 3-mm.-wide black lines, one set used for training 
and the other for the transfer tests. The lines are 
designated in order of length; the smallest (1) was 
18-mm. long, and each successive line increased in 
length by a factor of 1.2:1. Each line was cut from 
glossy drafting tape and was mounted on a 5 X 25 
cm. block of 6-mm.-thick white Masonite. A piece 
of 6-mm.-thick clear plastic (Plexiglas) was cut to 
the same size as the Masonite block and glued over 
it. The lower edge of each line was 1 cm. from the 
bottom of the block. 

Stimuli were displayed on a turntable that could 
be rotated to interpose a screen between S and the 
blocks. The three blocks used on each trial were 
arranged so that the lines were vertical with respect 
to S and bottoms of the lines were aligned. The 
long edges of the blocks touched, and the bottoms 
faced S. 

The stimuli comprising the training and test sets 
are shown in Table 1. ‘The combinations of sets 
differed from each other in the similarity of set 
members to each other. (within-set difference), and 
the four test sets varied in their similarity to the 
training set (between-set difference). The same 
stimulus (Line 5, length = 36 mm.) was the middle- 
sized and positive member of the four different 
training sets. 

Procedure. A noncorrection procedure was used 

throughout. On each trial three blocks were exposed 
until S made a choice. Then the screen was inter- 
posed between 5 and the blocks, and E made any 
/ changes in position and/or blocks, Stimu- 
ons varied according to a predetermined 
irregular sequence. 
In Phase 1 the child found a plastic chip under the 
middle-sized stimulus of the training set. The three 
blocks were in view when S came into the experi- 
mental room. Instructions were as follows. 


See this chip? See these blocks? I am goir 
to hide this chip under a block. You find it. 
Try to find as many as you can. Choose only 
one block on each turn. 


Phase 1 ended when S made five successive choices 
of the middle-sized block. 

In Phase 2, the chips were discontinued for the 
normal children (they were continued for the re- 
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TABLE 1 
STIMULUS SETS 


] Training-test difference 


Within-set || Training 

difference || 
ji 12:3 | 14st} ata | 24 
| 4-5-6 | 5-6-7 | 6-7-8 | 7-8-9 | 8-9-10 
[3-5-7 | 4-6-8 | 5-7-9 | 68-10 | 7-9-11 
2-5-8 | 3-6-9 | 4-7-10| 5-8-11 | 6-9-12 
|159 | 2-6-10 | 2-7-11 | 4-8-12 | 5-9-13 


\ 


tarded Ss), but a penny was given following every 
fifth correct response. The .S was told that a penny 
would be hidden instead of the chip and could be 
earned by choosing the correct block a few times 
in a row. The S was also told: 


Only one block is right. The same block is 
always right. I always hide the penny under 
the same block. 


Phase 2 concluded with 20 successive correct choices 
of the middle-sized stimulus, 

Phase 3 was the testing phase. Each of the four 
test sets associated with each training set was 
presented for four trials. The order of test set 
presentation was random with the limitation that 
each set appear in each block of four test trials. A 
test trial appeared after six or seven successive 
correct responses with the training set, Pennies 
continued to be given for every fifth correct response 
with the training set, but choices had no conse- 
quences on test trials. 

After the three phases were completed with the 
first training set and its associated test sets, a dif- 
ferent training set was introduced and the procedure 
was repeated. This sequence of conditions was 
continued until each S had been given the four 
different training sets and the four appropriate 
test sets. Thus, after the first four conditions, S 
had been trained with the four training sets and 
had been given four trials with each of the test sets. 
There were then four more conditions which re- 
peated the first four, thereby yielding a total of 
eight trials with each test set, The order of condi- 
tions was randomized with the limitation that each of 
the four training sets be used once before any was 
used twice—each S had a different order. Within 
cach condition, the order of the test sets was also 
randomized with the same limitation. 

In the first condition Phases Land 2 were conducted 
in a single session, and all of Phase 3 was conducted 
in a single session. For the retarded Ss Phases 1 
and 2 sometimes required as many as 3 sessions to 
attain criterion performance in each. In the seven 
subsequent conditions, Phase 1 was omitted, and 
Phases 2 and 3 were conducted in a single session. 
The experiment, therefore, involved 9 sessions for 
the normal children and between 9 and 11 sessions 
for the retarded adults. 

A display of toys and candy was located on shelves 
in the experimental room. Also, there was a box 
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containing trinkets located on the table holding the 
turntable. The trinkets were valued at 1 cent 
(normal Ss) or 1 chip (retarded Ss) each, whereas 
the toys and candy ranged from 5 to 20 cents or 
chips. On any given day, S could buy toys and/or 
candy if he or she had sufficient money or chips, 
buy trinkets, keep the pennies or chips, and/or save 
as many as desired to purchase more expensive toys. 


RESULTS AND DISCUSSION 


Performance with training sets. The mean 
number of errors made in attaining criterion 
in Phase 1 with the first training condition 
was 15.6 for the normal children and 38.3 
for the retarded adults. This difference 
was significant (p < .01, Mann-Whitney 
U test). No errors were made in Phase 2 
or in the training trials of Phase 3. Errors 
in Phase 1 did not differ significantly 
(b > .10, Kruskal-Wallis test) depending 
on the particular training set either for all 
of the Ss combined or for the normal 
children aad retarded adults separately, 
The means and ranges of errors were as 
follows: Set 4-5-6, M = 25.3, range = 3-65; 
Set 3-5-7, M = 17.0, range = 2-36; Set 2-5- 
8, M = 20.0, range = 2-45; Set 1-5-9, M 
= 26.3, range = 6-50, 

In training subsequent to the first condi- 
tion, errors occurred only with Set 4-5-6, 
with a mean of 2.3 (range 0-12) for the 
normal children and a mean of 13,3 (range 
3-27) for the retarded adults. This differ- 
ence did not attain statistical significance 
(b > .05, Mann-Whitney U test). No 
errors occurred with any other training set. 
As in the first condition, there were never 
any errors with the training trials of Phase 
3. Interpolated test trials in which re- 
Sponses were not reinforced had no ap- 
parent effect on the accuracy of the training 
performance, 

Performance with test sets. The Ss had 8 
trials with each test Set under each of the 
four training conditions, Eight normal 
children (five girls and three boys) chose 
the middle-sized test set member in all 32 
test trials. Consequently, their behavior 
fits the hypothesis that they were con- 
trolled by the middle-size relation. None 

of the remaining Ss demonstrated equiva- 
lent performance. 
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One normal child n 
choice on each exposure t ) test se 
and was able to choose S when the 
lines were covered. Verba and selec- 


tive covering of various pa 
blocks revealed that she w 
a discoloration on the bloc! 
the line. Since this disco! 
present on any test 
responses during the tests 
at random. 

The remaining seven norn 
all retarded adults were sens 


he training 
ponding to 
er than to 
n was not 
lock, the 
listributed 


stin 


Idren and 
^ changes 


in the sizes of the lines was no 
relation between the perfi in ori- 
ginal learning and the type patterns 


Those 
11 choice 
id not be 
on the 
original 
the 


revealed by the normal c! 
children who did not revea! 
of the middle-sized test stimu 
differentiated from those w 
basis of the number of errors 
learning phases; they ran; m 
fastest to the slowest learne: 

Figure 1 indicates the íi 
choices of the middle-sized t 
for each of these 13 Ss. All of | 
not occurring to the middle-six: 
were to the smallest stimulus 
Set, i.e., there were no choices ol 
test set member. In every case, the fred 
quency of choices of the middle-sized test 
Set member declined and the frequency of 
choices of the smallest stimulus increased as 
the differences between the training and the 
test sets increased. The gradients of choice 
of the middle-sized stimuli typically were 
higher with each progressive increase in 
within-set stimulus difference, although 
there were occasional reversals. 

Although there was only one instance of 
2 Ss revealing the same choice gradients in 
all of the conditions (Cy, and Czo), there 
Were several in which 2 or more Ss revealed 
the same gradient in a given condition of 
within-set difference. In fact, 8 Ss revealed 
the identical gradient with the within-set 
difference of 1:1. The heterogeneity 
among the 13 Ss was larger with increase 
within-set stimulus differences. 

Quantitative analysis. The dependence 
of choices on the combined effects of with- 
in- and between-set stimulus differences 


y of 
imulus 
sponses 
timulus 
he test 
largest 
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BETWEEN-SET DIFFERENCES 


FIGURE 1, 
sho! 
diffe 
difference = 


was predicted by the ratio theory model 
(Zeiler, 1963). The two general predictions 
were confirmed: (a) at any one level of 
between-set difference, choices of the mid- 
dle-sized stimulus increased and choices of 
the smallest set member decreased with 


Percentage of choices of the middle-sized stimulus in each test set 
; for each S. Open circles; within-set difference = 1.2:1; filled circles: within-set 
; open triangles: within-set difference = 1.7:1; 


(The data are 


filled triangles: within-set 


increases in within-set difference; and (5) at 
any one level of within-set difference, 
choices of the middle-sized set member in- 
creased and choices of the smallest stimulus 
decreased with increases in between-set dif- 
ferences. It also appeared that a modified 


f TABLE 2 
- Best FITTING CONSTANTS 
Within-set difference 
Subject 13:1 nas | aga 24: 

bly] ble) ob] y | oly 

^ 60 | 10.0 | .40 | 8.0 | .30 | 12.0 | 20 

«70 | 10.0 | .50 | 120 | -30 | 6.0 | “40 

60 |12.0| .30 | 120 | .40 | &0| 330 

60 | &0| .50 | 60} 40 | 60| 20 

50 | 8.0) .50 | 10.0 | ‘so | 6.0 | ‘60 

60 |10.0| .40 | 4.0} 230 | 40 | .30 

.60 | 10.0) 40 | 6.0 | <40 | 40| 230 

60 | 10.0) 40 | 6.0 | 40 | 60| .40 

«60 | 10.0] -50 | 6.0 | :50 | 12.0 | ‘60 

.60 | 10.0 | -50 | 6.0 | 40 | &0| .30 

60 | 10.0 | 40 | 6.0| 40 | 4.0 | 130 

60 | 40| 60 | 4.0| 50 | 40 | ‘40 

30 | &0|.50| 80} .40 | 60| 40 

1| 92| 45 | 72| 41 | 66| .36 


form of the model could describe the proba- 
bilities of response in the 13 Ss sensitive to 
within- and between-set differences. The 
revised model consists of seven assumptions. 
Let Z, = area of training stimulus i 
@=1,.. . a m); Zee = area of positive 
stimulus in training; Z,; = area of test 
stimulus? (i= 1,.. n); Pu = probability 
of response to Zu; and y and b = constants. 
The seven assumptions are as follows: 


LV AL ae 


Training adaptation level (A,): 


1 m 
log A, = = (X log Z,;). 


2 1] 
Test adaptation level (A): 
Pd 155 
log A, = y (i (35 lox Zu) 
d-1 
+ (1— y) log A.. [2] 


Positive stimulus ratio (X4): 


Zoe 


Ke= F 


[3] 


Test stimulus ratios (X,,) : 


AS [4] 
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Probability of response : 


Dy = |Xu — X [5] 
Vii = ect, [6] 
Vi 

Hue * [7] 

2 V 

i=l 

Equations 6 and 7 were no ned in 
the original ratio theory : the: it two 
new assumptions and the addi: Luce's 
(1959) choice axiom to the : The 
adoption of this axiom is no: ‘d to 
imply that it is uniquely appli« the 


present data (cf. Friedman, s, & 


Padilla, 1968); it was chosen is a 
convenient technique for demon the 
possibility of describing respons: bili- 


ties within the context of the rat ry 


For each S values of y and b v und 
that could describe the obtained ; bili- 
ties of responding for the fou: sets 
used under each level of within-sci ilus 
difference. Since p = 0.0 for four 
largest stimuli of the test sets, these roba- 
bilities were excluded. The probaoilities 
predicted by each pair of y and / ues 
were compared with those obtained o de- 
termined which pair produced the ilest 


deviation. The search for the best (i 
conducted by incrementing y in step: 10 
and b in steps of 2.0 until the average of the 
squared deviations for the four test sets was 
less than .01. The smallest values of the 
constants that produced deviations of this 
magnitude are shown in Table 2. (Note: 
once the 6 value that produced this small a 
deviation was found, y was set at the value 
that produced the least deviation.) Be- 
cause there were 13 Ss X 16 Test Sets X 2 
Stimuli in each, it is most awkward to try 
to present all of the expected probabilities. 
It is perhaps sufficient to indicate that no 
expected and obtained response frequency 
to a given stimulus deviated by as much as 
one response, i 

The constants varied among and within 
Ss. Since both tended to decrease as 
within-set differences increased, the con- 
stants seemed not to be arbitrary but rather 
to be a function of the particular stimuli 


was 


dibus acad 


constants may also be a func- 
dividual differences; however, 
ricted range implies that such 
es may be less important than the 
variables in determining the 


used. 
tion 
their 
diffe 
stin s 
Values. 
\ conclusion, these data indicate that 
the behavior of individuals in a three- 
imulus simultaneous discrimination can 
be classified into two general types. One 
involves control by stimulus relationships 
(although it may be worth noting that uni- 
rm choice of the middle-sized stimulus is 
predicted if y is 1.00) ; the other is described 
b» the ratio theory model. Furthermore, 
i^ appears possible to provide a quantitative 
count of performance over a range of 
ining and testing conditions. Less clear 
this time is what variables are respon- 
ible for establishing one or the other type 


of control. 
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AN ANALYSIS OF INTERSENSORY TRANSFER OI ME 
RONALD W. SHAFFER ? ann HENRY C. ELLIS 
University of New Mexico 

The relative contributions of four sources of transfer in intersensory tran 

between vision and touch were examined. Following a training task in i 

tifying visually presented polygons or histoforms, six groups learned to m 

a motor switching response to tactually presented polygons in a tran 

task. Transfer performance showed reliable evidence for the transfe 

forward associations, minimal evidence for the transfer of stimulus lear 

and no evidence for two types of nonspecific transfer. 

Learning to make specific identifying re- "recognize" the same stimuli ted via 
Sponses to different stimuli in a training a second sensory system. By stimuli 
task is known to affect performance in a for which Ss could have had 'revious 
subsequent transfer task where the "same" opportunities to learn mediati ponses 
stimuli are presented via different Sensory to their intersensory equivale: van be 
modalities | (e.g., Bjorkman, Garvill, & determined whether this kind coding 
Molander, 1965; Gaydos, 1965; Holmgren, system need be advanced. De 
Armoult, & Manning, 1966; Sinha & Sinha, If, after learning to associate tifying 
1960). Such transfer has been termed inter- responses with stimuli, Ss shox 10 per- 
Sensory transfer and several hypotheses formance decrement when trans ! toa 
have been advanced to account for its new modality, then the function: uival- 
occurrence (e.g, Gibson, 1969, chap. ll; ence of the intersensory stimui id be 
Wright, 1970). While intersensory transfer evident. However, less than per rans- 
has been repeatedly demonstrated, very fer performance requires controls tc partial 
few studies have provided the kinds of out sources of transfer other than ‘hat of 
Operations that allow the Partitioning of forward associations. The six trol 
contributing sources of transfer. The Pre- groups included in the present study permit 
sent experiment provides controls not pre- the separation of forward associations and 
viously contained in intersensory transfer the second source of specific transfer, stimu- 
studies (e.g., Gaydos, 1956; Holmgren lus learning, while additional controls allow 


et al., 1966; Sinha & Sinha, 1960), thereby 
allowing isolation of Sources of transfer 
specifically related to stimulus modality. 
Wo potential sources of Specific transfer 
are of immediate interest in the intersensory 
transfer paradigm: the transfer of forward 
associations and of stimulus learning, ie, 
learning how to discriminate among the 
stimuli. Transfer of forward associations 
with unfamiliar form stimuli can imply the 
operation of a stimulus encoding system 
superordinate to the Separate sensory sys- 
tems, since it would suggest that Ss could 
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the separation of two nonspecific sources of 
transfer, 

The transfer of stimulus learning would 
Suggest that the changes in the way stimuli 
are encoded are not specific to the particular 
Sensory system under which they were 
learned. This finding would follow from the 
assumption that there are amodal sources 
of information (Gibson, 1969, chap. 11), 
and that learning to respond to these 
amodal stimulus attributes should modify 
how the same stimuli are encoded when 
presented to another sensory system. —— 

In addition to converging on two specific 
Sources of transfer, the present study pro- 
vides the conditions for assessing nonspeci- 
fic transfer in an intersensory transfer para- 
digm. It is common practice in studies of 
Verbal transfer to use items for the non- 
Specific control that are independent of 
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specific | content, but are nevertheless 
sampled {om the same class of stimuli as 


"the other training conditions. For example, 


if adjectives are used as first-list items, then 
the baseline nonspecific control would also 
employ adjectives. While the dependency 
of the class of stimulus items on the amount 
of nonspecific transfer with verbal items 
has been demonstrated by Postman and 
Schwartz (1964), this variable has not been 
directly investigated with “nonverbal” 
form stimuli. It is not apparent, therefore, 
whether the degree of nonspecific transfer 
will remain invariant across different classes 
of nonverbal first-list stimuli. Moreover, 
the potential role of the mature of the form 
stimuli to be used for nonspecific controls 
becomes magnified when concern is focused 
on stimulus encoding changes relevant to 
the specific form stimuli used in the train- 
ine—transfer task (i.e., stimulus learning), 
as opposed to general encoding changes 
particular to the class of form stimuli used 
but not to specific members within that 
class. Accordingly, several nonspecific con- 
trols are contrasted to the conditions given 
exactly the same forms in the visual train- 
ing and tactual transfer task: Two groups 
receiving visual training forms from the 
same class (WC), and two groups receiving 
visual training forms from a completely dif- 
ferent class (BC), which are of comparable 
ease with regard to associative learning. 


METHOD 


Stimuli and apparatus. ‘Three sets of six forms 
were cut from 3.2-mm.-thick black rubber tile and 
mounted on white translucent plastic squares. 
Twelve of the stimuli were 12-point constrained 
metric polygons (CM Ps) sampled from the popula- 
tion of shapes described by Thurmond and Alluisi 
(1967). The CMPs were constructed from a radiat- 
ing matrix of 12 “lines” of points with a 30° angle 
between each successive radii of lines and with 12 
points along each line. Sampling by constrained 
rules simply means that a particular point on any 
one line of points can appear once and only once in 
each form, The difference between successive points 
was arbitrarily set at 3.175 mm. 

Another set of six stimuli (used for the BC non- 
specific control) were six-column constrained metric 
histoforms (CMHs), also described by Thurmond 
and Alluisi (1967). The sides of the matrix from 
which the CMHs were generated were set at 7.62 
cm. so that the CMH would approximate the overall 
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size of the CMP. This particular subset of CMHs 
was chosen because pilot data had indicated that 
visual learning rates (training) would be comparable 
to learning rates with CMP stimuli. 

The apparatus was constructed so that any 
particular form could be either viewed or tactually 
manipulated. The Ss were seated in front of a 
55.8 X 55.8 cm. plywood display panel painted flat 
black. Tactual manipulation was achieved by 5 
reaching under a hood located at the bottom of the 
panel and tracing the perimeter of a particular form. 
with his forefinger. The plastic square mounts were 
inserted individually by E into a holder located 
horizontally under the hood. The raised forms were 
used for visual presentation by inserting them behind 
a piece of 10.1 X 12.7 cm. one-way glass located 
directly above the hood. A rectangular tunnel was 
constructed directly behind the one-way glass. The 
pieces of plastic with the mounted forms were 
inserted into a tray slid through an opening in the 
tunnel by Æ. Hunter decade timers were used to 
control the duration of a 100-w. bulb located behind 
the slide opening. When the bulb was activated S 
was able to view the form through the one-way glass. 
Directly above the one-way glass was a window 
with a white paper insert which served as a rear- 
projection screen for the letter responses used in the 
training task. 

The six letters (A H, N, O, and R) used as the 
verbal response terms in the training t for three 
of the groups were projected on the rear of the paper 
via a Kodak Carousel 700 projector. Located di- 
rectly above the verbal response “window” were 
six horizontally arranged spring-loaded toggle 
switches to which S responded in the training task 
for three groups and in the transfer task for all 
groups. Over each switch was a circular green 
confirmation light which, at the end of the anticipa- 
tion interval, informed S as to the correct response 
The E's control panel contained switches to program 
the confirmation lights, a display panel to indicate 
which switch was chosen by S, and a master switch 
to initiate the presentation of each paired associate, 

Experimental design. Ninety-eight undergraduate 
Ss were assigned to each of seven treatment condi- 
tions. The Ss were randomly assigned to conditions 
in blocks of 7, with 1.5 from each condition per block. 

All groups except one with no training received 
two pa associate learning tasks. The stimuli in 
the training tasks were presented visually, while in 
the transfer task either the same or different stimuli 
depending on the treatment condition, were pre: 
sented via the tactual modality. This sequence 
was used exclusively, since there was no interest in 
the well-known asymmetry in visual-tactual vs 
tactual-visual transfer (e.g., Gaydos, 1956) In 
addition, Lobb's (1970) finding of no transfer fror 
touch to vision but reliable transfer from vision jJ 
touch in a discrimination-recognition task eS 
that a visual-to-touch training-transfer SO ente 
would optimize the transfer of stimulus cman 
The treatment conditions were differentiated E 
by the conditions of training. The seven training 


conditions are described below, after which the 
rationale is given for the inclusion of the various 
conditions. 


Group 1; Six CMPs were paired with six motor 
switching responses in the first training task. The 
transfer task was essentially a continuation of train- 
ing with the exception of the change in modality of 
the stimulus presentation. Thus, S first learned to 
associate a particular motor switching response to a 
particular visual CMP, and then learned to associate 
the analogue tactual form to the same motor 
response. 

Group 2; Same as Group 1, except that the re- 
sponse terms in the training task consisted of six 
letters of the alphabet (A, E, H, N,O, and R). The 
six letters ranked within the top nine of the alphabet 
in absolute frequency of occurrence (Underwood & 
Schulz, 1960), 

Group 3: Same as Group 1, except that a different 
set of six CMPs were used in training. 

Group 4: Same as Group 3, except that the 
response terms in training were the same letters used 
for Group 2. This is the WC nonspecific control. 

Group 5: Same as Group 1, except that the stimuli 
used in the training task were six CMHs, thus 
making the stimuli different from those in the 
transfer task. 

Group 6: Same as Group 5, except that the 
response terms in training were the same letters used 
for Group 2. This is the BC nonspecific control, 

Group 7: Same as Group 1, except that the train- 
ing task was not administered. 


The transfer of forward associations is given by 
contrasting the transfer performance of Groups 1 
and 2 under the assumption that superior transfer 
for Group 1 is not due to the transfer of response 
learning or, alternatively, is not spuriously given 
by the depressed transfer performance of Group 2 
due to the transfer of inappropriate response terms. 
Violation of either assumption would give an inap- 
propriate estimate of the transfer of forward associa- 
tion. The first assumption is readily met by 
trivializing response learning in the training task 
by using single-letter Tesponse terms and making 
them available for continual inspection. The 
second assumption is not as easil 


this source of forward transfer car 
using training- and transfer- 
from grossly different categories (Ellis, 1969, p. 408). 

The transfer of stimulus learning is provided by 
a direct comparison between Groups 2 and 4. Both 


ential sources 
since Group 2 
‘asks, the condi- 
stimulus learn- 
S some specific 
with the visual 
transfer to the 
nominal stimuli 
vidence superior 
the nonspecific 


ing. Accordingly, if Group 2 learn 
stimulus encoding skills associated 
training stimuli and if these skills 
tactual transfer task when the same 
are presented, then Group 2 will e 
transfer performance relative to 
control (Group 4). 
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Reliably more transfer for C 
Group 6 would indicate that n« 
an intersensory transfer paradi 
the class of form stimuli that ar« d. Acom- 
parison of Groups 6 and 7 would idence as to: 
whether any nonspecific effects ent. 

The inclusion of Groups 3 anc 
Groups 4 and 6, respectively, w 
dence as to whether any reliable ti 
learning was present, since t! 


ompared to 
transfer in 
pendent on 


plements of 
rovide evi- 
f response 
er groups' 


response terms were used in | ks; hence, 
equivalent transfer performance th “pairs! 
would further substantiate the oi "ssumption 
that sources of transfer due to e learning 
were effectively eliminated. f 
Visual training. A standard |-associate 
anticipation learning situation w ited to all 


The 
imu- 
response 


Ss except the no-training contr: tion, 
6-sec. intrastimulus interval included 
lus exposure and a 3-: 
exposure, 


stimuli 


Each of the six forms was presen 10 anticis 
pation trials for all training condit Evidence 
obtained from pilot data indicated | v mptotie 
performance would be achieved wit! imber of 
trials. An 8-sec. interstimulus (I5 intertrial 
interval (ITI) were employed. Th ind ITE 
were kept the same so as to minimi sibility 
that S could exclude certain respon rnatives 
(switches or letters) as having alr »ccurred 
within any one trial block. The stim. esponse 
pairs were randomized within each tria’ with two 
restrictions: (a) no more than two : isecutive 
switching responses in either ascending lescending 
series were allowed, e.g., Switches 1, 2 3 could 
not occur as response terms for the firs e forms; 
and (b) the same stimulus-response pair could not 


occur as the last pair of one trial and the first pair of 
a following trial. 

Standard paired-associate instructions were read 
to all Ss receiving training. The six lci were 
printed on a card and given to Ss using the Jette 
as response terms. The instructions were that 58 
could refer to the card as often as needed, but that 
they were to rely on the card as little as possible. 

Tactual transfer. All Ss received the same instruc 
tions and the same tactual paired-associate learning 
task. For each stimulus-respons presentation, 58 
were instructed, following a ready signal from Æ, to 
locate with their index finger a small notch centered 
on the front edge of the plastic mount. When this 

vas done, Ss then located the attached form by 
pushing the index finger forward. The forms re- 
tained the same orientation trial-to-trial within the 
holders. Moreover, for Groups 1 and 2, the place 
ment of the forms for tactual manipulation was 
Such that the top of the forms when presente 
Visually coincided with the most distant aspects 0 
the forms when presented tactually, The rr 
ISI, including a. 7-sec. anticipation interval and 4 
T-sec. stimulus and response interval, was starte 
when Ss began tracing the perimeter of the forms 
With the index finger, All tracing was done in 8 
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clockw tion, and Ss were instructed to pace 
thems¢ ; that at least one complete tracing of 
the fo uld be achieved before the correct re- 
spons displayed. Upon presentation of the 
corr sponse, as indicated by a light located 
abov c correct switch, Ss were instructed to con- 
tint ng the form until the “correct green light 


gore out," Again, as was the case with the visual 
tr , the ISI and ITI were kept constant at 
roximately 8 sec. 

Ihe six forms were presented for 10 anticipation 
irials following 1 study trial. The same “randomiza- 
rocedure used in the visual training task was 

T Ss were not informed as to 
hether the same or different forms would be pre- 
in the transfer t If S specifically asked 
information, he was told that a study trial 
be given first, and he could determine this 


imself, 


RESULTS 


Training. An analysis of the number of 
correct responses for Groups 1 through 6 
across five blocks of two trials each did not 
vield a reliable difference, F (5, 78) = .40, 
nor was the Groups X Trials effect reliable, 
F (20,312) — .73. Both findings are com- 
mensurate with the assumption that the 
six experimental groups were equivalent 
with respect to their learning rates during 
acquisition, hence differences in transfer 

»erformance must reflect paradigmatic 
erations and not differences in degree of 
riginal learning. 

Transfer. "Table 1 presents the transfer 
resultsin the form of mean correct responses 
averaged across all 10 transfer trials for 
cach of the seven groups. 

The comparison of initial interest, al- 
though itself nonanalytical in character, 
was that of Groups 1 and 7. A difference in 
favor of Group 1 has typically been taken 
as evidence for the transfer of specific asso- 
ciative habits. It was necessary that this 
difference be shown to be reliable, since all 
of the separate component sources of trans- 
fer were embodied in Group 1. An applica- 
tion of Dunnett’s test to this comparison 
revealed a reliable difference, t (91) = 3.15, 
b « .0t. 

Evidence for the transfer of forward asso- 
ciations is given by comparing the perform- 
ance of Groups 1 and 2. An analysis of 
the transfer performance for these two 
groups yielded a significant difference, / (26) 
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TABLE 1 


MEAN Correct RESPONSES AVERAGED ACROSS ALL 
10 TRANSFER TRIALS FOR EACH PARADIGM 


Group MEE SD 
1 | 8.58 
2 9.14 
3 | 9.77 
4 | 5.54 
5 | 9.38 
6 7.05 
7 10.37 


Note, See text for description of groups, 


= 2.22, p < .05. That the superior trans- 
fer performance obtained by Group 1 is 
not in any way attributable to the transfer 
of response learning is indicated by identi- 
cal performances of Groups 5 and 6, and 
nearly equal performances of Groups 3 and 
4, (26) < 1. 

Although Group 2 did evidence more cor- 
rect transfer responses than any other con- 
trol group (3 through 7), as indicated in 
Table 1, the Dunnett test contrasting 
Group 2 with 7 did not yield a reliable 
difference, £ (91) < 1. Accordingly, none 
of the other control groups (3 through 6) 
were reliably superior to the no-training 
control; and the combined Groups 3 and 4 
when compared to the combined Groups 5 
and 6 did not yield a significant difference, 
t (54) = 1.12. No reliable evidence was 
therefore obtained in the present study for 
the transfer of any nonspecific sources. 


DISCUSSION 


The principal finding of the present study 
was the positive transfer of forward associa- 
tions evidenced by Group 1, which received 
the same stimuli and responses in both the 
training and transfer task. This finding is 
taken as an unequivocal demonstration of the 
transfer of forward associations under condi- 
tions where the mode of stimulus input is 
changed. 

Two contrasting interpretations of this find- 
ingare possible. First, it could be assumed that 
Ss had, in advance of the present study, estab- 
lished common mediating responses to the 
intersensory equivalents of the six stimuli, or 
fractions thereof, and that the transfer of newly 
learned associations was therefore made possi- 
ble via these mediators. A four-stage stimulus 


equivalence paradigm, A-B, C-B, A-D, C-D, 
would provide an account consistent with the 
observed transfer of forward associations. It 
should be noted that this interpretation makes 
no necessary assumptions about whether the 
A and C terms possess any common attributes. 
The A and C terms could become functionally 
equivalent due simply to the addition of com- 
mon mediating responses. 

An alternative interpretation (Gibson, 1966, 
p. 55) is that although the stimulus objects 
are presented to different sensory systems, Ss 
can learn to encode on features of the input 
that are invariant with the sensory system to 
which they are presented. The degree of 
intersensory transfer would be directly related 
to S's ability to respond to features of informa- 
tion that are common to the various input 
systems. 

The procedure for generating form stimuli 
in the present study was chosen because the 
forms were not highly representative of stimuli 
with which Ss would have had previous experi- 
ence. Accordingly, the likelihood of the trans- 
fer of forward associations being attributable 
to previously learned mediators would be 
minimal. On the other hand, acceptance of 
the second explanation given above as a basis 
for the transfer of forward associations mini- 
mally requires that it be shown that Ss can in 
fact encode stimulus attributes that are in- 
variant to the mode of input. Group 2 provides 
the conditions for satisfying this assumption. 
Recall that the only source of positive transfer 
other than nonspecific sources for Group 2 was 
the transfer of stimulus learning. In short, 
if Ss in the training task learned specific stimu- 
lus encoding skills for the training stimuli, and 
if these skills transferred to the situation where 
the same stimuli were presented via another 
Sensory system, then it must perforce follow 
that Ss were responding to some common 
stimulus attributes in both tasks. Although 
Group 2 showed more transfer than the non- 
specific controls, this effect was not reliable. 
"Therefore, the results of the present study, 
although suggesting support for a “higher 
level encoding system insensitive to mode of 
Sensory Input, do not converge on either alter- 
native explanation, Ellis (1973) has, however, 
shown the transfer of stimulus learning in an 
intramodal transfer paradigm with 24-point 
shapes as stimuli, and has proposed that the 
transfer of stimulus learning depends on com- 
plex stimuli containing multiple redundant 
cues. Perhaps support for the invariant-fea- 
tures hypothesis could be obtained by employ- 
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ing more complex stimuli i: tersensory 
transfer paradigm, thereby ; : the basis 
for possible stimulus learning r 
The reasons for the failu: 
transfer attributable to non 
facilitation, as evidenced by 
transfer performance of Grou 


tain any 
sources of 
t that the 
through 6 


was comparable to Group 7. 't immedi- 
ately apparent. This finding er, is not 
unique, as Clark, Warm and ky (1972) 
also recently reported no e for non- 
specific transfer in an inte: transfer. 
paradigm. 

Several possiblities exist that in part, 
explain the lack of nonspecific fer in the 
present study. First, the tras criterion 
was fixed at 10 trials for all gro. ince pre- 
liminary evidence from pilot in. ions indi- 
cated that near asymptotic per! e would 
be obtained. Schwenn and P: (1967) 
have found a direct relationshi; een the 
level of training and the amount ispecific 
transfer. The possibility « erefore, 
that nonspecific sources of transi ld have 
been operative in the present ad the 
level of training been extended nd the 
criterion of 10 trials, Second, re vidence 
(Rogers, 1968) suggests that a pa what is 
learned as a nonspecific componenti | hythm 
of observing stimuli and a laten Í giving 
overt responses. Rogers found decre- 
ments were observed when the ipation 


intervals were shortened or length»: od. He 
argued that a reduction in performance corre- 


lated with a change in the intervals in ihe direc- 
tion of making them longer might ! due, in 4 
part, to a disruption of stable pacing behavior 
which in turn would affect specific techniques 


in learning to make associations, Therefore, 
the change in intervals from the trainiag to the 
transfer task in the present study might have 
contributed to the failure to obtain any non- 
specific transfer. The fact that Holmgren et al. 
(1966) found evidence for BC nonspecific 
transfer in an intersensory transfer paradigm 
with the use of consistent training transfer 
anticipation intervals would support this 
interpretation. 

Both of these interpretations of the lack of 
nonspecific transfer address themselves tO 
training-transfer parameters, variables unre- 
lated to changes in stimulus modality. Alterna 
tively, it could be argued that part of the non- 
specific skills that are learned is how to encode 
the information for particular modes on input: 
implying, therefore, no transfer of those skills 
when the stimulus modality is changed. 
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PARTIAL REWARD EITHER FOLLOWING Ok 
PRECEDING CONSISTENT REWARD: 


A CASE OF REINFORCEMENT LEVEL ! 


E. J. CAPALDI? 


Purdue University 


Previous investigations in which partial reward either preceded consisten 
reward (P-C) or followed it (C-P) employed the same reward magnitude 
connection with both schedules. Findings similar to those previously obtain: 
were obtained here when the reward magnitude associated with the C sched: 
was larger than that associated with the P schedule. 
smaller reward with the C than with the P schedule reduced re: 
extinction under P-C conditions and increased resistance to extinction unde: 
C-P conditions. These and earlier separate-phase extinction findings wer 

found to be compatible with the principle of reinforcement level, which also ha 

implications for positive and negative contrast effects, as well as a variety of 
other phenomena such as the frustration suppression effect. 


In separate-phase investigations, a par- 
ticular level of some variable occurs for 
a block of trials, e.g., large reward, partial 
reward, etc., followed in a second block 
of trials by a different level of the variable, 
eg., small reward, consistent reward, etc. 
Separate-phase investigations have been 
concerned primarily either with early post- 
shift behaviors, i.e., positive and negative 
contrast effects, or with extinction be- 
havior, or with both. Of these findings, 
only negative contrast is explained reason- 
ably well by available hypotheses (see 
Capaldi, 1972), Most hypotheses do not 
attempt to explain positive contrast (see 
Weinstock, 1971), and a variety of separate- 
phase extinction findings either contradict 
or are not explained by available hy- 
potheses (e.g., Robbins, Chait, & Wein- 
stock, 1968; Spear, Hill, & O'Sullivan, 
1965; Sutherland & Mackintosh, 1971). 
The purpose of the present investigation 
was to test a hypothesis, to be considered 
later, which is consistent with findings 
reported in available varieties of separate- 
phase investigations. The basic idea con- 
tained in this hypothesis is that any 
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However, associati: 
stance t 


change in reward schedule af rein- 


forcement level, which determ stimu- 
lus control over behavior thus 
performance. 

In previous investigations pa: eward 
has both preceded consistent rey (P-C) 
and followed it (C-P). C-P training con- 
sisting of 100 C and 60 P trials «duced 
resistance to extinction very subsi ntially 
relative to 60 P trials (Sutherland ickin- 


tosh, & Wolf, 1965; see Sutherlai 
Mackintosh, 1971, for other sin 
ings). Under P-C conditions a : 
crease in resistance to extinction was asso- 
ciated with a substantial number of 
interpolated C trials (70 P and 70 C trials) 
but not with a moderate number (70P 
and 25 C trials) in an investigation by 
Theios (1962; see also Sutherland et al., 
1965). Sutherland and Mackintosh have 
reviewed this literature and have charac- 
terized its findings as follows: The frok 
of the two schedules given has had a major 
or disproportionate effect on resistance to 
extinction. This means that if enough 
first-schedule training is provided, extinc- 
tion will be rapid if the first schedule is € 
and slow if the first schedule is P. Suther- 
land and Mackintosh have suggested that 
this disproportionate first-schedule effect, 
especially for C-P, while consistent with 
their attention model, is not consistent 
with hypotheses more commonly applie 
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schedule effects such as the 
(Amsel, 1958) and sequential 
, 1966) hypotheses. 
sequential hypothesis explains re- 


ward schedule and other effects by em- 


phasizing the control over behavior exer- 
cised by both external. cues, e.g., alley 
cues, and internal cues, e.g., those produced 
by nonreward, smaller reward, and so on. 
This sort of analysis might be successfully 
applied to P-C and C-P training, it was 
hypothesized, by assuming that changes 
in reward percentage involve the same 
sorts of principles as changes in reward 
magnitude. Within the sequential hy- 
pothesis, it has been assumed that the 
capacity of available stimuli to evoke the 
reaction is an increasing function of reward 
magnitude (e.g., Capaldi & Capaldi, 1970; 
Leonard, 1969). 

In available P-C and C-P investiga- 
tions, the same reward magnitude has 
been associated with the P and the C 
schedules. In the present investigation, 
the reward magnitude associated with the 
C schedule (either 1 or 12 .045-gm., pellets) 
s either less than or greater than that 
associated with the P schedule (always 
6 pellets), The five groups employed were 
designated C1-P6, C12-P6, P6-C1, P6-C12, 
and P6. 


METHOD 


Subjects. The Ss, about 120 days old at the 
start of the experiment, were 50 naive male rats 
purchased from the Holtzman Co., Madison, 
Wisconsin. 

Apparatus. The apparatus was a straight alley, 
208.3-cm. long, 10.2-cm. wide, and enclosed by 
22.9-cm,-high sides, covered with hinged 1.3-cm. 
hardware cloth. It was constructed of wood and 
painted a medium gray. It had three basic sec- 
tions: a 35.6-cm. start section, a 117.1-cm. run 
section, and a 40.6-cm. goal section. On all trials, 
S was placed at the rear of a 25,4-cm. start treadle 
suspended over a microswitch, starting a .01-sec. 
clock. This clock stopped and a second started 
when S broke a photobeam 10.2 cm. from the 
treadle's tip (start time). The interruption of a 
second beam 117.1 cm. from the first stopped the 
second clock (run time) and started a third clock. 
The third photobeam was 30.5 cm. from the second 
beam and 5.1 cm. from the front edge of a brass 
5.1 X 10.2 x 3.8 em. food cup. Interrupting it 
stopped the third clock (goal time). When S broke 
the third beam a brass guillotine door, 30.5 cm. 
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from the alley's distal end, was lowered manually, 
thereby confining S to the goal area. Individual 
times for all three sections were recorded, as well 
as “total” time, which was the sum of the elapsed 
times on the clocks. 

Procedure. The Ss were individually caged, fed 
12 gm. of Lab Blox each day, water always being 
available, and on Days 7-10 handled in groups of 
three for about 3 min. On Days 8-10 each S 
received six .045-gm. Noyes pellets in the home cage. 

On Day 11, which began experimental training, 
Ss were randomly divided into five groups of 10 
Ss each. In each of the three phases of experi- 
mental training there were five trials each day. 
Phases 1 and 2 lasted for nine days each and Phase 3 
(extinction) for eight days. Reward consisted of 
.045-gm. pellets, nonreward in all phases of 20-sec. 
confinement in the unbaited goalbox, Group C1-P6 
received one pellet consistent reward in Phase 1 
and six pellets partial reward in Phase 2. The 
remaining groups were designated in identical 
fashion, Group C1-P6 along with Groups P6-C1, 
C12-P6, and P6-C12 comprised the factorial portion 
of the experimental design. Group P6 began 
training in Phase 2. Three schedules of partial 
reward were used in rotation over the nine days 
of each acquisition phase, RNRNR, RNNRR, and 
RNNNR, where R denotes six-pellet reward and N, 
nonreward. 

The Ss were brought into the room in squads 
of five in holding cages, a squad consisting of 
one S from each group. The Ss were run in rota- 
tion, running order within a squad being varied 
randomly each day. In Phase 1 Group P6 Ss were 
handled briefly during their order in the rotation, 
"The intertrial interval was about 5-8 min. in each 
phase. Following the last daily trial, 5s were 
returned to the home cage, where the 12-gm. daily 
ration was provided about 15 min. later, 


RESULTS 


Time scores in each alley section were 
converted to speeds. The measure em- 
ployed was the daily median speed for 
each S in each alley section. 

The performance of each of the groups 
in each of the alley sections on the last 
two days of Phase 1 training (P1) and 
on each of the training days in Phase 2 
is shown in Figure 1. An analysis on the 
last two days of Phase 1 training indicated 
that differences were significant in each of 
the alley sections, start; F (4, 45) = 4.31, 
p < .01; run, F (4, 45) = 4.06, p < 01; 
and goal, F (4, 45) = 3.98, p < .01. New- 
man-Keuls tests indicated that in start 
and run Group C1-P6 ran more slowly 
than each of the remaining groups (P < 
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Figure 1, Median daily speeds for each of the 
groups in each of the alley sections of the last two 
days of Phase 1 training and on each of the training 
days in Phase 2, 


.01), and in goal Group C12-P6 ran more 
rapidly than each of the remaining groups 
(p € .01), no other differences being sig- 
nificant. It is sometimes reported (e.g. 
Goodrich, 1958) that when partial and 
consistent groups receive the same reward 
magnitude, the partial group is faster than 
the consistent group in the start and run 
sections of the alley but slower than the 
consistent group in the goal section of the 
alley. Thus, it is unclear whether the 
superiority of Groups P6-C1 and P6-C12 
relative to Group C1-P6 in start and run 
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was a reward magnitude efi reward 
percentage effect, or a combi of both, 

On the last two days of Ph training 
differences were significant i 1 of the 


alley sections; start, F (4 
p< .05; run, F (4, 45) = 2 
and goal, F (4, 45) = 2.81, p 
man-Keuls tests indicated 
and run Group P6-C1 ran 
than each of the remaining 
.05), and in goal Group P6-t 
rapidly than the remaining : 
05), no other differences bein iificant, 
Thus, terminal Phase 2 diff. s asa 


n more 
| (PS 


function of reward condition much 
like terminal Phase 1 differen: ı func- 
tion of reward condition, c that 
terminal differences were smalle hase 2 
than in Phase 1. 

Extinction. In view of th rminal 
acquisition differences obtain: re, in 
Some cases it is best to en se in 
extinction the interaction of icular 
variables with extinction days ` ex- 
tinction data were analyzed in ways. 
Following a one-way analysis of iance, 
a 2 X 2 factorial analysis was p- ‘ormed 
(Group P6 omitted) employing error 
term from the one-way anal) The 
factorial analysis evaluates the «cts of 


reward magnitude (C12 vs. C1) ai order 
of presentation of P and C (P-C vs. C-P). 

The performance of the five groups in 
each of the alley sections of each of the 
extinction days is shown in Figure 2. 
Figure 2 shows that Group P6-C1 tended 
to extinguish rapidly in early alley sections 
and that Groups C1-P6 and P6 tended to 
extinguish slowly in later alley sections. 
The one-way analysis indicated that the 
groups differed significantly in each of the 
alley sections; start, F (4, 45) = 6.71 
b < .001; run, F (4, 45) = 9.48, p < .001; 
and goal, F (4, 45) = 6.81, p < .001, and 
that the Groups X Days interaction was 
Significant in each of the alley sections; 
Start, F (28, 315) = 1.75, p < .01; rum 
F (28, 315) = 2.08, p < .01; goal, F (28: 
315) = 1.71, p <.01. Newman-Keuls 
tests indicated that Group P6-C1 ra" 
slower than all other groups in start a" 
in run (all ps < .01), that Group C1-P6 
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diffe »m all other groups in run 
(ps or better) and in goal (p < .01), 
anc Group P6 differed from all other 
gr in goal (ps < .01), no other dif- 
fe 's being significant. Except for 
C- ap P6-C1, these significant differences 


meaningful, for in no other case could 
ey be due to inferior terminal acquisition 
performance. 


In start, the effects of reward magnitude, ` 


F (1, 45) = 4.13, p < .05; of presentation 
order, F (1, 45) = 7.09, p < .05; and the 
Reward Magnitude X Presentation Order 
ction, F (1, 45) = 10.91, p < .01, 
e significant. In addition, each of these 
eracted significantly with days, Reward 
nitude X Days, F (7, 315) = 2.11, 
..05; Presentation Order X Days, 
(7, 315) = 2.16, p < .05; and Reward 
Magnitude X Presentation Order X Days, 
F (7, 315) = 2.31, p < .05. The major 
basis of these significant effects in start 
was the rapid extinction over days of 
Group P6-C1. In start, then, C1 produced 
less resistance to extinction than C12 when 
combined with P6, and P-C produced less 
resistance to extinction than C-P. In goal, 
all these effects were, again, significant, 
reward magnitude, F (1,45) = 4.27, p < 
05; presentation order, F (1, 45) = 4-13, 
» <.05; Reward Magnitude X Presenta- 
ion Order, F (1, 45) = 5.27, p < .05; 
Reward Magnitude X Days, F (7, 315) 
2.26, p < .05; Presentation Order X 
days, F (7, 315) = 2.09, p < .05; and 
vard Magnitude X Presentation Order 
x Days, F (7, 315) = 245, p < .05. Un- 
like in start, the major basis of the goal 
effects was the great resistance to extinc- 
tion of Group C1-P6. In goal, then, C1 
produced greater resistance to extinction 
than C12 when combined with P6, and 
C-P produced greater resistance to ex- 
tinction than P-C. In run, the only sig- 
nificant effects were those associated with 
Reward Magnitude X Presentation Order, 
F (1, 45) = 14.82, p< 01, and with 
Reward Magnitude X Presentation Order 
X Days, F (7, 315) = 2.82, p < -01. The 
trend in run, then, was intermediate to 
the extremes shown for start and goal, 
thus producing significant differences only 
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Ficure 2. Median daily speeds for each of the 
five groups in each of the alley sections on each 
of the extinction days. 


when reward magnitude and presentation 
order interacted. The major findings that 
emerged in extinction were as follows 
Group C1-P6 showed very great resistance 
to extinction relative to all other groups 
in the goal section, a tendency which 
while somewhat evident in the run section, 
was evident only in relation to Grou : 
P6-C1 in the start section. In each alles 
section, Group P6 performed much as did 


- 
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Group C1-P6 except, of course, that 
Group P6 ran more slowly than Group 
C1-P6 in the run and goal sections. Group 
P6-C1 extinguished rapidly relative to all 
other groups in the start section, a tend- 
ency which, while somewhat evident in 
the run section, was evident only in rela- 
tion to Groups C1-P6 and P6 in the goal 
Section. Finally, Groups P6-C12 and 
CI2-P6 failed to differ from each other 
in any alley section. 


DISCUSSION 


In the runway, small consistent reward has 
produced greater resistance to extinction than 
large consistent reward and less resistance to 
extinction than a larger magnitude of partial 
reward (e.g., Hulse, 1958). However, what- 
ever might have been the effects on extinction 
of C1 in isolation, these cannot have been its 
effects when employed in combination with P6, 
for Group C1-P6 was very resistant to ex- 
tinction, while Group P6-C1 extinguished very 
rapidly. Such interactive effects between 
schedules, while new to C-P and P.C in- 
vestigations, are not new to separate-phase 
investigations. The major examples of such 
interactive effects, of course, are the positive 
contract effect, or better performance on the 
part of a group shifted from small to large 
reward (Group C1-P6 was shifted from small 
to large reward) than on the part of a large- 
reward control, and the negative contrast 
effect, or poorer performance on the part of 
a group shifted from large to small reward 
(Group P6-C1 was shifted from large to small 
reward) than on the part of a small-reward 
control group. The problem for theory is to 
identify the processes responsible for these 
Interactive effects, The attention theory of 
Sutherland and Mackintosh (1971) is so 
stated that it expects that C-P training will 
either reduce resistance to extinction or at 
least produce no greater resistance to ex. 
tinction than P training (the precise form of 
the deduction depends upon number of p 
training trials), Also, given that Group 
C1-P6 showed Very great resistance to ex. 
tinction, the Sutherland and Mackintosh 
model would not expect Group P6-C1 to 
extinguish rapidly (see Sutherland & Mackin- 
tosh, 1971). However, inability to deal with 
the Reward Magnitude X Presentation Order 
interaction obtained here is not confined to 
attention theory. It is suggested that the 
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present Reward Magnitude 
Order interaction is simply 
interactive effects between rew 
which at present are not we! 


entation 


A model is considered below t pears to 
adequately explain known inter: schedule 
effects, whether manifested tinction, 


negative contrast, or positive co: 


Traditionally, the conditioning timuli to 


responses has been considered : accom- 
plished through either contigu or rein- 
forcement. The sequential view apaldi, 
1966), for example, emphasizes r« ement, 
The reinforcement assumption oí equen- 
tial model is replaced by that of r ement 
level, all other assumptions of that being 
retained. It is suggested that zth of 
conditioning is always determin: y the 
relationship between the reward is ex- 
pected and the reward that is obta This 
relationship determines reinforcem: level. 
More specifically, the view that th of 
conditioning is determined by re mag- 
nitude defined physically in term eight 
of incentive, etc. (e.g., Capaldi, i506), is 
replaced by the view that streng: con- 
ditioning is determined by reinforce:ioc:. le vel 
(obtained reward is, of course, defin: shysi- 
cally). Accordingly, if obtained rewird is 
greater than expected reward, then available 
stimuli will acquire a greater capa: than 


otherwise to elicit the reaction. However, 
if obtained reward is smaller than expected 
reward, then available stimuli will suffer à 
decrease in their capacity to elicit the r ction 
(unconditioning). Stimuli that are uncondi- 
tioned acquire an increased capacity to elicit 
competing reactions. Variables that are as- 
sumed to determine expected reward are 
considered below, along with selected as- 
Sumptions of the sequential view that are 
relevant to understanding reward schedule 
effects. 

Expected reward is assumed to increase as 
reward magnitude increases and to increase 
as reward percentage increases. Accordingly, 
with reward magnitude held constant, ex- 
pected reward will be higher the greater the 
reward percentage. Expected reward is as- 
sumed to increase along with number of 
training trials. Under consistent reward, con- 
ditioning will eventually be weak, because 
under consistent reward the value of expected 
reward eventually becomes identical to that 
of obtained reward, Under partial reward, 
the value of expected reward eventually 
stabilizes between two values of obtained 


reward, that available on rewarded trials and 
j 'that available on nonrewarded trials. Under 
partial reward, then, there is both strong 
conditioning (rewarded trials) and uncondi- 
tioning (nonrewarded trials). Adjustment of 
expected reward to prevailing reward condi- 
tion will be fastest where no specific former 
expected reward level has been established 
(as in Group P6), and it will be slower the 
stronger is the former expected reward (as in 
Group C1-P6). In considering groups such 
as P6 vs. Ci-P6, or groups that may be 
currently receiving the same reward mag- 
nitude but that have different expected reward 
values, it should be borne in mind that not 
only are rewarded-trial conditioning effects 
stronger where expected reward is lower, but 
nonrewarded-trial unconditioning effects are 
weaker. Accordingly, in comparing groups 
of this sort below, only rewarded-trial con- 
ditioning effects will be considered in the 
interest of simplicity. It has been suggested 
X (e.g., Capaldi, 1972) that nonreward or smaller 
F reward produces an internal stimulus (a mem- 
ory) which has two components, an emotional 
or avc component, and a component that 
corresponds to such characteristics of the 
reward as its size and so on (perceptual or 
cognitive component). Call this stimulus SN, 
The frustrative or aversive components of SN 
occur only when obtained reward is below 
expected reward, the perceptual components 
of SN being independent of expected reward. 
Since SN is a memory, the frustrative com- 
ponents of SN are available to be conditioned 
or unconditioned to the reaction only on 
trials which follow trials on which obtained 
reward was below expected reward. These 
assumptions are now applied to present and 
previous separate-phase findings. 

When the same reward magnitude is asso- 
ciated with the P and C schedules and P 
training follows C training, the higher ex- 
pected reward level appropriate to the C 
schedule persists for some number of P trials, 
and thus, on those P trials SY is less strongly 
conditioned to the instrumental reaction than 
where P-alone training is provided, and thus, 
C-P is less resistant to extinction than P 
(see Sutherland & Mackintosh, 1971). Under 
same-magnitude P-C conditions, expected re- 
ward level rises over the C trials, which would 
tend to produce less resistance to extinction 
in P-C than in P, but at the same time, 
a variety of stimuli gain in their capacity to 
evoke the reaction over the C trials (expected 
reward is below obtained reward on initial 
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C trials). This would tend to oppose the 
greater eventual unconditioning in P-C rela- 
tive to P, and perhaps accounts for the fact 
that, even following extensive interpolated C 
training, P-C is not much less resistant to 
extinction than P (Theios, 1962). Employing 
the reasoning used above, it can be under- 
stood why in the present investigation Groups 
C12-P6 and P6-C12 were less resistant to 
extinction than Group P6,* 

In dealing with the case in which the C 
schedule reward magnitude is smaller than 
the P schedule reward magnitude, the specific 
assumption is entertained that the expected 
reward level associated with P6 was higher 
than that associated with Cl. For Group 
P6-C1, then, on some number of C1 trials 
expected reward was above obtained reward 
(unconditioning), and therefore SN lost some 
of the capacity to evoke the reaction that it 
had acquired under P6 training. For Group 
P6-CI, then, at least some C1 trials func- 
tioned as nonrewarded extinction trials (al- 
though less s0), and thus, Group P6-C1 was 
less resistant to extinction than Group P6 
and Group P6-C12. For Group C1-P6, on 
the other hand, expected reward level was 
below obtained reward for some number of P6 
trials (strong conditioning of SN), and thus, 
Group C1-P6 was very resistant to extinction, 
Of course, expected reward was below ob- 
tained reward in Group P6 as well, but in 
Group P6 expected reward was not below 
obtained reward for as many trials as in 
Group C1-P6 (previously established expected 
reward levels are resistant to change). 

The considerable similarity between the 
frustration suppression effect, or FSE (e.g. 
Dunham & Marx, 1967), and particularly 
Groups CI2-P6 and P6-C1 here is worth 
noting. A group that receives a larger reward 
each day in magazine training than in lever 


? After this report was prepared, an investigation 
by Mellgren, Lombardo, Wrathe , and Wi (1973) 
appeared which showed that C-P (20 trials in each 
phase) produced greater resistance to extinction 
than P (20 trials) when the same reward magnitude 
was associated with each schedule. Following only 
20 C trials, expected reward would not be firmly 
established, and thus, SN was conditioned to the 
reaction in the P phase about as strongly in Group 
C.P as in Group P. Too, where only 20 P trials 
are employed, alley and other stimuli would not 
be strongly conditioned to the reaction in Group P. 
Such stimuli would be more strongly conditioned 
to the reaction in Group C-P due to the additional 
20 C trials, and thus, Group C-P was more resistant 
to extinction than Group P. 
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training (downshifted group) is less resistant 
to extinction than a control that receives the 
same smaller reward magnitude under both 
magazine and lever conditions. It is reason- 
able to assume that the downshifted group is 
frustrated during lever training. As Dunham 
and Marx point out, a contiguity emphasis 
(e.g., Amsel, 1958) would suggest that the 
frustrative stimuli would acquire control over 
the instrumental reaction in the downshifted 
group and, therefore, elevate resistance to 
extinction in that group. The same, of course, 
would be expected on the basis of a traditional 
reinforcement view (e.g., Capaldi, 1966). On 
the other hand, the FSE is clearly consistent 
with the reinforcement level view. That is, 
the larger reward associated with magazine 
training would elevate expected reward in 
lever training, thereby allowing frustrative 
and other stimuli to acquire but little control 
over the instrumental reaction, and thus, the 
downshifted group would extinguish rapidly. 
Other varieties of separate-phase extinction 
findings are consistent with the reinforcement 
level view, although these findings, like the 
FSE and C-P and P-C extinction findings, 
create considerable difficulty for available 
theories. It has been reported (e.g., Robbins 
et al, 1968; Spear et al., 1965) that groups 
given initial nonrewarded trials (no prior 
reward trials) prior to rewarded trials showed 
greater resistance to extinction than controls 
given rewarded trials only. These findings 
create difficulty for the sequential view, be- 
cause only one transition from nonrewarded 
to rewarded trials occurs (see Capaldi, 1966), 
and they create difficulty for Amsel's (1958) 
Írustration hypothesis, because initial non- 
rewarded trials are presumably not frustrative 
(see Robbins et al., 1964; Spear et al., 1965). 
According to the reinforcement level view, 
however, stimuli occurring in the rewarded 
phase would be more strongly conditioned to 
the instrumental reaction in the initially non- 
rewarded group (expected reward is below 
obtained reward) than in the rewarded-trial 
control (expected reward and obtained reward 
more quickly become the same). The similar 
finding reported by Capaldi (1970), that a 
small magnitude of consistent reward followed 
by a large magnitude of consistent reward 
produced greater resistance to extinction than 
large consistent reward alone, is also con- 
sistent with the reinforcement level view. 
The similarity of the above finding to those 
obtained here for Group C1-P6 is perhaps 
obvious; in all cases, postshift expected reward 
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was below obtained reward, 
resistance to extinction was in 


ll cases, 
by post- 


shift training. A difference bx the case 
being considered and the :gnitudes 
P-C vs. P case is worth nm to avoid 
possible confusion. Expected ird level 
and thus unconditioning is gr for P-C. 
than for P in extinction. B 'xpected 
reward level of, e.g., an initiall ewarded 
group, is either below that o: to that 
of its rewarded control at th: » of exe 


tinction, depending on rewarde 
level. 


training 


When expected reward is al btained 
reward, both unconditioning a: occur, 
In most available negative conti estiga- 
tions, SN has not been conditi to the 
instrumental reaction in preshil: , most 
negative contrast findings can bx ictorily 
explained in terms of the gener on de- 
crement and other effects of S" ippeal 
to unconditioning being unnec (see 
Capaldi, 1972). There appears t t least 


one case of negative contrast, | er, In 


which appeal to unconditioning a as.SN 
is required. Ison, Glass, and I 1969) 
reported that a shift from a larg« small 
magnitude of partial reward produc nega- 
tive contrast effect, however, not ly in 
postshift, as is usual, but only foll g con- 
siderable training. Early in pos: nift SN 
would elicit responding strongly use it 
was previously conditioned to the reaction by 
means of a large magnitude of partia! reward. 
However, because in postshift expected reward 
was above obtained reward on both rewarded 
and nonrewarded trials, unconditioning would 


eventually be considerable, and presumably 
it was this rather substantial unconditioning 
effect that was responsible for the late-trial 
negative contrast effect reported by Ison et al. 
If the postshift condition following large 
partial reward is small consistent reward 
rather than small partial reward, the un- 
conditioning effect would be less severe (no 
nonrewarded trials) and unconditioning would 
not occur over so many trials (the preshift 
and postshift expected reward level discrep- 
ancy is smaller) These are, perhaps, among 
the reasons why a shift from large partial 
reward to small consistent reward did not 
produce a negative contrast effect in an Ans 
vestigation reported by Capaldi and Zi 
(1969), 

Whether positive contrast has been obtained 
seldom or frequently seems, in part, defini- 
tional. If the meaning of positive contrast 


L REWARD FOLLOWING OR 


to cases in which reward mag- 


is rest 

nitude ‘her in postshift than in preshift, 
then e contrast has been obtained less 
frequ than otherwise. According to the 
rein! nent level view, however, the neces- 
sar) not sufficient conditions for positive 
con’ ast exist whenever expected reward lies 


be obtained reward, which widens con- 
si -rably the permissible instances of positive 

itrast. Thus, according to reinforcement 
ievel, positive contrast can occur on a shift 
irom partial to consistent reward as well as 
n a smaller to larger reward. 


shift from 


eung and (1969) reported positive 
contrast following shifts from various per- 
cent { partial to consistent reward. In 


and run sections of the runway, 
ial reward sometimes produces faster 
ining than consistent reward (e.g., Good- 
1959), which is, according to reinforce- 

nt level, a positive contrast effect. In the 
oal section, partial reward produces slower 
unning than consistent reward, which is con- 
sistent rather than inconsistent with the 
present view. In the goal section, non- 
rewarded-trial unconditioning effects under 
partial reward are potent, and as Miller 
(1959) suggests (see also Capaldi & Levy, 
1972), approach tendencies generalize more 
widely than inhibitory tendencies, which 
means that in earlier alley sections rewarded- 
trial conditioning effects under partial reward 
ill be potent relative to nonrewarded-trial 
»conditioning effects. Positive contrast was 
bserved by Robbins et al. (1968) on a shift 

m initial nonrewarded trials to rewarded 
although not classified as such by those 
authors. More orthodox instances of positive 
contrast were reported recently by Weinstock 
(1971) on a shift from small to large con- 
sistent reward, and by Mellgren (1971, 1972) 
on shifts from a small to a large magnitude 
of delayed reward. 

Mellgren (1971) observed a positive rela- 
tion between number of preshift trials and 
absolute speed on postshift trials, and this 
effect extended over 32 trials, at which point 
training was discontinued. According to the 
reinforcement level view, as the number of 
preshift trials increases, the number of post- 
shift trials over which obtained reward will 
remain above expected reward increases. 
Thus, as number of preshift trials increases, 
the stronger should be postshift conditioning, 
producing, accordingly, a positive relation 
between number of preshift trials and absolute 


speed in postshift. 
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D RECOGNITION AND MORPHEMIC STRUCTURE! 
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Ss learned a we 
ed tachistos 


pre opic 


i list and were then tested on recognition of words 
When a test word was identical to one of the 


learned words, recognition was facilitated in the usual way. Pretraining with 


| word 


comparat 


l 


t was a different derivative of the same root morpheme as a test word 
t facilitation of recognition of that test word. However, learning 
» visual-acoustic similarity but no morphemic relation 


word gave only slight, nonsignificant facilitation of recognition. It 
icluded that the process of recognizing a word involves assigning it to a 
istic unit with specific semantic associations, i.e, a morpheme. Analysis 


or responses suggested that, to some extent, root and suffix morphemes 


ire recognized independently, and that suffixes themselves may be subject to 
irequency effects, The principles underlying the phenomena studied may be 
most appropriately characterized as “the morpheme-frequency effect.” 


Words of higher frequency of occurrence 
n the language are more easily recognized 
than low-frequency words. The nature of 
the mechanism leading to this phenomenon 
is a subject of current debate (Broadbent, 
1967, 1971; Catlin, 1969; Morton, 1968; 
Nakatani, 1970; Treisman, 1971). All 
current theories, howev are agreed 
that the recognition system is biased 
toward high-frequency words, although 
that bias is seen as operating in different 
ays. In the past, it was debated whether 
iis bias was a function of the differential 
quency of the perceptual experience of 
weds (Howes, 1954; Neisser, 1954) or 
frequency of production of words 
(Daston, 1957), with corresponding defini- 
tion of the unit of recognition as perception 
or response based. 
^ way of reconciling these positions was 
put forward by Morton (1964, 1969), who 
proposed the logogen model, in which the 
central units, logogens, were affected not 
only by visual and acoustic information, 
but also by semantic information. The 
latter is implicated in speech production 


! This work was carried out at the Psychological 
Laboratory, University of Cambridge, while the 
first author was a final year undergraduate, The 
assistance of those who acted as Ss is gratefully 
acknowledged. 

? Requests for reprints should be sent to Graham 
A. Murrell, Medical Research Council, Applied 
Psychology Unit, 15 Chaucer Road, Cambridge 
CB2 2EF, England. 


and also in the recognition of connected 
material. Any logogen receiving more than 
a threshold amount of information "fires," 
making the corresponding word available 
as a response. As a result of firing, the 
threshold of the logogen is lowered and 
very gradually rises to a level slightly 
below its previous value. In this way, 
experience with words, both recognition 
and production, leads to permanent changes 
in threshold, biasing future responses. 

The short-term change in threshold was 
included in the model to account for the 
finding that when a word is recognized, the 
same word will be recognized more easily 
if presented again within a short time. 

The latter effect was utilized in the 
present experiment to create a short-term, 
experimentally controlled word-frequency 
effect. The technique has been used 
previously by Solomon and Postman (1952), 
who used nonsense words so there would be 
no "a priori" word-frequency effect. As 
will become clear, it was necessary to use 
familiar real words in the present experi- 
ment, but the uncontrolled a priori fre- 
quency effect could only mask the results 
and will not affect the conclusions, 

Neisser (1954) found that pretraining 
with a word facilitated recognition of the 
same word but not of a homonym, eg. 
after priming with "phrase," recognition 
of this word was easier than normal, but 
recognition of “frays” was not affected. 
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TABLE 1 
EXPERIMENTAL WORDS 

a) (2) (3) 
Car Cars Card 
Bored Boring Born 
Sings Singer Singe 
Pains Pained Paint 
Lives Lively Livery 
Sees Seen Seed 
Man Manned Manner 
Hanging Hang Hangar 
Reader Reading Ready 
Placed Places Placate 
Walled Wall Wallet 
Topped Topping Topple 
Basis Basic Basin 
New Newer Newt 
Mile Miles Mild 
Numbing Numb Numbers 


His interpretation was that pretraining 
"does not merely raise the probabilities of 
certain verbal responses, but rather facil- 
itates the perception of specific visual 
patterns [Neisser, 1954, p. 402]." Follow- 
ing this, Ross, Yarczower, and Williams 
(1956) attempted to demonstrate a mono- 
tonic relation between degree of facilitation 
and level of visual similarity between pre- 
training word and test word. They found 
no such relation, although they confirmed 
Neisser's main result. This, and other 
considerations, casts doubt upon Neisser's 
interpretation, and particularly his assump- 
tion that "phrase" and "frays" may be 
regarded as the same response (even 
assuming they are phonetically identical). 

The interaction of semantic and sensory 
information necessary to account for the 
facilitative effects of a context on word 
perception (Morton, 1969; Rubenstein 
& Pollack, 1963; Tulving, Mandler, & 
Baumal, 1964) also necessitates a refine- 
ment of the definition of the unit of 
recognition. Thus, since the contexts 
appropriate to the two homographs “chop” 
are totally different, they might be expected 
to have functionally Separate representa- 
tions in the recognition system. 

The question of interest in the present 
study is whether the unit concerned in 
recognition facilitation effects is 4 word 
as a visual-phonetic pattern, or a form 
bound to a specific semantic unit, ie, a 
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morpheme. The concl ll bear on 
the nature of the respo its made 
available by logogens. 

The experiment investi; the effect 
on word recognition of pr« ng with a 
word containing the same : :orpheme, 
compared with pretrainin; 1 a word 
containing the same visual ietic pat- 
tern as its “root” when this t actually 
the same morpheme. T! ffects of 
pretraining with the ident ord and 
zero pretraining, as contro nditions, 
were also tested. 

METHOD 

Subjects. Sixteen male undergr volun- 
teered as Ss, and were divided into t ups of 8. 
None were psychology students. | mpleted 
the procedure once only, with no pric »rmation 
about the experiment apart from specific 
instructions (see below). 

Materials. The apparatus was nbridge 
tachistoscope with an auxiliary timi evice to 
permit control of flash duration in . Steps 
between 10 msec. and 140 msec. A n density 
filter (X16) was fitted over the view ,perture 
to reduce stimulus contrast. The stim is words 
were typed in lowercase on plain cards, 


aligned so as to appear in the space between two 
short horizontal lines drawn on the kgroun 
card upon which a word was superimposed in the 


stimulus flash. The S was provided w ith the 
tachistoscope trigger button on a cable, to initiate 
the flashes, 

The experimental words are shown in Table 
The three words in each row of the table have an 
initial sequence of letters in common. Group 1 and 
2 words differ only in the inflection of their common 
root morpheme, but the same letter sequence in 
the Group 3 word is not the same morpheme, and 
the word bears no semantic relation to the other two. 
However, viewed purely as visual-acoustic patterns, 
the degrees of similarity between the words in 
different columns are comparable (e.g., the mean 
suffix length of Group 3 is intermediate between 
Groups 1 and 2, the difference between the means of 
Groups 1 and 2 being less than half of one letter). 

"Twenty other words were used in initial practice 
on tachistoscopic recognition. These were selected 
for low similarity with experimental words. — 

There were four sets of 12 words for pretraining. 
Each set comprised 4 words from each of Groups 1, 
2, and 3, typed on a card in random order. Each 5 
received a recognition test on the words of either 
Group 1 or Group 2 of Table 1. Thus, of the 16 
words each S was tested on, 4 were subject to each 
of the four pretraining conditions, which were as 
follows: (a) pretraining with the same word (Condi- 
tion S), (b) pretraining with a different inflection 


of the sa: t morpheme (M), (e) pretraining 
with an ed word containing the same visual- 
acoustic jut different terminal letters (V), and 
(d) zer raining, no similar word having been 
presen ). 

Ov eight combinations of pretraining and 
testin: word sets, every test word was subjected to 
ever straining condition. Thus from each group 


, recognition scores for each word under 


of eht 
each condition were obtained. 

^cedure. The outline of the procedure is best 
conveyed by quoting the instructions read to Ss. 


The experiment is in three phases. First, I 
ability to recognize words 
ted to you in a very brief flash. 
ill give you a short list of words to 
. [ will test your recognition of 
of words on brief exposure, some 
words being among those you have 
. Please place your face against the 

bber visor on this apparatus, and view the 
white card. When I say ‘now,’ focus between 
the lines and when you're ready press the button 
once to flash up the word. Then, immediately 
say aloud and distinctly what you think the 


am going to t 


that 


condly 


word was. Guess if you're not sure, and if you 
have no idea what it was, say ‘no.’ [practice 
phase] 


For the next phase, there are 12 words printed 
on this card in a random order. You have three 
minutes to learn them and go over them in your 
head. You will need to keep them in your mind 
through the third phase. Begin now. [pre- 
training] 

Time is up, and it is time for the third phase, 
but try to remember those words. Some of the 
following series of words will be among the 
words you have learned, some will be similar 
to those you have learned, and others will be 

juite different words. The procedure is the 
me as before. When I say 'now, focus 
between the lines... + [testing] 


15 the practice period, about 100 word exposures 
were given, of varying durations, to familiarize S 
with the task. The shortest duration at which 
words were correctly recognized at the end of the 
practice was taken as the individual S’s baseline 
for testing. 

"The set of words for testing was presented in a 
random order at baseline flash duration, then in 
the same order again at 5 msec. longer duration, 
and so on for nine serial presentations. Baselines 
varied from 20 msec. to 60 msec., and each S 
received exposures increasing in duration to 40 
msec. above his baseline. 

All responses in the test were recorded, but no 
knowledge of results of any kind was given. 


RESULTS AND DISCUSSION 


There were 39:3% correct responses, 
48.1% error responses and 12.1% “no 
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TABLE 2 
RECOGNITION PERFORMANCE 


Pretraining condition 
Scoring measure 
s M v ô 
Mean recognition 86.8 |63.3 |51.1 |46.8 
score for words 
Mean recognitions 435| 3.72| 3.225 | 3.02 
per 9 presentations 
Percent recognition 49.9 |464 |368 |36.6 
of root morphemes 
Percent recognition 62.2 |54.2 |51.6 |51.0 
of suffix morphemes 


Note. See text for explanation of pretraining conditions. 


responses. Only 7.0% of the error responses 
were not real words. Of these, nearly 
three-quarters were from four of the Ss, 
and five Ss gave no such responses. In 
general, error responses were similar in 
word length and gross shape to their 
eliciting test words. 

It was decided not to use a "threshold" 
measure for comparison of scores, since it 
would depend heavily upon the arbitrary 
criterion set by E and would neglect much 
of the available data. Instead, a measure 
of performance was devised which gave a 
value to every correct response according 
to exposure difficulty, normalized across Ss. 

First, each correct response was given a 
value equal to the reciprocal of the propor- 
tion of correct responses at that exposure 
over all Ss. This weighting gives higher 
values to more difficult discriminations. 
Thus weighted, correct responses were 
summed over the nine exposures to give 
a score for each word by each S. The 
scores of each S were further weighted by 
the multiplier necessary to correct his 
total score to 100. This procedure com- 
pensated for individual differences. 

The mean recognition scores under the 
four conditions are shown in the first row of 
Table 2. The untreated data are summa- 
rized in the second row of Table 2 as the 
mean number of wholly correct recognitions 
per nine presentations under each condi- 
tion. No further reference will be made to 
the latter figures. 

A three-factor analysis of variance was 
carried out on the recognition scores 
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(model from Winer, 1971, p. 380). The 
factors were conditions, words, and replica- 
tions (i.e., the two sets of eight Ss duplicat- 
ing each result). The pretraining conditions 
differentially affected word recognition, 
F (3, 93) = 12.65, p < .001? The result 
of principal interest is that although the 
difference between scores under Conditions 
V and O fell well short of reliability 
(F « 1), Condition M did facilitate word 
recognition compared with O, F (1, 93) 
= 5.38, p < .025. However, the effect of 
Condition M was rather less than that of 
S, F (1, 93)- 1079, 5 «.01. The 
Replications X Conditions interaction was 
not significant (F « 1), indicating that the 
differences between conditions were con- 
sistent between the two groups of Ss, 
Similarly, the Words X Conditions interac- 
tion was not significant, F (93, 93) = 1.32, 
indicating that the conditions had the same 
effects over all the test words used. 

There is some indication of effects of 
phonological similarity, which may be 
worthy of study in their own right. In six 
of the rows of words in Table 1, the “root” 
of Word 3 differed phonologically from 
those of Words 1 and 2, eg., reader, 
reading, ready. A separate analysis of the 
recognition of the 12 test words from these 
tows indicated that under Condition V the 
phonological dissimilarity tended to coun- 
terfacilitate recognition. However, the 
analysis of variance of recognition scores 
was repeated with these 12 words excluded, 
and even when the pretraining and test 
word roots had the same phonology under 
Condition V, there was no significant 
faenitatiop compared with O, F (1, 57) 
= 1.20. 

The results confirm the effectiveness of 
the pretraining technique for creating a 
short-term, experimentally controlled word- 
frequency effect. This effect does not 
occur when the pretraining word resembles 
the test word in visual-acoustic features 


* [n order to avoid the "language-as-fixed-effect 
fallacy” pointed out by Clark (1973), the appro- 
priate quasi-F ratio for the conditions main effect 
was also calculated: F’ (3, 17) = 8.03, p < .01 
This provides statistical support for generalizing 
the effect to any comparable sets of words. 
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only. There is reliable fa tion only 
when there is morphem: entity in 
addition to visual-acoustic : rity. 

Considered in terms of the | ;en model, 
this means that the logogen v threshold 
is lowered by pretraining a word 
corresponds not to that wor r even to 
the visual-acoustic pattern 'ntains as 
root, but strictly to the nx ne from 
which the word is derived hus, the 
word recognition process, in the 
absence of contextual or sema stimulus 
information, involves assig: of the 
stimulus to a particular m« ie with 
its specific semantic associ The 
morpheme, and only that on: rpheme, 
to which the decision proces igns the 
word, has its logogen thresh owered, 
Thus, when a pretraining word ngs toa 
different morpheme from a ar test 
word (as in Condition V), no itations 
would be expected. This in etation 
would also account for the nega results 
of Ross et al. (1956). 

The conclusion that the unit ived in 
facilitation effects is the morph me was 
further illustrated by the perc: ‘age of 
responses containing the same :cot mor- 
pheme as the eliciting test wo ‘These 
results are shown in the third row of Table 
2. Bearing in mind that the recognition of 
whole words was considerably better under 


Condition S than under M, it is significant 
that the recognition of root morphemes | 
under these two conditions did not differ 
reliably. It may be concluded that the 
latter result was due to the equal facilita- 
tion of root morphemes in these two 
pretraining conditions. Furthermore, rec- 
ognition of root morphemes was not 
reliably different under Conditions V and 
O. This would be predicted, since under 
Condition V the root morpheme of the 
test word did not occur in pretraining an 
thus could not have been facilitated. > 
Even though the root letter sequences 1 
pretraining were the same in Conditions M 
and V, with suffixes differing from the test 
words in both cases, the recognition of root 
morphemes was better under M (t = 2.38 
P «.025). The facilitative effects of \ 
pretraining were apparent in the intrusion 
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ng the root morpheme of 


errors ci 
- the pret: g word. Such errors amounted 
to 8.7 €. he responses under V, a figure 
«compa with the difference in root 
morp! » recognition between M and V. 
This evidence that response biases of 


magnitude were created under 
se two conditions. 

iere appears to be some validity in 
considering experimental words in terms of 
root fix, The results suggest that, 


ind suf 
to some nt, free morphemes and 


Su/fixes are recognized independently. This 
is consistent with the linguistic analysis of 
words into morphemes, and the view that, 
the response units made available by 
loge, eas are morphemes. 
; i: appears that the difference in word 


recognition scores between Conditions S 
and M was due to the pretraining with the 
test word suffixes under S, which did not 
occur in the other conditions. The scores 
for recognition of suffix morphemes are 
shown in Table 2. The difference in suffix 
recognition between S and O (t = 2.56, 
p < .025) demonstrates suffix facilitation. 
The differences in suffix recognition be- 
tween M, V, and O were not reliable. 
The difference between S and M was 
only poorly reliable (t = 1.59, p = .07), 
probably because suthx recognition under 

| M was inflated by the constraint imposed 
on response suffixes if the pretrained root 
wes correctly produced. Thus, if the test 
word were “newer,” then the likelihood of 
producing the correct suffix "cer" is zero 

if the root has been reported as “newt.” 
l If "new" has been seen correctly, the 
: likelihood of guessing “‘-er” is appreciable. 
Of the errors which were real words given 

in response to test words having the zero 
213 (93.0%) 
These responses were 
not counted in the suffix recognition 
| scores of Table 2, which are therefore an 
. 


morpheme (i.e, no suffix), 


had no suffix. 


underestimate. 
It is clear that there was substantially 
better recognition of suffixes than of roots. 
It may simply be that the suffixes were 
more distinctive visual patterns. The suffix 
“ing” was correctly recognized on 81.5% 
of the trials in which it occurred in the 
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test word, including 35.0% of the trials on 
which the first part of the response was 
wrong. For the suffix “ed,” there was 
recognition on 65.1% of its occurrences, 
including 28.3% error responses. Correct 
recognition of "-er" was 53.7%, including 
15.3% errors. 

It may be that '-ing" is more distinctive 
than “ed.” However, it is an attractive 
proposition that the suffixes themselves are 
subject to a frequency effect. If this were 
so, the relative proportions of trials on 
which the root and suffix of a test word were 
recognized would reflect the relative famil- 
iarity of the two morphemes to the subject. 

This experiment gives grounds for con- 
sidering a well-known phenomenon in new 
terms, as “the morpheme-frequency effect." 
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NT-VOWEL-CONSONANT RECOGNITION AS A FUNCTION 


ECONS 
’ OF GRAPHIC FAMILIARITY AND MEANING! 


SETH N. GREENBERG °? 


Ohio University-Lancaster 


The present research determined whether perception of letter strings is in- 
fluenced by meaning associated with the string and/or the familiarity of the 
string’s graphic structure. Combining a familiarization procedure which 
raphie familiarity and meaning to be independently associated with 
unfamiliar consonant-vowel-consonants (e.g., Taylor) and a forced-choice 
word perception procedure (e.g. Reicher) which eliminates nonperceptual 
influences, it was demonstrated that only graphic familiarity improves letter 
sequence recognition. However, it was noted that meaning pretaining in- 
‘luenced the choice of a unitization strategy used in the perceptual task. The 
results indicate further information about the scope of variables influencing 
unitization strategy, as well as the importance of unitization strategy in the 
perception of certain fixed orthographic structures. 


allow 


Ising visual thresholds as measures of 
perceptibility, Cattell (1885) determined 
that man perceives letters more easily when 
the letters form words. A basic question 
that arises from Cattell’s observation is 
which feature or features associated with 
words and not with random letter sequences 
contribute to the superior visual recogni- 
tion of words? Gibson (1972) c ified the 
distinguishing features of words into four 
types of information : graphological, phono- 
gical, semantic, and syntactic. The two 
fatures of interest in the present research 
include graphological-orthographic features 
which refer to the physical character and 
arrangement of letters in a sequence and 
semantic features. Specifically, this in- 
vestigation is concerned with whether 
meaning and/or familiarity of a graphic 
letter sequence increases a sequence of 
letters’ perceptibility. 


Attempts to determine whether graphic 
familiarity and meaning influence letter 
sequence perceptibility face two major 
problems: 


1. Employment of an experimental pro- 
cedure that successfully eliminates non- 
perceptual factors from biasing results, 
For example, if the "perceptual task" 
requires Ss to report a string of letters, 
after a brief visual presentation perform- 
ance may reflect both what Ss saw and 
what Ss were able to recall correctly 
(Baddeley, 1961; Hochberg, 1968). Thus, 
a significant effect attributed to a feature 
might be due to that feature facilitating 
recall rather than perception. The problem 
of nonperceptual factors can be circum- 
vented by the use of à forced-choice 
recognition technique introduced into word 
perception by Reicher (1969). Reicher's 
technique calls on Ss to select from two 


‘ This study is based on a dissertation submitted s : j 24 
to the Ohio State University in partial fulfillment reponse alternatives the letter that occu- 
of the requirements for the PhD degree. The pied a randomly designated position in the 

© research was carried out at the Human Performance — tachistoscopically presented letter sequence 
Center and was sponsored in part by National which immediately preceded the response 
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to reading earlier drafts and the many helpful ideas 
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? Requests for reprints should be sent to Seth N. 
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University-Lancaster, 1570 Granville Pike, Lan- 
caster, Ohio 43130. 


choices. Both response alternatives are 
equally likely to appear in the tested 
position. Reicher’s procedure successfully 
eliminates factors of storage load, inter- 
ference during retrieval, and biased re- 
sponding which hindered perceptual inter- 
pretations of previous experiments. 
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TABLE 1 


Pexcerruat STIMULI. GENKKATED. FROM THE 
VAMILIARIZATION. CONSONANT-VOWFEL-- 
" COoNSONANT /zitt. 


Weainning added vowrts | End added vowels 
imac Nonintaet intact | Nonintact 
Aa. | A VEL WL A | VEL A 
1n mi | ZAL t 
omt | oza m0 

| vm. | ump Tr.U | moo 

Note, The actual wet of simuli mm 
L Sete mirisi pendere 


(Gibson, Bishop, Schiff, 
However, the increased 
the letters I, B, and M when 
form IBM could have resulted from Ss 
es graphic familiarity 


perception of 
Taylor's results supported a 


te cet eee. oe a 
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iarity effect, but failed to : 
meaning. However, use 
threshold technique whic! 
nonperceptual influences, « 
during recall and response 
interpretation. of Taylor 
addition, CVCs were pa 
definitions for only 15 trial: ] 
employed in the perceptu: sk, 
failure of the meaning varia! 
tributed to insufficient asso 

The present research fo! la 
extensive version of Taylor’ 
larization procedure with : 
recognition task in order 
whether graphic familiarity an 
are features that enhance a |: 
perceptibility, 


METHOD 


Subjects. Twelve Ohio State Un: 
served for 13 45-min, sessions at $1 
The Ss were assigned to three famili: 
(H = high meaning associate, I. 
associate, and P = no associate) in o: 
ance, thereby providing four Sa per vr 

Stimulus material and design, Tu 
meaningful (m) CVCs (Archer, 1960 
stimuli in the familiarization task s: 


of the perceptua 
were treated identically. 
by a given $ during (aiiliarizatiote 


all 
Tam, 
The list used 


would be the 

stimuli, while 

in the experi QM 
of w set is exhibited 

bol i A perceptual stimulus set js e 


H 


and third columns in Table 1 contait 
which were constructed by 

four that differs from the cevos 

g or end of the facilia 

» each CVC generated eight intact 

py were randomly rere. from 
the purpose of testing tbe C 

One stimulus from each set of six st 

during a session, providing 24 tests of 

of intact stimuli. Counterbalancing 

24 tests each session produced an equal n 


H 


H 


] 


"n 
t 


lol each CVC position. A different member 
wet of «ix stimuli was probed in each of six 
J wesslons, Test» of the CVC positions 
‘counterbalanced within each set so that each 
was probed twice. 

order to meet the requirement of equally likely 
alternatives for the forced-choice probes, 
intact stimulus used for a CVC position probe 
a nonintact partner which was derived by 


the probed letter of the intact stimulus 
another letter (second and fourth columns) 
The replacement letter. in the nonintact. stimulus 


ed from the intact stimulus 
vrnatives for that particular 
he intact stimulus and ite 


and the letter rpi» 


sz] iL? 
inii 


simule 
deese! that both vowels serving as reponse 
alters toos would be used as the correct alternative, 
Ie. serve as the added vowels with a given CVC, 
amqually often 

The rationale behind added vowel testing was 
that ji permitted comparisons between. familiar 
j walamiliar mimulus condition. The CVC 
data could not be used for this purpose, since 
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Lor ciae and upon scanning strategy (uniti 
zation), 

The order of perceptual stimulus presentation 
and testing was random within each session (96 
trials) except for the following constraints: (a) no 
successive repetition of stimuli containing the same 
CVC base was allowed, (b) no more than four 
successive tosts of CVC positions or added vowel 
positions was allowed, (c) no more than four 
successive presentations of stimuli generated from 
CVCs belonging to one of the two lista was per 
mitted, (d) over 32-trial blocks, half of the stimuli 
were List 1 and half were List 2 generated stimuli, 
and (e) no CVC appeared as the base of a perceptual 
stimulus more than twice in a 32-trial block. 


Se in the H and L p 
n belongi 


i 
ji 


y 
i 
i 
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TABLE 2 
PosrrioNAL ERROR RATES (iN PERCENTAGES) 
$ C Ric Right-added 
Condition esr ue sn vowel. cons “vowed 
Group P 
Familiar-intaet 28 11 17 2 36 
Familiar-nonintact 18 33 ds 39 
Unfamiliar-intact 31 i e D 
M reme 29.5 15 27.15 37.2 325 
Group L 
Familiar-intact 17 19 15 18 46 
Familiar-nonintact 24 32 39 aa 
Unfamiliar-intact 22 it 25 2 
crea 19.5 21.25 35.5 27.5 47.5 
Group H 
Familiar-intact. 13 14 8 20 37 
Familiar-nonintact 32 25 32 d 
Unfamiliar-intact 17 i 2 H 
Unfamiliar-nonintact 
x 15 17.5 17.5 24.75 40.5 
e lected I si ly. ` Analyses of performed. 
on adta wowed data ei d ed vowel columns were cole sessions produced the tame mpl tin sas o ta collected 


forward association trial, while an S in the P. group During the appearance 
pronounced the list of CVCs three extra times, masking field 
These familiarization trials preceded 18 final the letter he thou 
Pronunciation trials (P group) or 18 forward Response alte 
association trials (L and H groups), as well as a 
ares of duration threshold trials, 

iJ 


# perceptual recognition session, an S sat 76 cm. in — 15-20 see, ; 
The basic projection equipment included two 


tm. white Lucite area serving as the projection Kodak 
e projection area was long enough to hold blade shutters which were controlled by a Hunter 


seven projected letters with a :64-cm. space between 1516 si; 
the end 


of the field, Immediately after the button press 


of the stimulus coincided with the center of the 
fixation field. The added vowel 


the leftmost or rightmost position, Thus, there was 


a space between the CVC base 
vowel, 
display, a masking field was presented 
series of overlapping Xs and Os, which covered the 


b 7 and the added 
Following termination of the stimulus 


containing a 


ely counterbalanced within sessio) 


of the respons 
» S chose, from the two « 
ight occupied the prob 
rnatives remained available for 3 sec. 
The S's choice was written on an answer 
was covered following the responsc. 
final six sessions were given to the experi- — between successive responses. depende 
mental task, CVC plus vowel recognition, During response time, 


ernativee 
rnatives, 
positi 


iset, which 
Fhe time 
upon S's 


with a normal interval of about 


cte x-channel timer, 
position letter spaces and the black screen, Eleven of th 


The field was high enough to hold three rows of — task were cont: 
letters with a -32-cm. space between rows and a 
-32-cm. space between the top or bottom row andthe moved, was in the 
lack were administered 
recognition test, 
to recall as m; 
could, while 
12 familiar CVCs from 
Portion — In both conditions, Ss 
às they wanted to com, 
l appeared in either $ was paid $1.25 for the 


the perceptual experiment, T 


in the reco; 


plete 


RrsuLTS 


Error rates for all five ti 


Carousel RA950 projectors located behind 


€ 12 participants in tlic perceptual 
acted 10 wk. after the completion of 
he twelfth 5, who had 
Pronunciation group. Two tests 
; à recall test first and then à 
In the recall test Ss were asked 
ny of the 12 familiar CVCs as they 
nition task Ss chose the 
the total set of 24 CVCs. 
were allowed as much time 

their resp bi 4 
recall-recognition session. 


ests positions are 


i i ized i xamining the 
middle row where the stimulus Had just appear] Summarized in Table 2. In exar g 


Occurring with the masking field were two response influence of graphic famili 
alternatives which occupied positions directly above perceptibility, 


(top row) and below tom row) the position ance was considered. 
formerly filled with the stimulus’ to-be-tested letter, 


arity on sequence 
only added vowel perform- 


The added vowel 


results provide evidence that both left and "v 


right is were recognized more often 
whe: layed with a familiar CVC than 
wit unfamiliar CVC. An analysis of 
varce performed on the error data 
co ned this observation revealing a 


si nificant effect of graphic familiarity, 

i, 9) = 8.40, p < .05. 

Had meaning influenced stimulus per- 
eptibility, Ss in the H and L groups 
should have performed significantly better 
than Ss in the P group. Examination of 
the CVC error data in Table 2 appears to 
indicate that meaning affected CVC per- 
, However, an analysis of variance 


ion. 


rformed on CVC position data, as well 
(n analysis of variance on added vowel 
ta indicated that the between-Ss variable, 
caning, failed to produce a significant 
jiect, F (2,9) < 1 for both analyses. The 
failure of the main effect of meaning could 
be due to inclusion of intact unfamiliar 
CVC tests and nonintact familiar and 
unfamiliar CVC tests with the intact 
familiar tests. Since only intact familiar 
CVC bases were treated differently in the 
various vroups, Le, used during familiari- 
zation, the effect of meaning should be 
confined to the intact familiar condition 
within each group, and would be identified 
is a significant Meaning X Familiarity 
X Intactness interaction. Analyses of 
ariance performed on CVC position and 

Ided vowel position data failed to produce 
s interaction, F (2, 9) « 1 for both 
omparisons. The lack of an effect of 
meaning on the total perceptibility of a 
-quence could have also resulted from a 
iailure of the familiarization procedure to 
produce a difference in the CVCs for the 
various groups, Results from the long- 
term recall task place doubt on this 
possibility. The Ss in the H and L groups 
recalled an average of 11 and 9 of 12 CVCs, 
respectively, while those in the P group 
recalled 5 CVCs. Recognition performance 
did not differ betweeen groups. A set of 
tests performed on the recall results demon- 
strated that both association groups recalled 
more items than the P group (p < .05 for 
both comparisons). This information indi- 
cates that differential familiarization effec- 
tively established differences in the CVCs 
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HIGH M GROUP HIGH M GROUP 
LOW M GROUP = LOW M GROUP 
P GROUP P GROUP — 


$ 8 8 


PERCENTAGE OF ERRORS 
5 8 


LEFT CENTER RIGHT LEFT CENTER RIGHT 
CVC POSITIONS 


FIGURE 1. saning X Consonant-Vowel-Con- 
sonant (CVC) Position X Familiarity interaction. 
(Abbreviations: M = meaningful and P = no as- 
sociate. ) 


for the various groups and thus eliminates — 


the failure of familiarization as an explana- 
tion of the null effect of meaning. 

While meaning failed to effect the level 
of CVC recognition, a significant Meaning 
X Position interaction, F (4, 18) = 14.59, 
p < .05, indicated an influence of meaning 
on the way CVCs were unitized. The 
interaction displayed in Figure 1 appears 
to result from the marked increase in errors 
from the left to center to right CVC 
position associated with the P group, con- 
trasted with a considerably more moderate 
left to right increase in errors for the H 
and L groups. The slopes of the three 
curves in Figure 1 were compared with a 
slope of zero (see Draper & Smith, 1966, 
pp. 15-24), Only the slope describing the 
increasing errors across positions for the . 
P group differed significantly from a zero 
slope, ¢ (D = 62, p < .05. In addition 
it should be noted that the positional error 


trends within each group were similar for — 


familiar and unfamiliar CVC bases. 


DiscusstoN 


The present results confirmed Taylor's 
(1958) finding of an effect of graphic familiarity 
upon letter sequence perceptibility. While — 
meaning pretraining did not produce a similar 
effect, it did influence the strategies for unit- 
izing the letter sequences. The Ss in the P 
group appeared to treat each letter of a CVG 
as a unit, and processed the letters in a left to | 
right direction. In contrast, the requirement - 
of making a meaningful association (H and L 
groups) encouraged 5s to treat the whole CVC 


t 
2 


i 
l 
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as a perceptual unit. Both strategies occurred 
with unfamiliar CVC-based stimuli, as well as 
with familiar stimuli. The effect of meaning 
pretraining, particularly with regard to un- 
familiar perceptual stimuli, supported and 
broadened an earlier finding of Aderman and 
Smith (1971). Aderman and Smith demon- 
strated that pretraining involving the con- 
sistent presentation of letter strings with a 
certain orthographic structure could influence 
the unitization strategy used in processing a 
succeeding letter string. Two orthographic 
structures were used during pretraining with 
half of the Ss receiving a list of stimuli having 
one structure, while remaining Ss received a 
list with a second structure. In the present 
study, orthographic structure of both the 
pretraining stimuli and the base of the per- 
ceptual stimuli was held constant, i.e., CVC, 
while the level of meaningfulness was varied 
during pretraining. The fact that variations 
in meaning pretraining led to different unitiza- 
tion strategies is evidence that types of pre- 
training less closely associated with the physical 
structure of a letter sequence than orthographic 
variations can produce changes in strategies 
for unitizing strings of a fixed structure. 
Aderman and Smith's (1971) results also 
indicated that if the pretraining orthographic 
structure conformed to the rules of English 
spelling patterns, S would attempt to unitize 
the succeeding test string into groups of 
letters identified as spelling units. If the 
Succeeding string also matched an ortho- 
graphic structure of English, the spelling unit 
strategy would be compatible with the succeed- 
ing string, and § performed better in this 
condition than if pretraining led S to use a 


"strategy in which single letters served as units. 


Aderman and Smith found no advantage, 
however, to unitization Strategies resulting 
from pretaining sequences either conforming 
or not conforming to Englisb structure when 
the succeeding test string did not conform 
to English structure. Perception of CVCs 
also appears to be unaffected by unitization 
strategies resulting from pretraining. Whether 
CVCs were unitized as wholes or by single 
letters, the level of recognition was not 
significantly altered, Thus, various types of 
pretraining produce different unitization strate- 
gies, but whether those strategies influence the 
perceptibility of a letter sequence depends 


SETH N. GREENBERG 


upon the orthographic str: of the test 
sequence. It should also be d that some 
pretraining variables, such niliarity in 
the present experiment, ini sequence 


perceptibility without affectin 
interaction between fam 
was found). Henc 


tization (no 
ad position 


variatio unitization 
strategies are not necessary to | ce changes 
in the level of perceptibility ter strings 
of a fixed structure. 
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Four expt 


presentati 


suits showed that (a) reactior 
y no differenc 


eness 


JOCATIONS ! 
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ments examined manipulations of retinal location, visual field of 
ponse variables in a visual matching task where the re- 
s one of speed of judging whether the stimuli were the same or dif- 


a time increased with retinal eccentricity, 
es between the two visual fields, (c) a 


nal Location interaction emerged, and (d) both judg- 
and of “difference” were responded to ina similar manner, 


rdless of the visual field of presentation. 


xperiments examined an aspect 

- effects of retinal location on in- 
nation processing of tachistoscopically 
vvesented letters. In most of the literature, 
stimuli have been a horizontal row of 
letters spread across the fixation point, 
subtending generally less than 3? or 4? of 
visual angle. With the assumption that 
acuity would be sufficiently homogeneous 
over this region (the evidence summarized 
in Riggs, 1965, notwithstanding), retinal 
location has not been considered a par- 
ticularly significant variable. This has 
been held even with the general findings 
in the same literature that accuracy of 
report, speed of reaction, clarity, apparent 
contrast, and other variables show vari- 
tion over this range of locations. Eriksen, 
ia a number of studies from his laboratory 


‘The project presented herein was performed 
lant to a grant from the National Institute 
of Education, Department of Health, Education, 
and Welfare. However, the opinions expressed do 
not necessarily reflect the position or policy of the 
National Institute of Education, and no official 
endorsement by the National Institute of Educa- 
tion should be inferred. 

Partial support for this research comes from the 
National Institute of Education (NE-G-00-3-0017) 
to the first author and from U.S. Public Health 
Service (MH 10753), Office of Education (OEG- 
2-0671), and from the Behavioral Research 
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(e.g., Eriksen & Hoffman, 1972; Eriksen 
& Rohrbaugh, 1970) has worried about 
this sufficiently to avoid horizontal arrays 
altogether, preferring to place his stimuli 
in a circular arrangement around fixation. 
This is not always a feasible procedure, 
especially when recognition among groups 
of letters (e.g., words) is of interest. 
Harcum (1964) has claimed that spatial 
organization among items in the array is 
a far more important variable than ab- 
solute retinal location. Hershenson (1969) 
has challenged this argument, suggesting 
that under circumstances in which the 
perceiver is thoroughly familiar with the 
organization of the material, the distance 
from fixation determines perceptual clarity 
and accuracy of report. Harcum (1970), 
after reanalyzing Hershenson’s data, has 
not been convinced, and the issue is as 
yet unresolved. 
In another line of work (e.g., Bryden, 
1967; Heron, 1957), it has been shown 
that there is an asymmetry of accuracy 
of report across the fixation point. When 
a single letter appears either to the right 
or left of fixation, it is reported more 
accurately when on the right side, pre- 
sumably because it projects to the left, 
language processing cerebral hemisphere. 
The recent findings of Egeth and Epstein 
(1972) complicate the issue. They pre- 
sented a vertically arranged pair of capital 
letters either 4° to the right or 4° to the 
left of fixation. The S’s task was to 
respond with a same or different judgment 
by pressing a button which recorded re- 
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action time. When the letters were on 
the right, the same responses were faster 
than the different responses, and the con- 
verse was found for letters presented on 
the left. They attributed this difference 
to different processing occurring in the two 
hemispheres. Geffen, Bradshaw, and Net- 
tleton (1972) required Ss to make name 
matches as well as visual matches (Egeth 
and Epstein required only the latter). 
They found that name matches were faster 
when the letters were presented on the 
right visual field as might be expected 
from the language specialization hypothesis 
about brain function. However, they did 
not replicate the same-different interaction 
with visual field side. 

This brief review is not intended to be 
exhaustive but only to show the range of 
variables relevant to problems of retinal 
location in information extraction tasks. 
There are many differences between the 
procedures of these and other related ex- 
periments, some of which are examined in 
the following studies. In each of the ex- 
periments reported, a pair of vertically 
arranged letters were briefly presented. 
The S had to classify them as rapidly as 
he could as same or differenl. "The pair 
could appear in 1 of 11 locations varying 
up to 4? on either side of the fixation 
point. This arrangement permits an ex- 
amination of the main effect of retinal 
location and the possible interactions be- 
tween eccentricity and type of response 
(same or different) or visual field (left or 
right). In one of the experiments, an 
uppercase and a lowercase letter were 
always paired; S was required to make 
a name match rather than a physical 
match, thereby permitting assessment of 
this variable, Finally, one experiment 
compared the traditional reaction time 
procedures of using two response keys 
with that used by Egeth and Epstein 
(1972) in which S used only one button. 
There S was instructed to press a button 
for sume and do nothing to indicate dif- 

ferent, or vice versa. 
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GENERAL Metno 


Subjects. In each of the expe: s, Ss y 
undergraduate students at the Unis 

Carolina who had normal or correc 
were right-handed. Experiment 
dents, Experiment IB used 60 sti 
ments ITA and IIB used 34 an 
respectively, and Experiments 111 

and 24 5s, respectively. 

Stimuli and procedure. The stim 
of letters presented directly abov 
They were black Paratype uppercase 
At a distance of 86 cm., they c 
about .35? vertically and horizont: 
tical distance between the two lei 
(except in Experiment ITA; see below 
chosen were from zero-order approx 
of letters. On half of the trials, th: 
the same two letters; the remainin: 
two different letters. A background : 
always on except during the presen 
letter pair. The luminance level in b 
was equated at 2 ftL. 

Upon introduction to the laborato 
informed about the nature of the tasi 
and were instructed to fixate on th 
located fixation spot. The stimuli 
were presented in a Gerbrands | 
(Model T-4B). Upon initiating a trial, 
stimulus would appear and a clock counte d 
tronic Research Mfg., Model 2600) w d -starti 
The 5s responded by pushing one of two buttons 
each 3.12 cm. from the start button and 3 cms 
from each other. The Ss used their index finger 
of the preferred hand for starting the tachistoseope 
and stopping the reaction time clock. ‘Ihe Ss were 
counterbalanced across response buttons and each: 
received one block of trials which were considered 
practice and not included in data analysis 


EXPERIMENT I 


The main variables of study were the” 
11 different retinal locations and same or 
different judgment, These 22 experimental 
trials were arranged in blocks in a random - 
order and repeated in a different order 
10 times per S. The Ss were first pre- 
sented with one block of 22 trials which 
were considered practice, and then tested — 
on the 220 experimental trials. Each $ 
participated for seven sessions. The study 
was then replicated (Experiment IB) using 
many more Ss with eight observations 
per S per condition, collected in only one 
Session. In Experiment IA, all letters 
were used with random assignment 10 
retinal locations. [n IB, only the letters 


I -CTING INFORMATION FROM 
Qr A, > i, C, T, R, and S were used, 
and t! ere counterbalanced over loca- 
tions Experiment IA, the letters were 
plac t fixation or at .5°, 1,°, 2°, 3°, 
or 4” either to the left or right of fixation. 
Exr-riment IB used the same locations 
execpt that the 4° location was replaced 


with 3.5% 


Results 


The 


wo 


; made 7% and 5% errors in the 
the experiment, respec- 
irials were excluded from the 
time analyses. Reaction times 

cored and summed for each S across 

i.c conditions and entered into an analy- 

of variance, a different one for both 
ersions of the experiment. In neither 
experiment was any main effect or inter- 
action found with visual field side, nor 
were practice effects significant. The left 
side of Figure 1 plots these results; the 
right side collapses over visual field since 
that variable produced no effects or inter- 
actions in either version. 

Judgments of same or different were 
equally fast—only a 2-msec. difference in 
both experiments. The different retinal 
»ositions, however, produced quite dif- 
‘rent reaction times, F (4, 12) = 29.2, 

id F (4, 236) = 36.7, respectively, both 
< 001, In both experiments, retinal 
‘oration and same-different response inter- 
'cted, and in virtually the same way, as 
cam be seen on the right-hand side of 
Figure 1, F (4, 12) = 3.3, p = .03, and 
F (4, 236) = 27.5, p < .001, respectively. 
Three of the reaction times near fixation 
for the different responses were slower than 
those of the same responses. In the eight 
other cases the same responses were slightly 
faster than the different responses. 

Trend analysis applied to these data 
showed significant linear effects in both 
experiments for the position of the stimu- 
lus on the retina, F (1, 3) = 42.6, p < 
001, and F (1, 59) = 101.6, p < .001, 
respectively. In Experiment IA, retinal 
location produced a quadratic component, 
F (1, 3) = 143, p < .001, and in Experi- 
ment IB, a cubic component was evidenced, 


versions ol 


t 
; these 


tion 
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EXPERIMENT IA 


10 E 3134 
RETINAL LOCATION (IN DEGREES ) 


Ficure 1. The mean reaction time (in milli- 
seconds) is plotted as a function of the retinal loca- 
tion of the stimulus pairs in Experiment I. (The left 
panels show the data for all 11 locations, the right 
panels collapse the data over visual field of 
presentation. ) 


F (1, 59) = 82, p< .006. A critical 
trend is the interaction of the linear com- 
ponents of same-different judgments and 
retinal location—in both experiments these 
interactions were significant, F (1, 3) = 7.8, 
p < .006, and F (1, 59) = 27.3, p < .001. 
There were other significant and highly 
complex trends evident—in Experiment IB, 
the same-different linear component inter- 
acted with the retinal location quadratic 
component, F (1, 59) — 25.0, p < .001, 
and the left-right visual field linear com- 
ponent and the retinal location quadratic 
component also interacted, F (1, 59) = 
10.5, p < .001. The triple interaction of 
the same-different and left-right linear 
components interacted with the cubic com- 
ponent of retinal location, F (1, 59) = 4.5, 
p = .03. These two interactions are highly 
complex, difficult to interpret, and account 
for a small portion of the variance (3.8%) 
relative to the linear trend of retinal loca- 
tion (51.8%) and the two-way interaction 
of Same-Different X Retinal Location 
(14%). 

An analysis of the reaction times for 
erroneous responses was also undertaken. 
In general, error responses mirrored cor- 
rect responses in most respects in that 
longer reaction times were more likely to 
be associated with errors. Thus, more 
errors were made for more peripheral loca- 
tions. In the second experiment, more 
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errors were made for the same responses— 
8% vs. 2%, F (1, 59) = 62.3, p < .001. 
In both experiments, the frequency of 
erroneous reaction times produced an inter- 
action between the same-different responses 
and retinal position, just as did the correct 
responses, F (4, 12) = 5.4, and F (4, 236) 
724.5, ps < .001 in both versions. There 
was a small interaction of Visual Field X 
Retinal Position in Experiment IB, F (4, 
236) = 6.5, p < .001. The sum of squares 
for this interaction accounted for only 1.3% 
of the total variance. 


Discussion 


The results from both versions of experi- 
ments are consistent. Reaction time in- 
creased with peripheral eccentricity. Further, 
the pattern of errors generally reflected the 
reaction time data. Thus, where there were 
increases in reaction time there were cor- 
responding increases in the number of errors. 
Since the error rate was generally low, Ss were 
not making a speed-accuracy trade-off. When 
the letters were presented 3.5? or 4? in the 
periphery the processing task was more dif- 
ficult, Ss therefore made more errors and, in 
addition, took longer to process the items. 

There was no evidence of an interaction 
of Visual Field X Same- Different responses, 
the results reported by Egeth and Epstein 
(1972). However, in Experiment I the letter 
pairs were separated only by a minimal 
amount of space. In the study reported by 
Egeth and Epstein, the letters were separated 
vertically by 4°, Experiment II was therefore 
constructed to examine this difference by 
placing pairs of letters either close together 
ag apart in either the left or right visual 
field, 


EXPERIMENT II 


Two vertically arranged letters were 
presented 3.5? either to the left or right 
of fixation. They were chosen from the 
symmetrical letters A, H, I, O, Ds rV. 
X, and Y. In one condition, the letters 
were separated vertically by 4°, similar to 
that procedure used by Egeth and Epstein 
(1972). In the other, only .15° separated 
the two letters. In both conditions there 
were two experimental conditions each 
with two levels: visual field and the re- 
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sponse same or different. Sin 
nine stimulus letters used, there 
possible same combinations.. ^ 
combinations were then arrange 
nine were repeated in the re 
that is, with whatever was th 
being the bottom letter. The: 
visual fields, nine different s 
possible responses (same com; 
different), and four replications of each 
the possible combinations making a toi 
of 144 experimental condition The 
were arranged in blocks of 36 trials, ea 
block being one replication of 
perimental trials. These four b 
each arranged in a random orde: 
end of an experimental sessio: 
would have provided 36 measur: 
each of the four possible conditio 


each 
nts for 


Results 


There was a great deal of viability: 
with respect to the number of errors made. 
Some Ss made many errors, while othe) 
made very few. The Ss were therefore 
divided into two groups—those 5s who 
made 10% errors or less and those who 
made more than 10% errors. In data 
analysis, these two groups have been 
treated separately. However, except im 
the one instance to be noted, the pattern 
of results was so similar across error rates, 
that Ss have been combined with respect 
to the presentation of results. 

In general, no significant effects or inter 
actions were found with either the wide 
(3.5°) or close (.15°) vertical separations. | 
Specifically, no hint of a Visual Field 
X Same-Different interaction was apparent - 
in either version, regardless of the error 
rate. These data are presented in Table 1. 
There were significantly more errors made 
on the same response for both error groups ' 
for both separations. 

The results of this experiment confirm 
those of the first experiment. In this 
study, when the letters were presented 
close together so that Ss could make @ 
Physical match, or far apart such that à 
name match might be made, there was 10 
Visual Field x Same-Different interaction. 


\CTING INFORMATION FROM DIFFERENT RETINAL LOCATIONS 


There 'een no clear evidence in Ex- 
perim _ IL or in the Egeth and Epstein 
(1972 idy that Ss were indeed naming 
the ili, Experiment III examined the 
nai process by using capital and small 
leti -s as paired stimuli. 


EXPERIMENT III 


The stimuli for this experiment (A, B, 
C D E.G T, Ne Bee R a) were 
Y different set of letters chosen to reduce 
he visual similarity between the upper- 

d lowercases. The S was now to indicate 

he letters had the same name or not. 

ill other respects this study was iden- 
|| to that of Experiment II. The 
rtical separation was .15°. 


Results 

All Ss tested in this experiment produced 
high error rates, averaging 27%. As in 
the other studies, the error rate was sig- 
nificantly greater for the same responses 
than for the different responses, F (1, 19) 
= 60.5, p < .001. 

The reaction time data were treated as 
in the other experiments. There was a 
ignificant left visual field superiority, 

(1, 19) = 5.8, p <.05. Overall, the 

t visual field was responded to 29 msec, 

ister than the right visual field. There 
vas no difference between the same and 
“rent response, nor any Visual Field 

Same-Different interaction. These data 
are presented in Table 1. 

he critical interaction of Egeth and 
Epstein (1972) remains elusive. When Ss 
were presented with stimuli that were 
named (Experiment III) or not named 
(Experiments I and II), or when stimuli 
were close together or far apart (Experi- 
ment II), this interaction did not emerge. 
There is one further methodological point 
that may be relevant. Egeth and Epstein 
used a go-no-go technique, in which half 
of their Ss were asked to respond by 
pressing a single button as fast as they 
could if the letters were the same and do 
nothing if the letters were different. The 
other Ss pressed if the two letters were 
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TABLE 1 
REACTION TIME (IN MSEC.) 
Visual field 
Experiment and condition | —— 
Left Right. 
Experiment IA 
Same 792 788 
Different 774 768 
Experiment IIB 
Same 183 | 801 
Different * 769 786 
Experiment III 
Same 988 1,026 
Different 967 987 
Experiment IV 
Same 567 574 
Different 603 600. 


differen! and did nothing if they were 
the same. 


EXPERIMENT IV 


Experiment IV is a replication of Ex- 
periment II, differing only in that half of 
the Ss were told to press the same button 
if the letters were the same and do nothing 
if they were different. The other half of 
the Ss were told to press the diferent 
button if they were different and do 
nothing if the letters were the same. The 
letters were separated by .15°. 


Results 


The data were treated in the same 
manner as in the other experiments. 
Overall, there was a 10% error rate. The 
analysis of variance showed that there 
was neither a main effect of same com- 
pared with different nor of the visual field 
of presentation. The Visual Field X Same 


Different interaction was not evident 
(F < 1). The error rates showed no sig- 


nificant differences or interactions. Thus, 
except for the overall reduction in reaction 
time between this experiment and the two- 
button ones, this methodological change 
cannot be responsible for the overall pat- 
tern of results in these experiments. 


GENERAL DISCUSSION 


One of the main findings of the present 
experiments was the change in reaction time 
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that was associated with changes in the retinal 
location of the stimuli. Experiment I showed 
that with increases in retinal eccentricity 
from fixation, judgments of sameness and of 
difference required more processing time. 
Both Harcum (1964) and Hershenson (1969) 
would predict this to be the case—both ac- 
curacy and reaction time data should reflect 
one another. These data do not directly 
converge on the notion of sensitivity com- 
pared with saliency as put forth by Harcum 
(1970). They do provide data that can be 
used to estimate the effects of manipulating 
fixation in other reaction time tasks that have 
spread letters across a fixation point. For 
example, a certain number of milliseconds 
might be subtracted from different retinal 
locations to examine visual processing uncon- 
taminated by varying retinal locations. 

The data provide evidence for a change in 
processing strategy depending on the locus 
of stimulation. The Same-Different X Retinal 
Position interaction found in Experiment I 
shows that at near foveal positions same 
responses are faster than different responses; 
this is a result that has been found many 
times (e.g, Posner, 1969). As the retinal 
eccentricity increases, however, the different 
responses come to be faster than the same 
responses, These results, the obtained inter- 
actions, and trend analyses support the con- 
clusion that S's strategies may shift to a 
criterion of sameness or difference as a func- 
tion of the location of stimulation. 

None of the other variables examined by 
the experiments seemed to be critical. The 
previously reported interaction of visual field 
with same-different responses was never ob- 
served. The only visual field effect found 
occurred when Ss had to make a name match, 
and this was equally strong for both responses. 
The amount of the vertical separation of the 
letters did not seem important, nor whether 
Ss used two response buttons, or only one 
with a go-no-go criterion, 

Overall the results of the present studies 
present data to support five main conclusions: 
(a) When Ss are engaged in a visual matching 
task, performance is slower the further the 
stimuli are from fixation ; (b) there is a change 
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- compared with different respon 


in the speed of responses for 


fovea; (c) there is no differ 
judgments of sameness or di! 
respect to visual field of present 
only hint of a visual field effect 
Ss are forced to name the stim 
neither spacing arrangements 
key arrangements seem to ex: 
influence on the results. 
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Spanish 
or English words. 
of words that were the same 
familiarization list. Familiarization 
in that the amount of 


within languages 
for all g 
interdependence hypothesis. 


oups 


The unique ability of bilinguals to 
cess information presented in two dif- 
nt linguistic systems has given rise to 
ering ideas as to the nature of bilingual 
nization of memory. One of these, 
independence hypothesis, assumes that 

he bilingual has two distinct memory 
tores, one for each language, with informa- 
tion presented in one language not being 
readily available to the other system. 
A contrasting position, the interdependence 
hypothesis, holds that bilinguals store in- 
formation in one memory store only 
(Kolers, 1966). An interdependence hy- 
poth vould assume that bilinguals store 
words in terms of their semantic features 
only, with some means of "tagging" the 
ems with the proper language at the time 
output, 

Although several studies of bilingualism 
Lave supported the interdependence hy- 
pothesis, support for this position has not 
been unanimous. Most studies supporting 
the interdependence viewpoint have de- 
monstrated a bilingual equivalence effect, 
wherein the presentation of an item in one 
language produces the same effect on 
recall of that item in the other language 
as if it had been originally presented in 
the other language (e.g., Glanzer & Duarte, 
1971; Kolers, 1966). Tulving and Colotla 
(1970), supporting the independence hy- 
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glish bilinguals were first familiarized on a list composed of Spanish 
Next they learned a Spanish or an English list consisting 
s (or translated) or different from those in the 


effects were uniform both between and 
f positive transfer obtained was the same 


The data were interpreted as being supportive of the language 


pothesis, maintain that this equivalence 
effect is probably due to a well-established 
habit of translating. Facilitation in the 
recall of translated items would not be 
evident were it not for an automatic 
translation of these items by S at the time 
of presentation. Presumably, this trans- 
lation habit would be strongest in bi- 
linguals who are not equally proficient in 
each language, translating from the weaker 
to the dominant language (Goggin & 
Wickens, 1971). 

'The present study was designed to test 
this translation hypothesis by giving Ss 
familiarization trials on items in one lan- 
guage and then having them learn, by free 
recall, the same items in translation, If 
the bilingual's two languages represent 
two independent systems, there should. be 
no familiarization effect at all, unless Ss 
actively translate the items. However, 
this translation should result in facilitation 
of learning of items in the stronger lan- 
guage only, after they have been famil- 
iarized in the weaker language. There 
should be no facilitation effect for items 
familiarized in the strong language and 
learned in the weak language, since Ss 
would have no reason to translate in this 
direction during familiarization. 

The interdependence hypothesis would 
predict facilitation of learning in all cases. 
If the familiarization of a word or its 
translation results in the activation of the 
same semantic representation, then. this 
familiarization should result in an equal 
facilitation of learning, regardless of 
whether or not learning is in the same 
language as familiarization, and regardless 
of which language is dominant. 
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TABLE 1 
Mean NuwnkRs or ITEMS RECALLED Per TRIAL 


Familiarization language 
List language 


English Spanish 
a Ex rimental 
English 12.38 11.91 
Spanish 10.25 11.39 
Control 
"ER lish 11.15 11.35 
Spanish 9.16 8.79 
METHOD 


Subjects. The Ss were 64 Mexican-American 
high school juniors and seniors, bilingual in English 
and Spanish, who volunteered to serve in the 
experiment. These Ss were asked to rate them- 
selves on a number of variables related to language 
learning and fluency, Assignment of 5s to condi- 
tions was by a predetermined random sequence 
according to order of appearance in the experi- 
mental room, 

Design, The Ss first read a 16-word list in 
English or Spanish and then learned a 16-word 
list in the same or the other language. In the 
experimental conditions, the second list consisted 
of the same words that were in the familiarized 
list, in the same language or translated to the 
other language. In the control conditions, the 
first-list words were unrelated to the words to be 
learned. Two Es tested the Ss. The design can 
thus be summarized as a 2 X 2 X 2 X 2 factorial 
design with a within-Ss variable of 10 free-recall 
trials (n 4/cell). 

Materials. The verbal materials used in the 
experiment were selected by asking 51 students 
from the same school as the experimental Ss to 
translate 100 common adjectives from English to 
Spanish or vice versa, Two different forms were 
used to counterbalance for direction of translation 
of the adjectives. The 32 adjectives most often 
translated correctly composed the four lists used 
in the study. None of the students asked to 
provide these translations served in this experiment, 

Procedure, Each of the four 16-item lists was 
typed on a plain sheet of typing paper. As each 5 
entered the experimental room, he was told by E 
that before the experiment started, Æ would like 
to hear S read a list of words. The Ss were then 
required to read the list aloud for six repetitions, 
Following the sixth repetition, S was told that 
the experiment could now begin and was read 
standard free-recall instructions. All Ss were then 
given 10 trials on the list, Stimuli were Presented 
auditorily on tape at à 2-sec, presentation rate. 
Recall was oral with a 90-sec, recall interval. 
Order of presentation of the words was randomized 
across the 10 trials, and none of these orders cor- 
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responded to the order of the « ial list read 


by Ss in the experimental conditio 


RESULTS 


The Ss rated their fluency 
guage on a 1-7 scale. The 
rating was 4.97, and the m: 
Spanish fluency was 3.77. 
Ss indicated that they had sp 
and Spanish for 13.27 and 


each lan- 
n English - 
rating of 
addition, 
1 English 
53 years, 


respectively. Fifty-eight of i! 64 Ss in- 
dicated that they thought English, 
X (1) = 3451, p <.01. Th: `s appear 
to be compound bilinguals (^ Ervin & 
Osgood, 1965) with English as ir domi- 
nant language. That is, thesc learned 
both languages at the same : and im 
the same context, yet (proba! due to 
a deemphasis of Spanish and a c... spond: | 
ing emphasis on the use of glish in 
school) they presently consider + mselves 
to be more fluent in English. 

An analysis of variance was o rformed 
on the number of items recalled o or trials. 
The experimental Ss recalled si icantly 
more items than did the controls, . (1, 48) 
= 10.20. English lists were easier to learn 


than Spanish lists, F (1, 48) = 1 
addition, the trials main effect 
significance, F (9, 432) — 124 
these Fs were significant at the .01 level. 
The Learning Language X ‘Triais inter- 
action was significant, F (9, 432) = 2.54, 
b <.05, as English lists were learned 
faster than Spanish lists. This is not 
surprising in view of the fact that for this 
group of bilinguals English is highly 
dominant. 

A facilitation effect of familiarization 
was found for all language conditions, and 
this enabled experimental Ss to recall sig- 
nificantly more words than control Ss, aS 
previously noted. Inspection of Table ! 
reveals that there was not the expected 
differential facilitation that would have 
been predicted by a translation hypothesis. 
That is, familiarization of a Spanish list 
followed by learning the English equiva- 
lents does not result in a greater facilita- 
tion, relative to control items, than does 
familiarization of an English list follow 
by learning of the Spanish equivalents. 


(Sc OmU. 


ifferential facilitation would have 
n manifested in significant Familiariza- 
Language X Learning Language and 
iliarization Language X Learning Lan- 
uage X Condition interactions. These 

eractions were not significant, Fs (1, 48) 
= .36 and 1.62, respectivels 


Discussion 


present study support 
an interdependence hypothesis of bilingual 
"organization of monory. A positive transfer 
effect of familiarization was produced in all 
combinations of familiarization list language 
and learning list language, both intralingual 
and interlingual. The positive transfer in the 
interlingual combinations is especially reveal- 
ing in view of the fact that there was no 
significant Familiarization Language X Learn- 
ing Language interaction. That is, these 
English-dominant bilinguals did not show a 
- significantly greater facilitation in the Spanish 
to English conditions ys. the English to 
Spanish conditions, as they should have if 
a translation hypothesis of the bilingual 
equivalence effect is to be accepted, 

The greater difficulty Ss have in learning 
Spanish lists relative to English lists can be 
attributed in part to the learning history of 
‘these Ss. In school they are expected to 
— learn in English, not Spanish. Consequently, 
they have not developed the elaborate coding 


The results of ih 
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strategies in Spanish that are necessary for 
rapid acquisition of verbal material. 

Finally, it should be noted that a single 
semantic store for bilingual memory that is 
supported by this experiment does not exclude 
the possibility that other components, notably 
the syntactic and phonological components, 
of a bilingual's linguistic systems are inde- 


pendent. This aspect of bilingualism has 
been closely investigated by Macnamara 
(1967). 
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Why do some problem solvers succeed on problems for which they have «|! 
the necessary component skills, whereas others do not? It was assumed (| 
a major portion of problem-solving ability can be traced to the presenci 
absence of higher order rules which make it possible to generate solution p) 
cedures. Experiments I and II were designed to test a simple problc: 
solving mechanism by which lower and higher order rules interact. Exp 
ment I involved a composition higher order rule and Experiment I, a higi 
order generalization rule. Results were over 98% consistent with theoretic 
predictions concerning the behavior of individual Ss on specific problen 
In Experiments III and IV the presence or absence of higher order rules wo 
determined indirectly through testing. The overall percent correct prediction 


were 85 and 92 respectively. 


One of the most crucial questions in 
problem-solving research is why some 
problem solvers succeed on problems for 
which they have all of the necessary com- 
ponent skills, whereas others do not. In 
dealing with this question, most studies 
of problem solving have attempted to deal 
with the process in its full complexity (e.g., 
Duncker, 1945; Newell & Simon, 1972). 
That is, most studies have employed prob- 
lem situations that involve problem defini- 
tion (the formation of subgoals), memory, 
the derivation of solution procedures, and 
the use of such procedures. The situation 
is further complicated because existing 
theories of problem solving are either ex- 
tremely limited in scope (i.e., to specific 
kinds of problems) or, if general, are more 
like overall schemas than strong theories 
(cf. Greeno, 1973). 

The present research has adopted a some- 
what different strategy. It seeks to deal 
separately with the various aspects of 
problem solving (e.g, the derivation of 


1 s research was supported in part by National 
Sci Foundation Grant GW-6796 and U.S. Office 
of Education Grant OEG-3-71-0136. Experiment I 
was conducted with Louis Ackler, Experiment II 
with Francine Endicott and John Durnin, and 
Experiments III and IV with Durnin and Wallace 
Wulfeck. The author would also like to thank 
Joe Karabinos and Diane Triman, who helped proc- 
tor the experimental sessions, 

2 Requests for reprints should be sent to Joseph 
M. Scandura, University of Pennsylvania, 3700 
Walnut Street, Philadelphia, Pennsylvania 19174. 
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solution procedures) but in a 
generalizes over tasks. Spe 
research is not concerned with <e 
memory or with problem defi: 
cautions were taken to insure t 


presented and that they wer 
in the way E intended. Furt 
going research is concerned wit! 
falsifiable theory (Scandura, 197 
ing the behavior of individua! 
ticular problem situations, an: 
either the performance of groups or 0 
individuals averaged over tasks | 
The general point of view adopted 
that rules underlie all behavior (eg 
Scandura, 1970). More specifically; 
research is based on the assumption tha 
when the goal adopted by S is known, 
major portion of problem-solving abi 
can be traced to the presence or absene 
higher order (HO) rules. These rules mi 
be used to combine constituent parts 0| 
problem solution into a coherent whol 
adequate for solving the problem ( 
Saugstad, 1955) and/or to otherwise get 
erate solution procedures from know! 
rules. 
In this context, the question arises 
to how known rules interact in prob 
solving. It is hypothesized that in proble 
solving (as in all learning) rules intera 
according to a fixed mechanism, whic 
presumed to be innate rather than learn 
(Scandura, 1973), This mechanism 


C 


HIGHER ORDER RULES 


operate as follows: Given a 


assume: 

goal s n (S, G), S tests to see if the 
probl: olution is immediately available 
(i.e see if he knows the solution). If 
not, trol shifts to the solution goal (SG), 
consisting of the set of potential solution 


ru';s, and S tests the rules available to him 
to see if any includes S in its domain and 
he goal in its range? If such a rule is 
available, control reverts to the original 
roal is applied to S. If not, 
o the higher level goal (HG), 
g of HO rules which apply in the 
u goal situation (S, G) and whose 
es contain a potential solution rule. 
ih this higher level goal in force, S is 
sumed to test his available rules as be- 
‘ore. If one is found, control reverts to the 
aext lower level goal and the HO rule is 
applied to other available rules, thereby 
generating a new (potential) solution rule. 
Tf the lower level goal is satisfied by the 
new solution rule, control reverts to the 
original goal and the solution rule is 
applied to S. The problem is solved if the 
(potential) solution so generated satisfies 
the goal situation. 
It should be noted that testing, goal 
hifting, and rule application are essentially 
ie same at all goal levels. Generalization 
id formalization of the mechanism is given 
i» Scandura (1973). Furthermore, HO 
rules are formally identical to other rules 
and obey the same laws of behavior. The 
descriptor "higher order" refers to the role 
of a rule in a particular instance of problem 
solving, not to its basic nature. This 
differs from common usage. The term 


ind rul 


ntrol shifts 


‘This condition is slightly different from that in 
an earlier formulation. (Scandura, 1973). Instead 
of requiring that the range of the rule contain the 
goal, | had initially proposed that the range be con- 
tained in the goal. While the latter condition works 
equally as well with initial goals, it is not fully 
adequate. For example, in the trading game experi- 
ment requiring the range of a composition rule to be 
contained in the higher level goal is overly restric- 
tive because this goal only contains A >Y > C 
rules (Y variable and A and C constants), whereas 
the range of the composition rule more naturally 
includes all composite rules of the form X — Y>Z 
(where X and Z are also variables). 
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“higher order rule" as used by Gagné 
(1970), for example, corresponds to the 
output of what is referred to here by this 
term. 

Suppose, for example, that S adopts goal 
G in situation S and has only the following 
rules available: ri, rə (neither of which 
satisfy SG), and h, where h (rı, re) = p and 
p satisfies SG. What can we say about 
what S will or will not do? Given that S 
has adopted goal G in situation S, and only 
has rules rj, rs, and h available, Table 1 
shows that the above mechanism provides 
a sufficient basis for predicting S’s behavior. 
If we changed the situation by eliminating 
any one of r, rs, or h, we could conclude 
that .S would fail. In order to make pre- 
dictions where training on rules is not a 
central variable, it is essential to determine 
what rules S knows. Indeed, the general 
importance of prior knowledge in research 
on complex human behavior is becoming 
increasingly clear (e.g, Bourne, Ekstrand, 
& Dominowski, 1971; Scandura, 1973). 

The major purpose of this research was 
to test this mechanism empirically, both 
by manipulating HO rules and by testing 
to determine their availability prior to 
problem solving. 


TABLE 1 
SEQUENCE OF EVENTS IN IDEALIZED 
PROBLEM SOLVING 


Event Theoretical justification 


SG = {ri|Se Domain ri, 
G C Range ri) = $ r 
(is empty) Assumption 

^. Control shifts to HG Higher level 
hypothesis 


HG contains h Assumption 
7. Control reverts back 
to SG and S applies 
h tory, rs (to get p) 


Reversion and per- 
formance hypotheses 
p satisfies SG Assumption 
, Control reverts to G 
and S applies p to S 
(toget R) R may 
satisfy G 


Reversion and per- 
formance hypotheses 


Note, See text for explanation of abbreviations used. 
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EXPERIMENT I 


Experiment I provided a test of the 
mechanism involving an HO composition 
rule. 


Method 


Materials and tasks. The experimental material 
consisted of tasks involving trading stimulus objects 
of one kind (e.g., red chips) for response objects of 
another (e.g., pencils). Each task can be charac- 
terized as a set of stimulus-response pairs in which 
n stimulus objects are mapped into (m + m) re- 
sponse objects (e.g., » red chips into n + 3 pencils). 
"Throughout the experiment, n +m < 10, with 
n € 1 and m <4, 

There were two kinds of rules, simple and com- 
posite, for solving such tasks, Simple rules were 
represented on 5 X 8 in. cards. To represent the 
simple rule which maps » paper clips into +1 
blue chips, for example, a paper clip was glued on 
the left and a. blue chip on the right of the card. 
Composite rules generate trades in two steps. One 
composite rule, for example, first changes pencils 
into paper clips and then paper clips into white 
chips, Composite rules were represented by taping 
together two simple rule cards. 

A pair of simple rules was said to be com patible 
if the outputs of one of the rules were of the same 
type as the inputs of the other, Compatible rules 
can be combined to form composite rules (e.g., the 
rules denoted “n caramels — n + 1 toy soldiers" 
and “n toy soldiers —» n + 2 pencils" can be com- 
bined to form a composite rule which maps n 
caramels into # + 3 pencils). The set of compatible 
pairs of simple rules comprises the domain of an HO 
rule which maps such pairs into corresponding 
composite rules, This HO rule was used to define 
a second, higher level task in which, given a com- 
patible pair (eg, "m caramels — n-+2 white 
chips," “n white chips + » + 1 pencils"), the goal 
was to devise the corresponding composite rule (i.e., 
for converting caramels into pencils). 

Subjects, design, and procedures. "The Ss were 31 
boys and girls between the ages of 5 and 9. They 
were run individually and given 25¢ for participat- 
ing. Seven Ss were excluded from the experimental 
comparison, 6 because they passed the pretest and 
1 because she failed to learn how to interpret the 
composite rule cards. At the start of the experi- 
ment, each .5 was told that he was going to play a 
trading game with E and was given several sets of 
objects totrade, The S was then taught to interpret 
simple rule cards and to make trades using the rules 
represented by the cards. For example, S was 
shown a rule card for trading paper clips for blue 
chips and E explained “no matter how many paper 
clips I give you, you must give me the same number 
of blue chips and then add one more," Practice was 

provided and, if necessary, E showed S how to make 
trades of this sort and asked him to repeat what he 
had been shown. The criterion was three consecu- 
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tive successful trades. The E the; e S a set of 
objects not in the domain of th. > (e.g, two 
pencils) and asked S if he could the rule to 
trade pencils for blue chips. R less of S's 
response, E emphasized that the ru uld be used 
only to trade paper clips for blue ch 

The E then showed S a differen! le card and 
asked him to interpret the rule, provi 7 assistance 
if necessary. Practice continued ! S reached 
criterion. Again, E emphasized th e rule was 
restricted to objects on the card. procedure. 
was repeated using different rules u 5 was able 
to interpret three consecutive rul is and use 


them without assistance. At thi t, it was 
assumed that for S simply seeing a rule card 
was equivalent to knowing and bein; e to apply 
the rule. 

During Phase 2, .S was taught to 
use the composite rules, For examp’ 
sented with a composite rule for tradi 


rpret and 
was pre- 
encils for 


white chips together with two pen He was 
told to use the first part (E pointed at first card) 
to trade the pencils for paper clips. e S was 
required to place the correct number per clips 
on the table. The E continued, “N use the 
second part to trade these four paper « ‘or white 
chips." This procedure was repeated \ different 
stimuli until S performed three conse: e trades 
correctly, Practice continued with ar rules 
until § interpreted and correctly appli ;ree con- 
secutive composite rules, 

The 5 was then given a pretest consis: x of two 
parts. First, S was presented with new ©. ds repre- 
senting a pair of compatible rules. ‘Thies, S was 
asked to make three trades requiring use of the cor- 
responding composite rule. (The S was never shown 


this rule directly, either before or after testing.) 
For example, an S who was presented with the 
rules “n pencils — n + 2 pieces of bubble gum 
and "n pieces of bubble gum — n + 1 paper clips? 
would be presented in turn with various numbers 
of pencils (e.g., two, four, and one) and asked to 
trade the appropriate numbers of paper clips. No 
reinforcement was provided on the pretest. 

Those Ss who consistently failed on the pretest 
were randomly assigned, in matched pairs, to one 
of two treatment groups, Group HR, which received 
training on the HO rule, and Group C, which was 
the control group. Those Ss who succeeded on two 
or more instances of the pretest were given 5 min. 
of irrelevant instruction (reading a comic book) and 
were given a posttest, which involved new rules but 
paralleled the pretest in every other respect. 

Each S in Group HR was taught the HO rule. 
The Ss were first shown two compatible rule cards 
and asked to interpret each. The E then demon- 
strated how to combine the rules by sliding the te 
cards together in the appropriate manner. The ^ 
was then asked to interpret this newly formed rule, 
and E emphasized that the rules could be combined 
only because the output of one was the same as the 
input of the other. Next, S was presented with 
several pairs of rules, some of which were not com 
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patible ach pair, S was required to form the Mong 1 
comp: ‘ic, if possible, and to interpret the | 
newly i rule but S did not actually make en| + + + ^ 
trade the randomly interspersed incompatible A 
pairs . ad to indicate that the rules could not be Be | ecu ^ 
80 C ied. After performing successfully on five E 
con^. itive (compatible and incompatible) pairs of a m tos sd 
rul- S was given the posttest. The time required E 
to ain S on the HO rule was recorded. Each a Ser TOM zi 
( ed to read a comic book for the e. RENS ^ 
n his matched partner in 
iroup HR required to learn the HO rule. Finally, wo| cH Xd L2 
the posttest was administered in the same manner as 
the pretest to all Ss, Xp] tee = | 
) eo lob oF bo wet 
'esults ; 
i E wom} + + 1 sd NG. 
| ihe experimental results were relatively HEICUETI "a 
ar-cut. Eleven of the 12 HR Ss solved i al Y r 
three transfer problems. The one S who em| + + 1 -d i 
| led after reaching criterion on the HO eer ee PIER yr lis 
ule was put through the experimental 3-1 id 
orocedure 1 wk. later. This time he per- 2 Bolrew I sg Sek nd -4[8 
E | Lm x 
l formed perfectly on the posttest (only). 3 T e 
All 12 control Ss failed uniformly on the z np NE " 
posttest. The individual results are sum- 2 KO eto ser SOMNI EY 
marized in Table 2. A ey ey IY H: 
Although statistical comparisons between "^B ——————— ——,| #5 
i s seem ind iate, reliability of tl ay I E ME S NI: wd 
groups seem inappropriate, reliability of the Sx ET  R es 
percent correct predictions may be ex- a2 wm) + + | e! 
pressed in terms of confidence intervals | TU ne a CP i 
(for the HR group). Based on the assump- = QC uM ron [ 
‘ion that correct and incorrect predictions s wool + + | vo 3 
re bi inally distri 2 $ N A ——— m a 
re binominally distabived and using the E EO EE ++ jai 
‘tained mean of 91.7% to estimate the q 3. eliZide ut e fs 
-pected percentage of correct predictions à. door teo quete etd ++] Fo 
' Group HR, the 68% confidence interval d peo mes ane | Ae 
wos between 83.9% and 99.5%. É - ——— 33 
3 wm| + + 1 wot | gs 
EXPERIMENT II H em| + + 1 ++ af 
E Pe aee pg 
(he purpose of this experiment was to E om| + +! n+ | 38 
| test the mechanism proposed above with ws er cep meats | H 
l two different and more complex HO rules cum DL 58 
— of varying generality. Both involved gen- (Soe Nae aa pgs 
eralization from a restricted rule to one EN ea TE FE 
that was more general. To help make it L^ ere e ad 
easier to identify common and disparate mx EE 
features, the method description parallels i B a 
that of Experiment I. 1 1 fo 1 EE 
- tis k 
E F4 E 
ei B ig 
Method. E n: i FE. 
Tasks and materials, The experimental tasks in- £8 2 8 , 298 5 E A 
volved responding with appropriate numerals to 54 es E E 1 35 
given stimulus numerals. Each task can be charac- B" Be B E i^ 
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Look at the two numb 
pairs whose input nos. 
are successive nos. 


Subtract their outputnos. 
|Call this difference a. 


15 a the same as the 
output no. for 1? 


Subtract a trom the 
output no, for 1, Call the 
difference d. Write nsan+d. 


Ficure 1. Flow diagram for the higher order 
finite differences rule which acts on restricted rules 
and generates rules of the form n — an +d. 


terized as a set of stimulus-response pairs in which 
each number n is mapped into a number of the form 
an + d, where a and d are whole number constants 
(e.g., n into 4n +13). In addition to the above, 
the domains of the restricted tasks contained ex. 
actly three numbers, 1 together with (any) two con- 
secutive numbers (e.g., 8 and 9). Restricted tasks 
were represented as triples which also designated 
the underlying rules, Triples were always presented 
to Ss together with the three input numbers; the 
pairs were printed on the left of a. sheet of paper 
and the inputs on the right. The rules underlying 
the unrestricted tasks were of two types. The an 
rules simply involved multiplying the input numbers 
by a (i.e, d = 0), The an + d rules involved both 
multiplication and addition (of d). 

Two HO generalization rules were identified, 
Both HO rules act on restricted rules (triples) and 
generate output rules of types an and an +- d, re- 
spectively. A flow diagram for the more general 
finite differences (F) rule is given in Figure 1. The 
division (D) rule is similar, Both HO rules partition 
the class of restricted rules into two equivalence 
classes consisting of an rules and an +d rules, re- 
spectively. The D rule only generates an rules, 
whereas the F rule generates an +d rules, (The 
D rule may also be viewed as having a limited 
domain consisting of only those restricted rules in 
which the output of 1 equals the quotient of the 
output in one of the other pairs divided by its 
input.) 

Subjects, design, and Procedure, The 80 Ss were 
students in Grades 5-12 of Catholic and public 
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schools of Philadelphia. They were s: 
Eight Ss were eliminated because t! 
to do simple arithmetic computatic 
ment was also terminated early (t 


"idividually, 
were unable 
The experi- 
ure equal y 


in experimental groups and sav ne) with 12 
more Ss who were able to solve ax »roblems but 
not an 4- d problems on the pretest ( sed below), 

Experiment II was also run in following 


five phases: 


1. S was presented with triples oí inber pairs 


and shown how to interpret then le was in- 
structed “to write the output that with each 
input according to the triple." Th terion was 
two correct solutions in suc n. 

2. The E introduced rules of the form » — an +d, 
gave one example of a general rule ere d 5 
and one where d > 0, and stated tho. for every 
triple there is a rule of the form n — « d, which 
"fits" the triple. The S was told: “^ neral rule 
fits a triple if, when you use the rul« the. input 
numbers...you get the same outpu s in the 
triple." The S was taught how to ch: 'hether a 
rule fits a triple and practiced checki: vith four 
different rules, two an + d and two Then S 
was presented in turn with 6 pages, cach .- ntaining 
a rule, a triple, and two-four input in the 
triple. He was told to “find what outpu! oes with 
each new input number according t e rule. 
This involved first checking to deter: vhether 
the given rule fits the triple and, if 3, to use it to 
compute the output for each new i: The 
criterion was four consecutive correct pr ms. 

Just prior to the pretest E summarized “or 5 what 
he had learned. That is, there are two ways to find 
the output for a given input of a triple: («) look at 


the triple to see what the output for that input is, 
and (b) use a rule, which fits the triple, i compute 
the output for the input. The S also was reminded 
how to check whether a rule fit a triple; and, that, 
given a triple and a rule that fits the triple, the out- 
puts for new inputs can be computed via the rule. 
3. The S was given a pretest consisting of four 
randomly sequenced transfer problems, two an and 
two an +d. Each problem consisted of a triple 
and three new inputs (but no rule). The S was 
instructed to “find out what output goes with each 
new input according to a rule...which fits the 
triple." If S solved one but not both problems of 
Type an (an +d), he was given another problem 
of that type. If he solved it, he was given the E 
solved problem again. Under these conditions, 5 
S solved the original two problems, he was considere 
able to solve that type in general, otherwise not. 
When S encountered difficulty on the pretest, ui 
explained that the task was difficult and provide 
encouragement. Based on pretest results, Ss were 
categorized into three classes: none—unable S 
solve either type of problem; AN—able to solve P 
but not an + d problems; and AN 4- Dale is 
solve both an and an + d problems. The none © a 
was randomly divided into three groups: Cae 
ceiving no HO rule training; D—Ss taught d 
rule; and F—Ss taught the F rule. Class AN W 
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ided into Treatment Groups D and F. 
The ^7: + D Ss effectively knew the identified HO 
rules were unassigned. 
4. "ach S assigned to Groups D or F was told, 
"| wi. teach you a procedure which can be used 
ue kinds of triples to find a general rule... 
which fits the triple.” On each HO task S was 
presented with a triple and required to construct a 
rule that fit the triple, A description of the appro- 
priate HO rule was placed on a stand in front of S, 
and E worked through two HO practice tasks with 
S. Then S was presented in turn with six more HO 
d required to reach a criterion of four correct 

is in a row. The Classes C and AN + D Ss 
(the latter solved all pretest problems) received no 
treatment. Just prior to the posttest E again sum- 
marized what S had learned, adding for Ss in the 
HO treatment groups : "You know a procedure which 
can be used for some kinds of triples to find a general 
rule... which fits the triple. . .refer to this procedure 
whenever you wish." The HO rule remained in 
front of S during the posttest. 

5. The posttest involved new transfer problems 
but paralleled the pretest in every respect. 


Results 

Of 60 Ss admitted, 50 were placed in the 
five experimental groups; 10 passed both 
the an and an + d problems on the pretest 
(and the posttest). The experimental re- 
sults for the 50 Ss were exactly as predicted. 
No S in Group C of the none class was 
successful on either the an or an + d post- 
test problems. All 10 Ss in Groups D and 
F, of Classes AN and AN + D, were 
successful on the AN problems of the post- 
test and all Ss in Group F of both classes 
were successful on the an + d problems. 

In short, Ss were successful on posttest 
problems if and only if they knew how to 
solve the problems before entering the 
experiment (given only the restricted rules) 
and/or they were taught an appropriate 
HO rule. 


Experiments III AND IV 


Experiments II and IV were conducted 
to determine the feasibility of predicting 
the performance of individual Ss on specific 
problems by assessing Ss’ knowledge rela- 
tive to the HO generalization rules identi- 
fied. Specifically, could the methods de- 
veloped in Scandura (1973) be used to 
determine which parts of the HO rules 
used in Experiment II were available to 
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individual Ss? And, could this information 
be used to predict the performance of 
individual Ss on new problems? 


Method 


The tasks and rules were as in Experiment II. 
The materials were six booklets. Booklets 1-3 
covered the same material as Phases 1 and 2 of 
Experiment Il. Material for Phase 3, pretest on 
criterion problems, and Phase 4, training on HO 
rules, were eliminated. Booklets 4 and 5 were 
substituted for this material. These booklets made 
it possible to assess knowledge relative to the HO 
rules. Because outputs of HO rules are themselves 
rules, Booklet 4 was used to provide instruction 
in how to write rules of the form n — an + d. Page 
1 read, “to write a rule... in which we multiply the 
input number by 3 and add 4 to the product, let n 
be the input number. To multiply n by 3 write 
3 xn. Toadd 4 to the product write 3 X n +4." 
Eight practice problems were given. Booklet 5 
tested for the HO rules, “On each page... is a 
triple. Find and write . . . a rule of the form 
n— an + d which fits the triple." Four problems 
followed, Two problems required use of an HO 
rule for deriving n >an rules and two problems 
for deriving n— an +d rules. Booklet 6 cor- 
responded to Phase 6, the posttest. 

Experiment I] was run with 17-first year algebra 
students and Experiment IV, with 9 general math 
and 11 algebra students, all from West Philadelphia. 
In Experiment III, Booklets 1-6 were administered 
during a l-hr. class period. In Experiment IV, 
Booklets 1-4 were administered on one day, and à 
short review booklet and Test Booklets 5 and 6 on 
the next. Both class periods were 40 min. long. 
In contrast to Experiment Il, instructions were 
read to classes instead of individuals, Each 5 had 
to work at t four problems correctly in each 
booklet, From 3-5 proctors were available in each 
classroom. In scoring Test Booklets 5 and 6, prob- 
lems were divided into two categori according to 
whether an an + d or an an rule was involved. If 
S got both problems of one type in a booklet correct, 
he was considered successful; otherwise not, Al- 
though Tests 5 and 6 both involved HO rules, the 
difference between them is critical. Predictions 
were based on the prior availability of appropriate 
HO rules. An S who could derive an, but not an + d, 
rules in Booklet 5, for example, was assumed able 
to solve only transfer problems involving an rules, 


Results 


The results of Experiments III and IV 
are summarized in Table 3. In all cases, 
the number of correct, as opposed to in- 
correct, predictions differed significantly 
from chance (p < .05 for both an and 
an + d problems in Experiment III; p < 
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TABLE 3 
SUMMARY OF GENERALIZATION ASSESSMENT RESULTS E : 
iue eon souci 
s e e e | successtulon e |  ofecerec nt correct 
ae test (5) transfertest | Aest) | transfertest | prediction: edictions 
(6) ^ 
E: iment III (Ns = 17) 
E ae : 4/7 10 1019 a g 
an + d problems 20) 2 2/2 15 3/ & 
Experiment IV (Ns = 2€ zm + ad 
roblems 13 11/13 7 7 9 
i He d pou 3 2/3 17 17/17 19 95 
vieux dida 312 20 15/20 Msc 12/17 32 86 
blems | 5 32:483) 930/32 EN | (82 
an + d problems 5 | 4/5 2 
Noch Temm patentlitees rere CU Bol S RD: 
.001 and p < .05, respectively, for an and memory and to some extent problem m 
an +d problems in Experiment IV; and  isgiven in Scandura (1973). Empiric | testing 
p < .001 and p < .005 respectively, for should be a first order of business. 
PS : JENA ibi Another feature of this research ch de- 
an and an + d problems in the combined  . eau REOS: 4 El 
4 i babilitv, Fi ^, 1948) serves mention is the po sibility of e mati 
Rainy s, crest posi Ree : ' cally identifying the basic rules unc ying a 
Although the difference yaa noc reliable, set of problems (cf. Scandura, Di sin, & 
the additional 20 min. provided in Experi- Wüülfeck, 1972), In general, the (s 
ment IV appeared to increase precision of. cluding HO rules) underlying a set oí i: oblems 
prediction on both an and an + d problems may be determined as follows: (a) “elect à 
(8% and 7%, respectively). Whether or broad (finite) sample of problems in the set 


not level of prediction could be further 
increased is not clear, but this observation 
tends to support the notion that results of 
such experiments may be expected to con- 
form to prediction just to the extent that 
memory-free conditions are realized. 


DISCUSSION 


Overall, the results of Experiments I and II 
provide strong support for the postulated 
mechanism. When the effects of memory are 
minimized, and S's goals are known to E, 
availability of appropriate higher and lower 
order rules appears to be both a necessary and 
sufficient condition for solving transfer prob- 
lems. Although its importance was not 
emphasized, ability to determine whether or 
not a given or derived rule satisfies a higher 
level goal is also crucial (Scandura, 1973). 
The results of Experiments III and IV show 
further how availability of HO rules may be 
determined through testing and used as a basis 
for explanation and prediction in problem 
solving. These results, however, apply only 
in situations where memory and problem 
definition are not likely to be involved. Ex- 
tension of the proposed mechanism to include 


and identify solution rules for each ; blem, 
These rules provide a sufficient basis for solving 
not only the sampled problems but «!! other 
problems “like” them. The use of such rules 
corresponds to reproductive problem solving 
(Wertheimer, 1945) and, aside from relative 
complexity, to performance theories in com- 
puter simulation (e.g., Newell & Simon, 1972). 
(b) Identify parallels among the identified 
rules (e.g., they may involve successive appli- 
cation of simple trading rules, or they may be of 
the form (an + d). These “parallels” indicate 
the presence of higher (and lower) order rules 
from which the solution rules may be cons 
structed. The composition and simple trading 
rules of Experiment I (from which the com- 
posite rules may be derived) and the generali- 
zation and restricted rules of Experiment I 
(from which the an and an + d rules may be 
derived) provide examples. This type of 
analysis (of rule sets) may be repeated as 
many times as desired. In general, the more 
basic rule sets obtained in this way make it 
possible, according to the postulated mecha- 
nism (Scandura, 1973), to solve a set of prob- 
lems larger than originally envisioned an 
correspond to productive problem solving 
(Wertheimer, 1945), 
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MEMORY SEARCH OF CATEGORIZED LISTS: 


A CONSIDERATION OF ALTERNATIVE SELF-TERMINAT 


SEARCH STRATEGIES! 


MARY J. NAUS? 
Wellesley College 


The two three-stage memory search models for categorized lists described bs 
Naus, Glucksberg, and Ornstein were investigated in three experiments. Bot 

models assume that memory search is serial and terminates after the probed 
category has been exhaustively searched. The models differ only in whethe: 
memory entrance is assumed to be directed to the probed category or to occu: 
randomly, The latency data in Experiment I support random entrance for 
two-, three-, and four-category lists. When trained in directed entrance, 5: 
in Experiment II showed that along with the greater reduction in search timc 
they obtained with random entrance, additional intercept time is required to 
locate the probed category in the list. Results from Experiment III indicated 
that Ss only use random entrance when the categories in memory are blocked. 
Sternberg’s exhaustive model for single-category lists is proposed to be a 


special case of this three-stage model. 


Sternberg (1966) proposed that the re- 
trieval of information from short, single- 
category lists in recognition memory is 
serial and exhaustive. Since memory 
search is obviously not always exhaustive, 
what are the characteristics of this exhaus- 
tive search? 

The task in which Sternberg provides 
evidence for an exhaustive memory search 
requires S to memorize a short set of items 
(called the positive set). A test item (probe) 
is then presented and SS is asked to decide 
if the probe was a member of the positive 
set. If the probe is a member of the set, 
a positive or yes response is correct; if not, 
à negative or no response is required. Re- 
action time (RT) is measured for these yes 
and no responses. Memory search was 
considered exhaustive because the slopes 
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for the positive and negativ: ponses 
were equal. Recently, howev« here is 
some evidence that an exhaus: model 
of memory search may not juately 
describe information retrieval in 5 task. 
When the items in the positive set are not 
members of the same category bu! can be 
grouped perceptually (Williams, 1971), syn- 
tactically (Clifton & Gutschera, 1971), se- 
mantically (Naus, Glucksberg, & Ornstein, 
1972), or symbolically (Milles, 1969), Ss 
are able to use the categorization to modify 
the exhaustive nature of their memory 


search. 

In an attempt to precisely describe the 
search of multicategory sets, Naus et al. 
(1972) proposed two nonexhaustive search 
models. Both of these models consider 
memory search to be a three-stage process 
rather than the single-stage one Sternberg 
(1966) implied. Both models assume that 
(a) memory entrance includes both the 
selection of the to-be-searched categories 
from the positive set and the transfer O 
these categories into short-term memory 
for search (b) search is serial, and (c) search 
terminates as soon as the probed category 
has been exhaustively searched. The two 
models difler only in their rule for the 
selection of the to-be-searched categories 
from theset. In the Directed Entry Model, 
the category of the probe is used both 
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to begi to terminate search, such 
that th: iber of items searched equals 
the nu of items in the probed cate- 
gory. Random Entry Model pro- 
poses tl: the categories entered for search 
are int ndent of the category member- 
ship of ‘he probe. If the items searched 
first ar: from the probed category, search 
termin- ‘es after this category has been ex- 
haust:vely searched. If the items searched 
first a:o from a nonprobed category, search 


continues until the probed category has 


been completely searched. When Naus 
et al. (1972) presented Ss with sets of both 
one snd two semantic categories, the two- 


cat:vory sets showed a slope reduction of 
24..%. This is consistent with the Random 
Entry Model's predicted 25.0% slope reduc- 
tion for two-category sets. 

The experiments presented here further 
investigate memory search of multicategory 
sets, Experiment I extends the number of 
semantic categories in the sets to three and 
four to determine whether Ss continue to 
use a random entrance process. Experi- 
ment II is designed to teach Ss to use the 
Directed Entry Model in order to specifi- 
cally compare the results of a directed vs. 

indom search. Finally, whereas in the 

st two experiments the categories in the 

s were presented in blocked order, Ex- 
periment III presents them in random 
order to determine the generality of ran- 
dom entrance. 


Td 


EXPERIME 


The Random and the Directed Entry 
Models make differential predictions for 
the number of items searched as a function 
of the number of categories in the set (K). 
For the Random Entry Model the pre- 
dicted percent reduction in multicategory 
slope equals [ (1.00 — (K + 1)/2K)- (100) ], 
while for the Directed Entry Model the 
predicted percent reduction in slope for 
multicategory sets is [ (1.00 — 1/K)- (100) ] 
(Naus et al, 1972). This experiment was 
designed to determine whether Ss continue 
to use random entrance for three- and four- 
category sets. 
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Method 


Experimental design. A mixed experimental de- 
sign with three groups of Ss was used. Group 
1C/2C was presented sets of 2, 4, 6, and 8 words, 
with half of the sets containing words from one 
category and half containing words from two. 
Group 1C/3C received sets of 3, 6, and 9 words, 
with half of the sets containing words from one 
category and half containing words from three. 
Finally, Group 1C/4C saw sets of 4, 8, and 12 
words, with half of the sets containing words from 
one category and half containing words from four. 
Each S received all combinations of positive set 
sizes, response types, and number of categories for 
his group. 

In each multicategory set there was an equal 
number of words from each category. This restric- 
tion, together with the different number of cate- 
gories in each group, yielded the different set sizes 
across the three groups. The categories in the 
multicategory sets were presented in blocked order. 
Half of the tests were negative and half were posi- 
tive. Negative probes were only selected from 
categories represented in each set. The sequence of 
set sizes, the number and order of categories, the 
positive or negative test trials, and the serial posi- 
tion of the probes were randomized and counter- 
balanced over trials, 

Stimulus materials, The total ensemble of stimu- 
lus words varied across the three groups: Group 
1C/2C had a total ensemble of 32 words, 16 animal 
and 16 girls’ names; Group 1C/3C had a total of 
48 words, adding 16 musical instrument names to 
the ensemble for Group 1C/2C; and Group 1C/AC 
had a total of 64 words, adding 16 vegetable names 
to the ensemble for Group 1C/3C. The instances 
of these four taxonomic categories were obtained 
from the Battig and Montague (1969) category 
norms, equating the categories for “cohesiveness” 
based on the mean rank of the words eliciting the 
categories. The instances of these categories were 
also equated for mean frequency, number of sylla- 
bles, and length on the basis of the Thorndike and 
Lorge (1944) word frequency count, The stimulus 
words were made into slides and were also printed 
on index cards for presentation in the free recall 
learning. 

Apparatus. A Kodak Carousel projector, equip- 
ped with a Lafayette tachistoscopic shutter, dis- 
played the slides on a translucent screen between 
the control room and S's Industrial Acoustic Com- 
pany chamber. Beneath the screen in S's room were 
a "Y" indicating the position for the yes response 
and an “N,” the position for the no response. The 
S sat at a small table facing the screen. On the 
table was an intercom connecting S's room with the 
control room and a response box containing a small 
ready light, two microswitch response keys, and 
two feedback lights. The feedback lights were 
located 5 in. behind each key. The ready light was 
mounted between the two response keys, The re- 
sponse keys were connected to a digital klock- 
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(a) Group 1C/2C: Experiment I, (b) Group 1C/3C: 


Experiment I, 


and (c) Group 1C/AC: Experiment I. 


kounter and two signal lights to i 
key was pressed. The projector, 
and feedback lights were controll: 
Reader. All presentation interva 
tions were controlled by electro: 
Procedure. The procedure wa 
three experimental groups. 
Ss were initially shown the stimu! 
and were instructed to memori: 


* to E which 
r, and ready 
Tally Tape 

1 light dura- 
lical timers, 
same for all 
ur sessions, 

rds on cards 
"call the 


In eax 


words in any order. At least 90 all was re- 
quired, The S then received cognition- 
memory instructions used by Ster (1966), and 
was also told that the categorize rds he had 


Each 
ification of 
set chang- 


just memorized would be used in et 
trial of the RT task followed 
Sternberg's fixed-set procedure, wit 
ing after every eight tests. 

The number of tests per 60-min 
across the groups: Group 1C/2C re« 
per session, Group 1C/3C received 
1C/4C, 224. 

Subjects, Ei 
ton Univei 


ion varied 
1 280 tests 
nd Group 


zhteen right-handed m m Prince- 


sity were paid $10 each t« 


Results and. Discussion 


Since primary interest was in nptotie, 
error-free performance, only ion IV 
data for correct responses ari ported. 
The error rates were between % and 
2.2% for both the one- and mu! category 
sets for all three groups. These or rates 
did not significantly differ from om another. 
There were also no differences in «sor rates 
for positive and negative respons 
E One-category results. The onc-category 
results were consistent with thos reported 
by Sternberg (1966) for all three groups. 


These data are summarized for Groups 
1C/2C, 1C/3C, and 1C/4C in Figure 1a, 
1b, tc, respectively. Mean RT increased 
significantly with positive set size, the posi- 
tive and negative slopes were equal, and 
the positive RTs were reliably faster than 
the negative RTs, 1 

Category effect: Means and variances. To 
assess the effect of taxonomic categories on 
RT, the one- and multicategory sets in each 
group were submitted to separate analyses 
of variance. For all groups, the mean RTs 
for multicategory sets were reliably faster 
than the mean one-category RTs, indi- 
cating that Ss used the categories to modify 
their search strategies. Specifically, these 
one- and multicategory differences in mean 
RT were a decrease in the multicategorY 
slopes, as reflected by significant Number 
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of Cate X Positive Set Size interac- 
tions fo: group. 

To de ane if Ss were specifically using 
the Ranc Entry Strategy, the slopes and 
intercep! for the one- and multicategory 
sets wi lirectly compared. These com- 
parison were applied to data pooled over 
positiv: and negative trials because there 
were n- reliable Positive-Negative X Cate- 
gories «teractions. These data are sum- 
mariz in Figure 2a, 2b, and 2c. When 
the © v- and multicategory data are based 
upou (he size of the set actually presented 


one- and multicategory slopes dif- 
ferc! reliably from one another for all of 
th ups: for Group 1C/2C, £ (5) = 8.86, 
p 5.01; for Group 1C/3C, £ (5) = 12.80, 
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Ficure 2. Mean reaction time (RT) as a func- 


tion of presented positive set size for (a) one- and 
two-category sets and mean RT as a function of 
effective positive set size for two-category sets: 
Experiment I, (b) one- and three-category sets and 
mean RT as a function of effective positive set size 
for three-category sets: Experiment I, and (c) one- 
and four-category sets and mean RT as a function 
of effective positive set size for four-category sets: 
Experiment I. 


15.83, p < .01. The obtained percent slope 
reduction was 26.3 for two-category sets, 
33.8 for three-category sets, and 35.7 for 
four-category sets. These obtained percent 
slope reductions are consistent with the 
predictions of the Random Entry Model. 

While the average data are consistent 
with the Random Entry Model, it is im- 
portant to determine if this model was used 
by the majority of Ss. In Group 1C/2C, 
four of the six Ss had one-category-two- 
category slope differences which were within 
the 95% confidence limits of a 25.0% slope 
difference. In Groups 1C/3C and 1C/4C 
only S; in Group 1C/4C did not show re- 
sults consistent with the Random Entry 
Model. 

Further evidence in support of the Ran- 
dom Entry Model may be obtained from 
the distribution of RTs. Since the Random 
Entry Model asserts that the RTs for 
multicategory sets are an additive com- 
bination of the time required to search 
proportions of the set ranging from one to 
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K categories, the multicategory variances 
can be predicted based upon the obtained 
one-category variances for sets of different 
sizes. The equation for predicting two- 
category variances presented by Naus et al. 
(1972) can be extended to three- and four- 
category sets. When an F test was used to 
compare the obtained and predicted multi- 
category variances, no differences were 
obtained for any of the groups. 

Category effect: Intercepts. Since the num- 
ber of items actually presented in the sets 
does not represent the number of items 
actually searched given random entrance 
(Naus et al., 1972), the multicategory sets 
have been replotted as a function of effec- 
live set size (see Figure 2a, 2b, and 2c). 
Given that these replotted slopes and the 
one-category slopes do not differ, the inter- 
cepts can be directly compared. Although 
the multi- and one-category intercepts did 
not reliably differ for any of the groups, 
the magnitude of the difference seemed to 
be a direct function. of the number of 
categories in the set: on the average 7 msec. 
for two-category sets, 22 msec. for three- 
category sets, and 34 msec. for four-cate- 
gory sets, A linear regression analysis on 
the intercept differences as a function of 
the number of categories in the set yielded 
a slope of 13.5 which significantly differs 
from zero, | (30) = 2.14, p < .05. This 
increase in multicategory intercept as a 
function of the number of categories in the 
sets implies that there is an additional 
constant operation involved in a Random 
Entry Search which is directly related to 
the number of categories in the set. This 
operation could be the en bloc transfer of 
each of the to-be-searched categories to 
short-term memory for search, This pro- 
posed transfer process Suggests a possible 
mechanism for the random selection of at 
least the first category searched: the cate- 
gory in short-term memory when the probe 
18 presented is the category that is searched 


first. Since the category in short-term 
memory when the probe is presented has 
the 


probability of 1/K of being the probe 

category this hypothesis predicts that 
K-1 

the Random Entry Model requires 57 i/K 
i-o 


transfers, each costing 13.5 msec. in inter- 
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use a directed entry search arge cate 
gorized sets (Okada & Burro 1973), why 
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The data from Ss in Group !( ^C sugges 
that a thorough comparison the tw 
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question. This S's mean da 
directed search. In addition, : 
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EXPERIMENT II 
Method. 


Experimental design. The Ss were either in Grout 
1C/3C or 1C/4C. These groups were divided int 
the regular and the cted-entry instruction cong 
ditions. The regular instruction Ss were given th 
same instructions as in Experiment | The directed: 
entry Ss were instructed and taughi to use th 
Directed Entry Model. The instructions stresi 
directed selection of the probed category prior t 
search onset. To aid Ss in using the Directed Entry 
Model, they were also provided with an auditory 
presentation of the probe category prior to the 
presentation of the probe. "This auditory category 
cue occurred simultaneously with the onset of the 
warning signal, about .8 sec. prior to the presentis 
tion of the probe, : 

Procedure. Initially both instruction conditions 
received the free recall task and the recognition 
memory instructions described in Experiment f 
In addition, for the first three sessions, Ss in the 
directed-entry instruction condition received in- 
structions that stressed the direct selection of a - 
probed category prior to search o and that | 
formed them that the probe category would he 
auditorily presented prior to the presentation of E. 
probe to aid in their memory ch of only p 
probed category. Session IV did not differ 
directed-entry and regular instruction Ss. 
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Subje: ‘ive right-handed undergraduate 
men fro: ton University were paid $10 each 
for servi s, 
Resulis od Discussion 
l Bot! -essions II and IV data are con- 
sidere- cause of the differential treatment 
in the irected-entry instruction condition. 
The r rates for these sessions for one- 
y and : vilticategory sets ranged 1.9%-2.7% 
for! h positive and negative trials. These 
err tes did not reliably differ for any of 


the s of the design. 
category results and category effect for 


ri instructions. The one-category re- 
8 for both groups and instruction 
© fitions supported those reported by 


cnberg (1966). The regular instruction 
adition was a control condition and repli- 
ited Experiment I, showing a slope reduc- 
tion of 33.9% for Group 1C/3C and 38.0% 
for Group 1C/4C. 

Category effect: Directed-entry instructions, 
In order to determine if Ss in the directed- 
entry instruction condition used the Di- 
rected Entry Model to search the multi- 
category sets, separate linear regression 
analyses were conducted for Groups 1C/3C 
ind 1C/4C for Sessions I1 and IV. Session 

II data summarize the effect of directed- 
try training while it occurred. The ob- 
ned one- and multicategory slopes dif- 

iced reliably for each group: for Group 
1. /3C, 1 (2) = 36.6, p < .01, and for Group 
1C 4C, t (2) = 39.6, p < .01, with the ob- 
tened slope reductions of 70.8% for Group 
1C 3C and 76.6% for Group 1C/4C con- 
sisient with a Directed Entry Search. 
Individual S's slope reductions were also 
consistent with the Directed Entry Model. 
The variance data, when the predicted 
multicategory variances are based on the 
obtained one-category variances for Set 
Size s/2, were also consistent with directed 
entrance. 

During Session IV, Ss no longer received 
instruction or training in the use of the 
Directed Entry Model. Although the ob- 
tained one- and multicategory slopes dif- 
| fered reliably for both groups—for Group 
i 1C/3C, t (2) = 9.4, p < .01, and for Group 

1C/4C, ¢ (2) = 39.11, 5 < .01—there was 
a difference in obtained slope reductions 


997 


between the groups. The obtained slope 
reduction of 34.0% for Group 1C/3C was 
consistent with the Random Entry Model, 
whereas the obtained slope reduction of 
76.4% for Group 1C/4C supported a 
Directed Entry Search, For both groups, 
individual .S's slope reductions were con- 
sistent with these averaged results and the 
variance analyses supported the findings 
obtained with mean data. Thus, in con- 
trast to Group 1C/3C, where Ss returned 
to the use of the Random Entry Model 
when training and instruction were dis- 
continued, Group 1C/4C Ss continued to 
use the Directed Entry Model during 
Session IV. 

Direct comparison of the one- and multi- 
category intercepts again requires that 
multicategory intercepts be reestimated as 
a function of effective set size based upon 
either directed or random entrance, The 
one-category and reestimated multicate- 
gory data are summarized in Table 1. 
Since there were no reliable differences be- 
tween one-category and reestimated multi- 
category slopes, the intercepts can be di- 
rectly compared, When Ss used the 
Directed Entry Model, there was a large 
increase in multicategory intercept: on the 
average 140 msec. for three- and 185 msec, 
for four-category sets. 

Comparison of the models. An in-depth 
comparison of these models may answer 
two important questions, Why do Ss not 
always use the Directed Entry Model to 
search the multicategory sets since it leads 
to a greater reduction in memory Com- 
parisons, and why is there a differential 
effect for Groups 1C/3C and 1C/4C during 
Session IV of this experiment? 

The Directed Entry Model leads to a 
large increase in multicategory intercepts as 
well as to a large reduction in the number 
of memory comparisons, ‘Two processes 
seem to be included in these multicategory 
intercepts. First, the Directed Entry 
Model requires time to transfer the probed 
category to short-term memory. Assuming 
that this transfer only occurs if the probed 
category is not already in short-term mem- 
ory when the probe is presented, on the 
average 1 — 1/K transfers are required, 
Second, the additional intercept time could 
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TABLE 1 
4 T: i R TIC € PARAM- 
i 5 D E -ENTRY INSTRUCTION CONDITION WITH THE M. ULTI ORY 
pow PERS M A adi or EFFECTIVE SET SIZE AS PREDICTED ; 
P EITHER THE RANDOM OR DIRECTED ENTRY MODELS 
Session III Session IV 
ERE Inte: ert. 
no, of Intercept | Absolute 4 es b 
MEER AE UEM ME AE ME MEN 
I— 
i er 494 24.7 
ic 409 41.5 " x 24, a A 
3C 550 mn et 513 254 
1C/AC "ei d; 
401 41.0 407 i Pars 3 
ic 548 45.6 183r t4 594 45.1 
5$ $05. 
be the time required to select the probed onset over selective entry. To! ier. M 
category from the set prior to search. If the net-savings hypothesis this nc on might 
this category selection is considered to be provide an explanation for th« ferences 
a serial matching operation between the between Groups 1C/3C and 4C in 
category of the probe and the categories in Session IV. In the regular inst: ‘on m 
the set, this operation would be represented dition, this task may favor à ro | n 
by an increase in intercept which isa direct onset for both Groups 1C/3C a: 1C/4C; 
function of the number of categories in the even though the set sizes in the i. er con 
set, approximately 45 msec. per category. dition are large. The training ri ived in 
Although Williams (1971) has found evi- the directed-entry instruction condition 
dence for a serial and exhaustive category modified the task demands suffi ‘ently to 
comparison process for perceptual cate- favor directed entrance for bo: groups. 


gories, the present experiment does not 
indicate whether the category comparison 
process for semantic categories is exhaus- 
tive or self-terminating, T ogether, these 
two operations add (1 — 1/K)-13.5 + (K) 
‘45 msec. to the one-category intercept 
resulting in a predicted three-category in- 
tercept increase of about 145 msec. and a 
four-category intercept increase of about 
190 msec. These results support a net- 
savings hypothesis (Naus et al, 1972) 
which suggests that Ss spontaneously use 
the Random Entry Model because in this 
experiment it is more efficient in overall 
time savings. However, one would suspect 
that, with additional categories and larger 
set sizes, the Directed Entry Model would 
become the more efficient. 

In addition to the Random Entry Model's 
overall time savings in the present experi- 
ments, the demands of this experimental 
procedure, such as temporal sequences and 
S's instructions, may impose a rapid search 


However, after training was disc. atinued, 
the strategy used to search th: _ multi- 
category sets depended upon the size and 


number of categories in the sets 1 

This experiment raises the question of 
the generality of the Random Entry Model, 
suggesting the importance of precise speci 
fication of the conditions under which it is 
used. Experiment III was designed to 
determine if Ss will continue to use the 
Random Entry Model if the category 
exemplars are presented in random order 
rather than in the blocked order. 


EXPERIMENT III 
Method 


Experimental design, All Ss were in Group 1C/' 2 
The stimulus materials, instructions, and nume 
trials were the same as in Experiment I with m 
addition of a postexperimental questionnaire at t 
end of the last session. MS. 

Subjects. Six right-handed undergraduate Pri m 
ton males were paid $10 each for their participati 
in the four sessions of the experiment. 


1 
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TABLE 2 
INDIVIDUAL Ss’ SLOPE ANALYSES FOR GROUP 1C/2C: EXPERIMENT HI 
E. EE Slope difference 95% confidence interval | Optained % 
S po uis dee qaid | -— Slope a 
sone slope | Obtained | Predicted | Lower bound | Upper bound retain 
St 31.7 26.7 11.0 9.45 9.2 12.8 29.2 
S: | 38.2 27.6 10.6 9.58 8.3 10.7 25.6 
Ss | 18.9 15.0 3.9 315 2.9 4.7 20.6 
Sa 31.4 23.8 1.6 1.9* 6.9 8.9 24.2 
Aver ei] 353 23 | 850 78° 70 8.6 25.6 
Ss | 32.7. 35.5 —2.8 8.2 = = = 
Ss | 33.6 30.8 2.8 84 == = = 
Aveng | 312 33.2 2.0 7.8 — — — 


Re: (is and Discussion 


category results. As in the first two 
© riments, the Session IV error rates for 
o. - and two-category sets, which ranged 
1 J%-.8% did not reliably differ, and the 
vae-category latency data were consistent 
with those previously reported by Stern- 
berg (1966). 

Category effect. When an analysis of 
variance was completed to assess the effect 
of taxonomic categories on RT, the mean 
RTs for two-category sets were reliably 
faster than the mean one-category RTs, F 
4, 5) = 17.73, p < .001. To determine 

hether Ss were using the Random Entry 

xlel to search the two-category sets, 
r ression analyses were applied to data 
pooled over positive and negative trials 
use there was not a reliable Categories 
X "ositive-Negative interaction. The one- 
ca’ gory sets had a slope of 31.3 and an 
intercept of 498 msec, The two-category 
sets had a slope of 27.8 and an intercept of 
505;nsec. When the one- and two-category 
slopes were compared, they did not reliably 
differ, showing a slope reduction of 11.2%. 
However, analysis of the postexperimental 
questionnaire indicated that Ss could be 
divided into two groups. Four of the six 
Ss (Ss 1, 2, 3, and 6) reported actively 
operating upon the random two-category 
sets to reorganize them into category blocks. 
Instead of rehearsing the items in the pre- 
sented order, these Ss described reorganiz- 
ing the positive set items into category 
groups and rehearsing the items in these 
blocks. In contrast, the remaining two Ss 


a tent with the Random Entry Model's predicted slope difference. 


wrote that they did not reorganize the 
positive set items into groups by category, 
but memorized the items in their presented 
order. The individual S's Session IV data 
with Ss grouped as "reorganizers" and 
"nonreorganizers" are presented in Table 
2. The reorganizers one- and two-category 
slopes differed reliably, ¢ (3) = 7.23, p < 
.01, yielding a slope reduction of 25.6%. 
All individual S's results were also consis- 
tent with the Random Entry Model's pre- 
dictions. In contrast, neither of the nonre- 
organizers had one- and two-category slopes 
which reliably differed from one another. 
This experiment suggests that memory 
search strategy selection, at least the use 
of the Random Entry Model, is influenced 
by the form of the input. Clifton and 
Gutschera (1971) showed that presenting 
two-digit numbers in blocked order, where 
the category is defined by the tens digit, 
greatly facilitated the use of a hierarchical 
search. However, randomizing the digits 
did not completely eliminate a hierarchical 
search, The Random Entry Model differs 
from a digit hierarchical search in that it 
seems to require that the items be stored 
in memory in blocked categories, Specula- 
tively, this blocked storage might be re- 
quired for the en bloc transfer of the cate- 
gories to short-term memory for search. 
Since all Ss did not block the categories in 
the set in order to use the Random Entry 
Model, this implies that the selection of a 
memory search strategy is an option avail- 
able to S, i.e., an S-controlled process rather 
than a structural memory feature (Atkinson 
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& Shiffrin, 1968). This indicates, as re- 
cently demonstrated by Hunt, Frost, and 
Lunneborg (1973), that individual differ- 
ence variables may be very important in 
investigations of memory search. 


GENERAL DISCUSSION 


These experiments suggest that memory 
search is a three-stage process. When the num- 
ber of items in memory is small and the items 
are all members of the same category, results 
are consistent with Sternberg’s (1966) exhaus- 
tive search model. This model, however, is 
proposed to be a special case of the more gen- 
eral three-stage model presented here. The 
single stage results because with small, one- 
category sets memory entrance is unnecessary 
and search always terminates after the com- 
plete positive set has been searched. In con- 
trast, when the sets are large or can be organized 
into categories, memory entrance becomes 
necessary and search can be nonexhaustive 
through the use of category boundaries. 

The Random Entry Model requires specu- 
lation as to how random entrance might occur. 
It seems reasonable to suppose that the cate- 
gory which is in short-term memory at test 
is the category which is searched first. It is 
more difficult to propose a mechanism for the 
random selection of the remaining categories 
for search. Random entrance requires that S 
knows when he has searched a nonprobed 
category, but does not use this information to 
select the probed category from the set. It is 
as if once S is in a "searching mode” it costs 
more in either time or effort to switch to a 
"selection mode" to locate the probed category 
in the set than it does to search additional 
categories, 

In addition to the two nonexhaustive, three- 
stage search models presented here, the slope 
reductions for multicategory sets could be in- 
terpreted in terms of a faster comparison 
model. In this class of models, memory search 
of the multicategory sets is considered to be 
exhaustive, but some or all of the memory 
comparisons are proposed to be made more 
quickly than in the one-category sets. Assume 
that the members of the probed category are 
compared to the probe at the one-category 
rate (x msec. per item), but the items in non- 
probed categories can be compared more 
quickly (y msec. per item). Thus, when the 
positive set consists of K categories of equal 
size, the average search time per item is 
x + (K — 1)y/K msec. The decrease in the 
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time required for search of the 
items would be represented 
multicategory slope. If it is í 
that the items in the nonprobe 
searched at twice the rate as 


ved category 
decrease in 
1er assumed 
ategories are 
tems in the 


probed category, this model : s the same 
prediction as the Random Ent lodel. The 
nonexhaustive models are cun. ‘ly favored, 


however, given that the fa comparison 
models cannot adequately a: nt for the 
intercept and variance param: in Experi- 
ment I and the increased s! reductions 
obtained in Experiment II. | mentation 


is currently under way to direct ‘Terentiate 
| 
' memory? 
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Di*-CTED FORGETTING AS A FUNCTION OF EXPLICIT 
WITHIN-LIST CUING AND IMPLICIT 
POSTLIST CUING? 


ADDISON E. WOODWARD, Jr.? 
Governors State University 
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Explicit within-list instructions to forget, as well as postlist implicit instruc- 


tions to forget, were presented within the same list structure to 60 .Ss. 


The 


results indicated that explicit forget items were far less likely to be intruded 


in immediate recall, recalled finally, or recognized on a final recognition test 


than implicit forget items. 


forget items differ from implicit forget items 


These findings support the notion that explicit 


Furthermore, while search and 


retrieval mechanisms may be a viable explanation for phenomena involving 
implicit forget instructions, differential rehearsal is the most likely explanat ion 
for directed forgetting resulting from explicit forget instructions. 


Che ability of Ss to differentiate between 
items that are to be remembered and items 
that are to be forgotten has merited a 
significant amount of research (Bruce & 
Papay, 1970; Elmes, 1969; Epstein, 1969, 
1970; Woodward & Bjork, 1971). Various 
experimental designs have incorporated 
either implicit (Epstein, 1969, 1970; She- 
bilske, Wilder, & Epstein, 1971) or explicit 
Davis & Okada, 1971; Woodward & Bjork, 
71) instructions to forget in order to 

estigate directed forgetting in memory. 

dicit forget instructions typically ask 

o exclude some portion of the input 
[roi output at the time of recall (Epstein, 
1959, 1970). Usually, Ss are not asked to 
recall those excluded items, thus the im- 
plicit forget label. 

Explicit forget instructions are most 
often incorporated in experiments using 
item-by-item cuing. The Ss are presented 
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4 Now at the University of Iowa. 


list items, with each item being followed 
by either an instruction to remember or 
an instruction to forget. A final recall test 
is used to determine the effectiveness of 
the forget instruction. 

Both kinds of paradigms have generated 
phenomena that have been labeled in- 
stances of intentional forgetting or directed 
forgetting. Presumably, a search for the 
mechanisms underlying forgetting as a 
result of implicit forget instructions would 
also explain forgetting as a result of explicit 
forget instructions. It is the contention of 
the present authors, however, that the 
basic phenomena resulting from implicit or 
explicit forget instructions are markedly 
different and, although each paradigm may 
tell us something about the memory pro- 
cess, neither tells us much about the 
other. 

The present study brings together within 
a single experiment both implicit and ex- 
plicit forget instructions. Implicit cuing 
to forget is defined as any cue that in- 
structs S, at the time of output, to dis- 
regard part of the input—a situation very 
similar to the procedures employed by 
Epstein (1969, 1970). Explicit cuing to 
forget is defined as any signal or cue, oc- 
curring after input but before output, in- 
structing S not to remember the item(s) 
in question. The focus is on the fate of 
those items with forget instructions. at 
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immediate recall, at final recall, and at 
final recognition. 

The explicit cuing is within-list, while 
the implicit cuing is postlist. Thus, time 
of cue is being manipulated in a way 
similar to Bjork's (1970) Experiment III. 
The implicit forget (F) items in this ex- 
periment are similar to items followed by 
Bjork's remember yellow:forget yellow in- 
struction which urged Ss first to remember 
the yellow items and then later to forget 
those items. Bjork's forget yellow:remem- 
ber green instruction is equivalent to items 
followed by an explicit forget instruction. 
The present experiment differs from Bjork's 
in that item-by-item cuing was used, the 
task was free recall, not paired associates, 
and a final recall test as well as a final 
recognition test were administered. 

Within each of six lists there were three 
classes of items: items followed directly by 
a forget instruction, and two remaining 
sets of items followed by remember in- 
structions but functionally separate. Each 
list was followed by one of six recall in- 
structions, If one set of remember (R) 
items is labeled A, the other B, then the 
recall instructions include recall only A, 
recall only B, recall A then B, recall B then 
A, free recall A and B, and finally, after 
one list, Ss were asked to recall all the 
items, even those followed by explicit for- 
get instructions. Every list then contained 
a set of items followed by explicit forget 
instructions. Lists that incorporated im- 
plicit forget instructions included the re- 
call-only-A instruction (B items were the 
implicit F items), and the recall-only-B 
instruction (A items were the implicit F 
items). At the conclusion of the experi- 
ment a final recall test was administered 
followed by a final recognition test. 

The data of interest include the following 
comparisons: (a) the probability of im- 
mediate recall, final recall, and final recog- 
nition of explicit F items vs. implicit F 
items (A items from the only-B condition, 
and B items from the only-A condition); 
(b) the probability of immediate recall, 
final recall, and final recognition of explicit 
F items vs. A items and B items in the 
recall-all condition; (c) the probability of 
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immediate and final recall i: 
nition of A items in the o 

vs. the probability of reca! 

the A then B condition anc 
only-B condition vs. B item 
A condition. 


final recog- 
\ condition . 
i items in 
tems in the 
the B then 


METHOD 


ite students 
l in order to | 


Subjects, "The Ss were 60 under 
at Albion College. They partici; 
fulfill introductory course requiren» 

Materials and apparatus. All e presented — 


six 24-item lists, Words were c« ; unrelated 
four-letter nouns, Eight random! cted words 
were followed by a red dot, eight black dot, 
and eight by a blue dot. In each ! uarter, two 
words each were followed by one of : olored dots 


more than 
ns used in 
m the same 
Phe words, ^ 
the recall 
d memory 
ated by à 
the length 


and the same colored dot did not ap 
twice consecutively. The distract 
the final recognition test were selecte 
pool of words as were the target wor: 
the cues to remember, and forget, 
instructions were presented on a high 
drum (change time less than .05 sec 
prepunched paper tape which determ: 
of item presentation and recall time. 


Design. All Ss saw the same lists « ns, Each 
list was followed by a different outp struction. 
The six instruction conditions were r only Ay 
recall only B, recall A and B, recall A © :: B, recall 
B then A, and a special list after whic’. Ss were t0 


recall all items—A, B, and F. 

Item sets were designated by th 
colors. Each word was followed by «her a red, 
black, or blue dot. Hence, there were ‘wo sets of 
remember items and one set of forget ‘ems differ- 
entiated by color. Each S was initial!y instructed 
that two colors designated remember items and one 
color designated forget items. 

Counterbalancing techniques were employed 80 
that, across Ss, each item served in each of the three 
conditions. This manipulation occurred as a result 
of changing the instruction associated with a pat 
ticular color cue from one § to the next. Also, 
across Ss, each output instruction occurred at each 
list position with the exception of the recall-all list : 
Which was always paired with the third list. Each 
list began with a 3-sec. ready signal and ended with 
a 30-sec. recall period. Each item was shown fot 
3.3 sec. and each instruction for 1.0 sec. ; 

Procedure. The Ss were run individually, and in 
the initial set of instructions they were told that 
they would see lists of words and that each wor 
would be followed by a colored dot which designate 
whether the item was to be remembered or for- 
gotten. Their task was to try to recall as many 9 
the to-be-remembered items according to the Sl 
structions following each list. They were informe 
that, following each list, there would be one of sit 
possible recall instructions and were told what these 
instructions would be and that it would be their 


different 


w-——— ls 


best st: 
forget : 


to try to remember R items and to 
and not to try to anticipate when the 
«call list would occur—a procedure em- 
Reitman, Malin, Bjork, and Higman 
i Bjork and Woodward. (1973). Such a 
was deemed successful by Bjork and 


proced 

Wood <d in that their Ss did not appear to 
antic, «e the occurrence of the special list, Ex- 
plicit get instructions existed at the onset of the 
expe nt when Ss were told which of the three 
colc ignated forget items. Implicit forget in- 
stri ns existed for A items in the only-B condition 
and B items in the only-A condition, Although 
the tial instructions to Ss indicated what the 


lik recall conditions would be, they did not 
in, ue which recall instruction was paired with 
w list. Since Ss did not know how many lists 
w -ol be seen, it seemed unlikely that they would 
t ble to accurately ascertain what a particular 
ij instruction would be. 
he Ss were given three practice lists, e ch con- 
ting of 12 two-digit numbers in order to familiarize 
hem with the procedure. The recall conditions 
paired with the practice lists included recall A then 
B, recall A and B, and recall only A. After all of 
the experimental lists had been presented, 5s were 
given a final recall test of both R and F items for 
as much time as they needed. At this point, Ss 
were asked to write down each and every word from 
the experiment that they remembered, regardless 
of the items' previous instruction. Following final 
recall, they were then given a final recognition test 
consisting of the original 144 items and 144 distrac- 
tors, Their task was to circle any item previously 
presented to them again regardless of the it m's 
previous instruction. The immediate recall tests, 
he final recall test, and the final recognition test 
ere all written, The Ss were instructed in the 
-dered recall conditions (A then B, B then A) to 
rite down as many of the words to be recalled first 
: they could, to then draw a line and write down 
many of the words to be recalled last as they 
;ud. The Æ attempted to monitor the recall 
process in those instances providing à partial safe- 
zuard against Ss who wished to circumvent the 
instruction. 


RESULTS 


Table 1 presents the probability of im- 
mediate recall, final recall, and final recog- 
nition of A, B, and F items. Recall of 
explicit F words is consistently lower than 
implicit F words. The probability of in- 
truding F items immediately in the only-A 
condition was .040, while the probability 
of intruding B items, items that were not 
to be recalled either, was .119. A corre- 
lated / test indicated this difference to be 
significant, £ (59) = 442, p < .001. In 
the only-B condition, explicit F items were 
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TABLE 1 


PROBABILITY OF IMMEDIATE RECALL, FINAL RECALL, 
AND FINAL RECOGNITION AS A FUNCTION OF 
RECALL INSTRUCTION AND ITEM TYPE 


Item type 
Immediate recall 
instruction 
F A B 

Immediate recall 

A only .040* .292 ,119* 

B only 031 117" 327 

Aand B 038 p .302 

A then B .039 271 .250 

B then A .033 .300 283 

Free recall all .081 275 254 
Final recall 

A only 048 121 123 

B onl 031 123 158 

A and B .051 135 125 

A then B 065 144 152 

B then A 042 187 131 

Free recall all .060 .100 087 
Final recognition” 

A only 4215 A35 ALD 

B m 252 .369 A27 

A and B 215 E 406 

A then B 207 448 429 

B then A 248 437 437 

Free recall all 271 ,391 387 


Note, Each point is based on 480 observations. See text for 


explanation of abbreviations, 
È These figures represent intrusions, since Ss were Instructed 


not to recall the items in question. 
+ False alarm probability was ‚085, 


intruded with a probability of .031, while 
A items were intruded with a probability 
of 117. This difference was also significant, 
t (59) = 424, p < .001. When Ss were 
asked to free recall all items from the list, 
F items were recalled with a probability 
of .081, while A and B items were recalled 
with probabilities of .272 and .254, re- 
spectively. A one-way analysis of variance 
indicated that indeed these differences were 
real, F (2, 177) = 25.66, p < .001. 

A 6 X 3 analysis of variance (Recall 
Conditions X Item Types) on the im- 
mediate recall data indicated a significant 
item effect, F (2, 1062) = 228,36, p < .001, 
a significant effect of recall instruction, F 
(5, 1062) = 4.98, p < .001, and a signifi- 
cant Item Type X Recall Instruction in- 
teraction, F (10, 1062) = 10.57, p < 001. 
To summarize the immediate recall data, 
we can say that explicit F-item recall re- 
mains invariant over recall instruction, 
with the possible exception of the free- 
recall-all condition; A- and B-item recall 
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does not. In the only conditions, Ss re- 
called less of the excluded items. 

A 6 X 3 analysis of variance on the final 
recall data indicated a significant item 
effect, F (2, 1062) = 44.77, p < .001, but 
neither a significant effect of recall instruc- 
tion, F (5, 1062) = 1.99, p > .05, nor a 
significant Item Type X Recall Instruction 
interaction, F (10, 1062) = 1.56, p> .05. 
The lack of interaction between item type 
and recall instruction indicates that the 
probability of recalling A, B, and F items 
remains invariant over recall instruction. 
The lack of an instruction effect suggests 
that the instruction effect obtained in im- 
mediate recall was short term and did not 
carry over into final recall. The significant 
item effect indicates that explicit F items 
were recalled less frequently than A or B 
items, When Ss, then, are asked to recall 
all items from the experiment, they still 
do not recall many explicit F items, al- 
though explicit F-item recall does increase 
slightly from immediate to final recall. 
This increase is entirely consistent with 
past studies using the final recall pro- 
cedure (Woodward & Bjork, 1971). The 
B items from the only-A condition and A 
items from the only-B condition were re- 
called finally about as well as they were 
immediately. 

The results of a 6 X 3 analysis of variance 
on the final recognition data were similar 
in nature to the analysis on the final recall 
data. A significant item effect was ob- 
tained, F (2, 1061) = 60.97, b < .001, but 
there was neither a significant effect of 
recall instructions, F (5, 1062) = do, > 
.05, nor a significant Item Type X Recall 
Instructions interaction, F (10, 1062) = 
73, p > .05. Once again there were large 
differences between explicit and implicit F 
items, this time in their recognizability. 
Implicit F items were more likely to be 
recognized, just as they were more likely 
to be recalled. Final recall data, as well 
as final recognition data, are noteworthy, 
not in the small differences that occur for 
à given item type as a function of the recall 
instruction, but for the uniformity for a 
given item type across the various recall 
instructions. The F items clearly differ 
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from A or B items in terms c/ | cir recalla- 
bility and recognizability, bu’ ey do not 
differ as a function of the ll instruc- 


tion. The A and B items do differ asa 
function of recall instruct: "ther, or 
from each other. 

The data are not suggestiv- of an only 
effect.  Epstein's (1969) oi effect is 
defined as superior recall of t — only con- 
dition compared with the app iate both 
condition. Recall of A items i; the only-A 
condition was .292, and in i^ ^ then B 
condition, .271. This differ: was not 
significant, / (59) = .91, p > Recall 
of B items in the only-B condii was not 
significantly greater than in th then A 
condition, / (59) = 1.50, p > .0.. Clearly, 
however, the items to be exclud: t output 
are recalled, or actually intrude | far less 
frequently than items that arc be rex 
called. Thus, A-item recall in only-B 
condition and B-item recall in only-A 
condition is far less likely the: in any 
other condition. The Item Typ: Recall 
Instruction interaction stems la: from 
this low probability of recall of «> items 
to be implicitly forgotten. 

Finally, it should be noted thai he data 
reported for the A then B condi:.on and 


the B then A condition do not izke into 
account order of recall. Although Ss were 
asked to recall (in accordance with the in- 
struction) A items first followed by B 
items, or vice versa, recall of all itens was 
scored, Taking order into account, and 
not accepting as correct A items intruded 
into B-item recall in the A then B condi- 
tion, or B items intruded into A-item re- 
call in the B then A condition, and the 
reverse, the probability of recalling A and 
B items was .183 and .214, respectively, 
in the A then B condition, and .210 and 
.194 in the B then A condition. There is 
some indication then that Ss had problems 
differentiating between A and B items. 


DISCUSSION 


The fact that these data did not yield an 
only effect in retrospect is not surprising 4 
is consistent with Epstein's research (1969, 
1970). Epstein found that the only effect 
appears only when the two groups of items 
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are fun: lly separate. The interspersing 
of imp! ems in this study through the use 
of the : idual cuing technique resulted in a 
loss of functional autonomy of each group 
of ite» and, subsequently, no only effect. 
Becau no only effect was obtained, one 
could stion the appropriateness of the term 
impl forgel instruction. In other words, is 
the ence of an only effect a necessary or 
prere dsite condition for the implicit forget 
inst) tion label? The’present authors believe 
not What is crucial are the operations r 

qt in producing the implicit forget in- 
str on. In this regard, our operations were 
ti me as those of Epstein's (1969). The 


i iction. was given after input and before 
ut, and in addition, Ss did not know that 
would ever be asked to recall or even 
gnize former list items. The recall-only-A 
truction, as well as the recall-only-B_ in- 
truction, although not successful in producing 
an only effeet, were undeniably successful in 
reducing output of the excluded items. 

The immediate recall data indicate that Ss 
had very little difficulty separating implicit 
from expli items, but that making à 
similar discrimination within the implicit 
dimension (between A and B items) was con- 
siderably more difficult, although not impo 
ble. Furthermore, final recall data indicate 
that the effect of the explicit instruction was 

ong term, while the effect of the implicit 

struction was short term. That is, recall of 
plicit F words was exceedingly low, both 
mediately and finally, while a low probability 
ecall of implicit F items immediately (i.e., a 
Ls recall of B items in the only-A condition 
cad a low recall of A items in the only-B 
condition) did not reflect a similar trend in 
final recall. 

ihe data strongly support the notion that 
explicit F items are different from implicit F 
items on indices of recall as well as recognition. 
Explicit F items are recalled rarely, immedi- 
ately, or finally, and do not depend upon the 
recall instruction. Implicit F items are re- 
called with much higher probability than 
explicit F items, immediately and finally, and 
are also more likely to be recognized as old 
In addition, even when Ss are asked 
immediately to recall explicitly labeled F 
items, they are less successful than when asked 
to immediately recall other types of items. 

Epstein (1970) suggests that directed for- 
getting may be attributable to search and 
retrieval processes. Such an explanation may 
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be appropriate for his pardigm which involves 
implicit postlist cuing and the only effect, 
However, the present authors would argue 
that the only effect is not representative of 
most directed forgetting experiments, and that 
implicit F items are not really forgotten. 
Directed forgetting, measured most reliably by 
either a final recall test or a special immediate 
recall test such as that which occurred with 
List 3, results with explicit within-list cuing, 
and its explanation involves rehearsal and 
other processing mechanisms at input, not 
search or retrieval pro at output. 

The main difference, and a crucial one be- 
tween explicit and implicit forget instructions, 
is the time of the instruction, In this experi- 
ment, explicit forget instructions followed the 
items immediately. The implicit forget in- 
struction was delayed until the time of output 
and was far less effective in terms of leading 
to "forgetting." When the instruction is €x- 
plicit, Ss can stop further processing of the 
item. If the item is not processed, or pro- 
cessed minimally, it is not likely to be recalled 
or intruded even though it may be recogni ad., 
Items later followed by implicit forget instruc- 
tions are processed by Ss in the same way no 
doubt as items that they will attempt to 
recall. The Ss, in an attempt to conform to 
the task requirements, exclude the implicit F 
items from recall, or attempt to, but this 
exclusion in no way affects later recall or 
recognition, 

In conclusion, the experiment and the re- 
sults reported in this paper point to clear 
differences both in recall and recognition be- 
tween items that < e explicitly told to 
forget as opposed to items that are implicitly 
to be forgotten, Put another way, there are 
vast paradigm differences among studies that 
fall under the label of intentional or directed 
forgetting and investigators should be sensi- 
tive to such differences. 
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variables, 
pared to rote-repetition instructions. 


tion or pair recognition. 


imagery ratings of the pairs. 


In recent years, two distinctly differ- 
ent techniques have been used to study 
"imagery" when the basic experimental 
materials were pairs of English nouns. 
One technique used nouns with scaled 
ratings of imagery (imagery ratings), and 
the other technique instructed Ss to imag- 
^ ine the referents of the nouns intera sting 
in some way in their imaginary scene 
imagery instructions). An interesting 

pect of these two variables, which may 

may not elicit imagery, is that they 
a pear to have similar effects upon recall, 
d spite the marked differences in experi- 
n: ntal manipulations (e.g., Paivio & Yuille, 
1009), The similarity of effects could be 
interpreted as the manifestation of a single 
underlying process that was induced by 
both variables, or as the reflections of inde- 
pendent but similar processes. Presuma- 
bly, such identical or highly similar pro- 
cesses would influence various measures of 
retention, such as recognition and recall. 

Suppose the variables trigger a single 
underlying process: both variables should 
then have similar effects on recognition and 
recall, Experiment I explored this possi- 
bility. As noted above, both variables 
apparently facilitate recall of the nouns 


! Requests for reprints should be sent to M. J. 
Peterson, Department of Psychology, Indiana Uni- 
versity, Bloomington, Indiana 47401. 


OF IMAGERY INSTRUCTIONS, IMAGERY RATINGS, 
NUMBER OF DICTIONARY MEANINGS UPON 
RECOGNITION AND RECALL 


meanings upon recognition and recall. 
duced superior recognition of stimulus words, response words, and pairs com- 
Response recall also was higher following 
imagery instructions than following repetition instructions. 
ratings aided stimulus recognition and response recall, but not response recogni- 
Thus, the two imagery-encouraging variables had 
different effects upon recognition and recall and cannot be considered as two 
techniques for initiating the same underlying process. 
number of dictionary meanings were independent of the effects of the two 
imagery-encouraging variables, althoug 
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Three experiments investigated the effects of two “imagery-encouraging” 
instructional set and imagery ratings, and of the number of dictionary 


Interactive imagery instructions pro- 


High imagery 


The effects of the 


h the meanings interacted with the 


compared to control conditions, but the 
effects upon recognition are less. clear. 
Bower (1970), using only concrete nouns, 
reported no differences in recognition as 
a function of instructional set. However, 
his use of the concrete nouns may have 
produced a ceiling effect that precluded 
his finding a difference. The relevant 
recall literature suggested that words with 
low imagery ratings should be recognized 
less often than words with high imagery 
ratings when imagery ratings are varied, 
and these effects were verified by Peterson 
and Murray (1973). ‘Their studies also 
showed better stimulus recognition follow- 
ing imagery instructions than following 
repetition instructions. 

While the Peterson-Murray (1973) re- 
sults indicated that both variables had 
similar effects upon recognition and recall, 
their experiments tested stimulus recogni- 
tion (as did Bower's 1970 experiment). 
Experiment II of the present paper ex- 
amined recognition of the response terms 
and of the pairs themselves às functions of 
imagery ratings and the instructional set. 

Another purpose of the present studies 
was to probe the relationships between the 
two imagery-encouraging variables and à 
semantic characteristic of the words. We 
know that the imagery-encouraging varia- 
bles are not reducible to the meaningful- 
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ness or the familiarity of the words (see 
Paivio, 1971, for a review), or to lexical 
complexity (Kintsch, 1972). However, the 
imagery-encouraging variables may be re- 
lated to a more cognitive index such as the 
conceptual complexity of the words. On 
an intuitive basis, words with high imagery 
ratings appear to differ in conceptual com- 
plexity from words with low imagery 
ratings, and the instructional set might 
produce similar differences. The possibility 
of relating the imagery variables to a single 
semantic index has theoretical appeal, of 
course. Consequently, Experiments II and 
III compared a measure of conceptual com- 
plexity, the number of dictionary mean- 
ings, with the two types of imagery- 
encouraging variables. 

To summarize, the primary purposes of 
the series of studies were to investigate: 
(a) the effects of two imagery-encouraging 
variables upon the recognition of individual 
words and words in pairs and upon the 
recall of the word pairs, and (b) the sym- 
metry of effects produced by the two 
imagery-encouraging variables and by the 
number of dictionary meanings. 


EXPERIMENT I 


Experiment I was designed to test recog- 
nition of the stimulus terms and recall of 
the pairs when the stimulus terms were 
shown alone, to provide an opportunity 
for imagining the referents of even low- 
imagery nouns, and when the pairs were 
then presented together. "There were three 
phases. Phase showed the stimulus items 
alone under instructions. to imagine the 
referents of the words or to silently repeat 
the words. Phase 2 presented the stimulus 
items paired with the response words. 
Half of the Ss given imagery instructions 
for Phase 1 were told to imagine the refer- 
ents of the words in teracting in their visual 
scene, and the other half were told to repeat 
the pairs silently. Similarly, Ss who re- 
ceived repetition instructions before Phase 
1 were split into imagery and repetition 
instruction groups for Phase 2. In Phase 


3, all Ss were tested for stimulus recogni- 
tion and for response recall. 
If imagery instructions and imagery 


ratings have essentially the same effect 
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upon stimulus recognition 
recall, then both recogniti 
should be higher with image: 
and high imagery ratings th 
repetition instructions and 
ratings. No interaction woul: 
the other hand, if the va: 
independent operations, the: 
act. For example, when Ss rc 
instructions, they may attem; 
the referents (or some referen: 
of the imagery ratings of the w 
Ss receive rote-repetition inst: 
words with high imagery ra 
induce the strategy of imaginin 
ents, while words with low ima 
would not. In this case, an 
would be found such that the hi 
items shown under imagery ; 
would be recognized and recalle 
curately, and the low-imagery i: 
under rote-repetition instruction 
recognized and recalled least ac 


ions, the 
might: 


imagel 
uctioni 


Method 


Design and materials, Four groups oí 
36 nouns during Phase 1, Two of the 
given instructions to visualize the obj: 
by the words in a vivid way in their \ 
(imagery instructions). The two other 
told to repeat the words lently (ro: 
instructions). The groups then studied 
nouns during Phase 2, One of the groups given 
imagery (I) instructions during Phase [ 
the groups given rote-repetition (R) i: 
during Phase 1 received imagery instructions for 
Phase 2. These groups were identified as 1-1 and 
R-I, respectivel The two remaining groups Te 
ceived rote-repetition instruction just before Phase 
2 began. These groups were de ted as 1-R and. 
R-R. | 

The nouns shown in Phase 1 were the stimuli of 
the Phase 2 pairs. Thirt additional nouns wet 
used as the Phase 2 response terms, Eighteen of the 
pairs were composed of concrete nouns, and the 
other 18 pairs contained abstract nouns. The 
concrete nouns had imagery ratings excceding 6.00 
and the abstract nouns had imagery ratings of less. 
than 4.50, according to the Paivio, Yuille, am 
Madigan (1968) norms. The words were chosen t0 
be approximately equal in rated m aningfulness 
The high-7 words were paired randomly to yield 
18 pairs, and the low-7 pairs were made up in the 
Same way. Eighteen additional high-/ words am 
18 more low-/ words were used as distractors if 
the stimulus recognition test. These words met the 
criteria stated above, 

The words were typed on mats for slide presenta- 
tion. The mats were ordered randomly for Phases 
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TABLE 1 
ORMANCE ON STIMULUS RECOGNITION AND RESPONSE RECALL Tests: EXPERIMENT I 
Stimulus recognition Response recall 
cry ratings 
f Mean hits Mean false alarms ' Ug Mean correct — | Conditional correct 
»up I-I 
High 17.19 Ad .98 8.50 49 
Low 17.44 38 98 5.88 34 
oup I-R 
High 17.19 94 97 5.28 EJ 
Low 16.75 1.25 96 4 
Group R-I 
High 16.00. .88 95 6.69 ES 
Low 16.06 2.56 .90 3.12 20 
Group R-R 
High 15.38 1.88 94 3.50 :21 
Low 15.81 3.31 91 1.44 .09. 


ve, For group names, the first letter indicates Phase 1 instructions, the second, Phase 2 instructions. I = imagery, R = rote 


ition, 


; and 2 and for the stimulus-recognition and re- 
sponse-recall tests, All Ss received the same order- 
ing of mats for each phase, 

Procedure. The Ss were told that there were 
three phases in the experiment and that th 
remember the items of each phase. They 
the appropriate instructions and were shown the 
slides of Phase 1 at a S-sec. rate. Following Phase 1, 
Ss were read their assigned instructions for Ph 
and the Phase 2 slid » shown at an 8-sec. rate. 
After a 60-sec. break, instructions were given for 
the recognition-recall tests, The Ss were told to 
write down the stimulus word as it shown on the 
screen. They then indicated their certainty that 
the word had been shown during Phases 1 or 2 by 

hecking a 5-point scale: Certainly Old, Probably 
Jid, Don't Know, Probably New, and Certainly 
Jew. The next task was to recall the word paired 
with the word written on the sheet, if Ss thought 
hey had seen the stimulus word previously. 1 
lide was shown for 15 sec. during the recognition- 
recall task. 

Subjects. Students in introduct 
courses at Indiana University participated as one 
method of fulfilling a course requirement. Sixty- 
four Ss were randomly assigned to 16 replications 
of the block of four groups. 


y psychology 


Results 


Stimulus recognition. The mean numbers 
of correct stimulus recognitions (hits), the 
mean numbers of incorrectly labeled new 
stimuli (false alarms), and A’ estimates? 
(Pollack & Norman, 1964) are given in 
Table 1. The hit and false-alarm rates 
were calculated from judgments of Cer- 
tainly Old and Probably Old given to true 


? The authors wish to thank Donald E. Robinson 
for supplying the tables of A’. 


old stimuli and to the distractors, respec- 
tively. The mean number of hits for the 
groups receiving imagery instructions before 
Phase 1 reliably exceeded the mean number 
of hits for the groups with rote-repetition 
instructions for Phase 1, F (1, 60) = 214.28, 
but neither the instructions given for Phase 
2, F < 1, nor the imagery ratings, F (1, 60) 
= 1.02, p > .05, produced significant dif- 
ferences in the hit rates. 

The false alarms varied as a function of 
the instructions given for both phases. 
The false-alarm rates were higher following 
rote-repetition instructions given before 
Phase 1, F (1, 60) = 55.86, p < .01, and 
before Phase 2, F (1, 60) = 16.93. The 
imagery ratings did not have a reliable 
effect upon the false-alarm rates, 

The more frequent labeling of a new 
stimulus as old following rote-repetition 
instructions suggested that Ss in these 
groups were using a more lax criterion for 
judgment than Ss in the imagery-instruc- 
tion groups. To explore detection sensi- 
tivity further, A’ estimates were obtained 
for each condition, and these estimates 
were subjected to an analysis of variance, 

The mean A’ estimate for groups receiv- 
ing imagery instructions before Phase 1 
was significantly greater than the mean A’ 
for groups with rote-repetition instructions 
before Phase 1, F (1, 60) = 34.00, p < OL. 
The instructions that preceded Phase 2 did 
not show significantly different estimates 
of A’, F (1, 60) = 2.00, p > .05, nor was 
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the interaction between the Phase 1 and 
Phase 2 instructions reliable (F < 1). 

The other method of inducing imagery, 
the imagery ratings, also showed significant 
effects upon stimulus recognition when the 
bias toward labeling new stimuli as old 
was taken into account. The A' estimate 
was reliably higher for stimuli with high 
imagery ratings than for stimuli with low 
imagery ratings, F (1, 60) = 20.62, p < .01. 
Further, the imagery ratings interacted 
with the Phase 1 instructions, F (1, 60) — 
12.00, p < .01. The mean A’ scores for 
the conditions contributing to this inter- 
action were high-I words, imagery in- 
structions (.98), low-/ words, imagery 
instructions (.97), high-/ words, repetition 
instructions (.95), and low-I words, repeti- 
tion instructions (.91). Tukey tests indi- 
cated that the means for high- and low-I 
words under imagery instructions did not 
differ reliably: all other comparisons were 
significant at the .01 level. These results 
indicated that when Ss received imagery 
instructions, recognition performance was 
uniformly high, regardless of the imagery 
ratings of the pairs. The imagery ratings 
of the pairs became an important determi- 
nant of recognition performance only under 
rote-repetition instructions. No other in- 
teractions were reliable. 

Response recall. Table 1 contains the 
mean numbers of correctly recalled re- 
sponses and the proportions of correctly re- 
called responses conditionalized upon cor- 
rect stimulus recognition. Analyses showed 
the same effects for both response measures, 
hence, only the second measure will be 
discussed. Reliably more responses were 
recalled following correct stimulus recogni- 
tion under imagery instructions than under 
rote-repetition instructions for both Phase 
1 and Phase 2: for Phase 1, F (1, 60) = 
40.00, p < .01; for Phase 2, F (1, 60) = 
152.24, P <.01; and when the pairs had 
high imagery ratings, rather than low 
imagery ratings, F (1, 60) = 14.70, p < .0t. 
No interactions were reliable. 


Discussion 


As expected, both imagery-encouraging vari- 
ables were associated with increased response 
recall, and the two variables did not interact. 
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These results imply that : 
flect the same or highly < 
the stimulus recognition d 
dictory evidence. Phase 1i 
facilitated stimulus recogni! 
repetition instructions, but 
produced differences only w! 


> variables re- 
srocesses, but 
ided contra- 
» instructions 
re than rote- 
igery ratings 
mbined with 


rote-repetition instructions ^e, these two 
variables have different ef on stimulus 
recognition and must be | to induce 
distinct underlying process 

Phase 2 instructions h lifferentias 
effects upon stimulus recog The most 
plausible explanation for th« of Phase 2 
instructions to influence s ecognition 
was that Ss who received in istructions 
before Phase 1 were likely ue to use 
an imagery strategy during | egardless 
of the instructions that imm: preceded 
Phase 2. The carry-over | instruc- 
tional set to the other may | ited from 
consideration by comparing ion per- 
formance on high-I and rds for 
Groups l-I and R-R. This c n showed 
the same effects as those de ibove for 
stimulus recognition (see Ta! z 

Experiment I may have u: mated re- 
call, particularly for the grou; ing rote- 
repetition instructions. It ible that 4 
when Ss were uncertain of their gnition of 
the stimulus words or were unable ‘o recognize : 
the stimulus words, they did not try to remem- 
ber the responses, thus suppressing recall. 
Experiments II and III were designed to 
avoid these potentially confounding effects. 


EXPERIMENT ll 

Experiment I] asked if the imagery- 
encouraging variables could be subsumed 
by the number of dictionary meanings of 
the words of the pair. Such a prediction 
would oppose the contemporary theories 
(e.g., Atwood, 1971; Bower, 1972; Paivio, 
1971), which assume visual imagery and, 
verbal attributes to be distinct but inter- 
acting processes. If these processes do 
interact extensively, then questions about 
the likelihood of enhanced encoding oF 
whatever occurs when the imagery-encour- 
aging variables are manipulated might 
better focus upon the conjoint relationship 
between measures of the verbal-linguistic 
characteristics of the words and of imagery 
ratings. F 

It is possible that the more meanings @ 
word has, the greater are the chances that 
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se these characteristics will be imagin- 
al When an easily imagined noun has 
eanings, or when Ss attempt to 
the referents of a noun with many 
igs, these meanings may "enrich" 
image or they may produce multiple 
images. Other easily imagined words with 
fewer meanings (or less meaningful words 
whose referents are to be imagined) would 
be expected to yield fewer images or images 
with less enrichment. On the other hand, 
the possession of many meanings might 
interfere with the encoding and acquisition 
of words which are more ditficult to imagine. 
Thus, Experiment II manipulated the 
number of dictionary meanings, instruc- 
tional set, and the imagery ratings of the 
pairs of words. The ratings of the pairs 
were assumed to provide a better index of 
the saliency of an image combining the 
referents of the two words than an average 
oi the imagery ratings of the individual 
words of the pair. The imagery ratings of 
the individual words were held relatively 
ccnstant, while the rating of the pair varied. 
For example, JUDGE-TABLE had a high 
imagery rating as a pair and JUDGE-WHEAT 
had a low rating as a pair, although the 
imagery ratings of the individual words 
. were approximately the same. 


mag 


Method 


Preliminary scaling and design. Eighty nouns 
with imagery ratings of 6.00 or higher and with 
meaningfulness ratings of 5.50 or higher were 
drawn from the Paivio et al. (1968) norms. Seventy- 
two randomly selected “stimulus” nouns were then 
paired with each of the eight remaining “response” 
nouns. The imagery ratings of these pairs were 
obtained by administering booklets containing the 
pairs to 30 students in introductory psychology 
courses at Indiana University. Six different book- 

| lets were used so that a particular stimulus noun 

- would be paired with only one of its response words 

~ per booklet. The instructions for rating were similar 

^to those used by Paivio et al., and Ss rated the 
imagery of the pairs on a 7-point scale. 

In addition, the numbers of different “meanings” 
or separate entries found in Webster's Unabridged 
Dictionary (1966) were tabulated. This variable 
was labeled “dictionary-meaning”’ (dm). The num- 
bers of dictionary meanings were calculated for 
pairs of words by adding the number of dictionary 
meanings for the two words of each pair. Finally, 
12 pairs were selected for each of the four cells of 
the design that arose from the two levels of imagery 
ratings (high and low) and the two levels of dictio- 
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TABLE 2 


Mean ImaGery RATINGS, MEAN NUMBER OF 
DICTIONARY MEANINGS, AND SAMPLE PAIRS 
or MATERIALS: EXPERIMENT II 


iti Imagery | Dictionary n 
Condition ratings» | meanings Sample pair 
High I-high dm 5.76 13.62 JUDGE-TABLE 
High J-low dm 5.74 6.92 DOCTOR-PUPIL, 
Low I-high dm 2.70 13.04 DOCTOR-EARTH 
Low I-low dm 3.20 6.96 JUDGE-WHEAT 


Note. Abbreviations: J = rated imagery, dm = number of 
different dictionary meanings. 
^ Maximum score for high imagery was 7.00. 


nary meanings (high and low). ‘The mean values 
are given in Table 2 for each cell. Table 2 also 
shows sample pairs. It should be noted that for the 
imagery variables, each stimulus noun was paired 
with two responses. This technique was used to 
control the imagery ratings of the stimuli and be- 
cause Experiment III was to test recognition of the 
response words. 

Two lists were prepared, using the first pairing 
for a stimulus in one list and the other pairing in 
the second list. Each list contained six high-J, 
high-dm pairs; six high-J, low-dm pairs; six low-Z, 
high-dm pairs; and six low-J, low-dm pairs. 

Subjects. Sixty-four new Ss from the same pool 
were assigned randomly to 16 replications of the 
block of two nd two types of instructions. 

Procedure. The Ss were given either imagery or 
rote-repetition instructions. They were then shown 
slides with the noun pairs at the rate of 8-sec. per 
slide. After about a 2-min. interval, the stimuli 
were presented alone, and Ss were instructed to 
write the appropriate response on their answer 
sheets. The procedure of Experiment IJ was com- 
parable to Phase 2 of Experiment I, followed by a 
test for response recall, Experiment Il had no 
Phase 1 counterpart, nor was any test for stimulus 
recognition introduced. 


Results 


The major results are shown in Table 3. 
The Ss given imagery instructions recalled 
significantly more responses than Ss given 
rote-repetition instructions, F (1, 60) = 
34.17, p < .01, and more responses were 


TABLE 3 


MEAN NUMBER OF CORRECTLY RECALLED 
Responses: EXPERIMENT Il 


Imagery instructions Repetition instructions 


Dictionary 
meanings 
High I Low I High I Low I 
High 4.81 4.03 3.22 1.97 
Low 4.50 4.38 2.84 2.59 
Nole. Abbreviation: J = rated imagery. 
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recalled from the high-7 pairs than from 
the low-7 pairs, F (1, 60) = 20.47, p < 01. 
The latter result indicates that response 
recall reflects the imagery ratings of the 
pairs, not simply the ratings of the individ- 
ual words of the pairs. No other main 
effect was reliable. 

The failure to demonstrate a significant 
main effect for the number of dictionary 
meanings indicated that this variable did 
not explain the effects of the two imagery- 
encouraging variables upon response recall. 
However, the number of dictionary mean- 
ings did interact with the imagery ratings 
of the pairs, F (1, 60) = 12.42, p < 01. 
The high-Z pairs were recalled more often 
when they had more dictionary meanings 
than when they had fewer dictionary 
meanings, but the opposite was true for 
low-/ pairs, as shown in Table 3. The 
number of dictionary meanings did not 
interact with the instructional set. 


Discussion 


The results of Experiment II supported 
theories which posit independent visual and 
verbal components or processes and argued 
against the possibility that the imagery-encour- 
aging variables might be reduced to a measure 
of conceptual complexity such as the number 
of dictionary meanings. The interaction be- 
tween the imagery ratings of the pairs and the 
number of dictionary meanings may have 
been produced by increases in the number of 
meanings which "enriched" the image or 
produced multiple images, while the extra 
meanings hindered the processing of words 
Which were difficult to imagine. 

Experiment II also showed that the imagery 
ratings of the pairs of words predicted recall 
more satisfactorily than the imagery ratings 
of the individual words, providing additional 
Support for the notion that the association of 
the words is important. Experiment III in- 
vestigated recognition of response terms and 
of the pairs under the same conditions as 
Experiment II. 


ExPERIMENT III 


Experiment III presented pairs of words 
with high and low numbers of dictionary 
meanings and with high and low ratings 
of imagery. During the exposure trials, 
one group received imagery instructions 
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and the other group, repi 1 instruc: 
tions. Half of each grou given 

response recognition test a lf, a pai 
recognition test. If the ima encourage 
ing variables affect the cı ig of the 
words, subsequent recogniti the indi- 
vidual words and of the p hould be 
better for pairs shown und igery ins 


structions or for pairs with imagery 
ratings compared with pair: 1 under 
rote-repetition instructions r pai 
with low imagery ratings. | predic- 
tions have been tested for timulus 
items (Bower, 1970; Peterso: Murray, 
1973; Experiment I of the pre series). 
Experiment III examined the . s upon 
recognition of the response term | of the 
pairs. 

The number of dictionary m« s was 
included because other investig s have 
reported that recognition was ersel 
related to the frequency of occ: ice of 


words (Shepard, 1967) and to thc aning- 


fulness of dissyllables (McNuls. 1963), 
while recall showed a positive rel: ship: 
The finding that dictionary meani:...s mir- 
rored these results while imagery-encourag- 
ing variables did not would be ad itional 
evidence that the imagery-encouray ie vari- 


ables differ from the various measures of 
verbal-linguistic characteristics, 


Method 


Materials and design. The materials wer 
used in Experiment Il. Half of the Ss saw | 
during the acquisition stage and the other h: 
List 2. All Ss were then transferred to a rec 
list that contained all of the pairs or responses from: 
Lists 1 and 2, Thus, on the recognition test, half 
of the items would be old and half would be new, 
for all Ss. 

One group was told to imagine the referents of 
the words, and a second group was told to repeat 
the words silently. These two groups were sub- 
divided for the recognition test. Half of each group 
received the response recognition test and half, the 
pair recognition test. 

Subjects. One hundred and ninety-two new 
students from the same pool were assigned to 24 
replications of the block of eight cells produced by 
the two lists and the four instruction groups. - 

Procedure. After hearing the appropriate in- 
structions, the pairs of the acquisition list were 
presented at an 8-sec. rate. The Ss were reminded 
to use imagery or rote repetition during the recogni- 
tion test to help them make their decisions. The 
Ss rated their confidence of their judgments by 
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TABLE 4 
PERFORMANCE ON RESPONSE AND PAIR RECOGNITION Tests: EXPERIMENT II 
Imagery instructions Repetition instructions 
»rmance measure High I Low High I Low I 
High dm | Lowdm | Highdm | Lowdm | Highdm | Lowdm | Highdm | Low dm 

Reton eccogniton Esa | 5t i ^ SF 

Mean hits 5. 5.46 5.25 .50 4.15 4.58 4.21 4.75 

Mean false alarms | 2.04 | 79 1.50 1.00 1.96 .96 1.62 1.38 

A! Tee | 94 ‘88 ‘93 ‘80 185 78 ‘85 
Pair recognition 

Mean hits 5.71 5.91 5.71 5.71 5.21 5.08 5.42 5.21 

Mean false alarms .08 08 21 16 .29 25 33 239 

A’ 98 99 97 98 95 94 95 94 


Vote. Abbreviations: Z = rated imagery, dm = number of different dictionary meanings. 


ircling numbers on the same 5-point scale used in 
iexperiment I, No measure of response recall was 
obtained in Experiment HI. 


Results and Discussion 


The mean numbers of hits, false alarms, 
and the A’ estimates are given in Table 4 
for the four groups. The recognition of the 
response words was significantly better 
following imagery instructions than follow- 
ing repetition instructions, F (1, 46) = 
13.64, p < .01, and for words with few 
rather than many dictionary meanings, F 
(1, 46) = 34.16, p < .01. Imagery ratings 
did not have a differential effect upon 
response recognition (F < 1). No inter- 
actions were significant. 

The finding that words with few dictio- 
nary meanings were recognized with greater 
accuracy than words with many dictionary 
meanings, while the opposite was true for 
recall, corresponded to similar results re- 
ported by Schwartz and Rouse (1961) and 
Shepard (1967) for the frequency of occur- 
rence of words and by McNulty (1965) for 
the meaningfulness of dissyllables. 

In general, detection of old from new 
pairs was superior to the detection of old 
from new responses. Imagery instructions 
also facilitated the recognition of the pairs, 
F (1,46) = 5.95, p < .05, but neither the 
imagery ratings, F (1,46) = 1.00, p > .05, 
nor the dictionary meanings, F < 1, were 
effective in enhancing the detectability of 
the pairs. These results make it clear that 
the response recall found in Experiment IT 


for these pairs could not be explained by 
simple recognition of either the response 
term alone or of the pair as a unit. 


GENERAL DISCUSSION 


The results of the three experiments indi- 
cated that theoretical explanations of imagery 
must accord separate treatments to the two 
imagery-encouraging variables, instructional 
set and imagery ratings, because the two v: ri- 
ables had different effects upon recognition. 
Their effects upon recall were highly similar 
and did not interact. 

Imagery instructions aided recognition of 
the stimulus words, the response words, and 
the pairs to a greater extent than control 
(rote-repetition) instructions. Imagery ratings 
were directly related to recognition of the 
stimulus words experienced under rote-repeti- 
tion instructions but not to recognition of the 
stimulus words shown under imagery instruc- 
tions or to recognition of the responses or the 
pairs, irrespective of the instructional set, 
Clearly, the instructional variable has a wider 
scope of influence than the imagery ratings, 
and the two techniques for encouraging im- 
agery induce processes which have differential 
effects upon recognition. 

Despite their disparate effects upon recogni- 
tion, the imagery-encouraging variables showed 
a highly similar influence on recall, Signifi- 
cantly more responses were correctly recalled 
following imagery instructions than following 
repetition instructions and for pairs with high 
rather than low imagery ratings. 

Why should the two imagery-encouraging 
variables have different but interacting effects 
upon recognition but not upon recall? One 
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explanation is that one of these variables, 
imagery instructions, influences both encoding 
and retrieval, thus affecting recognition and 
recall, while the other variable, imagery 
ratings, exerts its primary effect upon re- 
trieval. This explanation would have to be 
modified to permit imagery ratings to affect 
encoding when no imagery instructions were 
in effect (see Paivio, 1971, for a review). 

A varient hypothesis is that imagery instruc- 
tions induce a process that facilitates both 
recognition and recall Imagery ratings, of 
secondary importance, are effective only to the 
extent that they induce the same process. This 
hypothesis predicts that recall performance 
should mimic recognition performance, in con- 
tradiction to the present results. A slightly 
different version of this hypothesis is that recall 
of the pairs is facilitated by both variables, 
through enhanced recognition of the cue term, 
and that imagery instructions are dominant 
over imagery ratings. Further, imagery in- 
structions simply have additional effects, such 
as the enhancement of the responses and of the 
pairs, that are irrelevant to recall. This possi- 
bility seems most reasonable. 

The process or processes that underlie the 
imagery-encouraging variables may or may 
not involve visual components. It is quite 
plausible that the imagery instructions, for 
example, trigger active, but not necessarily 
visual, rehearsal mechanisms on the part of 
S, whereas the rote-repetition instructions 
elicit a far more passive process. So long as 
the process produces sufficient encoding, recall 
will be possible. The more active rehearsal 
might well yield differential confidence ratings 
of recognition performance, but recall would 
not differentiate between rehearsal and more 
passive repetition if the items were encoded 
at some minimal level, 

Neither imagery-encouraging variable was 
explainable by a measure of conceptual com- 
plexity such as the number of dictionary 
meanings. Increases in the number of dictio- 
nary meanings facilitated recall but hindered 
recognition of the responses, whereas the 
imagery instructions, in particular, were posi- 
tively related to both recall and recognition 
of the responses. These results also refuted 
the earlier proposal that high-imagery words 
with many meanings might yield more en- 
codings or an enriched encoding. Instead, the 
encoding accorded high-imagery words with 
many meanings presumably is less distinctive 
than the encoding of high-imagery words with 
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a limited number of dictionary 
explain the superior recall o^ 
despite the less accurate recogni: 
sary to assume that retrieval 
more effective with the high-: 
meaning words. 

Thus, we are faced with | 
dilemma of acknowledging 1 
sometimes contradictory effects 
tion and recall of two variables : 
induce imagery processes witho 
to assess precisely why these 
effective and without being abl e these 
variables to other characteristi verbal 
materials known to affect rec: om and 
recall. 


itriguing 
rlapping, 
recogni- 
sllegedly 
ng able 
bles are 
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Undergraduate Ss were required to indicate whether each stimulus in a series 
of CCCs, CVCs, WORDs (CVC type) and meaningful CCCs (eg., JFK, 
LSD) was a word or a nonword. Reaction time (RT) was the dependent 
l measure. The serial model of word-nonword classification, which states 
that phonological evaluation precedes memory search, would predict no dif- 
| ferences in RT between the meaningless CCCs and the meaningful CCCs, 
since the two items are equally unlawful. The parallel model, which assumes 
that phonological evaluation and memory search occur simultaneously, would 
predict that RTs to meaningful CCCs would be greater than those to meaning- 
less CCCs, because for the former type of item, there is confusion in finding 


meaning in an unlawful nonword. The results supported the parallel model in 


that RTs to meaningful CCCs were con: 
In a second experiment, 


ingless CCCs. 


sistently greater than those to mean- 
RTs to frequent CCCs equalled RTs 


to nonfrequent CCCs, suggesting that the word-nonword decision was not 


based on a frequency determination. 


A considerable amount of attention has 
recently been paid to what Norman (1969, 
pp. 162-163) has termed the "mantiness 
problem." A person is able to recognize 
in a very short amount of time that 
"mantiness" is not a word in his vocabu- 
lary. The problem, of course, is how this 
accurate decision is arrived at so quickly. 
It has therefore been the goal of several 
recent studies to discover the processes 
underlying this ability. 

When confronted with "mantiness" it is 
highly unlikely that the person engages in 
a scan of all the words he has stored in 
memory (this process would take too much 
time), but it is generally agreed that recog- 
nition of a word or rejection of a phono- 
logically lawful nonword requires some 
search for meaning. In addition, several 
authors (Norman, 1969; Rubenstein, Lewis, 
& Rubenstein, 1971; Stanners, Forbach, & 
Headley, 1971) suggest that rejection of 


1 This research was supported in part by National 
Institutes of Health Training Grant 00084. Part 
of this paper was presented at the meeting of the 
Eastern Psychological Association, Washington, 
D. C., May 3-5, 1973. The author wishes to 
express his gratitude to Ron Lecours for his as- 
sistance in the collection of data and to his advisory 
committee for their helpful suggestions. 

2 Requests for reprints should be sent to Neil 
Novik, Department of Psychology, University of 
Connecticut, Storrs, Connecticut 06268. 


a phonologically unlawful item such as 
"mgsptzy" may be accomplished by means 
of some preprocessing of the item which 
would determine its phonological lawfulness 

The present paper deals with the ques 
tion of how this preprocessing and the 
memory search may be related. The two 
basic alternatives are that the operations 
are sequential or that they act in parallel. 
In the sequential model, the person first 
decides whether the item could be a word, 
according to phonological rules. If a stimu- 
lus is rejected, no search through memory 
for meaning is needed. 1f, on the other 
hand, the item is accepted as being phono- 
logically lawful, the person must search for 
meaning. A word response would be emit- 
ted if meaning was found, while a nonword 
response would follow the failure to find 
such meaning. This type of model has 
been described by Rubenstein et al. (1971) 
and Stanners et al. (1971). Its support 
comes mainly from studies in which 5 is re- 
quired to respond to a visually presented 
stimulus simply on the basis of whether the 
item was a word or a nonword. ‘The model 
predicts that reaction times (RTs) for re- 
jecting phonologically unlawful units (e.g., 
consonant-consonant-consonants—CCCs) 
would be faster than for accepting words, 
since no search for meaning is required in 
the former case. Furthermore, RTs to 
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phonologically lawful nonwords (e.g., con- 
sonant—vowel-consonants—CVCs) should 
be slower than those to words, since an 
exhaustive search is needed when the item 
is not a word. 

These predictions have been supported in 
several recent studies. Lawful nonwords 
have been shown to have longer RTs than 
both real words (Rubenstein, Garfield, & 
Millikan, 1970; Snodgrass & Jarvella, 1972) 
and unlawful nonwords (Rubenstein et al., 
1971), The strongest support for the se- 
quential model comes from the study by 
Stanners et al. (1971), in which Ss re- 
sponded word or nonword to a series of 
visually presented CCCs, words (CVC type) 
and CVCs (nonwords). As the sequential 
model predicts, CCCs were responded to 
the fastest, words were intermediate, and 
CVCs were the slowest (approximate RTs 
were 600, 715, and 840 msec., respectively). 
The authors also found that the frequency 
in the English language of the initial and 
terminal consonants of the stimuli affected 
the RTs for words and CVCs but not for 
CCCs. These results suggested that pho- 
nological evaluation precedes the search 
for meaning and that the former operation 
is unaffected by at least one variable 
(consonant frequency) affecting the latter 
operation, 

The parallel model assumes that phono- 
logical information and the search for 
meaning are processed simultaneously. Al- 
though, at present, there is no direct 
support for such a model, several lines of 
evidence have demonstrated that familiar- 
ity can have a very early effect in the 
processing of visual information. At least 
two studies have shown that the recogni- 
tion of individual letters is improved when 
those letters are embedded in words 
(Reicher, 1969; Wheeler, 1970). Words 
have also been shown to have an advan- 
tage over nonwords in physical matching 
(Eichelman, 1970) and visual scanning 
(Krueger, 1970; Novik & Katz, 1971). 

These studies are important in showing 
that higher order information such as 
meaning may have very early effects in 
visual information processing. Although 
not directly supporting the parallel model 
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described earlier, these data 
might be possible for the 
given item to be processed e^ 
tion, even simultaneously w 
cal information. 

The present study seeks to ; 
support for that type of p 
Specifically, it was hypothes 
kinds of information, one co" 
nological lawfulness and ti e 
cerning the meaning of the i 
processed simultaneously. Th 
is based on the procedure used 
et al. (1971). The Ss were pri 
a series of three-letter items ; 
indicate whether each item wa 
a nonword. The dependent n 
RT. As in the Stanners et à! 
were presented with CCCs, ( 
CVC words. In contrast to Stan 
Ss in the present study also s 
of CCCs which presumably co 
good deal of meaning (e.g., J! 
etc.). The main comparison in th 
study was between the RTs to t. 
ingful CCCs and those to the noni 
ful CCCs. If a determination o! 
logical lawfulness were made pric: 
search for meaning, and S was «ble to 
reject an item on the basis of thai deter- 
mination, then the RTs to JFK and IRE 
should not differ. This is true because the 
items are presumably equally unlawful and 
because meaning does not affect the phono- 
logical decision in the sequential model. 
If, however, the two operations occurred 
in parallel, then the finding of meaning for 
JFK would conflict with the decision of 
phonological unlawfulness, causing a delay 
in the nonword response. In summary; 
the sequential model would be supported. 
by the finding of no differences in the RTS 
to the two types of CCCs, while such à 
difference would tend to support the parallel 
model. 


le direct 
model, 
hat twos 
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ings 
hono- 
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EXPERIMENT I 
Method 


Subjects. Twenty right-handed female under 
graduate students enrolled in introductory psy- 
chology at the University of Connecticut served 45 
Ss in the experiment. Each S received 1 hr. 0 
experimental credit toward the course requirement: 


PARALLEL PROCESSING IN WORD-NONWORD CLASSIFICATION 


App (he main piece of apparatus was a 
Kodali usel slide projector with a Pronto Press 
tachis shutter. The equipment used in 
progr lide presentations consisted of 
two c. timers, a Lafayette triple 


pul ner, and a Hunter interval timer. 
Ti ve were three telegraph keys placed on a table 
fy atof S. The center key was used to initiate 
a ' al, while the two outside keys were us ad for 
m king responses. A Hunter digital clock me red 
i milliseconds the interval between the presenta- 
ion of the stimulus and the response. 

The slides were projected onto a rear-projection 
screen placed 5 ft. from S: The screen measured 


14 X 14 in, but all except a center portion of the 

1, measuring 5 X 3 in, was blocked out by 
c oard taped to the back, The projected image 
of he stimulus was 3 X 1 in, and was projected 


nter of the ser 
The stimuli cor 


ed of WORDs (CVC 


ty} Cs, and CVCs (nonwords), typed in upper- 
cas vs and copied onto transparency film by a 
3M mofax No stimulus contained more 
tha ít 


I were-two 60-trial practice blocks. 2 
bloc! -re were 30 WORDs, 15 CC nd 15 CVCs. 


The Cs and CVCs were constructed by randomly 
selec’ ug 15 of the WORDs and inserting a random 
cons «ant or vowel in the center position (the only 


rest; tion being, of course, that for the CVCs, 
ano ser WORD did not result). For example, if 
CA* was the WORD, the CCC and CVC could 
have been CHN and CEN, respectively. Thus, in 
cach practice session, 15 of the WORDs had two 
corresponding nonwords, while the other 15 WORDs 
were filler items. 

For the test trials, 15 meaningful CCCs (e.g. 
JF IS, LSD) were chosen, The selection of these 
items was based on the results of questionnaires in 
which pilot Ss were asked to indicate the meaning 
of 20 such meaningful CCCs. The five items re- 
sponded to most incorrectly were eliminated. From 
cach meaningful CCC (CCC+) another CCC and 
a CVC were constructed in the same manner as 


In addition, 45 


NBC, LBJ). 
CCC +s, 15 CCCs 
Within each practice block and the test block the 
stimuli were randomly mixed. Each 5 received the 
same order of stimuli. 

Procedure. The S was instructed that pressing 
the center key would initiate the trial, and that 2 
sec. after she pressed the key, the stimulus would 
be presented. The Ss were told to respond to that 
stimulus as quickly and as accurately as possible on 
the basis of whether it was a word or a nonword. 
The key on the right was designated for a word 
response, while the left key was to be used for a 
nonword response. The 5 used her right hand for a 
word response and her left hand for a nonword re- 
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TABLE 1 


MEAN REACTION Tres (RTs) (IN MSEC.), 
STANDARD DEVIATIONS, AND PERCENT 
ERRORS FOR THE Test TRIALS 


IN EXPERIMENT I 
Stimulus} Numbct | Mean RT| SD — | % etror 
ccc+ 15 695 66 1.6 
CCC 15 672 59 1.6 
WORD 45 671 AT 3.9 
CVC 15 185 18 11.6 
Note. Abbreviations: C = consonant, V = vowd, + = 
meaningful. 


sponse. ‘There were no requirements as to which 
hand was to be used to press the center key, but S 
was instructed that both hands should be ready to 
respond when the slide was presented. No 5 re- 
ported any difficulty in getting ready for cach 
presentation. 

During the first practice block, speed and ac- 
curacy were emphasized by E approximately every 
15 trials. There was a 5-min. rest period between 
the two practice blocks. There was also a 10-min, 
interval between the second practice block and the 
test trials, during which time Ss were told that for 
the test trials, they would receive 4¢ for every RT 
under 900 msec., and that 25¢ would be taken away 
for every error. There was à slight delay midway 
through the test trials, long enough for Æ to change 
slide trays. 

For each trial, a maximum of 4 sec. was allowed 
"The failure criterion for the test 
Two Ss made more than 10 


for a response. 
trials was 10 errors. 
errors and were replaced. 


Results 


Median RTs of correct responses for the 
test trials were obtained for each S under 
each condition. Table 1 presents the RT 
data as well as the error data for the test 
trials. A one-way repeated measures analy- 
sis of variance was performed on the RTs 
to the three nonword conditions. The RTs 
to the WORDs were eliminated from the 
analysis because all of the WORDs were 
filler items, i.e, they were chosen indepen- 
dently of the nonwords. The analysis re- 
vealed a highly significant treatment effect, 
F (2, 38) = 82.67, p < .001. Comparisons 
between conditions were made by a series 
of Duncan's multiple range tests. As in 
several of the studies previously cited, the 
RTs to the lawful nonwords (CVCs) were 
longer than those to the unlawful nonwords 
(CCCs and CCC + s) (p < .01). More 
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importantly, however, the RTs to the CCC 
+ s were significantly longer than those to 
the CCCs (p « .05). The latter effect was 
also found in individual trends, with 18 of 
the 20 Ss having a greater RT to the 
CCC + s than to the CCCs. The CCC+ 
vs. CCC difference was also found with in- 
dividual pairs of stimuli. Median RTs 
across Ss for each CCC+ and CCC were 
obtained, and of the 15 pairs of stimuli, the 
RTs to 12 CCC + s were greater than 
those to their corresponding CCCs, while 
3 pairs showed the reverse trend. Both of 
these proportions are significant (p < .025) 
according to sign tests. 

In order to compare the RT data for 
WORDS to the data for the other condi- 
tions, a series of related £ tests were per- 
formed on the individual medians. The 
RTs to WORDs were found to be signifi- 
cantly shorter than those to CVCs, £ (19) 
= 8.70, p < .001, but no other differences 
were found. 

A series of sign tests showed that there 
were significantly more errors made with 
the CVCs than with the other stimuli 
(b « .001), while no other differences in 
percent errors were found. Since the RTs 
of incorrect responses might be indicative 
of strategies in processing (Stanners et al., 
1971), median RTs for the incorrect re- 
Sponses were obtained for the CVCs and 
WORDs. The medians were based on a 
total across all Ss of 35 incorrect responses 
in both conditions. For the CVC errors, 
the median was 711 msec., which was con- 
siderably lower than the mean RT for CVC 
correct responses. The median for WORD 
errors, on the other hand, was 756 msec., 
which was greater than the mean RT for 
correct WORD responses. For both the 
CCC and CCC+ conditions, there were too 
few errors for a median to be meaningful. 


Discussion 


The most important finding of the experi- 
ment was that the RTs to the meaningful, but 
unlawful, items (CCC + s) were greater than 
those to the unlawful meaningless items 
(CCCs), a difference that would not be pre- 
dicted by the sequential model described 
above. According to that model, there is a 
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l classifica- 
honological 
and if the 
unlawful, a 


two-stage process in word-n: 
tion. During the first stag: 

structure of the item is evali 
item is found to be phonologi 


nonword response may be L If the 
item is found to be lawful, S is memory 
in order to determine if the indeed a 
word. 

In the present experiment, + sand 
the CCCs were presumably « unlawful, 
and therefore, according to su del, both 
types of items should have be« ted in the 
first stage, yielding equal R7 finding 
of a consistent difference in R veen the 
CCC-F and CCC conditions bri: sequen- 
tial model into serious doubt. present 
data suggest that a phonologi: iluation 
does not necessarily precede memory. 
scan, and in fact, the two op on may 
operate in parallel. It is hyp: ed that 
the RTs to the CCC + s are d: because 
there is a conflict of information phono- 
logical evaluation leading to a ord re- 
sponse is opposed to the finding aeaning 
which might normally lead to a w: "sponse. 

It is important to note that for varallel 
model, no substantial changes need ve made 
regarding the nature of the operat: them- 
selves. It has usually been assume: “hat the 
phonological evaluation involves ;e rule 
process which analyzes and evalu phono- 


logical lawfulness (Stanners et ) 
Rubenstein et al. (1971) suggest thai pa 
this process also involves some pl: 


coding of the visual stimulus. The 


been de- 


meaning, on the other hand, has 
scribed as a serial self-terminating scan through 
a subset of memory representations in search 


of some form of semantic marker (Stanners 
et al., 1971). 

The postulation of a self-terminating scan 
for meaning was supported in several respects 
by the present data. First, the finding that 
RTs to CVCs were consistently longer than 
those to WORDs suggests that when the item 
was not a real word, S had to continue his 
search until he exhausted the set of representa- 
tions he was searching. When meaning was 
found, however, the search was terminated, 
yielding faster RTs to WORDs. Secondly, a 
self-terminating scan leads to two related pre- 
dictions concerning the RTs of incorrect re- 
sponses. An error with a CVC would mean 
that a semantic marker was incorrectly de- 
tected, terminating the search. The RTs for 
CVC errors, therefore, should be shorter than 
those for correct CVC responses. This rela- 


tionship should be opposite with WORDs, 
© since an error indicated that S failed to find 
the marker, making the scan exhaustive. The 
data support these predictions: the CVC error 
RTs were 74 msec. less than the CVC correct 
responses, while the WORD errors were 79 
msec. longer than the WORD correct re- 
sponses. Similar results and conclusions were 
also reported by Stanners et al. (1971). 

The parallel model not only has the advan- 
tage of being able to account for the CCC+ 
vs. CCC difference, but it is also not incon- 
sistent with aspects of the present data 
and previous used to support the sequen- 
tial model. A dy mentioned, the finding 
that CVC RTs were longer than those to the 
other kinds of stimuli is easily predicted by 
the parallel model. Similar results were re- 
ported by Stanners et al. (1971), Rubenstein 
et al. (1970), and Snodgrass and Jarvella 
(1972), and this is probably the most consistent 
finding in this type of experiment. 


: The sequential model, as originally stated, 


clearly predicts that RTs to WORDs should 
be longer than those to CCCs, since with the 
latter stimuli, no memory search is required. 
This prediction was supported by Stanners 
et al. who found that RTs to CCCs were more 
than 115 msec. faster than those to WORDs. 
In the present study, however, RTs to WORDs 
essent equalled those to CCCs. While 
this failure to replicate the previous findings 
may have been due to the fact that WORDs 
were responded to with the right hand, while 
nonwords were responded to with the left 
i hand, it should be pointed out that at least 
two other studies have also reported that 
j 


WORD RTs were as fast or faster than RTs 

to unlawful nonwords.  Rubenstein et al. 
(1971) obtained RTs of 842 and 876 msec. for 

their words and unlawful nonwords, respec- 
tively, while Snodgrass and Jarvella (1972) 
reported RTs of 578 and 621 msec. for these 
types of stimuli, respectively. In both studies, 
the authors explain their data by asserting 
that when an item is phonologically unlawful, 
nreprocessing and the detection of that un- 
s!ness, leading to a nonword response, 


nger than the processing of lawful 


issarly, the parallel model can account for 
RI ; WORDs being slower or faster than 
those to CCCs by asserting that one of the 
operations takes longer than the other. It is 
also possible that the two operations may be 
differentially sensitive to such experimental 
manipulations as instructions or reward con- 
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tingencies emphasizing either speed or ac- 
curacy. Therefore, it is clear that the rela- 
tionship between the RTs to WORDs and 
CCCs is not critical for distinguishing between 
the parallel and sequential models, since both 
models can account for both kinds of data. 


EXPERIMENT II 


Experiment lI was designed to control 
for the possibility that the memory search 
was sensitive not to meaning, but simply 
to the frequency of occurrence in the 
English language. Frequent CCCs were 
compared to infrequent CCCs much in the 
same way that CCC + s were compared to 
CCCs in Experiment I. If the memory 
search were sensitive to frequency, then, 
according to the present argument, RTs to 
the frequent CCCs should have been longer 
than those to the infrequent CCCs. 


Method 


Subjects. Twenty female undergraduate students 
enrolled in introductory psychology at the Uni- 
versity of Connecticut served as Ss in the experi- 
ment, All of the Ss were experimentally naive. 
Each S received 1 hr. of experimental credit toward 
the course requirement. 

Apparatus. The apparatus was the same as used 
in Experiment I. 

Stimuli, The stimuli in the practice sessions were 
the same as in Experiment I. For the test trials, 
the 15 most frequent CCCs were selected on the 
basis of the total score in the norms reported by 
Underwood and Schulz (1960). The range of the 
frequency scores was 50 to 534, and the mean score 
was 153, From each of the frequent CCCs (CCCfs) 
another CCC was constructed by replacing the 
middle consonant with another consonant so that 
the resulting CCC had a frequency of 0 in the 
Underwood and Schulz norms. In addition, CVCs 
were constructed from each CCCI by inserting a 
random vowel in the center position (with the re- 
striction that a word did not result). There were 
also 45 WORDs used in the test session, which were 
the same WORDs as used in the test session of 
Experiment 1. Thus, there were again 90 test 
trials: 15 CCCfs, 15 CCCs, 15 CVCs, and 45 
WORDS. 

Procedure. All procedures were identical to those 
of Experiment I. 


Results 


All scoring and analyzing procedures 
were the same as in Experiment I. The 
results for the test trials appear in Table 2. 
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TABLE 2 
Mean Reaction Times (RTs), STANDARD 
DEVIATIONS, AND PERCENT ERRORS 
FOR THE TEST TRIALS 
IN EXPERIMENT [I 


Stimulus Number 


type of stimuli | Mean RT SD % etror 
CCCI 15 678 47 2.0 
CCC 15 677 46 xt 
WORD 45 685 49 14 

VC 15 792 75 7.6 
Note. Abbreviations: C = consonant, V = vowel, f = fre- 


quent. 


The analysis of variance revealed a sig- 
nificant treatment effect for the three non- 
word conditions, F (2, 38) = 49.11, p< 
:001. A series of Duncan multiple range 
tests revealed that RTs to the CVCs were 
again greater than those to the other stimuli 
(p < .01), but there was no significant dif- 
ference between CCCfs and CCCs, ruling 
out the frequency hypothesis. The lack of 
a CCCf vs. CCC difference in the overall 
means was paralleled by individual trends, 
and intrastimulus pair differences, 

All other aspects of the results, including 
the RTs to WORDS, closely replicated the 
results of Experiment I. 


Discussion 


The hypothesis that Ss decide that an 
item is a word or a nonword on the basis of 
a simple frequency determination was not 
supported by the present data. Although it 
might be argued that the CCCfs in the present 
study were not frequent enough to cause a 
delay in RT, it should be pointed out that 
they are more frequent than the CCC +s 
used in Experiment I, which did cause such 
a delay. Therefore, it seems likely that it was 
the meaning of the CCC +s, rather than 
their frequency, which caused the effect. As 
Rubenstein et al, (1970) suggest, the role 
of frequency might be to limit the subset of 
memory representations to be searched, but 
frequency itself is probably not the deciding 
factor. 


CONCLUSION 


The sequential model and the parallel model 
described above are based on many of the same 
arguments and assumptions, and can account 
for much of the same data. In Experiment I, 
however, the CCC vs. ccc+ comparison 


NEIL NOVIK 


separates the predictions of the t, 
the data clearly support the | 
The results of Experiment II su 
memory search was not a sim; 
determination. The parallel moc 
principle to other previous mode! 
one described by Ingalls (1972). 

as in Experiment I, Ss could ha 
on the basis of the prelimina: 
alone, if the organization of the o 
sequential, His data, however, sı 
the higher order operations were | 
to complete the task, without res: 
information gained by the prelimi 
tions, Similarly, the most reason! 
sion to be drawn from the present « 
Ss are able to analyze the meaning o! 
lus without first evaluating it« p 
properties. 


requeney 
similar in 
ch as the” 
* study, 
sponded | 
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\IPORAL“COURSE OF RECOVERY FROM INTERFERENCE 


value of an interposed intertrial interval. One interpretation of 


uggesting that this interpretation of the effect is inadequate. 


the deficit occurs in the encoding, storage, 
or retrieval stage, but simply that prior 
experience. with the same class of items 


Journal oí ntal Psychology 
1974, Vol «1. 6c 1021-1028 
THE 
AND DEGREE OF LEARNING IN THE 
BROWN-PETERSON PARADIGM ! 
DELOS D. WICKENS? anp M. M. GITTIS 
The Ohio State University 
In the Brown Peterson paradigm, performance declines across successive 
trials with the same class of materials, but it improves as a function of the 
fect is that the previous interfering items are forgotten with time. This 
i pretation was tested by repeating items prior to the increased intertrial 
interval in order to reduce their rate of forgetting. No evidence was obtained 
io indicate that this variable influenced the temporal course of recovery from 
e interference, 
the Brown-Peterson paradigm 
(Bi 1938; Peterson & Peterson, 1959), 
suc e trials with the same class of 
mat : are given at an intertrial interval 


in : ieighborhood of 30 sec., and with a 
con: «nt retention interval for all trials, 
per^ mance drops markedly and reaches 
asy. ptote in three or four trials (Keppel 
& ' aderwood, 1962; Loess, 1967; Wickens, 
19/9). Two kinds of interpretations have 
becas made of these data. One is that $ 
becomes bored or “inattentive” and hence 
performs less well. ‘This interpretation 
seems most implausible, for the asymptote 
is reached within 2 min, after the experi- 
ment is begun, and if S's attention were 
as fickle as this interpretation implies, one 
would have to regard as suspect the results 
of almost all laboratory research in 
psychology. 

The most plausible interpretation of the 
decline is that lower performance on the 
later trials is engendered by the inter- 
ference resulting from the prior inputs. 
Since an asymptote is reached, it must be 
assumed that the growth of interference is 
limited to some maximum achieved within 
about four trials. Obviously, this is a 
proactive interference interpretation. Such 
an interpretation does not state whether 


V The research was supported in part by National 
Science Foundation Grant GB-33680 to the first 
author. 
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Wickens, Department of Psychology, The Ohio 
State University, Columbus, Ohio 43210. 


inhibits performance on subsequent items. 
The occasional occurrence of intrusions of 
prior items during the recall of later items 
offers some positive evidence of competition 
at the time of retrieval, although it does 
not negate an interference effect at the 
other stages. 

The degree of decline in performance is a 
function of the intertrial interval (€ 'ermak, 
1969; Loess & Waugh, 1967); but what is 
more important for the present ch 
is that if after » trials, an increase in 
intertrial interval occurs before trial n + 1, 
performance will improve on trial à + 1 
(Peterson & Gentile, 1965). 

Kincaid and Wickens (1970) found an 
orderly increment in trial n+ 1 as the 
critical intertrial interval was incr ed 
beyond the usual 30 sec. by 15-120 s 
At this latter temporal value, performance 
was increased by about two thirds of the 
drop from Trial 1 to Trial 4. This general 
result was also obtained by Hopkins, 
Edwards, and Cook (1972). 

A further interesting characteristic of 
this spacing effect is found in the research 
of Nield (1968). He used consonant 
trigrams (CCCs) as his target items and 
color naming as the rehearsal preventative 
tas! During the lengthened intertrial 
interval, whose increment was 16, 48, or 
80 sec. for different groups, his three sets 
of groups engaged in different types of 


= 
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TABLE 1 


ScHEMATIC DESCRIPTION OF THE PROCEDURE FOR 
THE VARIOUS GROUPS OF THE EXPERIMENT 


Group Trial sequence 

Experimental 0 CoNoCi1C«C3CiC:CiC4 
Control 0 CoNoCsCoCrCiCaCsCa 
Experimental 1 CoNoCiC2CsCiC2CaNiCa 
Control t CoNoCsCeCIC1C2C3N1C4 
Experimental 2 CoNoCrC2CaCiC:CsNiN2C4 
Control 2 CoNeCaCaC;CiCHCNINSCa 
Experimental 4 CoNoCiC2CsCiC2CaNiNzNaNiCa 
Control 4 CoNoCsCeCiCiCaCaNiN2NaNaCa 
Experimental 5 CoNoCiC2CsCiCoCaNiNeNaNaNsCa 
Control 5 CoN oCsCeCyCiC2CaNiN2NaNaNsCa 

Note. Subscripts indicate particular items (0 = practice 


item). Abbreviation: C = consonant trigram, N = number 
trigram, 


activities. One set of groups simply 
rested, another set named a matrix of color 
patches, while a third set continued with 
the Brown-Peterson memory task, using 
three-digit numbers as the target items. 
This latter task was chosen because there 
seems to be little or no interference between 
CCC items and number trigram (NNN) 
items. Note also that the distractor task 
was color naming, not digit subtraction as 
in the original Peterson and Peterson 
(1959) experiment. The set of groups did 
not differ from each other either in the 
rate or magnitude of recovery from the 
inhibitory effect. Within the limits of the 
kinds of tasks employed—resting, color 
matrix reading, or the Brown-Peterson 
task with items of a different conceptual 
class, the passage of time alone seems to 
have been responsible for the recovery. 
The phrase “within the limits of the 
kinds of tasks employed” should be empha- 
sized, since there is evidence that the 
recovery effect is influenced by the nature 
of the interpolated activity between trial 
n and n + 1 and the nature of the target 
activity. Evidence on this topic will be 
presented in the Discussion section. 
Several possible ways of accounting for 
these results seem available. One is that 
they are due to a forgetting of the inter- 
fering material either because of mutual 
interference among these items —an “acid 
bath” (Posner & Konick, 1966) type of 
approach—or to some time-related weaken- 
ing factor or factors. Another interpreta- 
tion is a list differentiation process made 
possible through the separation of the n 
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and the »+1 trials b; 
intervening materials (Go: 
1974). 

The present experiment v 


different 
& Riley, 


signed to 


test the forgetting-of-interfe interpre- 
tation. It was assumed tha me could 
delay the forgetting of th iterfering 
items, one would also depre: rate of 
recovery of performance acro ne. The 
method used to reduce forg of the 
interfering items was simply t : a dual 


presentation of the to-be-inter: g items 
for one group and only a single ; ntation 
of the items for the other (co: group, 
This method is based simply o time- 
honored belief that repetition ii ses the 
degree of learning and hen: retards 
forgetting. 
METHOD 

Subjects. The Ss were 960 male un iduate 
students at The Ohio State Universit: > chose 
to participate in this experiment to fulfi!! . require- 


ment for their introductory psychol 
The Ss were assigned to the 10 different 


rotating order of their appearance at the | atory, 
with 96 Ss in each group. 

Material and apparatus. The target ii of the 
experiment consisted of eight CCCs with as: ciation 
values between 25% and 33% (Witmer, 1°29) and 


six three-digit NNNs. The NNNs were : »mpiled 
from a table of random numbers such that no NNN 
had two identical digits or comprised a ning- 
ful" sequence (e.g., 321). The digit 7 was not used. 
Slides of the Stroop (1935) test were employed as 
the distractor or rehearsal preventative activity? 
They consisted of 15 color names appearing in a 
3X5 matrix. These were photographed from a 
sample given by Underwood (1966, p. 540). A 
plain 3 X 5 in. index card with a hand-printed 
sample of the Stroop test was shown to each S$ 
during the instructions. A metronome was used 
to produce clicks during the presentation of the 
Stroop slides. AII slides were presented by a Kodak 
projector and timed by a 35-mm. timer. 

Procedure, An asterisk presented for 2 sec. 
served as a ready signal for the 1.5-sec. presentation 
of the CCC or NNN which followed. Next was à 
12-sec. presentation of a Stroop slide accompanied 


*The Stroop test was chosen for the rehearsal 
Preventative activity because we have, in The Ohio 
State University Laboratory, found it to be a very 
effective distractor task in the sense that Ss become 
quite involved and challenged by the task itself— 
more so than a number subtraction ta It seems 
to be highly effective for distracting Ss from re- 
hearsing the memory task. 
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by 15 be he metronome, and finally there 
appeared resentation of a question mark. 
Each tr sre, took 21.5 sec., and each trial 
was ex e same except for the target item 
presente: | the specific Stroop slide given. 

The n of the experiment contained two sets 


of grc , the experimental set and the control set. 


The erimental set experienced repetition; the 
cont set did not, Table 1 presents the basic 
out! ic of the procedure. In that table the letter 


“C stands for the consc 
letter “N” repre 


igna cular item: 


ant trigram, while the 
trigram. The subs 
with the zero referring 
(Co and No) 


periment to accustom 


ants a dig 


practice m 


i , oup 2, which is typical. 
It t vO 


practice trigrams followed by 
thr ch ona d ent CCC, For the next 
thrc { same trigrams were repeated in 
their -quence. Then, with no break in 
tem; ceived two NNNs, and finally they 
were the critical trigram C4. Its control 
has < program except that no repetition is 
given will be noted that trigrams Ci, C», and 
Ci i just before the shift to NNNs as they 
also in the experimental group. The other 
group differ in the number of NNNs inserted 
betw: i Cy and the critical item Cy. Since the 


intertrial interval was 21.5 sec, the time value 
betv. on Cy and C; can be determined by the number 
of interpolated N trials, The use of NNNs during 
the recovery period was based upon the Nield (1968) 
finding of no apparent difference between type of 
activity interpolated and the beneficial value of 
the spacing effect. In addition, dealing with 
subsequent memory load activities tended to pre- 
vent Ss from thinking about any prior items, And 
finally, the present experiment is not concerned 
with the precise form of the recovery function but 
with the comparison of different groups’ recovery. 


RESULTS 

The effect of repetition. Figure 1 shows 
the percent correct recall based on items 
correct per trigram for the experimental 
and control groups across the first six 
trials. An analysis of variance of these 
data showed the performance of the experi- 
mental group to be significantly superior to 
that of the control group on the repetition 
series of trigrams, F (1, 958) = 48.22, 
p <.01. The effect of the trial block 
(Trials 1-3 vs. Trials 4-6) was also highly 
significant, F (1, 958) = 3,002.12, p < .01, 
as was the interaction of the two main 
effects, F (1, 958) = 16548, p< .01. 
These results are essentially a replication of 
the work of Cermak (1969). 
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Figure 1. Performance of the experimental and 
control groups on the six items prior to the introduc- 
tion of the increased intertrial interval. 


It should be remarked that the usual 
decline in performance across the early 
trials does not occur at a significant level. 
Presumably this is because PI had already 
built up as a result of the practice trials, on 
which performance was not recorded. How- 
ever there seems to be a slight downward 
trend across the first three or four trials 
for the control groups, showing the usual 
nadir of performance at about Trial 4. 

The effect of temporal spacing om per- 
formance. The percent correct recall on the 
final trial as a function of the number of 
NNN slides interposed between the last 
presentation of a CCC and the critical 
trigram is shown in Figure 2. An analysis 
of variance of these data suggests that 
repetition had no effect, F (1, 950) = .285. 
‘There was no significant interaction be- 
tween the effects of time and repetition, 
F (4, 950) = 1.928. The main effect of 
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FIGURE 2. Performance on the new (critical) 
item as a function of the length of the added spacing 
interval, The duration of any intervening item 
presentation and test is 21.5 sec; 


time, however, 


was highly significant, 
F (4, 950) 


= 13.752, p < .01, 


Discussion 


In beginning this discussion, it should be 
emphasized that the purpose of the experiment 
is not concerned with the retention of a 
previously presented target item, but rather 
with the measure of the effects of the strength 
Of prior target items upon the recall of a 
different target item which is presented follow- 
ing a lengthening of the usual interval. Much 
data (Keppel & Underwood, 1962; Loess, 
1967; Wickens, 1970) point to the fact that 
performance declines across the first three or 
four trials with the same class of materials, 
Presumably it does so because the input of 
the prior items interferes in some way with 
the performance on later items. With an 
increase in intertrial interval after trials with 
à constant interval, performance improves, 
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The interference-from-prio: 
tion of the loss across tri 
that such gains would be du 
of the prior items, and he 

interference and increment i 


interpreta- 
ild suggest 
forgetting 
lecrement in 


ormance, 
This interpretation, ther ould tend 
to account for the improvem« erformance 


which is associated with an 
time between trial 7 and » 4 
forgetting of the interferin: 
present experiment was desi 


ed span of 
1 increased 
rial. The 
» increase 


the retention of the interi items by 
repeating them in the carlie: Such an 
interpretation would predic t better 
learned (and hence better ren ed) items 
would persist longer in tim: therefore 
would depress the temporal : y curve. 
The analysis of variance gave n nce of à 
difference between the experim: ind con- 
trol groups, although the tin ect was 
significant for both groups. D the lack 
of main effect or interaction, th o groups 
with two and five NNNs wer wed by 
a ! test, and they fell short of 05 level. 
Considering that there are 96 Ss i; h group, 
the failure to find significance | een the 
experiments and controls is quite vincing. 

Assuming that the repetition di: engthen 
the first three triads, and Figure idicates 
that it did, one should expect then ve better 
retained across the interpolated aci:vity and, 
by producing greater interference, uppress 
the recovery curve. Obviously this did not 
occur, and so no support for the forgetting 


of the interfering material is found 

Research on the present topic | Hasher, 
Goggin, and Riley (1973) and also by Goggin 
and Riley (1974) led the authors to a list 
differentiation interpretation of the « fect. In 
their research, they worked with word triads 
as their target items, and after presenting 
three triads all drawn from the same category, 
they (a) shifted to a new category for three 
trials; (b) continued with triads of the same 
category for three trials; or (c) shifted to a 
rest condition in which viewed pictures. 
On the seventh trial they reverted to a triad 
from the original category. Enhancement of 
performance on the seventh trial was found 
only for the rest groups, despite the fact n 
the group shifted to a new category on Trial 
showed release from PI (Wickens, 1970). , 

The interpretation they made of EL 
results was that the interposed highly differen 
activity of "rest" permitted or generated. E 
differentiation between the specific mated 
of the first three trials and the material of the 
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! even though all four items came 


sevent 

from me category. By so doing, the 
interí of the previous three items was 
redu: luring seventh-item recall. Words in 
gen even though of a different category, 
are ble to produce the necessary psycho- 
log) ú contrast for differentiation. 


position that the differentiation was due 
ply to the change in the nature of the task 


in be contradicted for several empirical 
reasons, It will be noted in the Nield (1968) 
experiment described above that the group 
thi io perform the Brown-Peterson 
task, but ; new class of materials—NNNs 
ther than CCCs—demonstrated no differ- 
r from the groups which either color named 
o te 
cond experiment, Nield (1968) 
ch the task and did or did not change the 
na of thé materials, Half of his groups 
be cir Brown-Peterson trials with CCCs 
as target items and half with NNNs. 
Ai ! trials, they were shifted to a new 
ta which consisted of simply reading a 
ma: x of numbers or of consonants. It had 


be» clearly impressed upon Ss that this shift 
wo: d occur and that they were not to be 
te'od on the recall of the matrix material. 
Obviously, for half of each group the matrix 
wa: the same material as the target items, and 
for half it was different. On the subsequent 
Brown- Peterson trial, Ss were tested with the 
class of materials of the original three triads. 
Ii the materials were the same type throughout 
the Brown-Peterson, matrix, Brown-Peterson 
sequence, no enhancement of performance on 
the final trial was formed. If the Brown- 
Peterson material and matrix material differed, 
however, performance on the final trial 
markedly facilitated. Clearly, simply changing 
the task from a memorial one to a simple 
processing one does not achieve the psy! 
logical contrast effect which is nec 
list differentiation. It is apparent that, in 
our present state of knowledge, it is difficult 
to predict what manipulation will produce 
differentiation (if that is the basis for the 
improvement); although we seem to know 
that the employment of the same general class 
of material throughout, even though in a 
different manner, will prevent it. 

An example of the difficulty in predicting 
when differentiation will make itself felt is 
found in the results of the present experiment. 
Underwood (1969) has presented a consider- 
able body of evidence to the effect that 
frequency is a significant attribute of memory, 
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and Ss are highly capable of responding to 
differences in frequency of presentation in- 
dependent of response strength itself. Our 
experimental group received two presentations 
of the three pre-interpolated-task triads, and 
the post-interpolated triad received only one. 
The ratio of 2:1 should have established a 
differentiation between the first set of triads 
and the test triad. This frequency differential 
would not have been available to the control 
group. Hence, one should predict that the 
experimental group would excel the control 
group, but it did not do so. 

In conclusion it would appear that the for- 
getting-of-interfering-materials concept has no 
support from the data of this experiment; and 
that a differentiation interpretation, even 
though it is not without problems, may be 
somewhat more promising. 
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The nature of facilitation and interference effects was examined within the 


Brown-Peterson paradigm. 


Subjects were first given a block of three tests on 


items from a single taxonomic category, Three variables dictated the sequence 


of events thereaft 
f items and Blo 


rent ite fferent category 
juent block of inte 


hin the delayed prese 


a 


ı memory task or with rest. 


a) the similarity relationship between a subsequent block 
1 items, i.e., same category-same items, same category- 
; (b) the time interval between Block 1 


, immediate vs. delayed presentation; 


on conditions, whether the interval was 


Recall was best when items were 


ted, worst with different items from the same category, and intermediate 


) the category changed. 


A rest delay improved recall in all conditions 


relative both to no delay and to a delay filled with work. A differentiation 
»othesis was used to account for these results. 


Re Hasher, Goggin, and Riley 
(197 onstrated that both facilitative 
and ory effects occurring within the 
Brow eterson. paradigm (Brown, 1958; 
Peter & Peterson, 1959) persist over 


intervals that would normally be considered 
outs. i:e the range of short-term memory 
(STM). In that experiment, Ss were first 
tested with three triads of words from one 
taxonomic category, then three triads from 
another category, and finally three more 
triads from the first taxonomic category. 
The time intervening between the first and 
the third sets was approximately 90 sec. 
When the third set of triads was identical 
to the first set, facilitation over an appro- 
priate control was demonstrated, thus 
showing a memorial effect for specific items. 
When, on the other hand, the third set of 
triads was from the same category as the 
first but was comprised of different words, 
inhibition equal to a control tested on the 
same category throughout was demon- 
strated. Controls made it clear that the 
inhibition was category specific. 

Previous research (e.g., Cermak, 1969; 
Hebb, 1961) has shown the facilitative 
effects of repetition over rather long inter- 
vals in an STM paradigm, but studies in- 
vestigating the persistence of inhibition 


1 This study was supported by National Science 
Foundation Grant GB-36210 to the first author. 

2 Requests for reprints should be sent to Judith 
Goggin, Department of Psychology, University of 
"Texas at El Paso, El Paso, Texas 79968. 


have shown considerable loss of inhibitory 
effects. That is, for comparable time inter- 
vals, facilitation persists, but inhibition is 
lost. Some studies (e.g, Hopkins, Ed- 
wards, & Cook, 1972; Kincaid & Wickens, 
1970) have found a moderate amount of 
recovery (approximately 62% and 48%, 
respectively) from the build-up of proactive 
interference (PI) after a 90-sec. interval, 
and Cermak (1970) found almost complete 
recovery following a 66-sec. rest interval. 
In the Hasher et al. (1973) study, on the 
other hand, such recovery was not found. 
We speculated that the critical difference 

between our study and previous ones lay 
in the nature of the interpolated material 
and its effects on S's ability to differentiate 
clearly between original learning and later 
learning. Only in Hasher et al. (1973) were 
the interpolated material and task identical 
in nature to those of original and final 
learning. ‘The Ss were required to recall 
items different only in that they came from 
a different taxonomic category. ‘The other 
experiments used different kinds of activi- 
ties in the intertest interval, such as symbol 
cancellation, Stroop color naming, and rest, 
but in no case were Ss engaged in a memory 
task of the same nature. To determine 
whether or not the differences between 
these two classes of experiments are related 
to the nature of the intervening activity, 
it is necessary to include within the same 
design conditions that have a memory task 
as interpolated activity and conditions that 
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TABLE 1 
EXPERIMENTAL CONDITIONS 
Block 
Condition 
1 2 3 
(Tests 1-3) (Tests 4-6) (Tests 7-9) 
1 [o rest Cu 
2 Cı C: Cis 
3 Cı Cis rest 
4 Ci rest Cu 
5 Ci C: Cia 
6 €i Cua Cu 
7 Gi rest C: 
.8 Cı C: C: 


Note. Abbreviations: C = taxonomic category, s = repeti- 
tion of the same items, d = use of different items from the same 
category. 


have some other sort of nonmemorial filler 
task. This was the purpose of the present 
experiment. 


METHOD 


Design. Outlined in Table 1 are the eight experi- 
mental conditions, each one of which was divided 
into three temporally equivalent blocks of learning 
or rest activity. A block of learning activity con- 
sisted of three Brown-Peterson tests; all to-be- 
remembered items in any one block belonged to 
the same taxonomic category (C). Numerical sub- 
scripts represent category membership. Thus, 
Condition 8 received a different category in each of 
the three blocks (C;, Cs, C3), while all items belonged 
to the same category in Condition 6 (Ci, Ci, C). 
When there was category repetition across blocks, 
letter subscripts designate the degree of item 
similarity. Repetition of the same items is repre- 
sented by the letter "s"; the use of different items 
from the same category is represented by the letter 
"d." This distinction can be illustrated by an 
examination of Conditions 1 and 4. If the words on 
the first test of Block 1 had been, for example, 
MUSKRAT, WOLF, ELK, these same words would 
again appear on the first test of Block 3 in Condition 
1, while Goat, MULE, CAMEL might appear in the 
corresponding position in Condition 4. 

All conditions were treated alike in the first block, 
receiving three STM tests on items from a single 
category. Dictating the sequence of events there- 
after were three main variables: (a) the similarity 
relationship between a subsequent block of items 
and Block 1 items, i.e, same category-same items 
(Same), same category-different items (Different), 
or different category (Release); (b) the time interval 
between Block 1 and a subsequent block of interest, 
i immediate or delayed presentation; and (c) 
within the delayed presentation conditions, whether 
the interval was a work interval filled with a memory 
task or a rest interval in which Ss looked at scenic 

slides. Consider, for example, the same c: 


ategory— 
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same item relation. Both 
received Cy, but Ci, was te 
Condition 1, whereas it was | 
Condition 3. Comparison of 
can also be made between ( 
which were identical except th 
interval in the former and wa 
new category in the latter. Pai 
same category-different items 
tions 4, 5, and 6. Cia was test 
Condition 6, after a rest delay 
after a work delay in Condit 
formance on a new category c: 
no delay (Conditions 2 and 5 
(Condition 7), and after a wor! 
8). 

Materials. Foods, body part 
the three taxonomic categories | 
were chosen. The categories « ed equally 
often within each condition and red equally 
often in each of the three block ial, Each 
category was composed of 36 wo: vided arbi- 
trarily into 12 triads. Assignment e 36 triads 
to conditions and test positions wi "onditions 
could not be equalized, but was r nably well 
balanced. The 3 words in each tri: e arranged 
diagonally across a slide, 


ons 1 and 3 
ter a delay in 
mmediately in 
mance on Cy 
ms 1 and 2, 
clay was a rest 
vith tests on a 
mparisons on. 
ible in Condi- 
r no delay in 
dition 4, and 
Finally, per- 
ssessed after 
1 rest delay 
(Condition 


rimals were 
hich stimuli 


Procedure. Subjects were first 1 practice 
trials to familiarize them with the Practice 
trials were identical to regular test ls except 
that symbols, instead of words, wer: to-be-re- 
membered items. Each § then re three 90- 
sec. blocks of learning or rest activit». A block of 
learning activity consisted of three B n Peterson. 
tests, each of which ed 30 sec. sequence 
of events during this 30-sec. period was as follows: 


(a) a ready signal for 2.5 sec.; (b word triad 
presented for 1.5 sec., during which time S read 


the words aloud; (c) a 15-sec. presentation of a three 
digit number from which S was ructed to count 
backward by threes as fast as possible; and (d) & 


question mark displayed for 11 sec., which was S's 
signal to recall the word triad, in order if possible: 
If another test was to follow, it began immediately. 

During blocks of rest, Ss wei 
slides. These slides were pre: 
time intervals GEA TS 1 x 
Would have been used had the block been filled 
with STM tests. The Ss were instructed simply t9. 
relax and to view the pictures. If a block of learning 
activity followed.a block of rest, Ss were forewarned 
shortly prior to the appearance of the ready signal. 
Slides were presented by a Carousel projector, with 
slide duration regulated by a Gerbrands tape 
timer. 5 

Subjects. A total of 384 Ss was used in the experi- 
ment, with 48 in each condition. The Ss were 
students at the University of Texas at El Paso who 
participated in the study to receive bonus points In 
introductory psychology. The Ss were randomly 
assigned to conditions in blocks of 8. Thirteen 58 
were discarded for failure to follow instructions of 
equipment malfunction. 


nted for the same 
and 11 sec.) that 


shown 12 scenic a 
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Figure 1. Mean correct recall a 
breviations: | = immediate presen 
delay, Diff — different.) 


RESULTS 


\n item was considered correct if it was 
given, whether or not it was placed in cor- 
rect serial position. Mean correct recall is 
presented in Figure 1. The figure is divided 
into three panels that correspond to the 
three blocks of trials. All conditions were 
treated alike in Block 1 and, consequently, 
are shown as a single curve. Those condi- 
tions treated alike in both Blocks 1 and 2 
were similarly collapsed in Block 2 for 
simplicity of exposition. None of these 
conditions so treated differed significantly 
from each other. 

Replication of basic phenomena. High re- 
call on Test 1, followed by a drop in reten- 
tion with succeeding tests on similar 
stimuli, is characteristic of the Brown- 
Peterson paradigm. The rapid drop in 
recall, which has been attributed to increas- 
ing PI across tests, is evident in Panel 1. 
This main effect of tests was significant, 
F (2, 752) = 140.93, p < .001; no other 
effects were. 

Effects of block similarity. 
and third blocks of work activity, there 
were three types of items defined by their 
similarity relationship to the Block 1 items: 


In the second 


LI 


| (Diff) Í (5) * 
Y. 9 
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Control ~a AA "x 


PASS 


5 6 8 


Tests 


s a function of condition and test. (Ab- 
tation, RD = rest delay, WD = work 


Items were drawn from the same category 
(Different-Control), were drawn from a 
new category (Release), or were repetitions 
of items that had appeared in Block 1 
(Same). The effect of the similarity vari- 
able was examined on Test 4, across Block 
2, and across Block 3. 
On Test 4, the variations in item simi- 
larity had a strong and reliable effect on the 
level of recall, F (2, 141) = 16.52, p < .01. 
After PI has built up within a category, à 
change to a new category produces release 
from interference as evidenced by a marked 
improvement in recall (cf. Wickens, 1970). 
Replication of this phenomenon can be seen 
by comparing Release and Control condi- 
tions; a planned comparison showed that 
this difference was significant, F (4, 141) 
z1248/ p «300. Test 4 analyses also 
showed that repetition facilitated recall 
since Same Ss performed better than Re- 
lease Ss, F (1, 141) = 4.88, p < .05. 
Performance across the three tests of 
Block 2 is displayed in Panel 2. The gen- 
eral trend of recall by the Release condition 
was similar to that found in Block 1. On 
the first test with the new category» recall 
was high; on succeeding tests with similar 
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FIGURE 2. Mean correct recall as a function of 
condition and intertest interval. (Abbreviations: 
I = immediate presentation, RD = rest delay, 
Diff = different.) 


materials, recall decreased, presumably be- 
cause of cumulating interference. Thus, 
Control and Release Ss, whose recall was 
quite discrepant on Test 4, were performing 
at similar levels on Tests 5 and 6. Per- 
formance in the Same condition showed the 
benefit of item repetition, with recall being 
maintained at a high level throughout the 
three tests. Analysis of Block 2 showed 
that both conditions, F (2, 141) = 25.75, 
p < .001, and tests, F (2, 282) = 6.60, 
p < .01, produced reliable differences. In 
addition, the interaction between condi- 
tions and tests was significant, F (4, 282) 
=2.78, p < 05. 

The effects of item similarity are some- 
what more difficult to see in Block 3 be- 
cause there were two groups for each 
similarity condition. The two groups are 
distinguished on the basis of whether Block 
2 consisted of a rest interval or a work 
interval. Nevertheless, the general picture 
presented in Panel 3 is similar to that in 
Panel 2. The differences among similarity 
conditions occurred again and were signifi- 
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cant, F (2, 282) = 54.96, p ‘01, Over- 
all, Same Ss showed the ! t of item 
repetition; Different Ss sl the de- 
pressing effect of cumul: PI; and 
Release Ss showed the fa ion that 
accompanies a category Recall 
decreased over tests in the bl * (2, 564) 
= 25.88, p « .001. 5 n effect, 
however, must be interpret onjunc- 
tion with the reliable Cond X Tests 
interaction, F (4, 564) — b «05 
Recall decreased more rapi er tests 
in the Same and Release co 1s than 
in the Different condition, < ntly re- 
flecting some lower limit on in the 
last condition. i 
In summary, variations in it: imilarity 
between blocks of tests produced sirong and 
consistent differences in recall ese dif- 
ferences were similar to thos: und by 
Hasher et al. (1973). After P d built 
up within a taxonomic categor all was 
depressed if there was contin: testing 
on that category, or even if the: s later 
return to that category after an i: -rval of 
time. If there was a change in coding 
category (Release), on the other hand, re- 
call improved on the first test wit! he new 
items and then decreased as m items 
were presented in that category “nally, 
repetition of items (Same) produced an 
improvement in recall in comparison to the 
release conditions, indicating that specific 
items were retained in this paradigm. 
Effects of intertest interval. It has been 


pointed out that triads used in Blocks 2 
and 3 were related to Block 1 triads in 
three ways, i.e., items were repeated, came 
from a new category, or were different items 
from the same category. One question of 
interest was whether performance on these 
later tests would be affected by the time 
interval separating them from Block 1 tests. 
This can be determined by comparing the 
level of recall for Same, Release, and Differ- 
ent conditions tested immediately with 
that after rest delay. Figure 2 shows this 
comparison. 

Recall was better after a rest interval 
than after no delay on the first test of the 
block, F (1, 282) = 7.42, p < .01. Al 
though the individual similarity conditions 
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TABLE 2 
FREQUENCY AND SOURCES OF INTRUSIONS 


Condition 
sions E s 
pss 2 | 3 4 $ 6 7 8 
jock | 38.0 380 | 45.0 43.5 43.7 51.3 39.5 31.7 
ition from each source | 
alist 93 88 | .92 85 .89 .88 71 E 
1.00 96 1.00 | 1.00 93 1.00 1.00 .88 
79 77 | 72 | 61 .68 .63 .80 18 
14 16 | Bea TMA 45 AT 20 19 
07 06 wa n06 AT 20 0.0 04 


dia ally reliable effect, 
the effect in each of the 
thre: plus the absence of any 
reliai ion (F = .02) makes it clear 
that iacilitated recall in each con- 
ditio: ror recall after a delay has 
been Í by others under similar con- 
ditior Different Ss (Cermak, 1970; 
Kinca:: & Wickens, 1970) and, in one case, 
for bo: Different and Release Ss (Hopkins 
etal., 1972). 


Delay did not have a reliable effect when 
the block was considered in its entirety. 
'The interaction between delay and tests 
was, however, reliable, (2, 564) = 3.06, 
p< 05. As Figure 2 shows, the facilita- 
tion by delay was maintained across tests 
in the Release condition, but not in the 
other two. 

Effects of interval filler. 
pose of the present experiment was to de- 
termine whether the nature of the material 
interpolated between the original learning 
task (Block 1) and the final learning task 
(Block 3) would affect the level of recall in 
the third block of tests. These data are dis- 
played in Panel 3 of Figure 1. 

Condition 6 was not included in the over- 
all analyses of interval filler, but it was 
compared to Condition 5 in a separate 
analysis. These groups did not differ 
statistically in Block 3 (F < 1), and their 
overall performance was strikingly similar. 

ing to an old category with 


Thus, Ss returnt 
new items after an intervening different 


category (Condition 5) performed the same 
as Ss maintained throughout on the old 
category (Condition 6), i.e., inhibition did 
not dissipate with time even though release 


The main pur- 


on had occurred in Block 


from this inhibiti 
Hasher 


2. Similar results were found in 
et al. (1973). 

Overall analyses of fi 
conducted both on the first test of the block 
and on the block as a whole. On Test 7, 
recall was higher when Block 2 was filled 
with rest than when filled with work, 
F (1, 282) = 14.68, p < 01. Preplanned 
comparisons showed a significant effect of 
filler activity for both Different and Release 
Ss, Fs (1, 282) = 8.70, ps < .01, but for 
Same Ss, this effect, while in the same direc- 
tion, was not significant (F <1). The 
interaction between condition and filler 
activity was not reliable. 

Examination of Panel 3 shows that these 
effects persisted throughout Block 3. Re- 
call was higher when a rest interval occurred 
between the first and third blocks than 
when the interval was filled with tests on à 
different category, 1. 1,7282) = 7.82, 
p «t. 

Despite this pattern of results, 
action between condition and 
not reach significance, making cor 
about the differential effect of filler on 


similarity conditions ambiguous. Again, 


preplanned comparisons showed that recall 
higher than after 


after a rest interval was 
a work interval for Different and Release 
conditions, Fs (1, 282) — 19.46 and 18.52, 
ps < .01, respectively, but a difference in 
the Same condition was not found (F < 1). 
Error analyses. A breakdown of the in- 
correct responses is shown in Table 2. 
The measure of errors/block was used be- 
cause conditions differed in the number of 
blocks of learning activity. The most 


ller activity were 


the inter- 
filler did 
nelusions 
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intrusions were made in Condition 6, in 
which all items were different but belonged 
to the same category ; the fewest intrusions 
occurred in Condition 8, in which a differ- 
ent category of items was learned in each 


block. This would be expected if number 
of intrusions reflected the amount of 
interference. 


Most of the errors came from within the 
list; most of these, and in some cases all, 
were intrusions from an appropriate cate- 
gory. ‘Thus, Ss appeared to be able to 
monitor their recall and to restrict their 
output to the correct taxonomic category. 
By far the largest percentage of intralist 
errors, 72%, originated from the immedi- 
ately preceding test trial. Frequency of 
intrusions from earlier test trials was not 
high, although in three groups some errors 
were Block 1 intrusions into Block 3 recall. 
These percentages were 14%, 15%, and 
6% for Conditions 4, 5, and 6, respectively. 
It appears that even after 90 sec. of rest 
or other learning activity, Block 1 items 
are available to occur as intrusions if they 
are of the appropriate category. Frequency 
of this kind of error in the other conditions 
was almost nonexistent. 


Discussion 


The present study was addressed to three 
issues arising from differences in the results ob- 
tained by Hasher et al. (1973) and previous in- 
vestigators. First, of the various studies that 
have examined the persistence of proactive in- 
terference in the Brown-Peterson paradigm 
over time, only Hasher et al. found no reduc- 
tion in interference. That study, however, did 
not have an appropriate nonwork condition, 
Consequently, one cannot be sure that loss of 
inhibition would have been found under con- 
ditions comparable to other experiments. This 
study provides such a comparison. Second, 
the present study allows a direct comparison 
between a retention interval filled with rest as 
opposed to further rehearsal of words from a 
different category and further rehearsal of 
words from the same category. This com- 
parison is necessary in order to make a direct 
connection between the presence or absence of 
persisting inhibition and the nature of the 
filler activity. Third, this experiment, by 
extending the question of the effects of the 

interpolated activity to release and repetition 
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of original learning conditi 
of the PI decay hypothesis 

Consider first the effect me. 
Results section, these effect 
both on the first trial and o: iree trials of 
the relevant blocks. While ter measure 
may be of interest, it is o: e first-trial 
data that give an unambigu: isure of the 


lows a test 


In the 
re analyzed 


effect of previous experim operations. 
Consequently, our emphasi will be on 
this measure. Figure 2 show t our Differ- 
ent-Delayed condition was superior 
to the Different-Immediate « on. Thus, 


we replicated the results of 
menters showing that PI dissi; 
This facilitative effect of dela 
and to the same degree, in t 
Same conditions. 


us experi- 
with time. 

occurred, 
¿lease and 


The findings of Hasher et a! produced 
in this study in that there wa ich dissi- 
pation of PI in the Different « ion if the 
interval was filled with tests : ms from 
another category. Filling the val also 


on; while 
dition, it 


depressed recall in the Release c 
this effect appeared in the Sam: 


was less reliable. It is of intere » observe 
that the facilitative effects on r« of time 
and those associated with a chang: "ategory 
are additive. That is, the superiori: ~ of recall 
following a rest interval as oppost a work 
interval was as great in the Release condition 
as it was in the Different condition. ‘The 
absence of an interaction between category 


change and intertest interval can also be seen 
in Hopkins et al. (1972). 

It has been suggested (e.g 
Wickens, 1970) that PI decreases 
of intertest interval because the 
dependent decay of the encoding ch: 
responsible for PI. Several aspects of the 
present data argue against such an interpreta- 
tion. Presumably, a change in the type of 
items increases recall because there is a release 
from the interference that has accrued for the 
original encoding category (Wickens, 1970). 
If this is the case, a rest interval should be of 
no particular benefit when it is followed by @ 
category change. That is, since the items 
following a change in category are not encoded 
in the same way as the original items, decay of 
the encoding characteristics of the original 
items associated with a long intertest interval 
should be irrelevant. A rest interval in the 
Present study, however, facilitated recall just 
as much when there was a category change 
(Release) as when there was no category 
change (Different). 


Cincaid & 
function 
s a time- 
acteristics 


INTENANCE OF INTERFERENCE IN SHORT-TERM MEMORY 


There i ier feature of these data that 
weighs a time-dependent decay inter- 
pretatic effects of delay. One set of 
Ss (Sa: eived a later repetition of the 
origina! k of items; in this case, recall also 
improv^.! as a function of interblock interval. 

E It se somewhat unreasonable to suppose 
that :etention of items on their second presen- 
tation was facilitated because there was 
more forgetting of the items from their first 


present 


en established by this 
of other investigators: 


ing a rest interval; and 


(b) Is nc è I d 


) when the interval is filled 
pat iifferent taxonomic category. 
The | fact, however, that must be 
add ler to complete the picture. 
Seve iments (Cermak, 1969; Nield, 
1968 s & Gittis, 1974) have demon- 
strat after PI has built up in a Brown- 
Peter jesign with consonant trigrams as 
stimul:. materials, filling the intertest interval 
with «ts on number triads does not retard the 
reduciion of PI. That is, with CCC stimuli, 
there is no apparent difference in performance 
on the next retention test whether the interval 
is untilled or whether it is filled with tests on 
NN? 


One explanation of these three facts must be 
ruled out. The task used as an interblock 
interval filler cannot be responsible for the 
dissipation or the maintenance of PI in the 
studies under consideration. Although Nield’s 
(1968) results, for example, showed dissipation 
of PI over a filled interval and the present 
study did not, both procedures were ones in 
which S engaged in a Brown-Peterson learning 
task during the interval. 

An alternative hypothesis, more consistent 
with the present data, is that the amount of 
PI depends on the differentiation that has 
occurred during the intertest interval. Two 
types of differentiation can be distinguished, 
the first being dependent on time, the second, 
on item similarity. An unfilled interval allows 
facilitation of recall, not because of a weakening 
of Block 1 items, but because the time gap 
allows them to be dis netively identified as 
coming from a different part of the experiment. 
The pool of items from which Ss must retrieve 
the correct items during Block 3 will tend to be 
restricted to those encountered recently. Pro- 
active inhibition is thus reduced because the 
Block 3 items have a new, unique encoding cue 
to distinguish them from Block 1 items. If, 
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on the other hand, the interblock interval con- 
tains more of the same memory task with 
similar material, temporal differentiation cues 
are eliminated. Even though Block 3 is 
separated from Block 1 by the same nominal 
time whether the interval is filled with work 
or rest, the functional time is different. 

Why should materials that are sufficiently 
different from Block 1 items to allow release 
from PI (i.e., items from a different taxonomic 
category) produce such an effect? Perhaps 
as S tries to recall Block 2 items, he must 
repeatedly examine and reject erroneous in- 
trusions from Block 1. This activity may in 
turn bridge the temporal gap between Blocks 
land 3. When, however, letters are the to-be- 
remembered items and numbers are learned 
during the interval (e.g., Wickens & Gittis, 
1974), the items are sufficiently different in 
kind that letters do not intrude while S is try- 
ing to recall numbers. In these conditions, 
Block 1 items do not intrude and need not be 
rejected during Block 2, temporal differentia- 
tion between Blocks 1 and 3 occurs, and loss 
of PI in Block 3 is observed. The greater 
release of PI from Block 1 to Block 2 with 
letters and numbers (Wickens & Gittis, 1974) 
than with different taxonomic categories as 
in the present study is consistent with this 
interpretation. 
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\CTION OF POSITIVE AND NEGATIVE LABELS WITH 
ORY COMPOSITION IN ATTRIBUTE IDENTIFICATION 
CONCEPT PERFORMANCE? 


THOMAS C. TOPPINO AND PEDER J. JOHNSON? 


University of New Mexico 


The effect of 


l fect of ca 


label (example or nonexample) was separated from the 
mposition (the stimuli contained in a conceptual category) 
s of complementary concepts and requiring S to solve two 
with only examples or only nonexamples. It was pre- 
- positive label (example) would facilitate performance in cate- 
sition conditions characterized by a high relative frequency of 
timulus values, while the negative label (nonexample) would facili- 


;erformance in category composition conditions characterized by a low 


e frequency of relevant stimulus values. 
nfirmed the experimental prediction. 
response label had disappeared, leavir 


Results on the initial problem 
By the second problem, the effects of 


ng only a category composition effect. 


(he experimental hypotheses were discussed in relation to an earlier hypothesis 


based on preexperimental experience. 


A long-standing generalization in the 
concept literature is that instances from 
the positive category (examples) lead to 
more rapid solution than instances from 
the negative category (nonexamples). This 
generalization gained considerable support 
- — from early attribute identification (AI) con- 
cept experiments (rule known, relevant 
stimulus values unknown) where S was re- 
quired to solve a given concept by using 
instances from only the positive or only 
the negative category (Freibergs & Tulv- 
| ing, 1961; Hovland & Weiss, 1953). How- 
ever, recent studies (Davidson, 1969; Top- 
pino & Johnson, 1973) have shown that 
the earlier experiments confounded two 
potential sources of solution difficulty: 
category composition (the stimulus types 
composing the category, ie, TT, TF, FT, 
and/or FF), and response label (whether 
| the category is called positive or negative). 

T ILE 
is research was supported by National Insti- 
f Child Health and Human Development 
C 11D-03390-06 awarded to the second author. 
¿thors wish to thank Brian Euler for assisting 
se collection of the data. 
2 Requests for reprints should be sent to Peder 
J. Johnson, Department of Psychology, University 
of New Mexico, Albuquerque, New Mexico 87131. 

3 Any stimulus population can be partitioned into 
four stimulus types (TT, TF, FT, and FF) depen- 
ding upon the joint presence (T) and/or absence (F) 


of two relevant values (Haygood & Bourne, 1965). 


In short, the positive and negative cate- 
gories for any given concept differ in two 
respects: They contain different members 
of the stimulus population, and they are 
assigned different valenced labels. Thus, 
differences in performance associated with 
positive and negative instances of a given 
concept could be due to either or both of 
these factors. 

The independent effects of category com- 
position and response label can be experi- 
mentally isolated by using a pair of com- 
plementary concepts for which the positive 
category of each concept has the same 
composition as the negative category of its 
complement (Davidson, 1969). When the 
effects of category composition and re- 
sponse label are disentangled in this manner, 
results indicate that a positive label does 
not necessarily facilitate performance, and 
that solution difficulty depends primarily 
upon category composition factors (David- 
son, 1969; Toppino & Johnson, 1973). 
However, response label is not completely 
without effect. In each experiment, there 
was at least one condition in which positive 
superiority was obtained independently of 
category composition. These findings sug- 
gest a weak form of the early positive 
superiority generalization, viz., while re- 
sponse label does not always influence per- 
formance, when it does, solution is easier 
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with a positive label. The purpose of the 
present experiment was to investigate the 
Dossibility that, under certain category 
composition conditions, a negative response 
label may result in superior AI performance. 

Toppino and Johnson (1973, Experiment 
1) employed two pairs of complementary 
concepts (inclusive disjunction/joint denial 
and biconditional/exclusive disjunction) re- 
sulting in a positive and a negative label 
condition within each of four category com- 
position conditions [(TT, TF, FT), (FF), 
(TT, FF), and (TF, FT)]. The results 
indicated that a positive label yielded a 
clear advantage only when category com- 
position had a high relative frequency of 
relevant values [ie (TT, TF, FT) J 
Furthermore, the advantage of a positive 
label relative to a negative label, appeared 
to decrease as the relative frequency of 
relevant values decreased across category 
composition conditions [i.e., (TT, TF, FT) 
> (TF, FT) > (TT, FF)]. 

The above findings suggest that a posi- 
tive label somehow facilitates or encourages 
the use of relative frequency cues. Since 
the label “example” seems to connote 
Presence of something, a positive label may 
encourage S to use an affirmational Strategy 
whereby stimulus values that are present 
in the training instances become the basis 
of S's pool of hypothes h a strategy 
would facilitate perform: the extent 
that the relevant values with a high 
relative frequency. Howe negative 
label may have the opposite function. 
Since the label "nonexample" seems to 
connote the absence of something, a nega- 
tive label may encourage S to use a nega- 
tional strategy whereby stimulus values that 
are present in the training instances are 
excluded from S's pool of hypotheses. This 


* For a stimulus population de 


ned on four three- 
valued dimensi 


, there are 9 TT instances, 18 TF 
instances, 18 instances and 36 FF instances, 
Thus, the (T , TF, FT) category contains 45 
stimuli and each relevant value appears on 27 
stimuli, "Therefore, the relative frequency of relevant 
values for the (TT, TF, FT) condition is .60. By 
similar computations, it can be shown that the 
relative frequency of relevant values is .50 for the 
(TF, FT) condition, .20 for (TT, FF) condition, 
and .00 for the (FF) condition. 
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would predict that a negativ. ‘bel should 
lead to superior performan: the (FE) 
category composition conditi vhere the 


relevant values are never pri d. 


While Toppino and John 1973, Ex- 
periment I) found virtually lifference 
between the positive and : ive (FE) 
conditions, this finding may bh ributable 
to their use of distributed stimuli, 
These stimuli were generated 1 12-cell 
grid (four columns and th: ws) by 
illuminating 1 cell (value) i: i of the 
four columns (dimensions). T) the non- 
presented stimulus values v always 
present and visible but simp! nillumi- 
nated. This characteris ! stimuli 
may have reduced the difficult: solution 
in Toppino and Johnson's (197- Experi- 
ment I) positive (FF) conditi. ecause, 
if S attended to the unlit cells re grid, 
the (FF) problem could be solve! a con- 
junctive (only the relevant vais were 
never illuminated) by using an afi “ational 


rat- 
»priate 


strategy, However, an affirmatio 
egy would have been totally inap: 


if conventional geometric stimuli hod been 
employed because, with geometric stimuli, 
nonpresented values would |! truly 
absent. 

The present experiment employed inclu- 


sive disjunctive concepts and their « omple- 
mentary joint denial concepts in order to 
generate two conditions of category com- 
position [ (TT, TF, FT) and (FF) crossed 
with two conditions of response label 
("example" and "nonexample"). 1n addi- 
tion, conventional geometric stimuli were 
employed instead of distributed grid stimuli. 
It was predicted that the use of these 
stimuli would substantially increase the 
difficulty of solution in the positive (FF) 
condition. The result should be an interac- 
tion between category composition and re- 
sponse label such that the positive label 
would yield superior performance in the 
(TT, TF,FT) conditions, while the negative 
label would yield superior performance 1n 
the (FF) conditions. Finally, each S was 
presented two successive problems to solve 
in order to investigate the stability of the 
interaction. between category composition 
and response label. 


METHOD 


d Subjects were 50 introductory 
idents who earned extra class credit 


psych 


for « ental participation. Two were dis- 
mise è to equipment malfunction, The remain- 
ing 5« vere randomly assigned by blocks of 4 to the 
four ^xperimentál conditions, generated by requiring 


S c use only examples or only nonexamples to solve 
citer an inclusive disjunctive problem or a joint 
In addition, each S was required 
p in his particular 


denial problem 
to solve two succ 
experir tal 


5 timulus population 
valued dimensions resulting 

MI Ss were told that the 
would vary along the dimensions 
quare, or circle), arrow orienta- 


0 r 
ti ing up, sideward, or down), dot position 
f t middle, or bottom), and border (dotted, 
da olid). 

‘vali were mounted on 35-mm. slides and 
we rojected onto a translucent screen by 
me Kodak Carousel 750 projector. Two 
res ush buttons labeled “example” and “non- 
exa were located on a box situated in front 
of 5 «nd directly below the screen. 

l ce pairs of relevant values (solid border and 
do: in the middle, triangle and arrow pointing up, 


and dot at the bottom) were cted at 
dom from among the 54 possible pairs. Each 
selected. pair was relevant for one of the three 
problems which were prepared for each condition. 
‘Two of the three problems were presented to each 
S and the problems were signed so that, in every 
condition, each of the three problems was presented 
to one third of the Ss as the first problem and to 
another one third of the Ss as the second problem, 
Sequencing restrictions u d in the construction of 
these problems is described in detail in Experiment I 
of Toppino and Johnson (1973). 

Procedure. ‘The procedures of the present experi- 
ment were exactly the same as those detailed in 
Toppino and Johnson's Experiment | (1973). 
Briefly, S was given extensive rule training on his 
particular rule, followed by a test of rule requisition 
to insure that he had achieved a maximally high 
level of rule learning prior to AI training. When 5 
had correctly classified eight consecutive stimuli in 
the rule acquisition test, the nature of the AL task 
was thoroughly explained. A study-test procedure 
was employed, in which blocks of five training trials 
containing all positive or all negative stimuli were 
alternated with blocks of four test trials (without 
feedback) containing both positive and negative 
instances. Criterion was 12 consecutive correct re- 
sponses accumulated over three or four test blocks. 


RESULTS 
Analyses of trials and error data yielded 
virtually identical results. Therefore, only 
trials data are reported. 


/TERACTION OF CATEGORY COMPOSITION AND INSTANCES 
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FicuRE |. Mean test trials to criterion in terms of 
theantilogs of the means of loge (x + 1) minus one. 


Test trials to criterion on the Al task 
were transformed into x’ = loge (x + 1) in 
order to approximate homogeneity of vari- 
ance and symmetry of distributions. The 
data are presented in Figure 1 in terms of 
the geometric means of (test trials to cri- 
terion +1) — 1 [i.e., antilogs of the means 
of log.(« + 1) — 1]. 

Inspection of the data in Figure 1 sug- 
gests that the predicted two-way interac- 
tion between category composition and re- 
sponse label was obtained on Problem 1 
but not on Problem 2, implying a three- 
way interaction, An overall analysis of 
variance (category composition, response 
label, pairs of relevant values, and prob- 
lems) indicated that the overall two-way 
interaction between category composition 
and response label was marginally signifi- 
cant, F (1, 24) = 3.45, p < .10, and that 
the three-way interaction with problems 
approached significance, F (1, 24) = 2.81, 
p = 11, However, a more precise test of 
the experimental predictions. Was afforded 
by a separate analysis of variance for each 
problem. 

On the initial problem, the predicted in- 
teraction was significant, F (1, 36) = 5,30, 
p <.05. The positive label yielded su- 
perior performance in the (TT, TF, FT) 
condition, while the negative label yielded 
superior performance in the (FF) condi- 
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tion. Neither category composition nor 
response label yielded a reliable main effect, 
both Fs (1, 36) « 1.19, p > .20. 

Performance on the second problem in- 
dicated that practice eliminated the Re- 
sponse Label X Category Composition in- 
teraction, F (1, 36) = .82, p > .20, leaving 
only a main effect of category composition. 
The (FF) condition was solved more easily 
than the (TT, TF, FT) condition, F (1, 36) 
-T,p«.0. 


DISCUSSION 


The most important aspect of the present 
results was the interaction which occurred be- 
tween response label and category composition 
on Problem 1. The nature of this interaction 
indicated that a positive label facilitates pro- 
cessing of (TT, TF, FT) instances, while a 
negative label facilitates processing of (FF) 
instances. 

The hypothesis that predicted the obtained 
interaction represents an extension of an earlier 
hypothesis proposed by Bruner, Goodnow, and 
Austin (1956). According to these authors, 
preexperimental experience with conjunctive 
and affirmational concepts leads S to base 
solution attempts on stimulus elements which 
are common to positive instances (ie. an 
affirmational strategy). The present hypothe- 
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sis expands Bruner et al.'s notio 
that this same preexperimenta! 

S to eliminate stimulus elemen: 
in negative instances (i.e, a nt 
egy). Finally, if the effect of re 
based on preexperimental expe: 
be expected that the effect woul 
practice as observed in the pres: 
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for Alcohol Related Studies, University of Oklahoma Health Sciences Center 


<<...” 


Center 
Ear-response side and ear-hand components of lateral stimulus-response 
v were examined as a function of information processing 


(S-R) comp: 


) experime 
two stim 


»oice reaction time (RT) paradigms. 


nts. Experiment I investigated the compatibility 
i-one response) and Donders’ b (two stimuli 


A second experiment 


eflects ina Donders' a (simple) RT paradigm. Neither ear- 

ponse location correspondence were evident in the a. RT, 

significant in the 6 and c. Ear-hand correspondence was not 

y affected by task complexity but ear-response side correspond- 
ficantly increased from c to b and became more symmetrical. The 
ere significantly faster to right ear stimulation in the a task but signi- 


ly faster to left ear stimulation in 


tasksband c. The results are discussed 


terms of stage theory of information processing and in terms of their rele- 
ance for examination of functional asymmetry. 


A series of experiments has demon- 
str: ie existence of a potent stereotypic 
resp- se pattern (Simon, Hinrichs, & Craft, 
1970). Ina typical experimental paradigm 
Ss respond to stimuli, which are presented 
to cither the right or left side, by manual 
deflection of response keys located to the 
right and left of the body midline. The 
consistent finding has been that Ss respond 
more rapidly when the (apparent) side of 
stimulus presentation and side of response 
execution correspond. Simon, Hinrichs, 
and Craft (1970) have identified ear-hand 
correspondence and ear-response location 
correspondence as two significant factors 
in this lateral stimulus-response (S-R) 
compatibility effect. In a choice (two 
stimuli-two response) reaction time (RT) 
task in which the content of a monaural 
tone (high or low) determined the respond- 
ing hand, response side (as opposed to 
hand used) was identified as the most sig- 
nificant factor. Locus of the tone stimulus 
(right ear or left ear) was irrelevant in this 
task. When locus of a tone stimulus, 


1 This research was supported in part by U. S. 
Public Health Services Grant 14702, National In- 
stitute of Mental Health. The authors express 
their appreciation to James Bertera for his as- 
sistance in the conduct of these experiments. 

2 Requests for reprints should be sent to Oscar A. 
Parsons, University of Oklahoma Health Sciences 
Center, P. O. Box 26901, Oklahoma City, Oklahoma 
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which contained no information content 
(i.e., only one tone presented instead of two 
tones) determined the responding hand, 
both ear-response side and ear-hand cor- 
respondence effects contributed to the 
lateral S-R compatibility effect. 

Simon and his colleagues have demon- 
strated this lateral S-R compatibility with 
both auditory and visual stimuli. They 
have manipulated the strength of the re- 
sponse pattern by (a) decreasing the direc- 
tional information of the stimulus (Simon, 
Craft, & Small, 1971), (b) causing a conflict 
between the directional information in the 
stimulus content and in the stimulus loca- 
tion (Simon & Rudell, 1967), (c) introduc- 
ing higher intensity stimulation. (Simon, 
Craft, & Small, 1970), and (d) adding 
irrelevant directional stimuli in the same 
or different. modalities (Craft & Simon, 
1970; Simon & Craft, 1970; Simon, Craft, 
& Small, 1970; Simon & Small, 1969). 

As an explanation for this phenomenon, 
Simon, Hinrichs, and Craft (1970) have 
postulated that it is the reflection of an 
“innate tendency [in humans] to respond 
toward the source of stimulation [p. 1017." 
However, the fact that this effect has been 
observed only in choice RT studies raises 
questions as to the generality of the above 
explanation. For example, is the com- 
patibility effect also present in experi- 
mentally comparable, simple RT para- 
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digms? Further, what stage or stages of 
information processing are involved in the 
compatibility effect? Is the effect pri- 
marily due to stimulus choice or response 
choice or both ? 

Information processing theory offers a 
conceptual framework within which the 
lateral S-R compatibility effect can be ex- 
plored. According to this approach, a 
choice R'T paradigm contains four informa- 
tion processing stages (Smith, 1968); (a) 
stimulus preprocessing, (b) stimulus cate- 
gorization, (c) response selection, and (d) 
response initiation. Donders' (1969) three 
RT paradigms provide a systematic ap- 
proach which enables inferences to be made 
concerning stages of information processing. 
Briefly, Donders’ a is a simple RT paradigm 
(one stimulus, one response), Donders’ b 
is a choice RT paradigm (two stimuli, two 
responses), and Donders' c is a go-no-go 
choice R'T paradigm (two stimuli, one re- 
sponse). Donders’ b compared with c 
allows the role of response choice to be 
examined while Donders' c compared with 
a enables the role of stimulus categoriza- 
tion to be investigated. In this article, 
two experiments are reported which e 
amine both the ear-response side and ear- 
hand components of the lateral S-R com- 
patibility effect as a function of information 
processing complexity within the frame- 
work of Donders’ three RT paradigms. 
The results are pertinent to the following 
specific questions (which are addressed to 
both components of the compatibility ef- 
fect) : (a) Can the lateral S-R compatibility 
effect be replicated (Donders' b)? (b) Is 
the lateral S-R compatibility effect dif- 
ferentially affected by limiting response 
choice in a go-no-go paradigm (Donders’ 
c)? (c) Is the effect present when both 
stimulus choice and response choice are 


eliminated as in a simple RT experiment 
(Donders' a)? 


EXPERIMENT I 


The purpose of Experiment | was to 
answer the first two questions posed above. 
Both choice (Donders' b) and g0-no-go 
(Donders’ c) RT paradigms were given to 
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the same Ss and the comp 
were compared. 


ty effects 


Method 

Subjects. The Ss were 16 femal: nale paid 
volunteers. All Ss were right-ha | reported 
having normal hearing. The .Ss r age from | 
18 to 31 yr. with a mean of 23 

Apparatus, The Ss were seated nfortable 
chair with arm rests on either sid inse panel 


Two 
panel so 


was located directly in front of ir. 
response keys were located on the 


that they could be reached withou: the fore- 
arm from the arm rests. One ro key was 
located 15 cm. to the right, the oth 5 cm, to 
the left of S's midline. The respon were of 
the standard telegraph type, calibrat that the 
gap between contacts was maintained nm. and 
a pressure of 228 gm. was sufficient to contact. 
All responses were made by depressi: the 
with the index finger from a positio! hich the 
finger rested lightly on the button to ! ;ressed. 
The stimuli were 200- and 500-H; nes gen- 
erated by two Hewlett Packard Modi 0 oscillaz 


natched 


tors. The tones were presented throu: 


pair of TDH 39 earphones which were alibrated 
within 1 db. of each other. Output levo the ear- 
phones was maintained at 90 db. (re .00 b.) and 
was checked at regular intervals by a ibrated 
sound level meter. The intensity rem: sed cons 
stant throughout the experiment. Ton esenta= - 
tion and response time measurement re coni 
trolled by BRS 200 ies digital pro rimming 
equipment. Response times were record n milli- 
seconds on a Lehigh Valley 420-9 printou: counter. 

A green warning light located midway between 
the two response keys was presented 1 «cc. prior to 
the onset of each tone, and there was a 4-sec. 
interval between tone onsets. Both the warning 


light and the tone remained on either until S made 
a response or until 1.5 sec. had elapsed from tone 
onset, 

The S room was dimly lit by a small (6-w.) 
incandescent lamp located behind A 70-db. 
white noise background prevented disturbance from 
extraneous noises including the sound of the ex- 
perimental equipment in an adjacent room. 

Procedure. The Ss were instructed to respond by 
depressing the key with their index finger. E ach b 
performed under two RT conditions: a choice 
(Donders' b) and a go-no go (Donders' c), Half of 
the Ss received the 6 RT condition first, and the 
others received the c RT condition first. Each $ 
received eight practice trials before each condition. 

In the 6 RT condition there were two blocks of 
trials. The Ss responded on one block of trials with 
arms uncrossed, so that their left index finger was 
on the left key and their right index finger was On 
the right key. On the other block of trials Ss re- 
sponded with their arms crossed, so that their left 
index finger was on the right key and their right 
index finger was on the left key. Each block con- 
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TABLE 1 
É Mean REACTION TIMES IN b AND c PARADIGMS 
| Left hand Right hand 1 
| e i eens. 
| Left side | Right side Left side Right side x 
| x SE X SE x SE x SE 
Choice reaction time (Donders' b) 
71 12 447 | 16 390 18 442 17 414 
8 14 421 15 453 19 382 13 429 
18 | 434 422 412 421 
Go-no-go reaction time (Donders' c) 
318 11 339 13 323 13 355 14 334 
351 15 348 14 336 12 336 14 343 
| 335 344 329 346 338 


Reaction time in milliseconds. 


d of the same random sequence of 76 trials. 
ner a low pitched (200-Hz.) or a high pitched 
0-Hz.) tone was presented monaurally on each 
|l. Half of the Ss were told to respond with their 
vi cht index finger when they heard the high-pitched 
ione and with their left index finger when they 
heard the low-pitched tone; the other half of the Ss 
were given the opposite hand-tone rule. Each S 
used the same hand-tone rule throughout the entire 
experiment. Within the b RT condition the crossed 
and uncrossed trial blocks were counterbalanced. 
In the crossed block, half of the Ss performed the 
task with the left arm over the right and the other 
half performed the task with the right arm over the 
left. 

In the c RT condition there were four blocks of 
trials. Each block consisted of the same random 
sequence of tones that was used in the b RT condi- 
tion. On each block of trials S was instructed to 
respond when he heard one tone, not to respond 
when he heard the other tone, and to place only one 
index finger on one response key. Every S per- 
formed one block of trials with each of the possible 
combinations of index finger and key positions: 
right-on-right, right-on-left, left-on-left, and left-on- 
right. The nonresponding arm was placed on the 
armrest alongside S out of the way of the respond- 
ing hand. The order of presentation of the blocks 
of trials within the ¢ RT condition was randomized. 


Results 

For each S, median RTs were calculated 
for each tone-ear combination. Means of 
the median RTs are presented in Table 1. 


Planned comparison / tests were made 
on (a) order of presentation (b followed by 
c, € followed by b), (b) effect of hand-tone 
rule (high-right, low-left ; low-right, high- 
left), and (c) effect of arm over in the 
crossed conditions (right-over-left, left- 
over-right). None of these tests were sig- 
nificant, and they were, therefore, elimi- 
nated from further analysis. 

A five-way repeated measures analysis 
of variance (ANOVA) was conducted with 
condition, hand, ear, and side as repeated 
factors and sex as a between-S factor. The 
following main effects were significant : Sex, 
F (1, 30) = 7.08, p < .05, males respond- 
ing faster than females (350 vs. 409 msec.) ; 
Condition, F (1, 30) — 60.13, p < .001, € 
responses were faster than b responses (338 
vs. 421 msec.) ; Ear, F (1, 30) = 18.81, p < 
.001, left ear was faster than the right (374 
vs. 386 msec.) ; and Side, F (1, 30) = 4.78, 
p < .05, left side was faster than the right 
(376 vs. 384 msec.). 

The Ear X Side interaction F (1, 30) — 
109.71, p < .001, successfully demonstrated 
the ear-response location correspondence 
effect and replicated the finding of Simon, 
Hinriehs, and Craft (1970) in that when 
stimulus and response sides corresponded, 
response times were significantly faster 
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Donders’ b 


Donders' c 


Donders' a 


Left 


Center 
Response Side 


Right 


FiGURE 1. The Ear-Side correspondence effect 
in simple (Donder's a), two stimuli-one response 
(Donders' c), and two stimuli-two responses (Don- 
ders' b) RT conditions. (Right and left hands are 
collapsed.) 


(Figure 1). The Condition X Ear X Side 
interaction, F (1, 30) = 67.43, p < .001, 
indicated that the ear-response location 
effect was significantly greater in the 6 than 
in the c condition (Figure 1). A second 
ANOVA was performed on the Ear X Side 
interaction within the c condition and the 
effect proved significant, F (1, 31) — 14.29, 
b <.01, although predominantly on the 
left side. Thus this correspondence effect, 
while stronger in the b RT, does appear in 
both conditions. 

The Ear X Hand interaction was also 
significant, F (1, 30) = 18.53, p < .01, but 
the Condition X Ear X Hand was not 
(F = .04), indicating that the ear-hand 
correspondence effect, although present in 
both conditions, was not differentially af- 
fected by task complexity. The Ear x 
Hand interaction has another interesting 
feature; that is, the correpondence is en- 
tirely in the left hand. The means for right 
hand-right ear and right hand-left ear are 
both 377 msec. while the mean for the 
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and for 


left hand-left ear was 370 : 
the left hand-right ear, 395 ims: 

A comparison of Ear X Sid: Ear x 
Hand correspondence effects . splayed © 
in Table 2 for both b and litions, 
This table provides convenien parison 
with the results of Simon, |! s, and 
Craft (1970). 

The only other significant efi 18 Sex 
X Condition X Ear, F (1, 30 84, f 
< .01; males had less differen: tween 
the right and left ear in the < lition 
than the females. This inter: may 
reflect differences in functional a netry 
between males and females as been 
suggested by Kimura (1969). Ti- ‘ailure 
of the Hand X Side interaction (/ .69) 
to achieve significance indicates th» cross- — 
ing the hands was not a significan actor, 
Simon, Hinrichs, and Craft (1970) ‘ound 
that uncrossed trials were sign intly 
faster than crossed trials, but dire’ com- 
parisons between their experiment a:i this 
cannot be made as the relative cor t of 


the arm positions is unknown. 


EXPERIMENT II 


as to 
r the 
d be 
rs’ a) 


The purpose of this experiment ` 
answer the third question, i.e., wheti 
lateral S-R compatibility effect wo: 
found in a simple RT task (Dond. 


TABLE 2 


Reaction TIME as A FUNCTION OF EAR-RESPONSE 
Location CORRESPONDENCE AND 
EAR-HAND CORRESPONDENCE 


Donders' b condition 


bs Ear-hand | Ear-hand » 
Compatibility factor | correspon- noncor- X 
dence respondence 

Ear-side 

correspondence 380 406 393 
Ear-side 

noncorrespondence 450 450 450 
X 415 428 | 

Donders' c condition 
| 

Ear-side 

correspondence 327 336 332 
Ear-side 

noncorrespondence 338 353 346 
si | 
Xx 333 345 
Note. Reaction time in milliseconds. 


z 


x 


TASK LEXITY AND STRE 


in which b ulus and response choice 


are elin Simon (1967) examined 
ear pref in a simple RT task with a 
central! itioned response key and pre- 
ferred d responding. The possibility 
of an teraction with response side was, 
there’ re, not examined. In the present 
expenment, lateral and central response 


locations are compared with right-ear, left- 


ear, and b |] stimul 


| male paid volunteers. 


All ietermined by self-report 
and n of writing habits. All Ss 
rep mal hearing. Ages of the Ss 
rany ind 37 yr. of age with a mean of 
26. 

At fhe apparatus was basically the 
sam described in Experiment I with the 
addit third response key located on the S's 
midli iway between the two lateral response 
keys. muli were 200-Hz. tones of the same in- 
tensit, as in Experiment I. A green warning light 
locate . 15 cm. above the central response key pre- 
cedec the stimulus tone by a randomly varying 


inter al, either .5, 1.0, or 1.5 sec. The interstimulus 
interval was 5 sec, between tone onsets. 

Procedure, Eight blocks of trials were presented. 
Six consisted of single-hand responses with each 
hand performing in turn on each response key. Two 
blocks of trials consisted of two-hand responding 
(crossed and uncrossed) in conjunction with another 
experiment. During two-hand responding trials, 
the S’s task was to respond with both index fingers 
simultaneously while E recorded RT in milliseconds 
to the first key depressed. Half of the Ss received 
the two-hand block followed by the single-hand 
block, and the other half of the Ss received the trial 
blocks in the reverse order. The order of single- 
hand trial block presentation was randomized across 
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The Ss were given 21 practice trials randomly 
presented so that there were 7 each in the right ear, 
the left ear, and binaurally. Practice responses 
were ona randomly chosen single-hand response key, 
Each trial block consisted of 63 randomly presented 
stimuli, 21 each to the right, left, and both ears. 
In between trial blocks, S was told that he had 
performed very well and then told which hand- 
response key condition would follow. The Ss were 
exhorted at the beginning of the experiment and 
between each trial block to respond as rapidly as 
possible. Further, they were told that the tone 
would appear sometimes in the right ear, sometimes 
in the left, and sometimes in both. Regardless of 
where the tone appeared, they were to respond as 
rapidly as possible with the assigned hand-key 
condition. 


Results 


Median RTs were calculated for each of 
the ear presentation locations (right, left, 
binaural) within each of the trial blocks. 
Planned comparison / tests of single-hand 
vs. two-hand response trial blocks revealed 
no significant differences. Further ¢ tests 
of single-hand responding before vs. single- 
hand responding after two-handed response 
trials revealed no significant differences. 
The two-hand responses were omitted from 
further analyses. A three-way (Ear X 
Hand X Side) repeated measures ANOVA 
was then performed on the median scores 
of the single-hand response trial blocks. 
Table 3 presents the median scores. Of 
the seven possible main and interaction 
effects, only two were significant: Ear, f 
(2, 58) = 16.22, p < 01, and Hand X Side, 
F (2, 58) = 3.94, p < .05. The right-ear 
and binaural RTs did not differ, but both 
were significantly faster than the left-ear 


all Ss. 
TABLE 3 
MEAN REACTION TIMES IN THE à PARADIGM 
Left hand Right hand 
Ear Left side Center Right side Left side Center Right side x 

Ed SE X SE X SE Xx SE x SE x SE 

186 8 181 6 182 8 176 6 183 7 181 6 |181 
pee 181 8 186 6 173 7 172 6 178 7 177 6 | 176 
Right 180 7 177 6 175 7 172 7 181 6 177 6 |177 
x 182 178 177 173 181 178 178 


Note. Reaction time in milliseconds. 
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responses: left-ear mean = 181 msec.; bi- 
naural mean = 176 msec.; right-ear mean 
= 177 msec. Figure 1 presents the means 
of right- and left-ear responses on the three 
response keys for comparison with the re- 
sults of Experiment I. 

The significant Hand X Side interaction 
reflected the fact that the right hand was 
faster when operating on the left-side re- 
sponse key; similarly, the left hand was 
faster on the right side. This effect was 
not present in Experiment I and may repre- 
sent a chance finding. 


Discussion 


Lateral stimulus-response compatibility ef- 
fects. Those results which are relevant to the 
specific experimental questions are as follows: 
(a) The lateral S-R compatibility effect de- 
scribed by Simon, Hinrichs, and Craft (1970) 
in the Donders’ b paradigm is quite strong and 
reproducible. (b) The ear-response location 
component of the lateral S-R compatibility 
effect is present under the Donders’ c RT 
condition but is significantly less than in the 
Donders’ b condition. At the same time the 
symmetry of the effect increases from ¢ to b. 
The ear-hand correspondence effect is consis- 
tently present in both 5 and c conditions but 
operates almost exclusively in the left hand. 
(c) There is no S-R compatibility effect in a 
simple RT situation, Donders' a. 

The presence of the Ear X Side interaction 
in the Donders' c paradigm and its absence in 
the Donders’ a paradigm suggests that the 
stimulus categorization stage, but not stimulus 
preprocessing or response initiation, is impor- 
tant with respect to the ear-response location 
correspondence effect. When the additional 
stage of response selection is added in the 

Donders’ b paradigm, this correspondence 
effect significantly increases. Response choice 
appears to be an important determinant of the 
effect. The ear-hand correspondénce is also 
present in both b and c paradigms, but not 
differentially so, and is absent in Donders' a. 
Stimulus categorization, therefore, appears to 
be the most important stage for the ear-hand 
correspondence?effect. It is also important to 
note that both left-ear-hand and left-ear- 
response location correspondence effects are 
stronger than those for the right. Further, 
while ear-response location correspondence 
becomes more symmetrical as the task com- 
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plexity increases from c to car-hand 
correspondence is almost entiri left-hand 
effect. 

Ear effects. The significantly response 
to right-ear stimulation in t! nple RT 
(Experiment II) is consistent wi finding 
of Simon (1967). However, it i esting to 
note that in the choice RT condi Experi- 
ment I), responses to left-ear sti ion were 
faster than to right-ear stimulati eversal 
of the relationship in the simp! This 
reversal may have been due i: to the 
additional stimulus processing re: | in the 
stimulus categorization stage bourne 
(1970) has suggested that the stro: ght-ear 
bias in the simple RT may refle: atten- 
tional preference for the left hemi re as a 
consequence of Ss concurrent ve linking 
proces: ie, the hemisphere in : isplays 
a lower attentional threshold. 1 results 
from these experiments suggest an or exe 
planation within the framework Kins- 
bourne's attentional hypothesis. more 
complex b and c choice tasks appea to re- 
quire more concentrated concurré erbal 
activity as S had to remember the ^... d-tone 
rule, e.g., "high tone-right hand." e sig- 
nificant left-ear effect in these situati might 
have been due to the preoccupati: f the 
left hemisphere with verbal rehear À In 
fact, some Ss reported rehearsing the rle over 
and over to themselves during the to Sys- 
tematic control of this variable, however, was 


not attempted in this experiment. 

A second explanation for the ear chects in 
the b and c conditions may be that the right 
hemisphere is more effective in tonal discrimi- 
nation than the left hemisphere. ‘The work 
of Kimura (1967) provides support for this 
hypothesis. This would explain the facilita- 
tion of left-ear stimulus processing in those 
conditions in which high and low pure tones 
must be discriminated before responding, Per- 
haps both left hemisphere interference and 
right hemisphere facilitation are operating to 
produce the results obtained in the choice RT 
conditions. 


In conclusion, the data from these experi- 
ments suggest that information processing 
complexity, as exemplified in the additive 
model of Donders, must be taken into account 
when making inferences about the generality 
of the lateral S-R compatibility effect. These 
data further suggest that lateral asymmetries 
in RT performance are affected by the manip- 
ulation of information processing complexity. 
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ESTIMATION OF WORD FREQUENCY IN CONTIN 


AND DISCRETE TASKS! 


IAN BEGG? 
McMaster University, Hamilton, Ontario, Canada 


Long lists of concrete and abstract nouns which appeared varying number 
times were presented to two groups of Ss, One group judged how many tim 
each word occurred during presentation (online), and again after the wh 
list had been presented (delayed). The second group only judged the wor: 
after the list had been presented (discrete), All Ss subsequently recalled ti 
words. Frequency estimates were above true frequency for relatively lor 
frequencies, and below for relatively high frequencies in both the delayed aix 
discrete judgments, but more so in the latter. Additionally, the interactio: 
was more pronounced in abstract than concrete nouns. Recall was a positiv 
function of concreteness, true frequency, and subjective frequency. The rc 
sults were discussed in terms of multiple-trace and multiple-process views o! 
how frequency is represented in memory. 


This paper addresses the question of how 
Ss estimate the number of times a word 
has occurred in a given situation. Accord- 
ing to Howell (19732), there are four major 
hypotheses concerning the representation 
of event frequency in memory. The first, 
the trace-strength hypothesis, maintains 
that each presentation of an event causes 
an increment in the strength of the memory 
trace corresponding to the event, and that 
frequency estimation is effected by reading 
out current strength values. The second, 
the multiple-trace hypothesis, maintains 
that each presentation of an event pro- 
duces a new trace (Hintzman & Block, 
1971), or a new "list marker" (Anderson 
& Bower, 1972), and that frequency esti- 
mation consists of counting the number of 
traces or list markers. What the two hy- 
potheses have in common is that there is 
no unique frequency representation process. 
Rather, frequency estimation consists of 
inferring numerical judgments from a gen- 
eral memory representation. What differ- 
entiates the hypotheses is that the trace- 
Strength hypothesis has no mechanism for 
separately retaining information about the 
individual contexts in which each presenta- 
tion of an event occurs, Since Ss can 
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retrieve information about thi wation 
(Hintzman, 1970) and modali: Hintz- 
man, Block, & Summers, 1973) specific 
occurrences of events, the muli. 'e-trace 
hypothesis is more useful than t-e trace- 
strength hypothesis, which will no- be dis- 
cussed further here. 

Two further hypotheses can be -erived 
from the proposition that the situational 
frequency of occurrence of an iter: is spe- 
cifically represented in memory, an<: is not 
simply inferred from the general ::emory 


representation. One such hypothesis, the 
numerical-inference hypothesis, is that S$ 
simply count the number of times items 
have occurred, and give the curren: count 
as the frequency judgment. The second 
hypothesis, the frequency-attribute hypoth- 
esis, is that an item is remembered as a set 
of attributes, one of which increments with 
situational frequency; estimates of event 
frequency could thus be made by reading 
out the frequency attribute (Underwood, 
1969a). Since the numerical-inference hy- 
pothesis can be subsumed by the frequency- 
attribute hypothesis, they will both be 
referred to as multiple-process hypotheses, 
since they posit that frequency information 
is specifically represented in memory, rather 
than as an incidental component of the 
general memory representation of an 
event. 

It is difficult, if not impossible, to differ- 
entiate experimentally between a multiple- 
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ace and a muitiple-process hypothesis. 
- Since memory can onceived of as being 
characterized by dii ront levels of proces- 


sing (Craik & Li 1972), words 
processed for n: id simply be 
processed to a d han words 
processed for freq: a ifferences 
in memorability essed 
under the two 1 r 

ied 


reflect different 0 
by Underwood ( lect the fact 
that words proces- requency in- 
structions were | d as deeply 
as words process: ;nemory instruc- 
tions. Howell ( found that words 
processed under f: ncy instructions were 
not as well recalled .s those processed under 
memory instructions, but that frequency 
judgments were not affected by the set 
under which the events were processed. 
Howell concluded that the memory differ- 
ence reflected « different process, although 
it is also possible to interpret the results in 
terms of different levels of processing. 

If either the multiple-trace or the mul- 
tiple-process hypothesis is to provide a 
“reasonable account of frequency estima- 
tion, there are certain basic facts that must 
be taken into account. First, if Ss are 
presented with a list of words that appear 
varying numbers of times, and judge on 
- every trial the number of times a particular 
Word has occurred, average judgments cor- 
respond closely to actual frequency (Begg 
& Rowe, 1972, Experiments II & III). 
Either hypothesis could handle the finding 
easily. For the Anderson and Bower (1972) 
multiple-trace model, the mean number of 
. list markers available for an item presented 
|. ^ times in a list is » a, where a is the 
| probability of estab ing a list marker at 
| 


the time of presentation. The mean num- 
ber of list markers has an upper bound 
defined by true frequency (n) and a lower 
bound of zero. Frequency estimates are 
© Provided by a transformation of the num- 
ber of list markers. The results reported 
by Begg and Rowe could be accounted for 

y making the transformation linear (i.e. 

En = a + buo), in which case a = 0, and 

ba = 1. The multiple-process hypothesis 
could handle the result by assuming that 
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the frequency attribute, on the average, 
increments by a unit amount for every unit 
presented and is, at least during acquisi- 
tion, generally retrievable. 

The second general finding to be ac- 
counted for is that if frequency judgments 
are not made until after the list has been 
presented, the judgments generally are 
higher than relatively low true frequencies, 
but below relatively high frequencies (e.g., 
Peterson & Beach, 1967). There are sev- 
eral points to note about the interaction 
between true and judged frequency. First, 
the interaction obtains when Ss are previ- 
ously instructed either that the task will 
require frequency judgments (e.g., Howell, 
1973b; Underwood & Freund, 1970) or to 
remember the words (e.g., Hintzman, 1969; 
Howell, 1973b). Second, the interaction 
is sensitive to the frequencies used in the 
specific task, since it occurs with as few as 
2 presentations representing the highest 
frequency level (Hintzman et al., 1973), 
or as many as 10 (Howell, 1973b). That 
is, the interaction is a function of relative, 
rather than absolute, frequency. Third, a 
retention interval makes the interaction 
more pronounced, both with respect to 
overestimation of low and underestimation 
of high relative frequencies (Underwood, 
Zimmerman, & Freund, 1971). 

Since true and judged frequency interact 
when judgments are made after presenta- 
tion of a list (discrete judgments), but not 
during presentation of the list (online 
judgments), the interaction could reflect 
one of two factors, or a combination of the 
two. Online and discrete judgments differ 
both with respect to the presence of a delay 
interval interposed between the last presen- 
tation of an item and the subsequent esti- 
mation in the latter case, and with respect 
to the presence of overt running judgments 
in the former. In the present experiment, 
Ss who made online judgments were also 
asked to judge the words at the end of the 
task (delayed judgments). Comparisons 
between the two sets of judgments reflect 
retention effects, while comparisons be- 
tween delayed and discrete judgments re- 
flect the effects of the overt running judg- 
ments made in the former condition. 


1048 


The facts that true and judged frequency 
interact, that the interaction is relative to 
the range of true frequencies used, and that 
a retention interval accentuates the inter- 
action, must be accounted for by the hy- 
potheses, either by changes in the memory 
representation over time, or by variables 
affected by the presence of overt judgments 
during acquisition. First, consider the 
effects of a retention interval on the repre- 
sentations from which frequency estimates 
are inferred. According to Anderson and 
Bower (1972), the effect of a retention 
interval is to reduce the availability of 
individual list markers, so that the best fit 
line relating true and judged frequency 
would become less steep over time. The 
problem is that the model has no explicit 
provision for raising the intercept of the 
line as a function of forgetting, and so does 
not account for (a) the overestimation of 
low-frequency words, (6) the fact that the 
interaction is relative to the range of ex- 
perimental frequencies, or (c) the exaggera- 
tion of the degree of overestimation of low 
frequencies with longer delay intervals. 
However, the problem is not serious. If S 
compares his subjective mean number of 
list markers available across the full range 
to his subjective estimate of the mean 
number of occurrences of words, the differ- 
ence between the two could simply be 
added to all estimates derived from the 
number of list markers available. "That is, 
the effect of a retention interval is to move 
all judgments closer to the task-defined 
mean frequency. For Underwood's (19694) 
multiple-process model, no specific state- 
ment is made about the effects of a reten- 
tion interval on the frequency attribute. 
A reasonable result of deterioration of the 
frequency attribute, however, is that the 
attribute will become assimilated to the 
mean situational frequency of occurrence of 
the words. Thus, judgments would gener- 
ally become closer to the task mean over in- 
creasing retention intervals. Consequently, 
both models of frequency estimation can 
account for a frequency interaction, which 
is a function of relative frequency, and 
which becomes more pronounced with time. 
The forms of both models as developed here 


IAN BEGG 


would also predict that the < 
of responses would not diff 

of retention interval, as fou 
wood et al. (1971). 


ll average 
ı function 
by Under- 


Recall that, in addition retention 
interval, online and disc: udgments 
differ in that Ss in the fori: ise make 
running judgments during th sentation 
of the words. Whether s: timation 
requires closer attention, o: ides re- 
trieval practice, or whatever, | vypothe- 
ses, as interpreted above, wo vield the 
prediction that the effects o! h judg- 
ments would be effectively op to the 
effects of forgetting. That moving 
the running estimates should e judg- 
ments to become closer to thc mean, 

In the present experiment, c te and 
abstract words were presented 5s who 
either judged the frequency of irrence 
of each word as it occurred (o ) and 
then again after the list had been sented 
(delayed), or only judged freque -y after 
the presentation of the entire list (^ «rete. 
The expectation is that the oni judg- 
ments will be relatively accurate, ‘he de- 
layed judgments will show an interaction 


between true and judged frequency, and 
that the interaction will be m pro- 
nounced in the discrete judgments. The 
effects of concreteness on frequency esti- 
mation are not very pronounced in online 
judgments (Begg & Rowe, 1972), Since 
Anderson and Bower (1972) conceive of 
tagging with list markers as essentially à 
paired-associate task, abstract words should 
show a more pronounced frequency inter- 
action than concrete words with both de- 
layed and discrete judgments, because (4) 
concrete words are known to be better 
stimuli than abstract words in paired- 
associate learning (e.g., Paivio, 1969), and 
(b) concrete words are better retained than 
abstract words over long periods of time, 
particularly if the recall task is associative 
(Begg & Robertson, 1973). A frequency 
attribute, if it is conceived of as a paired- 
associate of the word, would yield the same 
predictions as the Anderson and Bower 
model. 

The Ss in the present experiment were 
also asked to recall the words after they 


had made incir terminal judgments. Re- 
call was ex ected to be a positive function 
of concretcness (e.g., Begg & Robertson, 
1973), true frequency, and subjective fre- 


quency (e.:., Underwood et al., 1971). 
METHOD 
Subject Twenty volunteers from McMaster 
Universit paid $2/hr for their participation. 


Half of the determined randomly, judged the 
frequency of cach word as it occurred, and half 
made their first judgments after the entire list had 
been presented 
Materials and procedure. Fifty concrete and 50 
abstract 1 were chosen from the Paivio, Yuille, 
and Madigan (1968) norms, with mean J ratings of 
6.39 and 3.04. All words were A or AA in frequency 
of occurrence. Ten concrete and 10 abstract nouns 
were printed on 17 cards, 10 were printed on 9 
cards, and another 10 were printed on each of Sup 
and 1 card. The 700 cards thus produced were 
arranged in 28 sets of 25 cards, with successive oc- 
rues of the words from each frequency level 
AM placed in consecutive piles. The pile in which 
ECL presentation of an item occurred was às 
om as possible, given the constraint that there 
: "nough adjacent piles for the later presentations 
ET A and that an equal number of concrete 
an! abstract nouns from any given frequency level 
ted in the same pile. After the sorting, each 
was thoroughly shuffled, and the blocks were 
placed together to form one deck of 700 cards. On 
then, 25 items separated successive 
of a particular item. 
ìs presented to each S in three stages. 
iaxe, S went through the entire 


In the pre 
list of 700 cards at his own rate, reading each word 
aloud. The Ss in the continuous-presentation con- 
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[4 drm 
DELAYED JUDGMENTS 


CLE 
9 v 135 E] 7 
DISCRETE JUDGMENTS 


FREQUENCY OF PRESENTATION 


Le Online, delayed, and discrete judgments of frequency, as a 
function of concreteness and the frequency of presentation. 


dition estimated the frequency of occurrence of each 
word in the list, while Ss in the discrete-presentation 
condition made no overt judgments, although they 
had been told that their task was to remember how 
many times each word occurred. In the second 
stage of the task, the judgment stage, all Ss were 
presented with lists containing the 100 words from 
the initial list. Six different random orders were 
used. The Ss were instructed to write down next 
to each word a number representing how many 
times they thought the word had occurred in the 
initial list. In the final stage of the task, the recall 
stage, all Ss were asked to write down as many of 
the words as they could remember, in any order. 
The whole task took about 75 min. 


RESULTS AND DISCUSSION 


Frequency estimates. Since any departure 
from accuracy in judging the frequency of 
once-presented items would necessarily be 
above true frequency, median rather than 
mean judgments were analyzed. For Ss 
in the continuous-judgment task, two sets 
of medians were calculated, one for the 
online judgments made during presenta- 
tion and one for the delayed judgments. 
Each set of medians had 10 members, 1 for 
the 10 concrete and 1 for the 10 abstract 
nouns at each of the five frequency levels. 
For the online judgments, only responses 
made to the last presentation of an item 
were scored. The medians for the delayed 
judgments andjfor the discrete judgments 
included all nouns in the final judgment 


list. 
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'The mean medians, averaged across Ss, 
are presented in Figure 1. For each of the 
three types of judgments, results are plotted 
separately for concrete and abstract nouns, 
and a perfect-accuracy line is included for 
reference purposes. There are several 
points to note in the figure. First, the 
online judgments made in the continuous 
task did not depart much from true fre- 
quency, as found by Begg and Rowe (1972), 
and there were no apparent effects of con- 
creteness. However, when the same Ss 
repeated their judgments in the delayed 
condition, the judgments showed an inter- 
action between true and judged frequency, 
and the interaction was more pronounced 
for abstract than concrete nouns. The Ss 
in the discrete task showed a more exag- 
gerated interaction than Ss in the delayed 
task, and the interaction was again most 
pronounced with abstract nouns. It is also 
perhaps of note that the means of the 10 
medians in each judgment condition (7.1, 
7.7, and 6.7 for the online, delayed, and 
discrete judgments, respectively) did not 
depart much from the true mean frequency 
of presentation, which was 7, suggesting 
that Ss have some idea of how many pre- 
sentations actually occurred, and adjust 
their responses accordingly. 

The reliability of the results was deter- 
mined by two analyses of variance. The 
first analysis compared the online and de- 
layed judgments made in the continuous 
task, to assess the effects of a delay interval 
on the judgments. Thus, time of judgment 
(online or delayed), concreteness (abstract 
or concrete), and frequency of presentation 
were all repeated factors. Overall, judg- 
ments increased with frequency, F (4, 
36) = 53.3, p < .01. More importantly, 
frequency and the time of judgment inter- 
acted, F (4, 36) = 7.44, p < .01, confirming 
the reliability of the tendency to overesti- 
mate low- and underestimate high-fre- 
quency nouns in the delayed judgments. 
Further, the three-way interaction, F (4, 36) 
= 2.65, p < .05, reflected the fact that the 
pattern found in the delayed condition was 
more pronounced with the abstract than 
concrete nouns. In fact, there is only a 
hint of the interaction in the delayed judg- 
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ments of concrete nouns. “Í esults can. 
be seen in the two left-h. panels of | 
Figure 1. 

The second analysis coin; | the two 
sets of judgments made aftei the words 
had been presented. Differ: between. 
the delayed judgments and ^ e discrete 
judgments can be attributed : > running 
estimates made by the Ss ii delayed 
task. The results were analy by2X2 
X 5 analysis of variance, wit! ne type of 
task as an independent fact ind con- 
creteness and frequency as rept | factors. 
As in the previous analysis, juc -nents in- 
creased with frequency, F (4 ) = 67.3, 
p < .01. The task variable intc:- cted with 
frequency, F (4, 72) = 6.18, p 01, since 
the interaction between true | judged 
frequencies was more pronounced in the 
discrete than the delayed task. Finally, 


there was a significant Concretcness X 
Frequency interaction, F (4, 72) = 4.22, p 
< .01, since the interaction between true 
and judged frequencies was more pro- 
nounced with abstract than concrete nouns 
in both tasks, as seen in the two right-hand 
panels of Figure 1. 

Overall, then, the interaction between 
judged and true frequency is absent in 
online judgments, is present in both de- 
layed and discrete judgments, and is espe- 
cially pronounced in the latter, Further, 
the interaction is more pronounced when 
abstract rather than concrete words are 
judged. The results conform quite well to 
the expectations of either a multiple-trace 
model that posits a comparison process 
between the number of list markers avail-_ 
able and the number of presentations made, 
or to a frequency-attribute model that as- 
sumes the frequency attribute becomes 
assimilated into the task-defined mean fre- 
quency as it deteriorates. 

, 1f the individual judgments become as- 
similated to situational frequency, then it 
should be the case that the amount of vari- 
ability around the mean decreases as the 
interaction between true and judged fre- 
quency becomes more pronounced. From 
online to delayed judgments, the mean 
Standard deviation decreased from 6.39 to 
$89, F (1, 9) = 727, p < .025, for the 
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abstract The decrease was not 
found fo jouns, which did not 
show as 1 € interaction as the 
abstract the mean stan- 
dard d yed judgments 
was 6 he discrete 
judg: ! on logio 
trat nati viations 
showed t! 18) 
= 14.6 zen- 


et ie judged situa- 


Rei r of words re- 


called by S was analyzed by 
a2 variance with task 
(cont as the independent 


facto: iess and frequency as 
repeat Recall increased with 
frequen 72) = 10.3, p < .01, with 
respective an recalls of 1.95, 2.40, 2.68, 
3.28, and .88, each out of a possible 10. 


Concreten ss was also positively related to 


recall, F (., 18) = 15.5, p < .01, with 16.8 
and 11.5 concrete and abstract words re- 
called « Í a possible 50. The Concrete- 
ness > interaction was not reliable 
(F < 1), although in absolute terms, the 
concr: ; effect was larger in the dis- 
crete tasi: (16.5 vs. 9.4) than in the con- 
tinuous task (17.1 vs. 13.7). 

In order to determine whether apparent 
frequen: fected recall, the frequency 
judgments made to the words that were 


later recalled were compared to the judg- 
ments of words not recalled. Because of a 
number of empty cells, the data were 
averaged over low frequency (1, 3, and 5), 
high frequency (9 and 17), and concrete- 
ness. [n discrete task, the mean 
judgment oí recalled words was higher than 
the judgment of forgotten words, both at 
low frequency, 6.23 vs. 5.12, F (1, 9) = 
6.70, p < .05, and at high frequency, 11.07 
vs. 8.92, F (1, 9) = 15.0, $ < .01.. The 
same was true in the continuous task, both 
at low frequency, 5.92 vs. 4.70, F (15,9) = 
7.22, p < .05, and at high frequency, 13.49 
vs. 11.89, F (1, 9) = 5.83, p < .05. Thus, 
recalled words were consistently higher in 
judged frequency of occurrence than for- 
gotten words. 
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As a second test of the hypothesis that 
subjective frequency is related to recall, 
recall was conditionalized on whether the 
final frequency judgments were above or 
below each S’s median judgment at each 
frequency level. For each S, the results 
were averaged over concreteness, and once- 
presented items were not considered, since 
for several Ss there were no judgments 
below the median. The proportions of 
words recalled, given that the final fre- 
quency judgment was above or below the 
median, were analyzed by a 2 X 2 X 4 
analysis of variance, with task (continuous 
or discrete) as an independent factor, and 
level of final judgment (above or below the 
median) and frequency (3, 5, 9, or 17) as 
repeated factors. Both subjective fre- 
quency, F (1, 18) = 13.5, p < .05, and 
actual frequency, F (3, 54) = 3.50, p < .05, 
were positively related to recall. Thus, 
items initially judged above the median 
were more likely to be recalled than items 
judged below the median, with respective 
proportions of .356 and .250, and recall 
increased over the four frequency levels 
included in the analysis, with proportions 
of .240, .280, .317, and .375. No other 
effects were reliable. 

Thus recall is a positive function of both 
subjective and actual frequency of occur- 
rence, whether subjective frequency is de- 
fined by average ratings of recalled words, 
or by recall or words partitioned according 
to the level of the frequency judgments. 
This extends the finding of Underwood 
et al. (1971). à 

The general conclusion of the paper is 
that any reasonable account of frequency 
estimation must include some process that 
allows judgments to become closer to the 
task mean, whether that process is an addi- 
tive bias (multiple trace), or assimilation 
of a frequency attribute into that mean. 
In his review, Howell (1973a) did not con- 
sider the nature of the interaction between 
true and judged frequency, or the fact that 
the interaction is a function of relative, 
rather than absolute frequency. However, 
his conclusion that it is not possible to rule 
out the alternative hypotheses is entirely 


correct. 
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The present results also have implica- 
tions for verbal discrimination learning, a 
task in which concrete items lead to better 
performance than abstract items even on 
the first trial (e.g., Rowe & Paivio, 1971). 
Ekstrand, Wallace, and Underwood (1966) 
offered an account of verbal discrimination 
in which situational frequency was viewed 
as the dominant attribute affecting per- 
formance. Such an account cannot easily 
explain imagery effects, since abstract words 
actually have higher subjective frequencies 
than concrete words for low frequencies if. 
the judgments are delayed, and the judg- 
ments are not different if they are made 
at the time of presentation of the words. 
Thus, although situational frequency is a 
powerful explanatory construct with re- 
spect to verbal discrimination learning, ad- 
ditional constructs must be invoked to 
explain the effects of imagery in the task. 
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IYPOTHESIS BEHAVIOR AS A FUNCTION 
OF AMOUNT OF PRETRAINING ' 


RAYMOND M. WHITE, Jr? 


Miami University 


Twe ients were performed to determine the effects of conceptual pre- 


trainin 
discrim 

1,2, 3, or 
problems, F 


blank tr 
tion of 


the hypothesis behavior exhibited by human Ss on simultaneous 
n problems that included blank tria 
imultaneous discrimination problems 
periment II was defined by the factorial combina- 
, 4, or 6 pretraining problems and two sets of instructions, 128 Ss 


For Experiment I, 80 Ss 
prior to four 


bein n routine instructions or directions to consider a single hypothesis 


M e, With tr. 


ning, Ss were more likely to maintain their previous 


h is after positive outcomes and to alter their hypothesis after negative 


s. The conclusion was that the uniprocess hypothesis models, s 


ch 


ne proposed in 1966 by Levine, do not accurately describe the behavior 
‘trained Ss, and that a multiprocess model of conceptual learning is 


led. 


Many, not all, theories of concept 
identifica: on. (CI) ascribe a single process 
to the problem solver in his attempt to 
identify t:e relevant attributes of a con- 
ceptual problem (e.g., Levine, 1966; Restle, 


1962). However, recently White (1972a, 
1972b discussed the possibility that 
S retains à number of means for solving a 
problem: Which of the approaches will be 
adopted in attempting to solve a problem 
depends on a number of factors such as the 


type and amount of training S has received. 

Support for White's (1972a, 1972b) multi- 
procc of CI was indirect. The Ss in 
his experiments were given one of two types 
of pretraining problems, The performance 
of Ss who were given systematic training 
(i.e., a routine conceptual problem) was 
unaffected by the length of postfeedback 
Interva!s after correct responses—indicating 
the absen | problem-solving activity 
during that inierval—but strong effects due 
to the duration of the postfeedback interval 
after errors were observed. These results 
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M. White, Jr., Department of Psychology, Miami 
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were as predicted by an error-trial-only 
theory of CI (e.g, Bower & Trabasso, 
1964). On the other hand, the performance 
of Ss who received unsystematic training, 
consisting of a problem whereby S could 
reach the solution criterion by memorizing 
the placement of each stimulus pattern, 
was affected equally by the length of inter- 
vals after both correct and incorrect re- 
sponses. The latter result is consistent. 
with the predictions of an all-trials, re- 
sponse-independent theory of CI-- (eg 
Bourne & Restle, 1959). Furthermore, as 
the amount of pretraining was increased, 
the effects of the length of postfeedback 
intervals were more consistent with the 
predicted outcomes of the opposing theo- 
ries: Systematic training provided perform- 
ance orderings as predicted by error-trial- 
only CI models, and unsystematic training 
produced orderings as predicted by all- 
trials CI models. 

Levine (1966, 1970) has provided a more 
direct way of assessing the processes under- 
lving performance in a GI problem and a 
theory to account for behavior in such 
problems. Levine’s method is to provide 
certain trials during which S is given feed- 
back concerning the correctness of his re- 
sponse (hereafter termed outcome trials, 
OTs). Following an OT, typically, are a 
series of trials for which no feedback is 
given (hereafter termed blank trials, BTs). 
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By observing the responses S makes during 
the BTs, the hypothesis S is entertaining 
can be determined. For example, in a 
simultaneous discrimination problem, if S 
always selects the stimulus appearing on 
the left side of the display as containing 
the correct attribute, it is inferred that he 
is hypothesizing that "leftness" defines 
problem solution. Through such analyses, 
it can be determined on what occasions S 
changes hypotheses and the manner in 
which the hypotheses are changed. 

Levine (1966, 1970) has reported several 
important findings using the BT procedure: 
(a) Ss consistently use hypotheses; (b) Ss 
rarely change hypotheses after having re- 
ceived a positive OT; and (c) Ss nearly 
always change hypotheses after receiving 
a negative OT. All of these results are 
consistent with the data reported by White 
(1972a, 1972b). First, Levine's Ss typi- 
cally receive extensive pretraining. Indeed, 
Levine assumes the universe of hypotheses 
is known by S and that the assumption 
can be met by training and instruction. 
White's (1971) finding was that Ss receiv- 
ing extensive systematic pretraining and 
instruction are likely to behave as hypothe- 
sis testers. Second, the behavior of exten- 
sively and systematically trained Ss was 
as predicted by an error-trial-only CI 
model; S retains his hypothesis after 
positive outcomes and returns it to a pool 
of hypotheses for further sampling after 
negative outcomes. However, the question 
of whether the pretraining and instruction 
induced Ss to develop and test hypotheses 
and thereby behave as they did in the BT 
problems has not been answered. It was, 
therefore, the purpose of the experiments 
reported herein to explore the effects of 
varying amounts of systematic training on 
performance in BT problems. Would, for 
example, highly trained Ss provide more 
response sequences during BTs that are 
ipn with hypotheses than untrained 
Ss 


EXPERIMENT I 
Method 


Subjects and design. A total of 80 Ss were ran- 
domly assigned, except for equating the number of 
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males and females, to one of í »erimental 
conditions, The five conditions lefined in 
terms of the number of pretraining ns which 
preceded the administration of four »roblems, 
The Ss received either 0, 1, 2, 3 etraining 
problems. 

Stimuli, The stimuli consisted « letters X 
and T, which were paired on eac! lus card. 
The letters also varied i ze (smal ge), tex- 
ture (clear or crosshatched), and ; (left or 
right side of card). The stimuli w« vared as 


8 X 11 in. dittos. 
A randomized blocking procedur: 
arranging the pretraining stimuli. 


ed for 
con- 


sisted of the random arrangement of muli of 
the population. Each dimension w: int to 
problem solution equally often. A : 1 order 


of the problems was selected for each 
Four of the aforementioned stimuli v 
for use during the BT problems, The 
conducted such that certain configur 
sponses during the BTs could be ind 
sistent with one of eight unidimensiona 
The remaining four stimuli were used 


OTs. The OT stimuli were randomly 
each problem with the restriction that : ppear 
equally often. 

Procedure. The Ss were run individua! ^y one 
of three Es. After being seated across a 1 from 


E, S was given routine instructions rega: og the 


pretraining problems. Each S was shown stim- 
ulus population, instructed as to the m« ig of 
feedback signals, and told that either tex posi- 
tion, size, or form of figure would defi blem 
solution, 

For the various pretraining conditio! 1 at- 
tribute for the single problem condition, or a com- 
bination of attributes for a series of problems, was 
randomly sampled. No repetitions of relevant at- 


tributes was permitted. The criterion of problem 
solution for these problems was set at 10 consecutive 
correct responses. The S was given a total of 48 
trials to solve each problem, the problem being 
terminated by E showing S the solution. A trial 
consisted of showing .$ the stimulus card for ap- 
proximately 5 sec., a pointing response by S, indi- 
cating which of the two figures contained the rele- 
vant attribute, and a verbal "right" or "wrong" 
by E. A period of approximately 5 sec. transpired 
between trials. The Ss were asked to state the 
solution upon meeting criterion, 

Following successful completion of pretraining, 
all Ss were instructed as to the procedure involving 
BT s. They were told that feedback would not be 
given consistently as it had during pretraining, and 
were asked to continue responding as they would if 
feedback were given on each trial. Following these 
instructions, a series of four 15-trial problems was 
administered. For each of these problems, Trial 1 
was designated as a positive OT. Regardless of S’s 
response, it was called correct. Trials 6 and 11 were 
also OTs. Feedback for the latter OTs was ar- 
ranged such that one trial was designated positive 
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and the oth ative. Since there are 16 possible 
combinatior feedback for Trials 6 and 11 for 
four proble ach S was randomly assigned to 
one of the s th the restriction that a single S 
be assigned t h order. 


Results 


2 ) Levine’s (1966) hypothesis 
theory, a la percentage of the response 
sequences di: ing BTs should be consistent 
with a single :ypothesis. For his problems, 
approximate! 91% of the sequences were 
found to be consistent with single hypothe- 


ccording 


ses, In the; resent experiment, the propor- 
tion of unive iate hypotheses was computed 
for each .S > means for the 0-, 1-, 2-, 3-, 
and 4-prob' -a conditions (Conditions 0-4) 
being 6 , .86, .86, and .89, respec- 
tively. inalvsis of variance applied to 
these d -vealed significant differences 
among proportions, F (4, 75) — 3.86, 
p< .01. 

One ma issumption made by certain 
hypothesis-‘esting theories of CI (eg. 
Rest! 1962) is that S retains the hypothe- 
sis whicl ted his previous response if 
that res > is called correct. Indeed, 
Levine reported that hypothesis 
retention alter correct responses occurred 
in 92%, of the cases studied. Comparable 
propor from the present experiment 


for zero through four problems, respec- 
tively, were .66, .72, .76, .81, and .75. The 
differences among these proportions were 
not reliable, F (4, 75) = 1.24, p > .05. 

A second assumption made by the afore- 
mentioned hypothesis-testing theory, and 
by Levine (1970), is that S returns his 
hypotiesis or set of hypotheses to a pool 
for resampling if the response dictated by 
the set is called wrong. Therefore, the 
probability of resampling that hypothesis 
should be low if S resamples from the 
available pool of hypotheses randomly, and 
gly low if S remembers hypotheses 
that have been infirmed. Levine (1966) 
reported that on 98% of the occasions after 
S's response had been called wrong a hy- 
pothesis different from the one manifested 
prior to the OT was observed. Again, for 
the present experiment, comparable values 
for Conditions 0 through 4 were .73, .91, 
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.88, .86, and .94, respectively. Analysis of 
variance showed the differences among 
these proportions to be significant, F (4, 75) 
= 2.54, p < .05. Inspection of the pro- 
portions shows the difference to lie between 
Condition 0 and the other four conditions. 

Since the frequency of hypothesis changes 
is a function of the number of pretraining 
problems administered, it might also be 
expected that similar observations could be 
made over the BT problems themselves. 
For that reason, the proportion of hypothe- 
sis changes after correct responses was 
computed for each BT problem, the ob- 
tained values for one through four BT 
problems, respectively, being .29, .36, .21, 
and .26. The obtained proportion of hy- 
pothesis changes after errors for one through 
four BT problems, respectively, were .85, 
.81,.91, and .90. Inspection of these da a 
indicate small differences and the lack of a 
discernible trend. Apparently, three OTs 
is an insufficient inducement for strategic 
change. 

A natural extension of the assumptions 
listed above is that the size of S's hypothe- 
sis pool remains constant regardless of the 
point at which the pool is measured. For 
example, if the pool is determined to con- 
tain, say, four hypotheses after the first 
negative OT, it ought to contain a like num- 
ber of hypotheses after, say, the third con- 
secutive negative outcome. To assess the 
size of the hypothesis pool, Levine (1966) 
computed the obtained proportion of suc- 
cesses for a group of Ss. Reasoning that 
the obtained proportion of correct hypothe- 
ses would ordinarily be computed by divid- 
ing the number of logically correct hypothe- 
ses, as determined by feedback, by the 
number of hypotheses in the pool, Levine 
was able to obtain an index of the size of 
the pool by dividing the obtained propor- 
tion of hypotheses into the number of 
logically correct hypotheses. Using an 
identical procedure, the size of the hy- 
pothesis pool after each OT was estimated 
for each experimental condition and is 
shown in Table 1. Several trends are 
apparent from these data: (a) the size of 
the hypothesis pool is reduced over OTs; 
(b) S's efficiency in handling information 
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TABLE 1 


ESTIMATED SIZE OF THE HYPOTHESIS POOL AFTER EACH OUTCOME 
TRIAL IN EXPERIMENT I 


Number of pretraining problems 
Trial sequence - — - —— 
0 a 3 4 
ils 7.31 4.57 5.56 1.65 
TEES (5.41-7.69) | (4.30-5.13) | (4.88-6.78) 30-5.41) 
+ Ps 4.93 2.37 2.75 2 52 
DUE (3.45-7.69) | (2.15-2.94) | (2.35-3.57) 1-3.28) 
EE —, BT: 6.40 3.37 3.65 3,88 
RER 3j (41.08-10.53) | (2.70-4.76) | (2.86-5.26) 04,44) 
+, BT: , BTs, —, BTs 4.00 2.91 2.20 20 
NES Ey (2.38-7.69) | (2.00-4.76) | (1.64-3.45) 3.45) 
+, BTs, —, BTs, +, BTs 3.10 1.60 1.55 00 
S Eu (2.00-5.56) | (1.70-2.22) | (1.27-2.13) 2.94) 
Vole. Ninety-five percent confidence intervals appear in parentheses, Abbreviations! BTs = blank trials, 4 »utcome 
trial, — — negative outcome trial. 
improves as a function of amount of train- some of the BT problems with ative 
ing; and (c) the information from positive OTs, while all of the BT pro'ems in 
OTs is used more efficiently in reducing Experiment I were begun with a positive 
potentially valid hypotheses than that OT. 
from negative OTs. Experiment II was designed to ss the 
reliability of the results of Exper nent I 
EXPERIMENT II and to determine what, if any, eff« M 
7 zarie ^ 's ol 
Experiment I showed reasonably strong produced by the posi ee eS ae 
training effects. In general, as S received Experiment I and Levine's (1960 Me 
s rests DS AS nt. > ^e T Exper nt 
more pretraining, he was more likely to ment. The procedures of Exp 


alter his hypothesis after having received 
a negative OT and to maintain his hypothe- 
sis after positive OTs. Furthermore, S 
used both outcomes more efficiently with 
more pretraining in the sense that the size 
of the pool of hypotheses being entertained 
was smaller after, say, a single positive 
outcome for Ss who had received three 
pretraining problems than for those who 
had received none. However, two aspects 
of the data were not entirely consistent 
with these aforementioned general trends 
and Levine's (1966) data. With four pre- 
training problems, Ss were more likely to 
alter their hypotheses after positive OTs 
than Ss with three pretraining problems, 
With respect to Levine's data, the proba- 
bility of hypothesis change after positive 
OTs was considerably higher than for 
Experiment I. One viable explanation for 
the discrepancy is that different procedures 
were used. Levine, for example, began 


were, therefore, as nearly identical to 
Levine's as was possible. In addition, a 
hypothesis arising out of Experiment I was 
tested by the manipulation of instructions. 
One interpretation of the high rate of hy- 
pothesis alteration after positive OTs is 
that 5 entertains several hypotheses simul- 
taneously, and any one of these active hy- 
potheses may be used as a basis for respond- 
ing after a positive OT. To assess the 
accuracy of this interpretation, special in- 
structions were included for some of the 
groups of Experiment II, asking them to 
consider a single hypothesis while attempt- 
ing to solve the problem. 


Method 


Subjects and design. The Ss were 128 male and 
female undergraduate students at Miami University 
who were assigned randomly, but in equal numbers 
with respect to sex, to one of eight experimental 
conditions. The conditions were defined by a fac- 
torial arrangement of two variables: number of 
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pretraining ems (0, 2, 4, or 6) and type of 
instruction utine vs. restricting). 
Material ! procedure. The materials and pro- 


cedures wer rly identical to those of Experiment 
I save for : cial instructions given to restrict 
S's hypot! 00] and the designation of the 
various OT sitive or negative. 


The Ss i special instruction condition were 
told to “co only one characteristic as possibly 
relevant one time," and to ‘never, for 
example, « er the possibility that X and/or 


levant c a cteristics.” 
os made relevant to solution were 
equency of occurrence and order to 
it this was possible. For the two- 
tion, the 12 permutations of the four 
ions were assigned to the first 12 Ss 
t condition. The four dimensions 
ed equally often in the problems 
other 4 Ss in the two-problem 
the four-problem condition, four 


large are th« 
The dim 
balanced fc 
the extent 
problem co: 
stimulus di 
assigned t 


4X4 La uares were selected to determine 
order of m :nment. The Latin squares 
selected four-problem condition were also 
used in ix-problem condition. One of the 
permuted s or one of the remaining pairs of the 
two-prob condition was added to each row of 
each squar problem condition. 

For th ms, eight sequences of positive 
(+) and negative (—) feedback were possible (e.g., 
EF F. f+ etc Four 4 X 4 Latin squares 
were coml determine the feedback sequences 
for each ol problems for the 16 Ss in each 
condition t OT of each BT called 
positive that S would not "leave" the 
univariat« pothe realm, 

Results 
The mean proportion of hypothesis 


changes after negative OTs for each ex- 
perimental condition are shown in Table 2. 
Analysis of variance applied to these data 
showed only a significant effect due to 
number of pretraining problems, F (1, 120) 
= 6.40, p < 01. The proportion of hy- 
pothesis changes was an increasing function 
of the number of pretraining problems. 

_ Table 2 also displays the mean propor- 
tion of hypothesis changes after positive 
OTs. Analysis of variance showed the 
Type of Instructions X Number of Pre- 
training Problems interaction to be the only 
significant source of variation, F (3, 120) 
73.20, p < .05. Hypothesis changes were 
especially prevalent where S had not re- 
ceived instructions to limit his hypothesis 
pool to univariate hypotheses, especially in 
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TABLE 2 


MEAN PROPORTION OF HYPOTHESIS CHANGES AFTER 
NEGATIVE AND POSITIVE OUTCOME TRIALS IN 
EXPERIMENT II 


Number of pretraining problems 
Instruction 


0 2 4 6 


Negative outcome trials 


Routine -15 .95 .94 .96 
Restricted .84 .90 .96 96 


Positive outcome trials 


Routine 36 48 .25 21 
Restricted 31 2 19 4 


the case where two or fewer pretraining 
problems had been administered. 

The mean proportions of multivariate 
hypotheses for each experimental condi- 
tion are shown in Table 3. Any response 
sequence that could not be identified as 
consistent with a known univariate re- 
sponse sequence was assumed to be the 
product of a multivariate hypothesis. Anal- 
vsis of variance applied to these data re- 
vealed significant effects due to type of 
instructions, F (1, 120) = 10.84, p < .01, 
with the groups being instructed to restrict 
their hypothesis pool to univariate hy- 
potheses showing a smaller proportion of 
multivariate hypotheses than the nonin- 
structed group. Additionally, moderately 
reliable effects were observed in the In- 
struction X Pretraining Problem interac- 
tion, F (3, 120) = 2.58, p< .10. As the 
number of pretraining problems increased, 
the proportion of multivariate hypotheses 
was reduced for the routine instruction 
condition. The proportion of multivariate 


TABLE 3 


MEAN Proportion OF MULTIVARIATE HYPOTHESES 
FOR EACH EXPERIMENTAL CONDITION 


Number of pretraining problems 


Instruction 

0 2 4 6 
Routine .25 15 2 Br 
Restricted .08 .07 .09 A3 
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TABLE 4 


ESTIMATED SIZE OF THE HYPOTHESIS POOL AFTER 
NEGATIVE OUTCOMES ON TRIALS 1, 2, AND 3 IN 
EXPERIMENT II 


Outcome trial 
Number of 
problems 
1 2 3 
0 6.90 6.25 4.55 
(5.80-8.70) | (4.55-9.09) | (3.57-5.26) 
2 6.35 4.55 4.35 
(5.48-7.84) | (3.64-6.06) | (2.94-6.67) 
4 6.06 351 3.03 
(5.26-7.40) | (2.99-4.55) | (2.22-4.35) 
6 5.80 3.45 2.44 
(5.06-7.14) | (2.90-4.35) | (1.89-3.33) 


Note. Ninety-five percent confidence intervals appear in 
parentheses. 


hypotheses was nearly constant for the 
special instruction condition. 

As in Experiment I, no systematic effects 
were observed as a function of BT problem. 

An estimate of the size of the hypothesis 
pool after negative outcomes on OT}, OT», 
and OT; as a function of the number of 
pretraining problems is shown in Table 4 
for the routine and restricted instructional 
conditions, respectively. The estimates 
were arrived at by computing the propor- 
tion of correct hypotheses after each nega- 
tive OT for each condition and dividing 
the appropriate proportion into the number 
of logically correct hypotheses (i.e., four 
after OT;, two after OT», and one after 
OT;) for the trial involved. In general, 
these data show the size of the hypothesis 
pool to be a decreasing function of the 


TABLE 5 


EsriMATED Size or Hyrornksis Poor as A Func- 
TION OF NUMBER OF PREVIOUS PosrrivE Out- 
COME TRIALS (OTs) AFTER NEGATIVE OT; 


Number of preceding positive OTs 


Number of 
problems 
0 1 2 
0-2 6.25 4.00 250 
(3:13-12.50)| (2.63-6.67) | (1.92-5.00) 
4-6 2.63 3.57 2.13 
(1.82-4.35) | (2.50-5.56) | (1.59-3.23) 


Note, Ninet: 


y-five percent confidence interval: F 
parentheses. s appear in 
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trial involved, that the size o pool is 
reduced more substantially trial to 
trial as S has more experie: ith the 


type of problem at hand, and : he size 
of the pool is smaller in the here Ss 
were specially instructed to ct their 
hypotheses to a univariate rea! 

An estimate of the size of th 'othesis 
pool as a function of the nun Í pre- 
ceding positive OTs is shown ible 5. 
The estimates were obtained b; puting 
the proportion of correct hypo: » after 
a negative outcome on OT, a: iding 
the obtained proportion into t! imber 
of logically correct hypotheses r OT; 
(i.e, one). Unfortunately, the: re no 
discernible trends where the < were 
analyzed separately for each tion. 
Quite likely, too few data poin: ain 
most cases, as few as eight) wer ;lved 
in the estimates. To improve tb relia- 
bility of the estimates, the data ^ the 
one- and two-pretraining-probler: condi- 
tions (collapsed over instruction: were 
combined and compared with the co7:bined 
results of the four- and six-probler < ondi- 
tions. In general, Table 5 shows sys- 
tematic reduction in the size of > hy- 
pothesis pool as a function of the number 


of preceding positive OTs for the ze: o-two 
problem combination, but no reduction in 
size of the pool for the four-six problem 
combination. J 


DISCUSSION 


Hypothesis theory. Levine (1966, 1970) re- 
ported that Ss attempting to solve simultane- 
ous discrimination problems for which solution 
is defined by a single cue employ univariate 
hypotheses, usually alter their hypotheses after 
negative OTs, and rarely alter their hypotheses 
after positive OTs. An additional assumption 
is that 5 samples a subset of the total pool of 
potentially correct hypotheses. Both experi- 
ments appear to justify the first three assump- 
tions, but only under certain conditions. The 
Ss who receive relatively extensive pretraining 
on a problem identical to the test problem 
nearly always alter their hypotheses after 
negative outcomes, rarely alter them after 
positive outcomes, and restrict their hypothe- 
ses to the univariate realm. The S who re- 
celves very little pretraining either entertains 
multivariate hypotheses, or no hypotheses at 


Moo MA o —-—MA ee E 
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to alter his hypothesis after 
nd positive outcomes, 
mpling assumption appears to 
w of the present data. The 
$ would alter his hypothesis 


all, and is 
both negat 
The subse 
be correct ! 
probability 


after a pos OT was much higher when 
instructions iot require him to restrict the 
size of hi king hypothesis set to one. 
Moreover, flect of such an instructional 
set is unifo ver varying numbers of pre- 
training pro 18. 

Finally, Li :ne (1966) reported that Ss used 


the informa 
their hypoth 


ı after positive OTs to adjust 
pools more than after negative 


OTs, but tl s regularly reduced the work- 
ing set of h heses after all outcomes. The 
results of t esent set of experiments gen- 
erally confi evine's observations, with one 
potentially rtant exception. In Experi- 
ment II, it be worth noting that the size 
of the wo pothesis set is unrelated to 
the numi ceding positive OTs where 
Ss had re r or six pretraining prob- 
lems. | . the reliability of this 
finding is on, since the results of 
Experin licate that Ss do reduce the 
size of thei s set over posi- 
tive OTs 1 5 whether 0, 1, 2, 3, or 4 
pretraini; is are given. Perhaps six 
or more pretraining problems would produce 
this effec eater reliability. On an 


intuitive |< eems plausible to assume 
that S wor eventually adopt a strategy 
which prod: single hypothesis to be 
tested and retained, if positive OTs ensue, 


Since S is not required to hold in memory a 
fairly large set of hypotheses, some of which 
have been confirmed and others that have not 
been tested. Tr „it is not clear at this 
point whether S ad strategy which places 
less emphasis on memory as he solves more 
probl of a particular class. 

, Multi’: rocess theory. The main assump- 
D of the ‘process point of view is that 
ene ral, perhaps many, ap- 
EEA a problem, Furthermore, it Is 
ER at, among other factors, theamount 
d deo S has had with the problem at 
E ermines the approach he will use in 

a ving the problem. The results of both 
experiments appear to be entirely consistent 
ui these general assumptions. In addition, 
ce expressed by White (1972a, 1972b) 
x nexperienced problem solvers would adopt 
in associative learning strategy, while more 
Sophisticated Ss would adopt a more efficient 
and elegant strategy of testing hypotheses, 
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appears to have been confirmed by the present 
data. 

More specific with respect to the associative- 
to-hypothesis learning progression are the data 
of untrained Ss who show a high rate ot 
unidentifiable BT sequences, a potential indi- 
cator of nonhypothesis behavior on their part. 
However, with training the rate of such BT 
sequences was reduced, but still remained 
above 10%, indicating that Ss were not simply 
dropping multidimensional hypotheses from 
their working pool. 

A corollary of the notion that S will adopt 
a hypothesis-testing strategy with training is 
that rarely will S alter his hypothesis after a 
positive OT. In fact, the rate of hypothesis 
change was reduced over training problems; 
in addition, the size of the hypothesis set re- 
mained relatively constant over a series of 
positive OTs for extensively trained Ss, a 
result which is consistent with the one- 
hypothesis-at-a-time model of Restle (1962). 

A final major hypothesis predicted that the 
rate of hypothesis alteration after negative 
OTs would be an increasing function of number 
of pretraining problems, and that the proba- 
bility of hypothesis change after positive and 
negative OTs would be equal. The first half 
of the hypothesis was confirmed by the data; 
hypothesis change occurred on 96% of the 
trials for the six-problem condition of Experi- 
ment II and was somewhat lower for all of 
the remaining conditions of both experiments, 
However, the second half of the hypothesis 
was not supported by the data in any condi- 
tion. One interpretation of the fact that the 
rate of hypothesis change was considerably 
higher after negative OTs than positive OTs 
has been suggested by White (1972a). In the 
latter paper, it was hypothesized that within 
a sample of adult human problem solvers a 
variance of strategies would obtain. For ex- 
ample, from instruction and previous problem- 
solving experience, some Ss would decide that 
their task was to memorize the response that 
was to be given in the presence of each stimu- 
lus; no consideration would be given to the 
possibility that the various stimuli are related. 
Other Ss might conceptualize the task as re- 
quiring an active reorganization of the prob- 
lem, and the formulation and testing of 
hypotheses. A proper mix of such problem 
solvers could conceivably have produced the 
data obtained in both experiments. If the 
data of a sizable number of Ss who are testing 
hypotheses, and who are altering their hy- 
potheses after errors and maintaining con- 
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firmed hypotheses are combined with a group 
of Ss who are adopting an associative strategy, 
and who are altering "hypotheses" after both 
positive and negative OTs, it would be ex- 
pected that the rate of hypothesis alteration 
would not be equal after positive and negative 
OTs, and that some moderate rate of hypothe- 
sis change would occur after positive and nega- 
tive OTs. While the notion of variant strate- 
gies among Ss has not been directly determined 
by the present experiments or White's (1972a, 
1972b) earlier experiments, the hypothesis ap- 
pears more crediblein light of these experiments. 

Theoretical sketch. The data from the pres- 
ent and earlier experiments suggest that S has 
available several approaches to a CI problem. 
The most rudimentary approach simply in- 
volves the representation of stimuli in their 
entirety with a proper response. Given the 
known difficulty Ss have with negative in- 
stances of concepts, it seems plausible to as- 
sume that S organizes the instances in such a 
way as to differentiate the negative and posi- 
tive instances; however, the negative instances 
are ignored so far as solution to the problem is 
concerned. If S finds that simple representa- 
tion of the stimuli and responses is overloading 
memory, it is likely that he will adopt a new 
Strategy. If such an interpretation is correct, 
it would be expected, for example, that training 
effects would be far less substantial if the 
stimulus population was delimited to a few 
examples. Moreover, a nonassociative strat- 
egy would be more likely to occur if S knew 
the stimulus population to be large (e.g., 
selection paradigm). 

, “Movement to the nonassociative or hypothe- 
Sis-testing realm, then, depends on the knowl- 
edge S has of the problem at hand and the 
development of the hypothesis-testing skill 
itself. The data that are available seem to 
indicate that Levine's (1966) description of the 
hypothesis-testing S is correct at least in those 
cases where 5 knows it is clearly beneficial to 
him to consider several hypotheses simultane- 
ously and to retain in memory those that have 
been infirmed, It could also be assumed that 
the hypotheses generated early in a series of 
problems are restricted to those suggested by 
positive examples. Later in training, negative 
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examples may serve as a sou * hypothesis 
generation and hypothesi Finally, $ 
may adopt the single-hypoth: -ction mode 
if circumstances are appropri r example, 
in the present experiments, rns that 16 
trials are given for each p: with four 
outcomes per problem. Sinc ency is not 
being reinforced, movement : a less tax- 


ing strategy would be expect: 
So far as training effects, 


changes, 
and the development of the : 


ty to test 


hypotheses are concerned, it the most 

powerful test of the overall | sis would 

be with young children (up : vr.), who 
are known not to possess the to test 

hypotheses (e.g. Douglass & | , 1971). 

Training effects would not ix ected for 

these Ss. 
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The present research is an extension of 
encoding variability theory to the semantic 
encoding of «:liguous words. According 
to Martin (1903) and others, the subject 
makes a perce: :al-encoding response to a 
presented s ius, Although this encoded, 
functional stiinulus may not be identical to 
the nominal stimulus, it constitutes the 
memorial representation of the nominal 
stimulus. The stimulus-as-coded (Bower, 
1972), not the stimulus-as-presented, is 
stored in memory. Encoding variability 
theory further assumes that a given S may 
make different encodings of the same 
hominal stimulus on different occasions. 
Since Martin’s research has been concerned 
with paired-associate learning and transfer 
situations in which trigrams were used as 
stimuli, he has y 'd encoding as stimulus 
selection. The principles of encoding vari- 
ability theory are, however, applicable to a 
wide variety of experimental contexts. 

The expe:i:ctal context with which we 
are concerned is recognition memory for 
familiar English words, and the type of 
encoding with which we are concerned is 
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codings for the balanced homographs. 
oding was proposed to account for the better recognition of balanced 
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«periment I compared recognition memory for two types of homographs. 
balanced homographs, there are two relatively equiprobable semantic 
odings; for polarized homographs, there is one dominant encoding. It was 
edicted, on the basis of encoding variability theory, that recognition would 
e superior for polarized homographs but the opposite outcome was obtained 
ıt two retention intervals and replicated. 
\ssociation tests were administered at two intertest intervals. The semantically 
sponses supported the assumption of greater variability of semantic 


In Experiment II, successive word 


A retrieval strategy of semantic 


the memorial representation of the semantic 
features of a stimulus. One basis for the 
presumed encoding variability of familiar 
words is their semantic ambiguity. In a 
large number of instances, the same se- 
quence of letters and phonemes may have 
two or more quite different meanings. 
Examples of such polysemous words, or 
homographs, are BANK, which may refer 
to a financial institution as well as to land 
adjoining a body of water, and CLUB, which 
may refer to an organization of people 
united by a common interest and to a 
wooden stick used as a weapon. The dis- 
tinct meanings of a homograph may be 
considered different semantic senses of the 
words that allow for different semantic 
encodings of the same nominal stimulus 
across Ss in a psychological experiment, or 
more importantly here, by the same Ss on 
different occasions. 

We believe that encoding variability 
theory has clear implications for recogni- 
tion memory research. To recognize an 
old item correctly, it is assumed that S 
must make the same encoding of the item 
at the time of study and at the time of test. 
If the test encoding of an item differs from 
the study encoding of the item, S will not 
recognize the item, since he is accessing an 
encoding that does not have appropriate 
occurrence information, whatever that may 
be, associated with it. From this perspec- 
tive, forgetting may be explained as failure 
of trace contact due to differences between 
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the stored encoding and the test encoding, 
rather than as loss of stored information. 

Another aspect of the present argument 
which needs to be made explicit is an 
assumption of encoding specificity. Ac- 
cording to this assumption, only a single 
functional encoding is made by the subject 
on à given occasion, with an occasion de- 
fined as the presentation of a verbal stimu- 
lus on either a study or a test trial. Light 
and Carter-Sobell (1970) and Winograd and 
Conn (1971) have presented evidence con- 
sistent with the specificity assumption as 
it applies to the semantic encoding of 
homographs. 

The first experiment and its replication 
are concerned with recognition memory for 
two types of homographs. A balanced 
homograph is defined here as having two 
distinct semantic senses that are relatively 
equiprobable encodings. A polarized homo- 
graph is defined as one with two distinct 
semantic senses that are markedly different 
in their semantic encoding probabilities, 
with one sense being highly likely. In our 
opinon, encoding variability theory leads to 
the prediction that recognition memory for 
polarized homographs should be superior 
to recognition memory for balanced homo- 
graphs. With high likelihood, S should 
make the same semantic encoding of a 
polarized homograph at study and at test, 
since its one dominant sense is a highly 
probable encoding. As argued above, trace 
contact and successful recognition depend 
upon such identity of study and test en- 
codings. On the other hand, since two 
senses of a balanced homograph are equally 
probable semantic encodings, S’s encoding 
of a balanced homograph may vary from 
one occasion to the next. When different 
encodings are made at study and at test, 
the consequence is failure to recognize the 
item as old. Thus, successful recognition 
of old items should be more likely for 
polarized than for balanced homographs. 

The longer the interval between study 
and test, i.e, the longer the interval be- 
tween successive encodings of an item, the 
greater should be the likelihood that the en- 
codings are independent. If this hypoth- 
esis of increased encoding variability over 
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time is correct, the predic 
recognition memory for 

graphs should be most c 
retention interval. 


EXPERIMENT 


Method 
Design. The design of Exper la and Ib. 
was a 2 X 2 factorial in which t» :omograph, 


jects vari- 
n-subjects 3 
tervals is i 
periments, 
ls were 3 
retention. 


balanced or polarized, was a witi 
able and retention interval was à 
variable. The length of the ret: 
the only difference between the 
In Experiment la, the retention 
min. and 25 min,; in Experiment | 
intervals were 3 min. and 48 hr. 

Materials. The homographs and 
encoding probabilities were taken í 
of Cramer (1970), Kausler and Kolla 
Perfetti, Lindsey, and Garson (1971 
of the homographs were selected fi 
named source. These norms had bec 
requiring Ss to give a single word ass 
series of homographs. Judges cl: 
tions according to the semantic 
represented. For example, ass 
"money," "vault," and “check” to the mograph 
BANK were classified as representing the encoding 
of bank as a financial institution; associ. tions such 
as "river," “hill,” and "slope" were classified as 
representing the encoding of “land adjoining a body 
of water.” 

Fifty-six balanced and 56 polarized hoi vographst 
were used in Experiment I. The selection ci 
for a balanced homograph was 
of the associations were classified by the judges 
under the primary semantic sense (S;). In other” 
words, the normative or cultural encoding prol 
abilities of S, for the balanced homographs were 
less than .75. For the 56 balanced homographs, 
the mean probabilities of Sy and Ss, the secondary 
semantic sense, were .54 and .31. The selection” 
criterion for a polariz homograph was that at 
least 75% of the associations were clas 
judges under the primary semantic sense. 
probabilities of Sı and Ss for the polarized homo: 
graphs were .83 and .11. The median word fre- 
quencies (Kučera & Francis, 1967) were 25 per 
million for the balanced homographs and 36 per 
million for the polarized homographs. The cor- 
responding means were 99 and 107 per million. L 

Two lists, A and B, each containing 28 balanced 
and 28 polarized homographs, were constructed. 
One random order of each list was used for study. 
The list that was not presented for study supplied 
the distractor items on the two-alternative, forced- 
choice recognition test. Three different test forms - 
were constructed by randomly pairing List A and 
List B homographs that were of the same type, € 
TTE Tq pO m B 

*A list of the words used in Experiment I is 
available from the first author. 


970), and 
majority 
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on the ' > choice was always between two 
balanced raphs or two polarized homographs. 
Since li were not significant, this variable 
is not ment 1 again. 

Proced» bjects were shown either the List 
A or Lis is at a 3-sec. rate. In both experi- 


tely half of the Ss who were tested 
| interval studied List A. Following 
h st word, Ss worked for 3 min. 
on numb ion problems. In Experiment 
Ia, half of were administered the recognition 
test immed after the filler task. Before the 
other Ss wer «ministered the recognition test, they 
received a 2. min. break during which they were 
asked to leav: room and not to discuss the experi- 
ment, In E> eriment Ib, the filler task was fol- 
lowed by ei mmediate testing, as in Experiment 
la, or test 48 hr. The experiments were con- 
ducted in ; with small groups of Ss. Experi- 


presentat l€ 


ment la ipleted before Experiment Ib was 
begun 

Subjec rticipated in the first experi- 
ment a pated in the second experi- 
ment. .ssigned randomly to lists, test 
forms, ervals, until there were 25 
Ss at ( <periment Ia and 24 Ss at 
each i: iment Ib. Subjects in Experi- 
ment | ner session students at Emory 
Unive: »m 41 were paid for participation 
and 11 lling an introductory psychology 
cours Subjects in Experiment Ib 
served d regular school year to meet an 


introductory »sychology course requirement. 


Results and Discussion 


, lie results of the experiments are shown 
in Table 1. In Experiment Ia, the main 
effect of type of homograph was the only 
signifi utcome, F (1, 48) = 21.60, 
p < 001. The interesting aspect of the 
results is that recognition memory for 
balanced homographs was superior to 
recognition memory for polarized homo- 
grapi Since this result was contrary to 
predict: nd since no effect of retention 
interval w : lent, the experiment was 
pi wii! tention interval that was 
sufficiently long to allow for a performance 
decrement, 

in Expe ‘iment Ib, recognition memory 
lor balanced homographs was again superior 
to that for polarized homographs, (1, 46) 
=20.41, p < .001. There was a significant 
decline in performance at the 48-hr. reten- 


EA reported in this paper treat words 

don $ 3 ect. If words are treated asa random 

(1973 E min F' is computed, following Clark 
3), all of the significant outcomes remain so. 
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TABLE 1 


CORRECT RECOGNITION 
PROPORTIONS 


Type of homograph 


Experiment 


Polarized Balanced 
la 
3 min. ,80 .86 
25 min. 83 87 
Ib 
3 min. .81 .86 


48 hr. i65 61 eM 7a AO 


tion interval, F (1, 46) = 18.55, p < 001, 
but there was no statistical support for an 
interaction between type of homograph and 
retention interval. 

Imagery ratings of the homographs were 
collected to determine whether an uncon- 
trolled difference in rated imagery might 
account for the difference in recognition 
memory between the two types of homo- 
graphs. Thirty-seven undergraduates from 
the Emory University subject pool rated 
each homograph on a 7-point imagery 
scale, according to the procedure described 
by Paivio, Yuille, and Madigan (1968). 
The difference between the mean rating of 
the balanced homographs (4.94) and of the 
polarized homographs (4.00) was not 
significant. 

The data of Experiments la and Ib evi- 
dence a relationship between semantic en- 
coding probabilities and performance in a 
recognition memory task. Since the direc- 
tion of the relationship was contrary to 
predictions, it is necessary to reconsider our 
argument. Although this reconsideration 
will be undertaken more fully after presen- 
tation of Experiment II, one hypothesis 
that led to the second experiment is men- 
tioned here. It is conceivable that Ss are 
more likely to encode both semantic senses 
when there are two encodings of similar 
strength, as is the case with balanced homo- 
graphs, than when there is one culturally 
dominant encoding, as with polarized 
homographs. If two semantic representa- 
tions are encoded, then either encoding at 
the time of test can result in trace contact 
and successful recognition. While such 
double encoding violates the strong form of 
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the encoding specificity assumption out- 
lined earlier, it is consistent with a weak 
form of encoding specificity, namely, that 
one or more semantic representations may 
be tagged on a given occasion. As Light 
and Carter-Sobell (1970) point out, this 
assumption is still distinguishable from an 
assumption of generic encoding, since it is 
not assumed that all senses sharing the 
same lexical form are encoded whenever 
that form is studied. 

Mnemonic performance in Experiment I 
was predicted from encoding data obtained 
by other investigators. These encoding 
data were taken from word associations 
given on a single occasion, while our con- 
cern is with variability within Ss on two 
occasions. Therefore, Experiment II was 
conducted to assess the stability of se- 
mantic encodings over the time periods 
used in the first experiment with the expec- 
tation that such data would illuminate the 


recognition memory performance of Experi- 
ment I. 


EXPERIMENT II 
Method 


Design. "Two coordinate procedures were em- 
ployed. In one procedure, discrete word associa- 
tions to the same homographs were obtained from 
the same Ss on two occasions Separated by either 
5 min. or48hr. In the other procedure, the method 
of continued association was used to obtain from 
‘Ss two different associations to the same homographs 
on one occasion. From the first procedure, data 
concerning the relative stability of semantic encod- 
ings over time were obtained. From the second 
procedure, the likelihood of S making two semanti- 
cally distinct encodings on the same occasion, when 
instructed to produce two different responses, was 
determined, 

Materials. More stringent selection criteria were 
used to choose 25 balanced and 25 polarized homo- 
graphs from the norms cited in Experiment I. A 
homograph was considered balanced if the norma- 
tive encoding probability of S; was less than .58 
and the normative encoding probability of S, was 
at least .40. The mean published values of S; and 
S: for the balanced homographs were .49 and .41 
A homograph was considered polarized if the norma- 
tive encoding probability of S; was at least .78 and 
if the normative encoding probability of Ss was at 
least .10. The mean published values of S: and S, 
for the polarized homographs were .81 and .14. 
The actual S; and Sy values for the Emory popula- 
tion, as distinguished from the published values, 
are reported in the results section. "Thirty non- 
homographs were also selected, half of which were 
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used in the booklets for the firs: nd the other 


half of which were used in t! lets for the 
second test at 5 min. or 48 hr. T homographs 
were chosen from words whose d y (Random 
House Dictionary of the Eng uage, 1968) 
meanings represented only one sense in the 
authors’ judgment. The frequencies 
(Kučera & Francis, 1967) were ilion for the 
balanced homographs, 30 per mi r the polar- 
ized homographs, and 29 per n for the 
nonhomographs. 
In the booklets for the dis 


lation pro- 


cedure, a single word appeared on e with a 


space for S's response next to t] In the 
booklets for the continued assoc procedure, 
the same word appeared twice o: se with a 
space for S’s response next to each « nce of the 
word. Each S received a 67-page b A space 
for S's name and instructions w« d on the 
first 2 pages. The remaining 65 asisted of 
the 25 balanced homographs, the ed homo- 
graphs, and 15 of the nonhomograp): rs. Four 
random orders of the words were used struction 


of the booklets 


Procedure. Small groups of Ss we: d. Sub 
jects in the discrete association proc were in- 
structed to write down the first word t came to 
mind upon seeing a stimulus word. A complet- 
ing the first association te > discrete 


number 
, the Ss 


association procedure recei 
progression task. In the 


min. conc 


then were readministered the associatio st, while 
in the 48-hr. condition, the Ss were di sed and 
retested 48 hr. later. For the second ociation 
test, Ss received a booklet consisting of 0 homo- 
graphs on which they had been teste :viously. 
and the 15 nonhomographs on which th. had not 
been tested previously. The words in ine second 
test booklet were in a different random order from 


those in the first test booklet. The order of the 
words and the filler words were changed to reduce 
S's tendency to respond exactly as before, i.e., to 
minimize possible intralist context effects. Sub- 
jects in the continued association procedure were 
dismissed upon completion of their one test booklet. 

Subjects. The Ss were 105 Emory undergraduate 
students who Participated to fulfill an introductory 
psychology course requirement. Groups were 
randomly assigned to conditions and booklets. In 
the discrete association procedure there were 38 Ss 
in the 5-min. condition and 30 Ss in the 48-hr. 
condition. In the continued association procedure, 
37 Ss were tested. 

Scoring. Two judges independently classified the 
associations according to the semantic senses that 
Were represented. The semantic senses that were 
used in scoring were similar to those that were re- 
ported in the norms cited in Experiment I. There 
Was 95% agreement between the two judges. A 
third judge resolved the scoring disagreements. 
The third judge and one of the first two judges were 
unaware of the purpose of the scoring and of the 
expected distributions of encoding probabilities. 

Six percent of the associations were unscorable 


in that 

reason: 

exclud: 

of the : 

sentative 

scorable 

S, or S 

tertiary - 

number « 

Results a 

Stability f group norms. 

of the obtai :ed group norms as well as their 

comparabi! y with the norms of other 

investigat s will be reported first. Based 

on the d from the first test for the 68 

Ss in t screte association procedure, 

the ob ! mean values of S; and S» for 

the 25 ized homographs were .87 and 
" 
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iges could not classify them under a 
antic sense. These associations were 
hereported analyses. Three percent 
g associations were scored as repre- 


hird semantic sense, i.e., 97% of the 
tions were classified under either 
ind conditional probabilities for the 
ire not reported because of the small 
itions that were classified under Ss. 


Mscussion 


The stability 


P12. thé 

mean 

at 48 

These 

the : 

obtain 

in E 

stabilit; 
For 1 of the 

but for ni Ji 


5-min. retest (n = 38), the 

[| S, and Ss were .88 and .12; 

30), they were .86 and .12. 

in good agreement with 

es of .8t and .14 that were 

à the published norms cited 

nt I and also demonstrate the 
he group norms. 

25 balanced homographs, 

e polarized homographs, 

there were discrepancies between the 

Em norms and the published norms 

as to which semantic sense was dominant, 

i.e., as to the definitions of Sı and Sx. This 

outcor 


o i surprising, since the criterion 
lor a balanced homograph was that its 
Semantic encodings were relatively equi- 
probable, When S, and S; are redefined 
according. to the obtained norms, the 
bala ii «d homographs became more polar- 
ized than when they are based on the pub- 
lished values At the first, 5-min., and 
48-hr. tests, the mean S, and S» values 
were .63 and .34, .62 and .34, and .62 and 
x spectively, In subsequent analyses, 
Sige ns of S, and S; and their en- 
coding probabilities are based on the 
Emory data. 

Although the balanced and_ polarized 
homographs are different with respect to 
the probabilities of their primary associates 
when the associates are defined semanti- 
cally, it should be noted that they are not 
different when the primary associate is 
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TABLE 2 
ENCODING PATTERNS FROM TEST TO RETEST 


Time Polarized Balanced 

P (same) 5 min. .88 (902) .19 (886) 
48 hr. .87 (710) T1 (699) 

P(S2|Si) 5 min. .06 (793) .15 (552) 
e 48 hr. .07 (616) 47 (448) 
P(S,|Se) 5min. 44 (106) .28 (310) 
48 hr. 48 ( 86) .28 (238) 


scored in terms of words. In word associa- 
tion norms, the primary associate usually 
is defined as the single word that the sub- 
jects gave most frequently. For the present 
materials, collapsed across procedures and 
tests, the mean probabilities of the primary 
word associates were .26 for the polarized 
homographs and .24 for the balanced 
homographs. 

Within-subject encoding stability. Data 
relating to the question of the stability of 
semantic encodings over time are presented 
in Table 2. The entries in Table 2 are for 
pooled observations over Ss and words. 
The number of observations in each cell is 
given in parentheses. In the upper portion 
of the table, the probabilities of making the 
same semantic encoding at test and retest 
are shown. At each interval, the prob- 
ability of the same semantic encoding is 
less for balanced homographs than for 
polarized homographs. The data were sub- 
jected to an analysis of variance by taking 
the proportions across words for each S. 
The effect of type of homograph was the 
only significant outcome, F (1,58) — 52.38, 
p «.001. The assumption of greater vari- 
ability of semantic encoding for balanced 
homographs, on which the first experiment 
was based, is supported. However, the 
assumption of increased variability over 
time is not supported. 

The likelihoods of switching from Si to 
S., and vice versa, are shown in the middle 
and lower portions of Table 2. In the 
middle portion of the table, P (S |S)) is the 
probability of a switch to the secondary 
encoding at retest, conditionalized on the 
occurrence of the primary semantic encod- 
ing at the first. test. Although the prob- 
abilities of switching from S; to S, are low, 
the greater likelihood of switching for the 
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balanced homographsis significant, F (1, 58) 
—44.61, p < .001. 

The bottom portion of Table 2 contains 
the probabilities of switching from S: to S, 
P(Si|S). These probabilities are sub- 
stantially higher than those of the preced- 
ing analysis and are an indication that sub- 
jects are less likely to repeat an encoding 
if it is the normative secondary than if it is 
the normative primary. It is evident that 
the probability of an encoding switch from 
the normatively weaker to the normatively 
stronger encoding is greater for polarized 
homographs, F (1, 52) — 10.39, p «.0t. 

Independence of encodings. The question 
of whether responding at the second test 
was independent of responding at the first 
test was examined next. The observed 
joint probability of particular test-retest 
encoding patterns was compared to a pre- 
dicted joint probability that was deter- 
mined from independent estimates of S 1 and 
S» encoding probabilities. Since all values 
lor 5 min. and 48 hr. were almost equal, 
these data were combined. For example, 
the observed joint probability of an S, 
encoding at both test and retest, P(S, and 
$3), was .52 for balanced homographs. The 
predicted P(S, and Si), if it is based on 
independent estimates of P(Si) on each 
occasion, is P(S,)P(S;), or (.63) (.62), or 
29 n every comparison, there was a 
lack of agreement between the predicted 
and observed values and the discrepancies 
were consistent. The observed probability 
of making the same encodings over time 
was greater than expected on the indepen- 
dence assumption, and the observed prob- 
ability of two different encodings over time 
was smaller than expected. The differ- 
ences are significant by chi-square, There 
ts no evidence that semantic encoding on 
the second test is independent of the first 
test. 

Scoring. by word. So far, the data have 
been considered largely in terms of semantic 
types or senses. The probability of obtain- 
ing the identical word as the response to 
the same homograph at both tests is also 
of interest. For the polarized homographs, 
the probabilities of the same word at both 
tests were .54 at both 5 min. and 48 hr. 
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For the balanced homo the cor- 
responding values were . .45. The 
effects of type of homo; ime, and 
their interaction were n ignificant. 
Semantic scoring appa: was not 
affected by differential b to repeat 
particular words for eith: inced or 
polarized homographs. Th come was 
expected, since the strengt! primary 
word associate, as reported was the 
same for the two types of h phs. 
The absence of a signifi ffect of 
test-retest interval on word tition is 
consistent with the outcom: mantic 
scoring. Our data are in cl. reement 
with those of Fox (1970) witb -spect to 
within-subject change. Fox nistered 
identical word association : to the 
same Ss with a test-retest i || of 59 
days and scored the respons exact 


words rather than by semantic s. He 
reported that the probability : within- 
subject word change was .52, l« ır data, 


the overall probability of a wor change 
was .48. Thus, across the two studies, 
the probability of S changing his a: ciative 
response does not increase from lin, to 
59 days. Both here and in Fo. - work, 
there is evidence for considerable norma- 
tive or within-group stability in « onjunc- 


tion with considerable within-subject in- 
stability. That is, while Ss change their re- 
sponses, the group norms remain constant. 

Continued association. In the continued 
association procedure, the subjects were 
required to produce two successive but 
different associations to each homograph. 
The probabilities that the two associations 
represented the same semantic sense were 
-65 for the polarized homographs and .52 
for the balanced homographs. The differ- 
ence was significant, F (1, 72) = 39.06, 
P < .001. When the probability of a 
switch to the secondary encoding is con- 
ditionalized on the occurrence of the pri- 
mary encoding as the first response, i.e, 
P(S:/S1), the resulting values are .26 for 
the polarized homographs and .36 for the 
balanced homographs, F (1, 72) = 19.27. 
P <.001. For the opposite pattern, ie 
P(S,|S), the conditional probabilities are 


-75 and .54 for the polarized and balanced 


j 
4 


‘ 
: 


SEMA 
hom« respectively, F (1, 70) — 
34.46, 001. The patterns are the 
same e obtained for discrete associa- 


ble 2), but the likelihood of 
ater in all cases for continued 
This difference may be due 


to the raint of not being allowed to 
repeat ime word in the continued 
associa! ocedure. 


ENERAL DISCUSSION 


Experime:; I was based on some assump- 


tions abor mantic encoding processes lead- 
ing to the liction that recognition memory 
for bala: omographs would be inferior to 
recogniti nory for polarized homographs. 
Howe: pposite state of affairs pre- 
vailed ition memory for balanced 
homo uperior to that for polarized 
homo finding was replicated, and 
the a is concerning the variability of 
semar "vv of the two types of homo- 
grap! d by administering a word 
associ on two occasions. The word 
associa lings were clear-cut: In agree- 
ment : issumptions underlying the 
recognit mory experiments, semantic en- 
coding w ore variable for balanced than for 
polarized | iphs. 

Since the results of Experiment II support 
the “ssumptions on which the experimental 
tests of encoding variability theory were based, 
we must reconsider what is happening in recog- 
nition Earlier, a model that in- 


d the assumption of multiple encod- 
is mentioned as a possible fit for the 
recognition memory data. In a simple form 
of such a model, it is assumed that 5 may 


achieve, on some occasions, more than a single 
Semantic encoding of a homograph. If 
multiple entations are encoded at study, 


recognition inemory should be improved, since 
trace contact can result if any one of the 
previously stored encodings is made at test. 
* encoding data from the continued associa- 
n ecure of Experiment II are evidence 
E . more than one encoding is at- 
empted, S is more likely to arrive at two 
semantically distinct encodings for balanced 
homographs than for polarized homographs. 
When Ss were required to provide two re- 
Sponses, the second response represented a 
different semantic sense significantly more 
often for the balanced homographs. Thus, the 
superior recognition of balanced homographs 
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is consistent with an assumption of multiple 
encoding. 

Rather than assume multiple semantic en- 
coding at study, we consider it more likely 
that Ss in a word-memory experiment make a 
single encoding of an item at study but, when 
forced to choose between two seemingly un- 
familiar items, attempt to recode the test items 
semantically at the time of test. (It is assumed 
that, if S makes a test encoding of an old word 
that differs from its study encoding, he is 
placed in the situation of having to choose be- 
tween two unfamiliar items on a forced-choice 
recognition test.) Furthermore, Ss may be 
more likely to recode if they notice that the 
words in an experiment are homographs. 

We do not know of any evidence to compel 
an assumption of recoding at test over the 
alternative assumption of multiple encoding 
(double tagging) at study. However, at pres- 
ent, we believe that it is both parsimonious and 
plausible to continue to assume encoding 
specificity at study, unless an alternative 
strategy is made more efficient by special con- 
ditions such as producing puns, This view 
preserves the original encoding variability 
argument on which this research was based, 
with the addition of an assumption concerning 
the likelihood of additional retrieval operations 
at test. 

An alternative account of the superior 
recognition performance for the balanced 
homographs would be in terms of word fre- 
quency. Although the balanced and polarized 
homographs were equated on word frequency, 
it is obvious that they were equated only on 
frequency of the homograph form, rather than 
on frequency of particular meanings. For 
example, assume that the homographs BANK 
and CLUB are of equal frequency. If CLUB is 
balanced and BANK is polarized, then the pri- 
mary encoding of CLUB necessarily will be 
lower in frequency than the primary encoding 
of BANK, since the primary sense of a balanced 
homograph must have a smaller slice of the 
frequency pie than the primary sense of a 
polarized homograph if the pies have been 
kept the same size. Thus, it is possible that 
the superior recognition of balanced homo- 
graphs is a special case of the well-documented 
finding that recognition memory is better for 
low-frequency than for high-frequency words 
(Schulman, 1967; Shepard, 1967). Whether 
differences in frequency of the semantic types 
used here are sufficient to account for the differ- 
ence found in recognition memory is an em- 


pirical question. The reader will note that the 
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selection of items matched in S; frequency but 
differing in Sı and Ss probabilities not only is a 
formidable task in itself, but necessarily must 
result in unequal frequencies of the homograph 
forms. 

Finally we note some interesting features of 
the word association data of Experiment II. 
With regard to stability, a distinction must be 
made between scoring for repetition of the 
same word, the traditional word association 
measure, and scoring for repetition of the same 
category of semantic meaning. With respect 
to the exact associate, S is as likely to change 
his response as to repeat it. If our data are 
compared to Fox's (1970), this within-S in- 
stability is relatively constant from 5 min. to 
59 days. If word associations are scored 
semantically, they are much more stable on 
the whole (see Table 2). But, even with 
semantic scoring, time between successive 
tests has very little effect, at least over a 
48-hr. interval. 

We seem to have assessed a relatively per- 
manent characteristic of the semantic memory 
system. In distinguishing between semantic 
and episodic memory, Tulving (1972) points 
out that information in semantic memory is not 
temporally dated while information in episodic 
memory is. In these terms, the present re- 
search was an attempt to predict some simple 
phenomena of episodic memory, using a 
recognition memory task, on the basis of in- 
dependently determined characteristics of 
semantic memory, Although our particular 
predictions were not confirmed, we have found 
a relationship between the two systems that 
must be accounted for. 
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E OF A PRIMARY FRUSTRATION EFFECT FOLLOWING 


\LITY REDUCTION IN THE DOUBLE RUNWAY ! 


HENRY A. CROSS? AND WILLIAM N. BOYER 


Colorado State University 


ts were trained with sucrose reward in both the midbox and the endbox of 
e double alley. Subsequently, Ss were divided into groups which continued 
receive half of their trials under training conditions (nonfrustration trials) 
nd half ("frustration" trials) under either zero, alfalfa, regular, or sucrose 
[ ward. There was evidence of a primary frustration effect (FE) when reward 
juality was reduced in the midbox. Experiment II examined quality reduc- 
on in the midbox in a between-Ss design and again there was an FE with 


Results were interpreted as com- 


tible with the notion that an FE can be produced with incomplete reward 


1974, X 069-1075 
EVI 
ward quality reduction in the midbox. 
duction in the midbox. 
A The (Boyer, Cross, & Anderson, 
in pres recently observed in both 
barpr ‘ree-choice preference situa- 
tions prefer, at least in limited eat- 
ing si Noyes 93% sucrose pellets to 
the regui et of the same manufac- 
turer iso prefer the regular pellet 
to th s alfalfa-base pellet. The 
ordin iouship of these reward prefer- 
ences ; possible a test for a primary 
frustrat effect (FE) in a double-runway 
situation wisel, 1967) where midbox 
(Goal Box 1) reward can be "reduced," 
eve; though the weight or quantity of 


reward is actually held constant. Accord- 
ing to Scull (1973), who has surveyed the 
frustr ‘iierature, there has never been 
an adequate demonstration that an FE can 
be produced thro ugh nonzero or incomplete 
reward reduction in the midbox of the 
traditional frustration apparatus. A num- 
p ople (e.g., Bower, 1962; Daly, 
968 ; Ni & Ludvigson, 1965) have 
Ta ue demonstrate the energizing effects 
a eward magnitude reduction in the mid- 
X, but such attempts have either been 
i tioned on methodological grounds or 
nave raised some new problem. The pres- 
ent investigation is an attempt to circum- 
vent some of the previous difficulties and 
T The authors wish to thank Barbara N. Hammack 
end Michael J. Sandifer who collected the data of 
Experiment I. 
eos for reprints should be sent to Henry A. 
qe i ED n Psychology, Colorado State 
y ollins, Colorado 80521. 


to ascertain if incomplete reward reduction 
in Goal Box 1 leads to energized behavior 
in Alley 2 of the double runway. 


EXPERIMENT | 
Method 


The Ss consisted of 36 male albino rats from 
Carworth of New City, New York, which were 60-90. 
days old at the start of the experiment, 

A traditional two-alley frustration apparatus was 
used throughout the experiment. All start and 
goal boxes were 30 cm. long, and the two alleys were 
89 cm. in length. The apparatus was 15 cm. wide 
and 30 cm. in depth and was painted gray except 
for the black endbox (Goal Box 2). The floors of 
the start box, midbox (Goal Box 1), and endbox 
were spring loaded and permitted the recording of 
latency and running times in both segments, When 
S was placed in the initial start box, an electric 
circuit was opened which was later closed when Æ 
raised the start-box door. The first Standard elec- 
tric clock, begun with the opening of the door, 
stopped when S left the start-box plate. Clock 2, 
which started with S’s departure from the start-box 
plate, stopped as S entered and depressed the mid- 
box (Goal Box 1) plate. Clocks were read, reset, 
and readied for identical activation in the second 
alley. Times were recorded in hundredths of a 
second. 

The Ss were housed in pairs and allowed a 5-day 
habituation period with food and water ad lib, On 
Day 6, Ss were placed on a 23-hr. deprivation 
schedule which continued throughout the experi- 
ment except for the time constraints encountered 
as the number of trials was increased during acquisi- 
tion. On Days 8-10, Ss were placed in pairs in the 
open apparatus and allowed 10 min. of exploratory 
activity. On Days 11-12, Ss were allowed 10 min. 
of rewarded exploratory activity which consisted of 
free access to a cup of Noyes 45-mg. regular pellets 
in each goal box. 
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GROUP S 
SUCROSE TO SUCROSE 


GROUP NR 


SUCROSE TO REGULAR 


MEAN SPEED (1/TIME,sec x 100) 
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E FRUSTRATION TRIALS 
: 4 ^ 1 
a 4 
2 4, 
[3 1 / 
o / 
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| 


3 4 
TEST DAYS 


FiGURE 1. Alley 2 speed scores for frustration and nonfrustration test 
trials for the four basic groups over eight test days. (Abbreviation: R = mean 
score for the group at the end of acquisition training.) 


Acquisition training was begun on Day 13 and 
continued through Day 35. All Ss were given a 
total of 66 trials which were rewarded in both the 
midbox and the endbox with four 45-mg. Noyes 
9396 sucrose pellets. The trials were initially given 
at the rate of 1 trial/day but were ultimately in- 
creased to 6 trials/day according to the following 
daily schedule: 1, 1, 2, 2,2, 2, 2, 2,2, 2, 1, 1, 1, 1; 
1, 1, 6, 6, 6, 6,6, 6, and 6. The Ss were given 3 sec. 
in the start box before the door was raised and were 
subsequently retained for 30 sec, in the midbox. 
The Ss were taken from the goal box after they 
finished eating their reward. All Ss were run in 
squads in a rotation which generated an_intertrial 
interval of 10-15 min. 

Following acquisition trials, the total running 
speed for Alley 2 was determined for the last six 
acquisition trials. All 36 Ss were rank ordered in 
terms of their terminal running speed and the four 
fastest Ss were then “blocked” and each member 
of the “block” was randomly assigned toa different 
treatment group. This procedure was continued 
until Ss had been distributed into nine such blocks. 
During the test trials, the four treatment groups 
each continued to receive two daily trials in which 
sucrose pellets, as in training, were available in both 
the midbox and the endbox. These daily test trials 
were ordered according to the following schedule of 
frustration (F) and nonfrustration (N) trials: 
ENFN, NFFN, FENN, NFNF, FNNF, NNFF, 
FNEN, and NFFN. On the other two daily trials, 
the “frustration” trials, Ss received either sucrose 


© 


3.748 5 
TEST DAYS 


reward, regular reward, alfalfa reward, o: ; ewards 
in the midbox according to their respective grou 
designation: S, R, A, or NR, respectiv u 
reward was given on designated frustratioi ids 3 
always involved the same weight amount of foo 
which had been employed during acquisition, ie, 
four 45-mg. pellets. There was no ci: box 
reward from that employed during acqu 
During this test phase, trials were ru^ at the rate | 
of 4/day and were continued for a total of eight 
days. 


Results 


The time scores (in seconds) for the 
second alley were reciprocally transformed 
(1/Time X 100) for each test trial. Subse- 
quently, a mean speed score was computed 
for the frustration and nonfrustration con- 
ditions for each test day, and these mean 
speed scores served as the basic datum in 
a multifactored randomized block analysis 
of variance. 3 E 

Figure 1 graphically depicts the experi- 
mental findings over the eight test days. 
It can be seen that in Group S there is no 
reliable difference between mean running ( 
speed on frustration. and nonfrustration 
trials. This result was predicted since, 10 


j 


this cc zroup, there is actually no dif- 
ferent: ‘tment for frustration and non- 
frustr rials. However, for the other 
group e is a noticeable increase in 
runnin) eed as one moves from the non- 
frustra to the frustration trials and this 
differe ie., frustration vs. nonfrustra- 
tion, tistically significant as indicated 
Be by | significant main effect for 
frust: condition, F (1, 8) = 62.67, 
p< and a significant Group X 
Frus: Condition interaction, F (3, 
324) 4, p «.002. There were two 
other cally reliable differences: one 
differ ated to an overall increased 
speed test days, F (7, 56) = 11.66, 
p nd the other to a Days X 


Fr Gon Condition interaction, F (7, 56) 
= 10.00, p < .000001. These basic statis- 
al find revealed in Figure 1. 
con ns with Duncan 
O01) indicated that only 
: no reliable difference 
peed for all shift trials 
t en frusi .89) and non- 
fr ions (X = 121.97). In 
h: groups the frustration 
Qr luced significantly faster 
u did the corresponding non- 
4 ustration trials: Group R (X — 131.06 
VS. A 117.00), Group A (X = 138.39 
vs. X = 124.83), and Group NR (X = 
7143.20 vs. X = 117.49). 
urther demonstration of the increased 
running speed produced by quality reduc- 
tion in the midbox can be seen in the group 
comparisons on nonfrustration trials. The 
only significant comparisons were obtained 
between Group A and Group NR and be- 
tween Group A and Group R. The slight 
elevation in running speed on nonfrustra- 


‘in ( 
in m 


_ anomalous result. In Groups S, R, and 
a NR, the nonfrustration condition compari- 
., sons did not differ. A comparison of only 
4 the frustration condition across groups 
suggests something of the relative strength 
| of a possible FE produced by quality reduc- 
tion in the midbox and bears on the initial 
hypothesis that an FE would be inversely 
related to quality of reward obtaining on 
frustration trials. Groups R, A, and NR 


JZMARY FRUSTRATION AFTER REDUCTION IN REWARD QUALITY 


| tion trials in Group-S animals was an, 
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all differéd significantly from. the control 
situation, Group S, in frustration condi- 
tion speeds. Beyond that, Group A did 
not differ from Group NR but did signifi- 
cantly exceed Group R and Group S in 
running speed. Group R Ss were also 
significantly faster than Group S animals 
on frustration trials. 


Discussion 


The results of this study suggest that it is 
possible to produce a primary FE by nonzero 
or incomplete reward reduction. In this par- 
ticular case, the incomplete reward reduction 
was achieved, in Group A and Group R 
animals, while actually holding constant the 
weight of the food available on frustration and 
nonfrustration trials. The fact that Ss of 
Groups A and R had available on frustration 
trials a reward of equal weight and one that 
most probably contained more nutritional or 
satiety value than the nonfrustration sucrose 
reward would seem to militate against any 
attempt to invoke the original form of the 
“demotivation’’ or "response-depression" hy- 
pothesis (Seward, Pereboom, Butler, & Jones, 
1957) as an explanation of the present results. 
On the other hand, it is true that the design 
of the present study does not effectively guard 
against the interpretation that the observed 
results are attributable to absolute effects 
stemming out of such things as subtle differ- 
ences in eating times caused by different re- 
ward texture, etc. Another interpretation 
problem results from the fact that there was a 
Frustration Condition X Days Interaction. A 
perusal of Figure 1 shows that, with the excep- 
tion of Group S, there is an increasing differ- 
ence between frustration and nonfrustration 
trials over test days. It is possible that what 
is interpreted in the present study as an un- 
learned energizing effect of frustration is con- 
taminated with an association. component, 
For these reasons, it was decided that à second 
experiment should be attempted in which con- 
trol conditions are established in a between-5s 


design. 
EXPERIMENT Il 
Method. 


The Ss were 44 male i 
from National Breeding Laboratories, Creve Couer, 


Missouri, which were 60-90 days old at the beginning 
of the experiment. The same two-alley frustration 
apparatus (Experiment I) was employed except that 
it was modified by painting the second alley black 


albino rats purchased 
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and by placing 3-in. hardware cloth on the start box, 
first alley, and midbox. Both Alley 2 and the end- 
box (Goal Box 2) were black and were devoid of 
hardware cloth. 

The Ss were housed in pairs and were allowed a 
20-day habituation period in their cages with food 
and water ad lib, On Day 21, Ss were placed on a 
23-hr. food deprivation schedule which continued 
throughout the experiment except for the time con- 
straints encountered as the number of trials was 
increased during acquisition, On Days 22-26, Ss 
were placed, before their feeding period, in groups of 
six in the open apparatus devoid of all food cups 
and were allowed 5 min. of nonrewarded exploratory 
activity. On Days 27-29, the deprived Ss were 
placed in groups of two in an unpainted rectangular 
pine box (46 28 x 25 cm.) and were permitted to 
eat specific reward pellets from specific and differen- 
tiable food cups affixed on opposite ends of the box, 
The food cups were of different shape and the white 
one, which was later to occupy the midbox (Goal 
Box 1), was either empty or contained a large supply 
of 45-mg. pellets appropriate to the experimental 
group in which the two Ss had been randomly 
placed. The black cup, which was to occupy the 
endbox or Goal Box 2, always contained a large 
quantity of 45-mg. pellets appropriate to the experi- 
mental group in which the two Ss had been ran- 
domly placed. The black cup, which was to occupy 
the endbox or Goal Box 2, always contained a large 
quantity of 45-mg. 93% sucrose pellets since all Ss, 
regardless of their experimental group, were ulti- 
mately to receive such sucrose reward in the endbox, 
On Days 30-32, all 44 Ss were given, before their 
feeding period, a single goal placement in the black 
endbox of the two-alley frustration apparatus and 
were allowed to eat five 45-mg. Noyes pellets com- 
posed of 93% sucrose, The endbox door was down 
and the pellets were given in the black food cup 
appropriate to the endbox (Goal Box 2). On Day 
33, all Ss received their first acquisition trial in the 
entire apparatus, The 5 was placed in the start 
box and the start-box latency and Alley 1 running 
times were recorded as in Experiment I. When S 
entered the midbox he received either no reward, 5 
pellets of Noyes 45-mg. alfalfa. pellets, or 93% 
sucrose. The difference in midbox reward constitu- 
ted the only difference in the treatment of Ss during 
acquisition. After a 10-sec, delay in the midbox, E 
activated the midbox door and again latency and 
running times were recorded for the second portion 
of the apparatus. For 4 days, Days 33-36, each 
rat received only a single trial in the double-alley 
apparatus. This days, Days 


At the start of the experiment all Ss were ran- 


1 f four experimental grou 
(n = 11) which differed only in their midbox (Gout 
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Box 1) reward experience. Group A- 5s always 
received five 45-mg. alfalfa pellets in midbox 
and five 45-mg. 93% sucrose pellets in ti- endbox, 
This reward experience was continuous -ver the 
entire experiment and did not change ng the 
final 12-day test period. Group O-S S: eived 
continuous training throughout. Grou; mals 


differed from Group A-S Ss only in i all 


trials they found no reward in the whi } cup 
in the midbox, Group SS-A Ss all 181 
training trials in which both the midbx the 
endbox always contained five 45-mg. rose 
pellets. On Trial 82, the initial frus rial, 
SS-A Ss received five 45-mg. alfalfa pe tead 
of their customary 93% sucrose. Since i ES 
5s had never before encountered alfalfa, ince 
pilot work indicated that the t encoun with 
alfalfa produces “shock” and long midbox | cies 
and Alley 2 running speeds, these Ss we: iven 
five alfalfa pellets to eat from a Gerber's b jar 


lid in the neutral pine box approximately hr. 


before their first test experience. "This w ‘one 
only once and was not repeated during th ldi- 
tional test days. The Ss of Group SS-O r ved 


five 45-mg. 93% sucrose pellets in both the 1 box 
rial 


and endbox for the first 81 trials, Again, on 
82, these Ss received their initial frustratio 
when they encountered an empty midbox. 
12 test days, the initial daily trial for 
frustration groups, SS-A and SS-O, w; 


lva 


warded. The single daily frustration trial, ‘he n, 
occurred either on Trial 2 or Trial 3 and was ran- 
domly determined according to a schedule ch 
had only the restraint that there be an equal nuis ver 


of frustration trials in each position (Tria! 2 and 
Trial 3) over any 4-day test block. The actual 
schedule during test days was as follows: NNF, 
NFN, NEN, NNF, NEN, NEN, NNF, and NNF. 


Results 


To test that the four treatment gr 
were performing in a similar way at 
end of acquisition, the Alley 2 time scores 
for the last six trials of acquisition (Trials 
74-79) were reciprocally transformed and 
subjected to a split-plot analysis of vari- 
ance. The group or treatment effect was 
not significant, F (3, 40) = .31, and the 
Group X Trials interaction was also in- 
significant, F (15, 200) — .64. Only the 
trials effect was significant, F (5, 200) — 
6.69, and this was attributable to the fact 
that the initial daily trials: (in this case 
Trial 74 and Trial 77) were depressed rela- 
tive to the other trials witkin a given day. 

Since all groups were essentially alike in 
Alley 2 speed at the end of acquisition, 
additional analyses centered around Alley 2 
Speed during the 12 test days in which 
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FIGURE 2. 
tion Tri 
blocks. 


frustration trials were administered. Alley 
2 runway times were reciprocally trans- 
formed and the transformed speed scores 
were collapsed into four blocks of three 
doys each. The mean transformed value 
ior a given block was then computed for 
the first nonfrustration trial, the second 
nonfrustration trial, and the frustration 
trial. These mean speed values were em- 
ployed as the basic datum in a multi- 
factored split-plot analysis of variance in- 
volving groups, frustration condition, and 
blocks. Figure 2 depicts the speed data. 

'The main effect for groups was not sig- 
nificant, F (3, 40) = .89, but there was a 
significant frustration condition effect, F 
(2, 80) — 28.42, p « .000001, and a sig- 
nificant first-order Groups X Frustration 
Condition interaction, F (6, 80) = 3.36, p 
< .006. In addition, the only other sig- 
nificant effect was that for blocks, F (3, 120) 
= 8.14, p < .00006. 

Subsequent comparisons with the Duncan 
range test (a = .001) revealed that all 
major comparisons were as predicted. The 
Ss of Group SS-A ran significantly faster 
on the frustration trials (X = 106.59) than 
on either the first (X = 89.35) or second 
(X = 90.05) nonfrustration trial. In addi- 


2 3 4 
TEST BLOCKS 


Alley 2 speed scores for Nonfrustration Trial 1, Nonfrustra- 
al 2, and frustration trials for the four basic groups over four test 


tion, these same Ss ran significantly faster 
on the frustration test trials (X = 106.59) 
than the control Ss, Group A-S, ran on the 
same trials (X = 90.52) but under condi- 
tions of continuous alfalfa reward in the 
midbox. The designation of a frustration 
trial for Group A-S animals was arbitrary 
and was dictated by the reward schedule 
of the parallel Group SS-A animals. 

The other experimental group, Group 
SS.0, also showed the within-Ss FE in that 
mean frustration trials (X = 106.98) sig- 
nificantly exceeded mean first (X = 85.50) 
and second (X = 90.77) nonfrustration 
trials. In addition, and more importantly, 
the frustration trial (X = 106.98) of Group 
SS-O animals was significantly faster (p < 
.025) than the mean (X = 97.48) of desig- 
nated frustration trials in the corresponding 
control group, i.e., Group O-S. 

As a test of the relative capacity of 
alfalfa test trials to energize behavior, 1.&., 
produce the FE, it can be seen that the 
mean (X = 106.59) of frustration trials 
of Group SS-A animals is essentially that 
of the mean (X — 106.98) of the frustra- 
tion trials of Group SS-O Ss. Neither 
experimental group differs significantly in 
their nonfrustration trials and the corre- 


rua: 


sponding frustration means in their respec- 
tive control groups, Group A-S and Group 
O-S, are not reliably different, 

"The absence of any significant. interac- 
tion involving blocks argues against the 
notion that the observed increase in speed 
— on frustration trials is essentially acquired 
- or is contaminated by an association 
|. component. 

Finally, another observed result which 
=- Was not anticipated was the greatly de- 
© pressed initial daily trial, i.e., the first non- 
—— frustration, of both of the control groups, 

Group A-Sand Group O-S. In both groups, 

these means, 70.16 and 87.27, are signifi- 
. cantly below their corresponding second 
- nonfrustration trials, i,e., 90.57, 100.70. 
-. The initial nonfrustration trial for Group 
- O.S animals does, however, appear to be 
- essentially in line (compare with 89,35 and 
. the 85,50 of the same trial for Group SS-A 
- and Group SS-O,) In Group O-S, then, it 
- is the second nonfrustration trial which 
. appears to be "out of line.” At any rate, 
- there is no ready explanation for this first- 
- trial depression of Alley 2 speed in Group 
— A-S and Group O.S Ss, 


GENERAL Discussion 


The reported experiments, 


particularly Ex- 
) periment Il, argue strongly 


that an FE is 


€s preferred, 

‘Such a notion 
. Mense and, in fact, Tinklepaugh (1928) long 
- A&o demonstrated, at least at an anecdotal 
‘ , that a 


with a normally 
Acceptable reward of lettuce, and would, when 


denied the expected reward, engage in ener- 


but also to apply to situations where the ex. 
pected reward was merely reduced to a lower 
McHose and Ludvigson 
bbe and McHose 
t ower's interpretati 

by suggesting that his observed paia it. 
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of nonzero reward reduction were co: 
by response inhibition stemming ou: 
rat's consumption of more or less foo 
first goal box. The argument is that a 
Ss design like Bower's does not prec! 
possibility that what appears to be a 
ment in frustration drive with greate: 
plete reward reduction is, in fact, si 
function of absolute reward magnitude i 
Box 1. In short, the Bower results ar: 
fectly compatible with the so-called “den 
tion hypothesis," which attributes diffe: 
in Alley 2 running speed between frust 
and nonfrustration trials not to incr: 
energy on frustration trials, but rather 
depressed running speed following th: 
warded nonfrustration trials, 

Still later, Daly (1968) objected to ce: 
inadequacies of the McHose and Ludvig 
(1965) procedure. While these latter inve 
gators had, unlike Bower (1962), emplo: 
control groups to test for a possible Alley 
speed depression stemming out of a decre: 
in hunger drive, they too encountered so: 
methodological difficulties. Even with th 
controls, they did have their Ss experienci 
more than one reward-reduction conditi 
ie, a partial within-Ss design. They : 
failed to control for the direction of shift in 
that only in one group, the group shifte 
zero, was the shifted magnitude below 5e 
magnitude value employed in the second oal 
box. Daly's work pointed up further ¢om- 
plexities in this area in that her attemp to 
deal with previous design limitations lec. in 
some groups, to differential acquisition er- 
formance in Alley 2 even though equal rew: rds 
were given in Goal Box 2. 

The present approach Sought to avoid some 
of these difficulties by employing a procedure 
wherein reward “reduction” in the midbox was 
restricted to a quality dimension, In Experi- 
ment II reward magnitude as such was always 
the same, in both goal boxes, except for one 
group, Group SS-O. Although the previous 
design complexities in this area prompt caution, 
it is reasonable to conclude that the present 
Fesults are compatible with the interpretation 
that àn incomplete reward reduction in the 
midbox leads to a primary frustration effect 
as exhibited by faster running in Alley 2 on 
designated frustration trials, It could be main- 
tained that the present findings result from 
quality change per se and cannot be unequivo- 
cally attributed to quality reduction or down- 
ward change. ‘There are Studies, for example, 
which show the invigorating effects of reward 
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or se (Meyer & McHose, 1968; 
1968), and such an interpretation 
discounted. On the other hand, 
| changes in Experiment I which 

ic reward preferences argue against 

rpretation, In addition, a recent 

n by the authors? of the Crespi (1942) 

ace shifts with reward quality change, 

of reward magnitude, offers further 
igainst a simple change hypothesis. 


Crespi, W. N. Boyer, and J, T. Melella, 
ved study, 1973. 
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VARIABLES AFFECTING THE INTERMANUAL TRANSFI: 2 
AND DECAY OF PRISM ADAPTATION ! 


CHONG SOOK CHOE? anp ROBERT B. WELCH 


University of Kansas 


The study examined the effects of exposure (continuous vs, terminal) and 
practice (massed vs. distributed) upon prism adaptation, its intermanual 
transfer, and spontaneous decay, Subjects were measured before and 0, 5, 10, 
and 15 min. after 30 trials of exposure to 11.3° Jateral prism displacement on 
target-pointing accuracy (each limb separately), placing the arm straight ahead 
(each limb separately), and setting a pinpoint light source straight ahead, 
Adaptive pre-post shifts in these measures represented “negative aftereffect,” 
“proprioceptive shift,” and “visual Shift," respectively, The major findings 
were that (a) negative aftereffect was greater with distributed than with 
massed practice, (5) all four groups experienced intermanual transfer of nega- 
tive aftereffect, (c) proprioceptive shift transferred intermanually for the 
massed practice but not the distributed practice groups, (d) visual shift oc- 
curred only for the distributed practice-terminal exposure group, and (e) both 
ipsilateral and contralateral negative aftereffect and proprioceptive shift de- 
cayed significantly over the 15-min, postexposure period. 


It is now well documented that human 1966) have demonstrated partial i: er- 
beings are capable of adapting to the effects manual transfer of prism adaptation - ith 
of prismatically displaced vision (eg. Hay  S'sheadina fixed position during expo: ve, 
& Pick, 1966; Held & Hein, 1958).- Of The crucial difference between experi ts 
recent interest is the issue of whether or which have found intermanual transfer id 
not these effects transfer, in part or whole, those which have not may be in the nə ire 
from exposed to nonexposed limb. Some ofthe exposure condition employed. W: cre 
investigators (eg. H. B. Cohen, 1963; intermanual transfer has not occurred S 
Mikaelian, 1963) have reported no such has been allowed a noninterruptec wW 
transfer, However, Hamilton (1964) ob- of his limb throughout the prism-expo-ure 
served adaptive shifts for both exposed period, while in experiments in w. ich 
and nonexposed limbs when S was allowed transfer was obtained, exposure to the hmb 
free head movement during prism exposure; occurred only at the termination of © ch 
no transfer occurred when (as in previous visuomotor response. M. M. Cohen (1 
studies) S's head was immobilized. He Sug- directly compared continuous and 
gested that the freedom to engage in head minal prism exposure with respect to 
movements leads to a change in felt orienta- termanual transfer of prism adaptation 
NE a ve MM vs and WE results lend support to this argu- 
eM lar Bot fea ts p & focali- ment; only terminal exposure resulted in 

The situation if com licated by the f; s iron rr E this 
Xr MERE. EU p z um act result, Taub and Goldberg (in press) have 
Taub, & Berman 1967: Ke M SUE n. erg, Proposed that continuous exposure is actu- 
1S Tee 5 ; alil & reedman, ally a form of “massed” practice, whereas 


! The Present investigation formed the basis of a M terminal exposure condition penents 
master’s thesis by Chong Sook Choe, The research distributed Practice. They pointed out 
us supported by Grant EY-00560-02 from the that Spatz (1966) had found intermanual 

ationa -ye Institute of the National Institutes of transfer of à visuomot d it 
Health, The authors wish to thank John J. Uhlarik rotor) only whe eee skill sul 
and Robert L Bermant for their comments and y aUe NES) a rest interva 
criticisms of this manuscript. between ipsilateral training and contra- 
‘ * Requests for reprints should be sent to Chong lateral testing. Other investigators (e.g. 
S m ecd of Psychology, University Irion & Gustafson, 1952: Kimble, 1952) 
on ansas, Lawrence, Kansas 66045, have reported similar results. Taub and 
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. (in press) supported their hypoth- 
fading intermanual transfer with 
ous exposure condition when it 
spersed with short rest intervals. 
cless, it remains tenable that both 
yat es—type of exposure and type of 
prac ice -influence the occurrence and 
am. nt of intermanual transfer of prism 
ad ation The primary goal of the 
pr -ent experiment was to investigate this 
pe sibility. 
second aim was to examine the nature 
o: the adaptive shift underlying the inter- 
n. aual transfer of adaptation. At least 
too end products of prism adaptation have 
‘on identified—a change in felt limb 
position (e.g. Harris, 1965) and a change 
vocentric visual direction (e.g., Uhlarik 
Canon, 1971). The latter investigators 
have hypothesized that the relative mag- 
nitiles of these two components of prism 
vation are determined by which infor- 
.tional source is attended to during prism 
exposure; it is primarily the nonattended 
modality that is modified. They argued 
that with terminal exposure S is forced to 
pay attention to the felt position of his 
limb throughout its movement, until finally 
seeing its displaced image. 'Thus, in this 
condition felt limb position should remain 


esis 
a c 
was 


Ne 


relatively unmodified, while vision is 
changed. Conversely, vision “dominates” 


in the continuous exposure condition and 
therefore proprioception should be altered. 
Their results confirmed these predictions. 
Therefore, it is conceivable that the pres- 
ence of intermanual transfer of prism 
adaptation only in experiments using 
terminal exposure results from the fact 
that this condition elicits a shift in vision, 
an effect which necessarily determines 
pointing accuracy for both limbs. Clearly, 
this interpretation differs from that sug- 
gested by Taub and Goldberg (in press). 
Finally, there remains the possibility that 
intermanual transfer of a shift in felt limb 
position also occurs and represents at least 
a partial basis for intermanual transfer of 
the prism-correcti ve changes in eye-hand 
coordination. 

A third goal of the present investigation 
was to examine the spontaneous decay of 
prism adaptation, in both exposed and 
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nonexposed limbs, as a function of expo- 
sure condition and type of practice. Some 
investigators (Hamilton, 1964; Hamilton 
& Bossom, 1964; Taub & Goldberg, in 
press) have reported rapid decay of nega- 
tive aftereffect during postexposure periods 
of 15 min. in duration. Others (e.g. 
Dewar, 1970; Goldberg, Taub, & Berman, 
1967 ; Welch, 1971) failed to observe decay 
during this same postexposure interval. 
Again, it appears that exposure condition 
is an important variable. All the studies 
employing continuous exposure have pro- 
duced rapid decay; those involving ter- 
minal exposure have led to little or no 
decay. It was possible to directly compare 
these two conditions in the present experi- 
ment. Only two reported studies have 
examined the effect of type of practice 
upon decay of adaptation, and their find- 
ings appear contradictory. Dewar (1970), 
using terminal exposure, found that adapta- 
tion resulting from massed practice decayed 
faster than adaptation generated in a dis- 
tributed practice condition. On the other 
hand, Taub & Goldberg (in press), who 
used continuous exposure, appear to have 
found just the opposite result. The 
implication that type of exposure and type 
of practice interact with respect to rate of 
prism adaptation decay was examined in 
the present study. 


METHOD 
Design 


All Ss were measured before and after prism 
exposure on accuracy of target pointing (visuomotor 
coordination) and placing the arm st ht ahead 
of the nose (felt limb position). For each task 
both exposed and nonexposed hands were tested. 
A third measure was S’s accuracy in setting à 
pinpoint light source straight ahead of his nose 
(visual straight ahead). The postexposure mea- 
sures of each of these three tasks were taken at 
0, 5, 10, and 15 min. after the termination of the 
prism-exposure period. For none of the pre- or 
postexposure measures was S allowed visual feed- 
back regarding the accuracy of his performance. 
Type of exposure—continuous and terminal—was 
combined with type of practice—massed and dis- 
tributed—to produce four experimental groups. 


Subiects 


Sixty-four students from introductory psychology 
classes at the University of Kansas served as 58 
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in the experiment in order to fulfill a course require- 
ment, All had normal vision without correction and 
were naive as to the nature of the experiment. 
"They were randomly assigned to four groups, each 
containing 16 Ss, 


Apparatus and Procedure 


Displacement of the visual field was produced by 
à Risley rotating prism (Bausch and Lomb, No. 
71-48-59), mounted in the right eyepiece of a pair 
of welder's safety goggles, The left eyepiece of the 
goggles was occluded. 


panel, circumscribing an are whose radius was 57 
The panel extended 
approximately 11 cm, above and below the hori- 
A position transducer (resistance 


his index fingers S wore a tubber finger with a metal 
1 The measure of target-point- 
ing accuracy was taken by requesting S to reach 


Providing E with a measure of S's target-pointing 
The experimental session was divided 
phases: Exposure and 
Ostexposure. 
At the outset of the Preex- 
posure Phase an impression of Ss’ teeth was taken 
to provide the basis for a biteboard used during 
tai experiment to stabilize the 
Position of his head, After instructions and several 
practice target-pointing trials (with visual feedback) 
the goggles were placed 


whose Position 
among three 


of the three target positions with one hand and 
then this was repeated with the other hand. The 
order of the target positions Was the same for both 
limbs and for all Ss, 


hand was tested first and the left hand second; for 
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the remainder the order was reversed, r one half 
of the Ss the hand tested first was to | e prism- 
exposed one; the remainder were tes rst with 
the nonexposed hand. For the meas: of felt 
limb position § placed his arm so that vas felt 
to be located directly in front of his nose le arm 
movement was similar to that used for arget- 
pointing response. Vision was occluded l accus 
racy was measured four times with eac} b, the 
order of hand tested being the same * the 
target-pointing response. The third ty pre- 
exposure measure was .S's accuracy in ng a 
pinpoint light source to the straight-ahe sition 
in the otherwise dark room. The .S pusher oggle 
switch (using the hand that would not be « ed to 
the prism) which caused a dot of light move 
/ 


laterally at a slow, constant speed (3.5; 
just above the top of the pegboard panel, i 
mately eye level. This light (T1-5 Va run 


1.5 v.) 
jecte 


was attached to the end of a rod which 

vertically from a toy car located on an ar track 
behind the vertical pegboard panel. F mea- 
sures were taken, the starting position vary y from 
far right to far left (in a “RLLR” order) rhe S 
was allowed to reverse the direction of t} ght if 


it had gone past apparent straight ahead 
recording of white noise was played di 
measure in order to mask potential audi 
tion cues resulting from the sound of the 
electric motor, Preexposure target-poi; ic 
curacy was always recorded firs Howeve lea- 
sures of felt limb position and visual straight nead 
were taken second and third, respectively half 
of the Ss, and in the reverse order for the other half, 
Exposure Phase, During the Exposu 
the experiment .S was allowed to see his li 
the exposure slot, which was either co: 
partially uncovered, depending upon 
mental condition, A luminous rubber was 
placed over the index finger of the hao: to be 
exposed. The task during this part of th peri- 


ment was to reach forward and point to one ùf three 
targets—the numbers vat RY con- 
structed of luminous tape and far 
edge of the board at points corresponding io the 


À the variable prism was ad- 
Justed to 20 diopters, with the base oriented either 
tight or left, Producing a lateral visual displacement 
of approximately 11.39. The 5 was informed by Æ 


After the room lights had been extinguished, 
S opened his eyes and repeatedly pointed at each of 


out in a predetermined block-randomized number 
The sequence was the same for all Ss. 
The luminous rubber finger which .S wore allowed 
him to see his target-pointing accuracy, A constant 
Tate of responding was maintained by means of a 
tape-recorded 2.sec. metronome beat. The S’s hand 

egan from a Starting position at the near edge of 


the table and traveled approximately 60 cm. to its 
destination, 
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instructed to point as accurately as he 
ot to correct for an error while the 
was in view. It was during the 
hase that the experimental conditions 
:mented. 
the Ss were provided with continuous ex- 
to the limb by completely uncovering the 
re slot. The remainder experienced terminal 
re: for these Ss the slot was covered except 
j-cm. gap at its perimeter. These two groups 
further subdivided into massed and distributed 
tice conditions, Massed practice was achieved 
iaving S point at a target on every 2-sec. beat 
he tape-recorded metronome for the 30 exposure 
thus providing 1 min. of uninterrupted 
ict In the distributed practice condition S 
ponded every 2 sec., and then “rested” for 2 sec., 
a total exposure period of 2 min, (i.e., 1 min. of 
ictice), Thus, as defined by the nature of their 
tivity during the Exposure Phase, there were four 
roupsz Continuous-Massed, Continuous-Distrib- 
ited, Terminal-Massed, and Terminal-Distributed. 
Postexposure Phase. Immediately after the final 
prism-exposure trial, the Postexposure Phase began. 
The S closed his eyes, the exposure slot was covered, 
the number targets removed, the variable prism 
reset to 0 diopters, and the luminous finger removed. 
The S was told that his vision was once again 
normal? Then no-feedback measures were taken 
of target-pointing accuracy (three trials for each 
limb), felt limb position (two trials for each limb), 
and visual straight ahead (two trials). These three 
measures constituted the 0-min. Postexposure Test. 
The order of hand tested was the same as in the 
Preexposure Phase for a given S, as was the order 
in which the measures of visual straight ahead and 
felt limb position occurred. After these initial 
postexposure measurements had been obtained S 
was requested to rest quietly in his chair with s 
closed and arms placed at his sides. After 5 mi 
another set of measures was taken, (the 5-min. 
Postexposure Test). This procedure was repeated 
at 10 and 15 postexposure min. Including the time 
spent in taking the measures, the total length of the 
Postexposure Phase was approximately 25 min. 


” 


‘The rationale for informing S that his visi 
was or was not displaced is that in prism exposure 
involving target pointing, and hence error-corrective 
feedback, the existence and nature of the visual 
rearrangement quickly becomes apparent to S. 
Consequently, it is crucial that he be informed prior 
to the Postexposure Phase that his vision will once 
again be normal. Otherwise, his negative aftereffect 
is likely to be contaminated by the conscious and 
deliberate correction for errors that he is likely to 


have used during the Exposure Phase and which he 


if he thought that his vision 
]t is assumed that the after- 
effect of prism exposure is, at least with respect to 
target-pointing, a more valid measure of adaptation 
in the present experimental situation than is any 
change in behavior observed to occur during the 


Exposure Phase. 


would continue to use 
remained displaced. 
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RESULTS 


Three operationally different measures of 
adaptation were obtained, each defined as 
the difference between pre- and postexpo- 
sure performance. A prism-compensatory 
shift in target-pointing accuracy is com- 
monly referred to as "negative aftereffect"" 
(NA); a shift in felt limb position may be 
termed “proprioceptive shift” (PS) and a 
change in visual straight ahead, “visual 
shift” (VS). For the purposes of the 
statistical analyses, a pre-post difference in 
the adaptive direction was given a positive 
sign while an "antiadaptive" shift was 
negatively signed. Because magnitude of 
NA did not differ between the prism base- 
right and prism base-left conditions, £ (62) 
=.99, p > .05, the data for the two prism 
orientations were combined for the sub- 
sequent analyses. Also, the data were 
collapsed over hand exposed, order of hand 
tested, and order of visual and proprio- 
ceptive measures. Figure 1 depicts the 
levels of the three measures of prism adap- 
tation and their decay rates over 15 min. 
for each of the four experimental groups. 

Negative Aftereffect. Clearly, all groups 
manifested with the exposed limb a mean 
NA (see Figure 1A), which was significantly 
greater than zero at the 0-min, Postexpo- 
sure Test. With respect to the unexposed 
limb a £ test revealed that the adaptive 
shift at the 0-min. test for all conditions 
combined was statistically significant, / (63) 
=6.19, p « .001, indicating that partial 
intermanual transfer of adaptation had 
occurred. A three-factor mixed analysis 
of variance (Exposure X Practice X Limb) 
was carried out upon the NÀ scores at the 
O-min. Postexposure Test. The results in- 
dicated significant effects for Practice, 
F (1, 60) = 4.98, p < .05; Limb, F (1, 60) 
—100.20, p < .001; and their interaction, 
F (1, 60) = 5.36, p < .025. It is clear from 
an examination of Figure 1A that the inter- 
action represented the fact that the differ- 
ence in NA between massed and distributed 
practice (in favor of the latter) existed only 
for the exposed limb. 

As a means of testing for the presence of 
decay of NA in the exposed limb across 
the four Postexposure Tests, a three-factor 
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FIGURE 1. Mean Pre-Post Shifts (deg.) at each P. 


NA = Negative Aftereffect, PS. 


mixed analysis of variance (Practice X Ex- 
posure X Postexposure Test) was per- 
formed on the data. The Postexposure 
Test effect was clearly significant, F (3, 180) 
—20.81, p < .001, indicating that a cer- 
tain amount of decay occurred. The 
interactions between Practice and Post- 


ostexposure Test. (Abbreviations: 


= Proprioceptive Shift, VS = Visual Shift.) 


exposure Test, F (3, 180) = 2.56, p < .055, 
and between Exposure and Postexposure 
Test, F (3, 180) = 2.50, p < .055, were 
marginally significant. The same analysis 
carried out on the results for the unexposed 
limb also revealed a significant F for Post- 
exposure Test, F (3, 180) = 3.58, p < .025. 
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Howe ne of the remaining factors Or 
inter was statistically significant. 
Pr eptive Shift. Although it is ap- 
part m inspection of Figure 1B that 
PS o «cred for the exposed limb at the 
0-mi- Postexposure Test, it is not clear 
tha: his was true for the nonexposed limb. 
In cta t test on the PS scores for the 


«posed limb for all groups combined 
fa od to find the mean to be significantly 
g -ater than zero, t (63) = 1.15. However, 
v. on each group was tested individually 
ontralateral PS the means for the 
ninal-massed and continuous-massed 

»ups each proved to be greater than zero, 

(15) = 1.86, P «05; ! (15) = 1.93, 
n 03; indicating the presence of inter- 
anual transfer of PS for these groups. 
A mixed analysis of variance (Exposure x 
Practice X Limb) on the PS measure for 
both limbs at the 0-min. Postexposure Test 
found only the difference between limbs to 
be statistically significant, F (1, 60) = 
27.00, p < -001. A Practice X Exposure 
analysis of variance was carried out upon 
the PS measures for the nonexposed limb 
alone and revealed a significant effect for 
Practice, F (4, 60) — 5.71, P € 1025 — 
supporting the conclusion that only the 
massed practice groups experienced inter- 
manual transfer of PS. 

An Exposure X Practice X Postexposure 
Test analysis of variance on PS for the 
exposed limb revealed that the effect of 
Postexposure Test was statistically signifi- 
cant, F (3, 180) = 8.43, p < .001. This 
indicates that a decline in ipsilateral PS 
occurred. during the postexposure period. 
None of the other factors Or interactions 
was significant. The same analysis applied 
to the results for the nonexposed limb pro- 
duced similar results—Postexposure Test 
was the only significant factor, F (3, 180) 
—349, p < 025. 

Visual Shift. A test revealed that mean 
VS at the Q-min. Postexposure Test for all 
groups combined was significantly greater 
than zero, £ (63) — 2.60, p < 01. How- 
ever, individual / tests revealed that only 
the mean for the terminal-distributed group 
was statistically significant, / (15) = 2.06, 
p< 05. A two-factor analysis of variance 


nt 
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(Exposure X Practice) on the data for the 
Q-min. Postexposure Test revealed no 
significant main effect nor an interaction. 
When Postexposure Test was added to 
these factors, in order to examine potential 
changes of V. S over the Postexposure Phase, 
none of the factors or interactions was 
significant. 

Comparison of Negative Aftereffect, Pro- 
prioceptive Shift, and Visual Shift. To 
examine the effects of practice and exposure 
on intermanual transfer of NA and PS a 
four-way mixed analysis of variance (Ex- 
posure X Practice X Type of Adaptation 
Limb) was carried out on the 0-min. 
Postexposure Test measures. A significant 
F was obtained for Type of Adaptation, 
F (1, 60) = 10.44, p < .005. Comparison 
of Figure 1, A and B, reveals that NA was 
greater than PS. Limb was another signifi- 
cant factor, F (1, 60) — 86.49, p < .001, 
confirming the results of the analyses re- 
ported previously. Also statistically signifi- 
cant were the Practice X Type of Adapta- 
tion interaction, F(1,60) — 6.15, p < .025, 
as well as the Practice 7X Limb interaction, 
F (1, 60) = 7.02, p< 025. Inspection of 
Figure 1 indicates that these interactions 
represented the fact that practice had a 
differential effect on NA (in favor of 
distributed practice) for the exposed limb 
only and on PS (in favor of massed prac- 
tice) for the nonexposed limb only. 

As a means of investigating the possi- 
bility that type of practice and type of 
exposure have differential effects on adapta- 
tion as a function of which measure is used, 
an Exposure X Practice X Type of Adapta- 
tion analysis was performed upon VS and 
ipsilateral PS taken at the 0-min. Post- 
exposure Test. Only Type of Adaptation 
proved to be statistically significant, T 
60) = 8.73, P< .005. Examination of 
Figure 1, B and C, indicates that ipsi- 
lateral PS was greater than VS. 

In order to examine the relative rates of 
decay of adaptation for each of the three 
measures, an analysis (Exposure x Prac- 
x Type of Adaptation X Postexpo- 
sure Test) was done on VS and the NA and 
PS measures taken with the exposed limb 
only. Type of Adaptation, F (2, 120) = 


tice 
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9.66, p < .001, Postexposure Test. Fe (3, 
180) = 1408, p< .001, and their inter- 
action, F (6, 360) — 6.28, p < .001, all 
were significant. The interaction appears 
to be due to the fact that only VS failed 
to decay over the Postexposure Tests. 


Discussion 


Contrary to expectation, all four experimen- 
tal groups demonstrated intermanual transfer 
of NA. It is possible that the relatively short 
prism-exposure period (30 trials of target 
pointing, spread over 1-2 min.) may have been 
an optimal situation for generating transfer, 
irrespective of experimental condition. Freed. 
man (1968) reported large intermanual transfer 
when training was terminated at a preset 
criterion, while no transfer was observed 
when Ss were overtrained. In the present 
experiment, intermanual transfer of NA was 
not influenced by type of exposure. This re- 
sult is contrary to the differential exposure 
effect on transfer reported by M. M. Cohen 
(1967). It is not clear whether massed ys, 
distributed practice affected transfer, Initi- 
ally, the absence of a main effect for Practice 
might be viewed as an indication that the type 
of practice did not affect the intermanual trans- 
fer of NA, However, intermanual transfer is 


pretation; that is, 
do indeed lead to different amounts of transfer, 
The significant Practice X Limb interaction 
Postexposure 


nonexposed limb h 
reported and it is not at all apparent how this 
might occur. Another finding difficult to 
interpret is that in the two distributed groups 
the PS measure for the nonexposed hand 
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showed the change in the antiadapti 
tion. Perhaps the task of pointing 
ahead of the nose in the dark is gove 
only by felt limb position but also by a : 
tendency to point at an imagined . 
ahead position. It could be argued 
that both proprioception and this 
tendency are altered by prism expos 
that only the latter component (or a | 
of it) transfers to the contralater 
Aside from the validity of this explan 
remains of interest that only the 
practice groups demonstrated transfer 
Furthermore, regardless of the basis 
form of transfer, its ‘istence sugges 
the intermanual transfer of NA m 
always be the result of a change in v 
some investigators (e.g., M. M. Coh 67; 
Kalil & Freedman, 1966; Wilkinso: 
have apparently assumed. 

Both NA and PS revealed a signifi 
cline across the four Postexposure T 
both limbs. A review of previous lite 
led to the prediction that NA (for ex 
limb) resulting from continuous « 
would decay more rapidly than that 
by terminal exposure. Such an ) 
Postexposure Test interaction was fou 
the present study, although the signifi: uce 


level was marginal (p < .055). One c na- 
tion for this tentative finding is that terminal 
prism eXposure leads to a relative ge 


change in vision (cf. Uhlarik & Cano: 
and that a prism-adaptive change i: 
decays less rapidly than does propric 


adaptation. Partial Support for this 

pretation is seen in the finding that VS d 
to decay during the Postexposure Phase n 
fact, there was a nonsignificant increase i S 


for three of the four groups. The fact t 
VS appears to be more resistant to decay tha: 

S may be related to the results of a study 
by Hay and Pick (1966) who found that over a 
6-day prism exposure period visual adaptation 
ultimately replaced the relatively transient 
initial change in felt limb position. 


i massed 
practice, a reported by 
Taub and Goldberg (in press) with continuous 
exposure, but not found by Dewar (1970), 
Who used terminal exposure, It is possible 
of the “prism shaping" 


tech: i Howard (1968) led to such a 
;unt of adaptation for both groups 


ects of the practice variable were 
[o imed.” Examination of Figure 1B 
re an unclear picture regarding the 
0] ‘| condition for eliciting PS. None of 
t ctors or interactions for PS in the ex- 


po d limb at the Q-min, Postexposure Test 
ed statistically significant. With respect 

5 the most effective condition at the 0-min. 

texposure Test was distributed practice- 
minal exposure. The finding by Uhlarik 
i Canon (1971) that terminal exposure 
duces primarily VS and continuous exposure 
marily PS was not supported by the present 
‘ata when the practice variable was collapsed. 
in the other hand, Figure 1, B and C, suggests 
he existence of this predicted Exposure X 
Dype of Adaptation interaction, provided 
that only the distributed practice groups are 
examined. It should also be noted that in the 
study by Uhlarik and Canon (1971) the ob- 
tained interaction held only for the “direct” 
measures (ie those taken with the prism 
displacement present) ; no interaction was 
found for the aftereffect measures which, of 
course, were the only kind taken in the present 
experiment. 

With the present data it is possible to ex- 
amine the "additive model" of prism adapta- 
tion presented by several investigators (e.g. 
Wilkinson, 1971), which holds that NA = VS 
-FPS. When adaptation as manifested at the 
(-min, Postexposure Test was examined for 
all groups combined, PS + VS = 4.57, while 
NA = 5.64, This difference of 1.07 proved 
to be significantly greater than zero, £ (63) = 
1.74, p < .05. While it is possible that the 
finding of NA > PS + VS in this experiment 
was due to the fact that NA was measured 
before PS and VS and, therefore, was perhaps 
less affected by spontaneous decay, this seems 
an unlikely event, given that only three mea- 
sures of NA for each hand (a total period of 
approximately 45 sec.) were taken prior to 
the measures of PS or VS. Furthermore, an 
interesting observation is made when this 
additive model is tested for each group 
separately. For the massed practice groups 
the model held quite well—the difference be- 
tween NA (4.60) and PS + VS (4.55) was not 
significantly greater than zero, £ (31) = .10, 
p> .05. On the other hand, the model failed 
to predict the results for the distributed 
practice groups, in that the difference between 
NA (6.68) and ps + VS (4.58) was different 


from zero, t (31) = 2.76, p < .003. The 
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tentative conclusion is that distributed prism 
exposure leads to a third component of prism 
adaptation, which adds to PS and VS in deter- 
mining the magnitude of NA. The precise 
nature of this postulated component remains 
to be clarified. 
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ticular letter. 
letter. 
(less than 7 words) but at some point 
even with categories as large as 32. 


operating for larger sets. 


How is information retrieved from long- 
term memory ? Although it is too early to 
give a detailed answer to this question, 
some recent work does appear to have 
eliminated some of the logical possibilities. 
In particular, there is now substantial 
evidence that the normal process of in- 
formation retrieval involves little or no 
successive scanning of items in the memory 
store, This conclusion is based on a series 
f studies which showed that the amount 
time taken to retrieve a word from a 
large category was either equal to or only 
slightly greater than the time taken to 
retrieve a word from a small category. 
Landauer and Freedman (1968) used a 
series of nested categories in which each 
was a subset of the larger one, thus guar- 
anteeing without an actual count that the 
categories could be ranked according to 
size (e.g., living thing, animal, bird). They 
und that it took longer to decide that an 
item was not a member of a large category 
than to decide that it was not a member of 
a small category, but there was no appreci- 
able difference in decision times for positive 
instances. Collins and Quillian (1969), 
Meyer (1970), and Landauer and Meyer 
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In four experiments, Ss learned categories of varying size and were then 
required to produce a member of one of the categories that began with a par- 


There was only one word in the category that began with the 


Reaction times increased as à function of size with small categories 
levelled off, with no additional increase 
These findings suggest that two processes 
are involved in retrieval—a successive scanning mechanism being used with 
categories small enough to be held in immediate memory (i.e., about 7) and a 
different. mechanism (probably involving little or no successi 


sive scanning) 


(1972) using somewhat different procedures 
report that identification took somewhat 
longer for larger categories. In contrast 
Neisser (1964) compared identification time 
for "animals" and "first names" and found 
no difference in latency. Although most 
of these studies did find a small difference 
favoring the small categories, the magni- 
tude of the difference is very small relative 
to the size of the categories involved. For 
example, Landauer and Freedman report 
that the correct identification time with 
negative instances was 53 msec. longer for 
large than for small categories. Given the 
relative size of the sets to be searched, in 
order for such a small difference to be 
caused by the additional time it takes to 
scan the larger categories, successive scan- 
ning would have to be enormously rapid, 
on the order of a thousand words per 
second, which seems improbable. 

More recently Freedman and Loftus 
(1971) departed from the previous work 
that had studied identification time and 
instead measured the speed with which S 
could actually produce a word himself. 
Instead of giving S a stimulus and asking 
him to decide whether it was a member of 
a category, he was given a category and 
asked to produce a word that belonged in 
it. Several studies by these authors found 
that the speed of producing a word was 
independent of the size of the category to 
which the word belonged. (Freedman & 
Loftus, 1971; Loftus, Freedman, & Loftus, 
1970). 


1085 


1086 JONATHAN L. FREEDMAN 
Although the results of this previous 
work are fairly convincing, the use of al- 
ready existing or natural categories presents 
certain difficulties. Any existing category 
is already large or small—it cannot be 
both. Thus, the size of a category is neces- 
sarily confounded with the specific category 
itself. Accordingly, the effects obtained in 
these studies could be due to certain char- 
acteristics of the specific categories used or 
to the hierarchical system that exists in 
the language. Despite the fact that some 
of these studies have used a large number 
of different categories and found quite con- 
sistent results, it is possible that within 
the language there are certain character- 
istics common to all large categories that 
differentiate them from smaller ones. For 
example, large categories tend to be higher 
up in any hierarchical structure since 
naturally as one goes up the hierarchy a 
category tends to be more inclusive and 
therefore have more members. Thus, the 
lack of effect due to category size could 
conceivably be due to other factors that tend 
to be correlated with the size of the cate- 
gory and which for one reason or another 
facilitate retrieval from large categories. 

A number of authors (e.g., Juola, Fischler, 
Wood, & Atkinson, 1971; Tzeng, 1972) 
have reported studies in which Ss learned 
one or two sets, and were then shown a 
word and asked if it belonged to the set 
they had learned. As with other studies 
of identification time, this work generally 
found small but significant effects of set 
size. It does appear, however, that in this 
research the lists, while learned to criterion, 
were not learned to the point that the Ss 
knew them well. In the Juola et al. study 
Ss were given the list 24 hr. before the 
experimental session and were asked to 
study it until one successful complete recall 
could be made. In Tzeng's study, there 
was a short training session. Thus, al- 
though Ss could recall all of the items on 
the list, it would be difficult to argue that 
they had learned them to any great extent, 

The level to which Ss had learned the 
categories is critical. One can see this, for 
example, by examining more closely Juola 
et al's two-stage model of recognition. 
Their first stage involves contact between 
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input and long-term memory neces: ry to 
assign a recognition strength to the input. 
If the item is highly familiar, the 5 is as- 


sumed to be able to output a fast re ?nse, 
without checking the item again- the 
memorized list. More specifically, tHe re- 
sponse to a highly familiar item is as: :med 
to be rapid, error prone, and indep: «dent 
of the number of items in the men: ized 
list. If the presented item is only roder- 
ately familiar, however, the S is pres med 


to check the memorized list before res>ond- 


ing, and the time to search will be ^ ‘unc- 
tion of the number of items in th: list. 
The above reasoning appears to an 
admission that many of the items w not 
learned very well. Since we are iv. «ti- 
gating the difference between sho nd 
long-term memory, and since amo of 


learning is one element in that diffe: e; 
it is not clear how useful these results ire 
for our purposes. In any case, it seersed 


desirable to conduct a study in wh vi Ss 
were given extensive training or h 
constructed categories and productio- (s 


opposed to identification) from these : 
gories was investigated. This paper ~e- 
ports four such experiments. 


EXPERIMENTS | AND Il 
Method 


Materials. For Experiment I, 50 words \ 
selected from different categories in the Battig 
Montague (1969) norms. Twenty of these we 
were divided evenly among four lists, so that 
5 words in a list began with each of the letter 
MEP S and T. The remaining 30 word ( 
different initial letters) were randomly assigned : 
fillers to each list so as to create lists of length: 5 
(no fillers added), 10, 15, and 20 words. 

For the first S, the 50 words were typed on a 
sheet of paper in such à way that the 5-word list 
was designated List 1, the 10-word list designated 
List 2, etc. The next S studied lists created by 
rotating the 30 filler words, so that his List 1 had 
the same 5 target words as the first S's, but had the 
15 filler words which were previously on List 4. 
Thus, for S 2, List 1 had 20, List 2 had 5, List 3, 10 
and List 4, 15 words, This process was continued 
for the next two Ss so that for each set of four Ss, 
each group of 5 target words was in a total list of 
either 5, 10, 15, or 20 words. For each set of four 
55, 20 new key words with different initial letters 
were selected. In all, five such sets of lists were 
created resulting in a total of 20 unique lists. 

The lists for Experiment II were constructed in 
an identical manner to the lists for Experiment I 


with eption that all words in a particular list 


came thesame category. The categories were: 
anini »y's name, girl's name, or country. Each 
S st ! one list from each of these categories. 
So, for example, for S 1, List 1 consisted of five key 
anin List 2 of 10 girl's names (5 of them key 
namo), List 3 of 15 boy's names and List 4 of 20 
cour 1es. 


Subjects. Undergraduates at the Columbia Sum- 
mer School were paid $6 for 3 1-hour sessions and 
were promised a bonus if they performed well. In 
Experiment I, 20 Ss participated of which only 18 
completed the experiment. In Experiment II, 16 Ss 
completed all sessions. A $2 bonus was paid to all 
Ss who completed the experiment. 

Procedure. At the first session the Ss were told 
that they would be given four lists of words and 
that their task was to learn them extremely well. 
They were supposed to learn what words were in 
each list and also be able to tell what list a par- 
ticular word was in. The instructions said, “you 
should learn these words so that they are almost 
like second nature." During the first session the 
Ss were handed a sheet containing the four lists 
and given 10 min. to study it. They then were 
given 5 min. to write down as many of the words 
as they could remember in the appropriate lists. 
Next, they were shown cards containing a word 
followed by the number of its list or the opposite 
order (e.g., camel—1, i1—camel). The Ss looked at 
these cards and said aloud both halves. Each word 
in the list was presented twice in this manner, once 
with the word first and once with the number of the 
list first, a different random order being used for 
each S. Finally, the Ss once again tried to write 
down as many words as they could remember in 
each list, A¢ the end of the session they were given 
a list and told to study it hard, to practice, and an 
appointment was made for the next session. 

The second session began with a recall task in 
which the S tried to write down as many items in 
each group as he could remember. (Any S who 
could not recall every item was told to go home, 
restudy, and a new appointment for the second 
session was made.) If the S recalled perfectly, he 
then received a number of training tasks in which 
each of his words was presented to him an equal 
number of times. In the first task, E read each of 
the words aloud, the S telling what group it was in 
and making up à sentence containing the word. 
If the S gave the wrong group, E corrected him. 
In the next task, Æ read the words aloud again in a 
different random order and the S was required to 
give another word in the same group. He was told 
to try to give a different word each time but to 
respond quickly. Whenever he made an error he 
was so informed. Finally the recall task was re- 
peated, the Ss were encouraged to study hard and 
a third appointment was made approximately two 
weeks hence. The first two sessions were for pur- 
poses of training; the third session produced the 
essential data. Stimuli were presented on a machine 
similar to that used in previous studies (cf. Freedman 
& Loftus, 1971). It consisted of a half-silvered 
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TABLE 1 


Reaction Times AS A FUNCTION OF CATEGORY 
SIZE FOR EXPERIMENTS I AND II 


Category size 


Experi- 
ment 
5 | 10 | 5 20 
I 1.88 3.98 | 6.25 5.17 
I ii | 1.94 | 2.37 1.93 


mirror behind which a card was placed. When a 
light went on behind the mirror the S could read 
what was printed on the card. The left side of the 
cards could be illuminated at a predetermined inter- 
val before the right side. As soon as the right side 
was lit a timer started which was stopped by the S 
speaking into a unidirectional microphone. In this 
experiment the Ss were shown the number of a 
category to the left and then after a pause of .5 sec. 
(automatically timed) the right side was lit reveal- 
ing a single letter. The S's task was to say aloud a 
member of the indicated category that began with 
that letter. Subjects were instructed to respond as 
quickly as possible but to be certain not to make 
mistakes: "speed is essential but so is accuracy." 
"They were given the following example—“you might 
see the category ‘trees’ and the letter ʻo’ and you 
should say ‘oak’ as fast as you can." Subjects were 
given three practice trials and after each one were 
asked if they fully understood the instructions. If 
during these trials they made any errors, they were 
once again told that they should respond quickly 

but not make errors. The critical trials consisted 

of a random presentation of the 20 possible com- 

binations of the four categories and the five critical 

letters. Thus for each category the 5 gave five 

responses, One beginning with each of the five 

selected letters, The E did not inform Ss whether 

they were correct. If the .S did not respond. within 

10 sec., the trial was terminated and the next trial 

started. 


Results 

In both experiments all of the Ss learned 
the lists well enough to produce the total 
list without errors during Session 2 and to 
respond correctly to virtually all of the 
test trials in Session 2. In addition, on the 
test trials of Session 3, there were fewer 
than 4% errors in Experiment I and fewer 
than 2% in Experiment II. On the other 
hand, there were a considerable number of 
trials on which the Ss gave no response 
within the 10 sec. allowed. For the 5, 10, 
15, and 20 item categories, failures to re- 
spond were 4, 12, 29, and 28% respectively 
for Experiment I, and 2, 2, 12, and 9% for 
Experiment 11. Table 1 presents the mean 
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reaction times for both experiments as a 
function of the size of the category. These 
means are based on median times for each 
SS for each category, with the error and no 
response trials excluded. 

The results for both experiments are 
quite similar. Reaction times are faster 
for categories of 5 than for larger categories 
and for Size 10 than for 15, but are actually 
slower for 15 than they are for Size 20. 
Using the Newman-Keuls procedure (Kirk, 
1969) for simultaneous testing of pairwise 
comparisons, only the following differences 
are significant in Experiment I: (a) Size 5 
is different from all other category sizes, 
p< .05. (b) Size 10 and 15 differed, p < 
.05. For Experiment II, size 5 is different 
from all other category sizes, p < .05, while 
no other differences are significant. 

Clearly responses to the smallest category 
are faster than to any of the others while 
categories of sizes 10, 15 and 20 do not 
produce consistent differences in reaction 
times. If anything, the largest category is 
somewhat faster than the next largest. 
This is not a significant effect, but is the 
opposite of what would be expected from 
a scanning model. Beyond size 10 there is 
no evidence that larger categories take 
longer. 

Before discussing the significance of these 
findings in more detail, it seemed advisable 
to replicate this study once more. The 
possibility existed that the results obtained 
were for some reason dependent on the 
specific list sizes employed and it therefore 
seemed advisable to investigate the effects 
of different size categories. In addition, 
we wanted to include at least two categories 
of sufficiently small size so that they would 
ordinarily be considered to fall within the 
scope of short-term or immediate memory 
since it occurred to us that perhaps two 
different processes were involved in the 
production of members from the smaller 
as compared to the larger categories, Ac- 
cordingly, Experiment III was conducted 
using lists of size 3, 6, 12, and 24. 


Experiment [II 
Method 


Subjects. Twenty-eight Ss were recruited from a 
local newspaper advertisement, Most were under- 
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graduates from nearby colleges. They were ©: id 
$5 for three sessions and were promised a bony of 
$3 if they performed well. 

Materials. A set of 12 target words and a « 

33 filler words were selected from the list of 

nouns provided by Paivio, Yuille, and Madi: 
(1968) on the basis of three criteria: (a) the wo 
were four to eight letters in length; (b) the tar 
words began with each of the letters F, S, and 

(c) none of the filler words began with the let 
F, Sor M. Lists of length 3 (no filler words add 
6, 12, and 24 were then constructed accordin 

the procedures used for list construction in Ex; 
ment I. 

Procedure. The first session was identical to 
of Experiment I, After the session Ss were gi 
the lists to take home and study, and an appoi 
ment was made for the next session approximat: 

1 wk. hence. 

The second session began with a recall task 
which the S tried to write down as many items 
each group as he could remember. Any S who d 
not perform perfectly was sent home and his seco: 
session was rescheduled. During the second sessi 
a series of 90 stimuli was presented on the machin 
used in Experiment I. The stimuli consisted of th: 
number of a list (category) and a word. There was 
an interval of .5 sec. between the category number 
and word. Subjects were supposed to say 3 
when the word was a member of the category and 
“no” otherwise. They were asked to respond 
quickly as possible but to make no errors. Of : 
90 total trials, half were positive and half w 
negative because each word on S's list was paire 
once with a correct list number and once with an 
incorrect one. Finally, the recall task was repe«ccd, 
the Ss were encouraged to study hard, and a third 
appointment was made approximately 1 wk. hence. 

During the third session, stimuli were pres 
on the machine and the Ss’ reaction time for ne g 
a member of a list beginning with a given lette 
measured. The critical trials consisted of a 
presentation of the 12 possible combinations oí 
four categories and the three critical letters. T! 
for each category, the S gave 3 responses, o! 
beginning with each of the 3 selected letters. Rea 
tion times were recorded for all Ss. The E did no: 
inform the S whether he was correct. 


vas 


Results 


There were fewer than 4% errors. The 
percentages of response failures were 2, 8, 
20, and 18 for the 3, 6, 12, and 24 item 
lists, respectively. The mean reaction 
times for correct responses for production 
of a list member are presented in Table 2. 
Using the Newman-Keuls procedure re- 
vealed that responses to items in categories 
of size 3 are significantly faster than re- 
sponses in all other categories, p < .05. 
No other differences are significant. This 
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finding parallels that of the first two ex- 
periment With small categories, the 
larger the set, the slower the reaction times; 
but at some point, a further increase in 
size does not produce longer latencies, and 
there is even a decrease in time for the 
largest category. 

Although the results of these three studies 
were quite consistent, it was decided to 
collect data on a wider range of category 
sizes. Accordingly, Experiment IV in- 
volved 6 categories ranging from 4 to 12. 


EXPERIMENT IV 
Method 


Subjects. Thirty-six Ss, mostly college students, 
were recruited from a local newspaper advertise- 
ment. They were paid $3.50 for 3 sessions, and 
were promised a bonus of $3.50 if they performed 
well. 

Materials, A set of 24 target words and a set of 
58 filler words were selected from the Paivio, Yuille, 
and Madigan (1968) nouns on the basis of three 
criteria: (a) the words were from 4 to 8 letters in 
length; (b) the target words began with one of 
the letters B, F, L, and R and (c) none of the filler 
words began with the letters B, F, L, and R. The 
24 target words were divided into 6 sets such that 
each set had 4 words, each beginning with a different 
letter. Lists of length 4, 5, 7, 12, 22, and 32 were 
created using the procedure described in Experi- 
ment I. A given S received four lists, two short 
lists and two long lis Thus, the first S received 
the 4, 5, 12 and 22 word lists, typed on a sheet of 
paper in such a way that the 4 word list was desig- 
nated List 1, the 5 word list was designated List 2, 
and so on. The lists for the next 3 Ss were created 
as in previous experiments. For the next group of 
4 Ss, the same process was repeated using the 
original lists of 4, 5, 12 and 32 words. In all, 36 
unique sheets of paper were created, each contain- 
ing 4 typed lists. 

Procedure. Upon arriving at the experimental 
lab, S was told that he would be given four lists of 
words and that his job was to learn them extremely 
well. He was given a sheet containing the four 
lists, told to study it every day and an appointment 
was made approximately four days hence. 

The second session began with a recall task in 
which S tried to write down as many items in each 
category as he could remember. If he missed more 
than one word, he was required to go home, relearn 
the list, and return on a different day. Next S was 
given the standard identification task, using the 
machine described in Experiment I. He was shown 
cards containing a word and a list number (e.g., 
camel—1, 2—shirt) and he responded “yes” into 
the microphone when the word was a member of 
the category and "no" when it was not. Negative 


TABLE 2 


Reaction Times AS A FUNCTION oF CATEGORY 
SizE FOR EXPERIMENTS III AND IV 


A Category size 
peri- 
ment | | 4 | A 

HI |144 | 2; 

IV 2.63 | 2.71 


trials were created by pairing words on the list with 
incorrect category numbers. The S saw an equal 
number of positive and negative trials, with the 
word to be categorized printed before the list 
number on half the trials, and after it on the re- 
maining half. 

The third session occurred approximately 2 days 
later. First S was asked to write down as many 
items from each category as he could remember. 
Again, if he missed more than one word, his third 
session was rescheduled. Next, S was given the 
“item-production” task; stimuli were presented on 
the same machine using the same procedure de- 
scribed previously. In all, a random presentation 
of the 16 possible combinations of the 4 categories 
and the 4 key letters was presented to S. Thus, 
for each category, S gave 4 responses, one beginning 
with each of the 4 key letters. 


Results 


There were fewer than 5% errors. The 
percentages of response failures were 4, 5, 
4, 8, 11, and 13 for the 4, 5, 7, 12, 22, and 
32 item categories, respectively. The mean 
reaction times for correct responses for the 
six category sizes are presented in Table 2. 
Once again there is an increase in RT’s as 
category size increases from 4 to 5 and from 
5 to 7; but at that point no further increase 
occurs, with 12 and 32 actually being some- 
what faster than 7. Using the Newman- 
Keuls procedure, size 4 and 5 are different 
from all other category sizes, p < .05, but 
not from each other; no other differences 
were significant. 


GENERAL Discussion 


At first glance, the present results may 
appear to be at odds with those reported by 
Juola et al. (1971). Subjects in their experi- 
ment memorized a list of either 10, 18, or 26 
words, and then were tested for recognition 
of those words. The recognition test consisted 
of presenting a series of single words to S, 
and for each word S made a positive or nega- 
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tive response to indicate whether or not the 
word was a member of the learned list. Re- 
sponse latency was found to increase as list 
size increased. However, the Juola et al. 
paradigm is different from the present one in 
several major respects. Their Ss learned a 
single list of words; ours were tested with a 
production test. Their S could respond on the 
basis of a subjective judgment of the familiarity 
of the test item (and this is precisely what 
Juola et al. claim their Ss do); while our Ss 
could not. 

The four experiments reported here have 
produced consistent results—with small cate- 
gories, the larger the set, the slower the reac- 
tion time; at some point, this effect ceases, 
and additional increases in set size do not 
affect latencies. Although the failures to re- 
spond increased only slightly with category 
size for the large categories, one might claim 
that the stable latencies for correct responses 
are simply an item selection artifact. The 
argument is that as category size increases, the 
strength of any given item in the category 
progressively decreases, until eventually the 
item's strength does not allow it to be recalled 
within the 10 sec. limit. Thus, although the 
average strength of all items decreases with 
category size, the weakest items move from 
being recallable within 10 sec. to being re- 
sponse omissions. This would leave the mean 
correct-response latency unaffected by cate- 
gory size, However, such an item selection 
notion would also predict that the entire fre- 
quency distribution of correct response laten- 
cies would progressively shift in a positive 
direction as category size increases, and this 
result did not occur. 

A more reasonable interpretation of these 
results is that two different retrieval processes 
are involved when items are retrieved from 
small versus large categories. The smaller 
categories are within the capacity of short- 
term memory. The individual is able to keep 
as many as seven items in his immediate 
memory store. When he is given the category 
name and then asked to produce an item 
beginning with a particular letter, he may put 
the full category in his immediate store, scan 
it, and find the appropriate item. This would, 
of course, produce a category size effect, with 
larger sets taking longer (Sternberg, 1966). 
However, once the category is too large for 
the immediate store, this process is no longer 
feasible, Instead S must rely on a mechanism 
for retrieving information from long-term mem- 
ory, and as previous research has indicated, 
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this does not involve a successive search 
is accordingly not affected by the size o 
set. 

This implies that a process equivale: to 
that involved in short-term memory ma: be 
activated, The critical element, accordi: to 
our notion, is the size of the set to be sear «d. 
If it is small enough to be held in short- rm 
memory, it will be; and the search proces: vill 


be the familiar successive scan. If the is 
too large for the short-term store, a difi t 
process will be employed, one involving le 


or no successive scanning. We would add at 
if the normal process for searching long n 
memory should fail (i.e, S does not fin e 
desired item), it seems plausible that S v | 
again resort to successive scanning and 

the size of the set would once more be c: 
That is, very difficult items from long 
memory might force S to use some í 

succi ye processing. Thus, we would 
pothesize three distinct mechanisms oí 
trieval: (a) successive scanning for small sc 
(b) nonsuccessive scanning for larger sets 
long as the items to be found are reas 

easy or for whatever reason the search ` 
cessful within a reasonable period; (c) 
siderably slower and tedious successive 

for large sets when necessary. 

The notion of dual strategies is not n 
with us; evidence has been found by oth 
investigators, in different situations, For ¢ 
ample, in an experiment by Mandler and Dean 
(1969), Ss were shown a list one word at : 
time, As each additional list item was pr: 
sented, S had to recall all of the words pr: 
sented up to that time. With each successi 
input item, the S’s organization of the ' 
changed; with categorized lists, in particula 
these changes were quite dramatic. For the 
first few trials, an S usually began recalling 
items in the order they were presented. Afte 
roughly five items had been presented, 5 
typically began to organize the items into 
category clusters rather than recall them in 
their input order. The typical strategy, then. 
appears to be as follows: Ss begin by recalling 
the specific items on the list in STM; when 
STM capacity is exceeded, they reorganize the 
STM information. Rumelhart, Lindsay, and 
Norman (1972) have incorporated these two 
strategies into a successful simulation of list 
learning under these conditions. 
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yet set size, stimulus frequenc. 


quence, and the int 
Response-stimulus 
difficult to 


Theios, Smith, Haviland, Traupmann, 
and Moy (1973) have demonstrated large 
stimulus presentation frequency effects on 
reaction time (RT) in a Sternberg (1967) 
type fixed-set memory scanning task. 
Mean RT to both positive and negative 
stimuli increased approximately 85 msec. 
as stimulus presentation probability was 
decreased from .30 to .05.  Theios et al. 
interpreted these results as supporting a 
serial self-terminating memory scanning 
process such as the model proposed by 
Theios (1973a, 1973b). It is assumed that 
memory representations of both target and 
nontarget stimuli are organized probabilisti- 
cally, trial-by-trial into a limited capacity, 
push-down memory stack on the basis of 
stimulus occurrence. More frequent stim- 
uli (both target and nontarget) result in 
relatively fast RTs since they are more 
likely to have occurred recently and have 
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and response-stimulus interval were varied 
sign first used by Sternberg. Signifi 
to stimulus frequency, stimulus sequence 
stion between stimulus freq 
terval affected only the RT 
"count for assuming a serial exhaustive memory scanning process, 
but can easily be accounted for using a self-terminating scanning process. 


ING REACTION TIMES' 


cant effects on RT were found due 
, response frequency, response &e- 
ncy and target set size. 
tercept. The data are 


their S-R memory pairs located relatively 
early in the serial scan. Reaction time to 
infrequent stimuli will be relatively slow 
since their S-R memory pairs will be 
located relatively late in the serial scan. 

In addition, the Theios (1973b) self- 
terminating memory model predicts that 
there should be both stimulus and response 
sequential effects in memory scanning RTs 
similar to those found by Bertelson (1965), 
Rabbitt (1968), Remington (1969), and 
Smith, Chase, and Smith (1973) in choice 
RT. Reaction time to a stimulus in à 
memory scanning task should decrease as 
a function of recency and frequency of 
either stimulus or response occurrence in 
any random sequence of trials. A response 
frequency as well as a stimulus frequency 
effect on RT follows directly from the se- 
quential properties of the process. 

Finally, the Theios (1973b) model pre- 
dicts an interaction in RT between stimulus 
Írequency and target set size, since both 
these variables are assumed to affect the 
memory scanning process, 

The exhaustive scanning process sug- 
gested by Sternberg (1966, 1967) can ac- 
count for response frequency and response 
sequential effects by a response process 
such as Sternberg's (1969) binary decision 
or response translation stages. However, 
the exhaustive scanning model does not 
have any explicit mechanism to account for 
stimulus frequency or stimulus sequential 
effects in RT. The exhaustive scanning 
model might possibly account for stimulus 
frequency and sequential effects by assum- - 
ing that stimulus recency affects the stimu- 


STIMULUS SEQUENTIAL EFFECTS IN MEMORY SCANNING 


lus encoding process such that more fre- 
quent and recent stimuli are encoded faster 
than less frequent and less recent stimuli. 
This would predict stimulus frequency 
effects on RT to target and nontarget 
stimuli, stimulus sequential effects, and no 
interaction between stimulus frequency and 
target set size. This follows since stimulus 
frequency would affect the encoding process 
and target set size would affect the memory 
scanning process, and the effects of the 
two variables would add rather than inter- 
act (cf. Sternberg, 1969). 

The present experiment and analyses 
were conducted to provide further evidence 
on the question of whether memory scan- 
ning is self-terminating or exhaustive. A 
critical question is whether there will (self- 
terminating prediction) or will not (ex- 
haustive prediction) be an interaction in 
RT between stimulus frequency and target 
set size, The design of the experiment was 
effectively a replication on our equipment 
of the essentials of the Sternberg (1967) 
intact stimulus condition. In addition, 
however, we analyzed the RT data for 
stimulus and response probability effects 
and sequential effects and for an inte 
involving stimulus probability and target 
set size. 


Mernop 


Subjects. The Ss were 48 University of Wisconsin 
undergraduates (30 females and 18 males) who par- 
ticipated in order to fulfill an introductory psy- 
chology course requirement. It was required that 
all Ss be right-handed, 

Apparatus. The apparatus was the same as that 
described by Theios et al. (1973). The stimuli were 
the set of digits 0 through 9 presented on an Indus- 
trial Electronic Engineers, Inc., Series 10 rear- 
projection visual readout unit. The S sat in an 
arm chair with a response button mounted in each 
arm beneath the index finger. A Digital Equipment 
Corporation PDP-8 computer was programmed to 
select at random the stimulus for each trial with the 
probability for the selection of any particular stimu- 
lus on a given trial preset by Æ. The computer also 
measured the RT and timed the response-stimulus 
interval. 

Design. The 48 Ss were divided into two groups 
of 24 each. One of the groups was instructed to 
press the right-hand button whenever a positive set 
stimulus occurred and the left-hand button other- 
wise, The other group pressed the left button on 
positive trials and the right one on negative trials. 
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‘The 10 stimuli were partitioned into positive and 
negative sets in the same manner as in the Sternberg 
(1967) study, For a given S, 1 digit was presented 
on approximately 4/15 of the trials, 2 others were 
presented on approximately 2/15 of the trials each, 
and the remaining 7 each occurred on approximately 
1/15 of the trials, Each S was run through four 
conditions with positive memory set size (M) equal 
to one, two, three, or four, Each of the 10 digits 
was assigned to the positive set for one condition 
and to the negative set for the three others, For 
M = 1, 2, and 4 the stimuli were assigned to the 
positive set so that the probability of a positive 
response occurring was 4/15. The M = 3 condition 
was always presented first, as a practice condition, 
and here the probability of a positive response was 
3/18 (the digits assigned to the positive set in this 
condition were the three 1/15 probability digits not 
used in the M = 4 positive set). Only data from 
the conditions where M = 1, 2, or 4 were analyzed, 
Each of the six possible orders of presentation of 
the M = 1, 2, or 4 conditions was assigned to four 
Ss within each group. Two blocks of trials were 
run for each condition, with 108 trials in each block, 
For one block, the time between S's response to one 
stimulus and the onset of the next stimulus. (the 
response-stimulus interval or RSI) was ,5 sec, For 
the other block, the RSI was 2 sec. Each of the 
two presentation orders of these blocks was assigned 
to two members of the four-subject subgroups, The 
actual assignment of particular digits to the posi- 
tive set in cach condition followed the example of 
Sternberg (1967), 

Procedure, The instructions urged 5 to respond 
as fast as possible without making errors, The 
number of permissible errors was never made 
explicit. The S was told that if he made too many 
errors in any one block of trials, he would be given 
additional blocks of trials until his error rate was 
acceptably low, Actually, the largest number of 
errors permitted in a block of 108 trials was nine. 
The S was then told the numbers in the positive 
set, After a block of trials S was allowed to relax 
while the error data was being typed out on the 
teletype. If the number of errors was within the 

limit, the next block of trials, with the 

ive set but with the other level of RSI, 
was begun, If the number of errors was too large, 
the trial block was repeated. This procedure was 
continued, alternating short and long RSI, through 
all four memory set size conditions until all eight 
different trial blocks were run. 


RrsULTS AND DISCUSSION 


The overall error proportion was less 
than .03, and trials on which errors oc- 
curred were not included in the following 
analyses. The effect of the between-sub- 
jects variable (dominant vs. nondominant 
hand on positive set) was examined by an 
analysis of variance along with several of 
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FIGURE 1, Mean Teaction time for positive 
(target) and negative (nontarget) stimuli as a func- 
tion of memory set size with the best-fitting pair of 
parallel linear functions, 


the other variables. This variable was 
found to be nonsignificant (F < 1.00), and 
in all further analyses all Ss were treated 
as one group. 

Figure 1 shows the increase in mean RT 
over memory set size for positive and for 
negative trials plotted against the best- 
fitting pair of parallel straight lines. Sepa- 
rate analyses of variance were performed 
for the positive and negative data to deter- 
mine whether the null hypothesis of lin- 
earity could be rejected for either. It was 
found that both the positive and negative 
data points differed significantly from their 
respective best-fitting straight lines. For 
positive data points, F (1, 94) = 12.88, 
P < .001, and for negative data points, F 
(1, 94) — 6,08, p «.05. However, it was 
possible to account for 97% of the variance 
among the six means by the hypothesis of 
parallel linear functions, The data are of 
the same general form as Sternberg's (1967) 
data, and the present study used four 
times as many Ss as Sternberg's, meaning 
that for the present study there is a great 
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deal more power for the analysis of va: nce 
test of the linearity hypothesis. 


Stimulus Frequency Effects 

An important test of the theore -al 
models is whether or not stimulus pre :n- 
tation probability effects can be found >- 
tween items for a given memory set 
Figure 2 shows the mean RT for nega: ve 
items at each of the levels of probabi! cy 
for each target set size. Note that R^ is 
higher for low probability items under 1 
set size condition, Two analyses of \ 
ance were performed on the data re} 
sented graphically in Figure 2, the first 
mean RT for negative items with pri 
bility 1/15 and 2/15 and with 4 and 1 ite 
in the positive set and the second on me 
RT for negative items with probabili 
4/15 and 1/15 and set size 2 and 4. ‘J 
pattern of significant effects was the san 
for both analyses. With 1 and 47 degree 
of freedom in each comparison, there were 
significant main effects due to target set 
size, F = 234.28 and F = 70.94, ps < .001 
for the first and second analysis, respec 
tively, stimulus probability, F = 8.29, p< 
01 and F = 27.26, p < .001, response 
stimulus interval F = 28.24 and F = 39.09, 
ps < .001, along with only one significant 
interaction, that involving set size and 
probability, F = 5,22, P< .05 and ^ 
8.24, p < 01. 

The finding of a probability effect among 
the negative items could not be predicted 
on the basis of an exhaustive scan of the 
positive set as proposed by Sternberg 
(1967). However, the finding is consistent 
with a model in which both positive and 
negative items are represented in memory 
and mean RT is inversely related to stimu- 
lus frequency and recency (Theios et al., 
1973). On the other hand, the possibility 
must be considered that the probability 
eflect does not reflect the nature of the 
serial comparison Stage, but rather one of 
Sternberg's (1969) other processing stages, 
presumably the encoding stage. However, 
the observed interaction between stimulus 
probability and target set size leads to 
the conclusion, on the basis of the Sternberg 
analysis, that set size and stimulus proba- 
bility affect the Same processing stage, In 
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the Sternberg papers, the assertion that 
the serial comparison stage is exhaustive 
is based on the parallel increase in mean 
RT for positive and negative items as 
target set size is increased. The target 
set size variable and the serial comparison 
stage are strongly linked. Therefore, the 
finding of the interaction between stimulus 
probability and target set size is not com- 
patible with the assumption of a serial 
comparison stage unaffected by stimulus 
probability. 

One way in which an exhaustive scanning 
model might be used to explain the proba- 
bility effects found in the present study is 
that the serial comparison stage could be 
presumed to consist of an exhaustive scan 
of the target set but with the scanning rate 
faster for high probability probe stimuli. 
However, the basis for Sternberg's asser- 
tion of an exhaustive scan rather than a 
self-terminating scan is the parallel increase 
in mean reaction time for positive and 
negative items over increases in memory 
set size, rather than an increase which is 
twice as great for negative items as for 
positive items. But that result was ob- 
tained in a situation (the present design) 
where target set size and relative stimulus 
probability were confounded. If stimulus 
probability affected scan time significantly, 
it would mean that the removal of the 
probability effect (so that the “pure” target 
set size effect could be seen) would result 
in a reduced slope for mean RT to positive 
items since items which are in small target 
sets are also highly probable in the Stern- 
berg (1967) design. Also, there would be 
an increased slope for mean RT to negative 
items since the high probability stimuli are 
negative under the larger set size condi- 
tions. The consequence, therefore, of 
adopting this particular modification of the 
exhaustive scanning model is to destroy the 
original rationale for proposing the model. 
Since stimulus probability affects mean RT, 
the rationale for the Sternberg (1967) ex- 
haustive scanning model is called into ques- 
tion regardless of whether stimulus prob- 
ability is presumed to affect the scanning 
stage, the encoding stage, or any other 
stage, since the parallel linearity of his 
obtained functions must be assumed to be 
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Mean reaction time to negative (non- 
target) i as a function of stimulus presenta- 
tion probability and target set size (M). 


partially due to variations in stimulus 
probability as well as to target set size. 


Response-Stimulus Interval 


'The mean RT for the long (2.0 sec) re- 
sponse-stimulus interval (RSI) was sig- 
nificantly higher than for the short (.5 sec.) 
RSI. The fact that RSI does not have a 
significant interaction with either target 
set size or stimulus probability implies that 
RSI does not affect serial comparison, but 
rather some other processing stage. It is 
possible that variation in RSI affects S's 
"readiness" in a continuous information- 
processing task. With RSI at .5 sec. the 
task demands complete attention and con- 
tinuous concentration for a period of less 
than 2 min. for a block of 108 trials. For 
an RSI of 2.0 sec., 108 trials take nearly 
5 min. and the length of time between 
trials is enough so that the S's attention 
may move away from the task until it is 
brought back by the onset of the next 
stimulus. It is not necessarily the case 
that the RSI affects a precomparison stage. 
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^ Ficure3. Mean reaction time (msec.) as a func- 
tion of sequence of stimuli for long and short 
response-stimulus intervals, respectively. [Let S 
répresent any particular stimulus and D represent 
any stimulus different from S. The point 5 repre- 
sents the mean reaction time to stimulus S over all 
trials. The points DS and SS represent the mean 
reaction time (over all trials and all stimuli) to 
stimulus S given that a different (D) or the same 
(S) stimulus was presented on the previous trial, 
Likewise, the letter triples represent the mean reac- 
tion time (over all stimuli and trials) to stimulus S 
given that a different (D) or the same (S) stimulus 
was presented on the previous two trials. For 
example, the point DDS represents the mean reac- 
tion time toa stimulus S given that another stimulus 
(or two other stimuli) were presented on the pre- 
ceding two trials. ] 


It may instead affect response selection or 
evocation time, 


Sequential E fects 


If the ordering of a self-terminating 
memory stack is based on frequency and 
recency, it follows that the same stimulus 
occurring twice or more in succession would 
increase the probability of that stimulus 
representation moving to an earlier posi- 
tion in the memory buffer. Accordingly it 
is useful to determine if mean RT is 
affected by whether the stimulus on a given 
trial is the same or different from the 
stimuli on immediately preceding trials. 
Figure 3 shows the stimulus-response se- 
quential effects in the RT data. The 
figure shows the mean RT to a stimulus 
(S) as a function of the stimuli which 
occurred on immediately preceding trials. 
If the same (S) stimulus is repeated, reac- 
tion time systematically decreases. On the 
other hand, if the stimulus S has been pre- 


ceded by different (D) stimuli the rea 
time systematically increases. 

Response sequential effect. The sequi 
tial effects in Figure 3 could be due t 
either stimulus repetitions or rc-ponse repe 
titions. That theré are respo quential 


effects in the data is shown ire 4, 
Here RT to a stimulus (S) is se 
function of whether the same (S po se 
or a different (D) response was c Oii 
immediately preceding trials. Hl: 

these response sequential effects arc n 
confounded, to some degree, with sti us 


repetitions. 

"Pure" stimulus sequential effect. 
order to obtain measures of the pure sti 
lus and response sequential effects, 
considered only runs of trials in which : 
same response is given repeatedly. Fig 
5 shows the pure stimulus and respo: 
sequential effects. The top branch sho: 
the pure response repetition effect whe: 
the stimulus is not repeated, but the re- 
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FIGURE 4. Mean reaction time (msec.) as a 
function of sequence of responses for long and short 
response-stimulus intervals, respectively. [Let S 
represent any particular response and D any re- 
sponse different from S. The point S represents 
the mean reaction time for response S over all 
trials. The points DS and SS represent the mean 
reaction time (over all trials and responses) for 
response S given that a different (D) or same (S) 
response occurred on the previous trial. Likew 
the letter triples represent the mean reaction time 
(over all trials and responses) for response S given 
that the same (S) or different (D) responses occurred 
on the previous two trials. For example, the point 
DSS represents the mean reaction time for response 
S given that the same (S) response was made on the 
previous trial but a different (D) response was made 
two trials previously. ] 


STIMULUS SEQUENTIAL E 


tematically decreases with response repeti- 
tion. The lower branch shows the decrease 
in RT when both stimulus and response 
are repeated. As can be seen, the decrease 
in RT is much greater for stimulus and 
response repetition than for response repe- 
tition alone. The difference between the 
nodes of any two adjacent branches of the 
sequential tree diagram is a measure of the 
pure stimulus repetition effect. On the 
basis of these analyses, it is obvious that 
there are both stimulus and response se- 
quential effects in memory scanning data, 
at both long and short response-stimulus 
intervals. 


Application of a Self-Terminating Model 

Theios et al, (1973) discussed three 
models of a self-terminating memory scan- 
ning system. The first model had an un- 
limited capacity serial scanning buffer, the 
second model had a fixed sized, limited 
capacity scanning buffer, and the third 
model had a variable sized, limited ca- 
pacity scanning buffer equal to the target 
set size. Each of these three models had 
two versions, one in which movement up 
in the memory buffer was the same for 
target and nontarget stimulus representa- 
tions (a, = a.) and one in which target 
(positive) representations moved up faster 
than nontarget (negative) representations 
(a, > a). The assumption of the equality 
of the movement parameter can be rejected 
on the basis of the data from the present 
experiment. Assuming d, equal to a, it 
was not possible to get enough slope in the 
predictions for positive RT as a function 
of target set size. Thus, with a, greater 
than a_, target representations on the 
average will be located earlier in the 
memory buffer than representations of non- 
target stimuli of equal frequency. 

The fact that there is a target set size 
effect in the data of Figure 2 is sufficient 
to reject the assumptions of either an un- 
limited capacity buffer or a fixed sized, 
limited capacity buffer. The data suggest 
that the buffer capacity increases with 
target set size. 

"Taken together, the data of the present 
experiment suggest the variable sized mem- 
orv stack model proposed by Theios et al. 
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FIGURE Mean reaction time (msec.) as a 


function of sequence of stimuli (for long and short 
response-stimulus intervals, respec tively) during 
runs of the same response. [Let S represent any 
particular stimulus and D represent any stimulus 
different from S. The point S represents the mean 
ction time to stimulus S over all trials 
points DS and SS represent the mean react ion time 
(c all trials and all stimuli) to the stimulus S 
given that a different (D) or the same (S) stimulus 
was presented on the previous trial and gi 
the response on both s was the same, I 
the letter triples represent the mean reaction time 
(over all stimuli and trials) to stimulus S given that 
a different (D) or the same (5) stimulus was pre- 
sented on the previous two trials and given that 
the response on all three trials the same, For 
example, the point DDS is the mean reaction time 
toa stimulus S given that different stimuli requiring 
the same response were presented on the two previ- 
ous trials. ] 


(1973). Using a Monte Carlo computer- 
simulation program, separate quantitative 
predictions. were obtained from the self- 
terminating memory scanning model de- 
scribed in Theios (1973b) and Theios 
et al. (1973) for thelong and short response 

stimulus interval conditions. A statistical 
response protocol corresponding to the self- 
terminating process was obtained for every 
response protocol given by a real S in the 
present experiment, under the assumption 
that the Ss were homogeneous with respect 
to the values of the parameters of the 
process. The size of the memory stack 
(Z — 1) was set equal to the target set 
size and the time to retrieve a response 
from long-term memory given an unsuccess- 
ful exhaustive scan of the buffer (tz) was 
set equal to the buffer comparison time (f). 
Chandler's (1969) subroutine STEPIT was 
used to estimate the values of the remain- 
ing five parameters. They were the RT 
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FiGURE 6, Mean reaction time as a function of 
target set size for positive (target) stimuli (open 
circles) and negative (nontarget) stimuli (closed 
circles). [The lines are predictions obtained from a 
Monte Carlo computer simulation of the self- 


terminating memory scanning model of Theios, 
1973b.] 


intercept (the combined encoding and nega- 
tive response time, C), the difference be- 
tween positive and negative response selec- 
tion and output times (D), the memory 
comparison time (/), the probability of a 
positive stimulus-response representation 
moving up to the top of the memory stack 
after that stimulus is presented (a,), and 
the probability of a negative stimulus- 
response representation moving up to the 
top of the stack after that stimulus is 
presented (a_). 
Table 1 shows the mean RT values from 
‘he computer simulation as well as the 
nean RTs from the present experiment. 
V goodness-of-fit analysis of variance was 
erformed to determine whether the ob- 
erved mean RTs differed significantly from 
hose obtained from the two computer 
mulations. There are 20 comparisons 
mean differences) involved, and a total of 
) parameters were estimated from the 
ata to obtain the predicted means. Thus, 
te numerator degrees of freedom are 10. 
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The error term for the F ratio is the average 
of the squares of the standard errors of the 
observed means, that is, the within-cell 
variance, which has 940 degrees of freedom. 
The test indicated no significant differences 
between the simulated means and the ob- 
tained means, F (10, 940) = 1.19, p > .05. 
It is reassuring to note the parameter in- 
variance obtained from the two indepen- 
dent simulations. The parameter esti- 
mates for the long and short RSI conditions 
differ appreciably only in the RT intercept 
(C). The fit of the model would be almost 
as good if the long and short RSI condi- 
tions were simulated together, using com- 
mon values for the remaining parameters, 
i.e., if only 6 parameters would have been 
estimated. 

Figure 6 shows that, contrary to common 
belief, the self-terminating process predicts 
that mean RT is an essentially linear func- 
tion of target set size with approximately 
equal slopes for target and nontarget stim- 
uli. Figure 7 shows that the self-termi- 
nating process predicts the obtained inter- 
action in RT between target set size and 
stimulus frequency. 

Inspection of the estimated values of the 
parameters in Table 1 indicates that the 
scanning rates are typical (42 msec.) and 


TABLE 1 


SERVED MEAN REACTION TIMES (MSEC.), 
STANDARD ERRORS, 


= SELF-TERMINATING 
MODEL, AND PARAMETER VALUES 
USED IN THE SIMULATIONS 


Short response- 


jl s Long response 
stimulus interval 


stimulus interval 
Condition e e 
Simu- | Oh- 

lated | served 


;| Simu- | Ob- 
SE | ated served |“ 


Target Set Size 1 | ei 
Positive 4/15 435 |434 4| 8) 485 | 471 | 10 
| 8| 
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Ficure 7. Mean reaction time (dots) to non- 
target stimuli as a function of stimulus presentation 
probability and target set size. [The lower two 
points are for a target size of one, the middle two 
points are for a target size of two, and the upper 
three points are for a target set size of four. The 
lines represent predictions obtained from a Monte 
Carlo computer simulation of the self-terminating 
memory scanning model of Theios (1973b). The 
data clearly show the significant obtained and pre- 
dicted interaction between stimulus probability and 
memory set size. ] 


that the a, estimates are larger than the 
a- estimates reflecting the greater emphasis 
on the target items in the scanning process. 
The 79-msec. difference between positive 
and negative RT intercepts reflects the 
fact that negative responses were required 
on 11/15 of the trials whereas positive 
responses were required on only 4/15 of the 
trials. Sternberg (1969) has reported simi- 
lar results. The 79-msec. difference can be 
accounted for by bias in what Theios 
(1973b) has called the response program 
selection process. 


CONCLUSIONS 


The results clearly indicate that in the 
Sternberg (1967) fixed-set memory scanning 
task there are significant (a) stimulus frequency 
effects, (b) stimulus sequence effects, (c) re- 
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sponse frequency effects, (d) response sequence 
effects, (e) target set size effects, and (f) 
effects due to an interaction between stimulus 
frequency and target set size. The obtained 
interaction is especially difficult to account for 
using an exhaustive scanning process. How- 
ever, the observed pattern of results is easy 
to account for using a self-terminating process 
(Theios, 1973b; Theios et al., 1973) as was 
shown by a computer simulation of the present 
experiment. When viewed against the back- 
ground of the research literature in the choice 
reaction time and memory scanning areas, the 
present results suggest the general theoretical 
process of human perception, cognition, deci- 
sion making, and response selection proposed 
by Theios (1973a, 1973b). 
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STIMULUS PROBABILITY AND STIMULUS SET SIZE 
IN MEMORY SCANNING! 
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In many recent studies of speeded scanning of immediate memory, variations 
in the size of the positive set (s) were confounded with variations in the 


probability (P) of the individual items within the positive set: As s increased, 
P decreased . The present experiment sought to determine whether the effect 


on RT attributed to s could be accounted for by variations in P. 


"This was 


accomplished by factorially varying both s and P. Probability effects were 
confined to items in the positive set and were insufficient to account for the 


effect of s. The results are di 


ed in terms of a model in which s and P affect 


different information-processing stages. The s affects the number of compari- 
sons between the encoded item and the items stored in the memory of the 


positive set, as proposed by Sternberg. 
information as to the particular digit that was presented is a: 


selection— 
ble to the 


The P affects respoi 


mechanisms for response selection along with the knowledge that there was or 


was nota match. The 
in tune with the P values 


The number of things that one has to 
think about and the expectancy as to the 
likelihood of occurrence of these things— 
stimulus number and stimulus probability 
—have long been regarded as fundamental 
variables in the study of cognition. The 
common finding that longer RTs would be 
produced by an increase in the number of 
possible stimuli or a decrease in stimulus 
probability was a result that was compati- 
ble with most theories of stimulus recogni- 
tion. Discriminating among the various 
theoretical accounts for these effects has 
been a more elusive task. 

One class of models holds that variations 
in stimulus probability and stimulus num- 
ber affect only a single commodity such as 
information (in bits) or repetitions, Ex- 
amples of such models are those that posit 
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the Research Foundation of State University of 
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J. Cleary, and Myra P, Leffel for their aid in the 
data analysis, 


sponse selection mechanisms are assumed to be biased 
of the items within the positive set. 


a single decision maker which operates « 
stimulus information in bits (e.g., Brigg 
& Johnsen, 1973; Briggs & Swanson, 1970: 
Hick, 1952) or sequential position in : 
stack (Theios, Smith, Haviland, Traup- 
mann, & Moy, 1973). 

In contrast, Sternberg (1: 1967) | 
advocated a model in which the time to 
scan immediate memory is affected by the 
number of possible items in memory- -no 
by their probability of occurrence. In 
series of character classification exper 
ments, Sternberg (1966, 1967) found tha 
the time to classify a character (e.g., 
digit) as a member of an arbitrary subset 
of characters (the memory or positive set 
was a linear increasing function of the 
number of items, 5, in that subset. More- 
over, the slopes for the positive (match) 
and negative (nonmatch) functions were 
identical. From these data, Sternberg 
postulated a high speed (25 items/sec), 
exhaustive, sequential scan of the items in 
the positive set. However, in Sternberg's 
experiments, the number of items in the 
positive (memory) set s was confounded 
with the probability (P) of the individual 
items in that set: As s increased, P de- 
creased. By interpreting his results sol: 
in terms of stimulus number, Sternberg was 
implicitly assuming that there was either 
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no effect of P* or else that P affected some 
tage other than memory scanning. This 
onfounding, which Sternberg explicitly 
acknowledged, was a necessary consequence 
of controlling for stimulus and response 
entropy and response probability. For 
example, in Sternberg's (1967) experiment, 
when s = 1, 2, and 4, the individual items 
hod P-values of 4/15, 2/15, and 1/15, re- 
ctively, so that the probability of a Yes 
sponse was always 4/15. Because of the 
founding between s and P, Biederman 

| Zachary (1970) suggested that the 
ower RTs to the larger set sizes might, 
in part, represent adjustments to stimulus 
probability within the positive set. (Stern- 
berg, 1966, found no effect of increasing 
number of negative set items from 2 

15, with negative response probability 
ld constant, so if probability adjustments 
re at play, they would have to be to 
sitive set stimuli.) An effect of P had 

idy been documented in other experi- 
ments (e.g., LaBerge & Tweedy, 1964), in 
faster RTs were produced by more 
able stimuli even when the different 

uli were assigned to the same response. 

he present experiment was designed to 

sal whether such probability adjust- 

m sts would be found in the Sternberg 
memory-scanning task. Specifically, the 
probability of the items within a given 
sitive set was varied. If it is assumed 
ihat P affects memorial comparisons and 
the memorial comparisons proceed at à 
fixed rate, then an exhaustive, sequential 

nning theory would predict that such a 

bability imbalance would have no effect 
on RTs. 

Recently, however, Klatzky and Smith 
(1972) argued that an exhaustive scanning 
theory could accommodate such a result 
by assuming that the adjustments to proba- 
bility were made at some stage other than 
the comparison stage; viz., a stimulus en- 
coding stage. The exhaustive scanning 
theory and the additive factors method 
(Sternberg, 1969) then predict that varia- 
tions in P and s should produce additive 
effects on RT. Such a result was reported 

lec E 


3 As defined here, P is the probability of occur- 
rence of a particular stimulus. 
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by Klatzky and Smith in a letter classi- 
fication task. Also compatible with an en- 
coding effect of P is Miller and Pachella's 
(1973) finding that P and stimulus degra- 
dation (which would be expected to affect 
encoding) interact. The RTs to items of 
low probability of occurrence were more 
adversely affected by the presence of visual 
noise than high-P items. Since Sternberg 
(1967) found that degradation did not 
interact with variations in positive set size 
(i.e., memory scanning), the Klatzky and 
Smith (1972), Miller and Pachella (1973), 
and Sternberg (1967) experiments are all 
compatible with a two-stage theory, en- 
coding and memory scanning, for the effects 
of P and s, respectively. 

The design of the present experiment 
compared the effect of P on positive RTs 
with the effect on negative RTs. An en- 
coding theory of P-effects would predict 
that response type would not interact with 
P. Theios et al. (1973), Krueger (1970), 
and Miller and Pachella (1973), in fact, 
reported such a result. However, these 
investigators all employed positive and 
negative sets that were of equal size. Such 
a procedure might well lead to S's func- 
tional (i.e., positive) set not being the one 
specified by E. When sets are of equal 
size, there is little utility for S to confine 
his scanning to the set which Æ designated 
as positive. Under such conditions, S 
might reasonably be expected to sometimes 
scan one and sometimes scan the other set. 
Or else, S could scan an amalgamation of 
sets (Hawkins & Hosking, 1969) by com- 
bining the high-P items from the negative 
set with the positive set. The design of 
the present experiment employed a posi- 
tive set that was smaller than the negative 
set hence, less scanning would be required 
if S confined his scanning to the positive 
set. 

METHOD 


Subjects. Forty-eight undergraduate students at 
the State University of New York at Buffalo par- 
ticipated in the experiment as part of their course 
requirement in introductory psychology. 

Stimuli and apparatus. The stimuli were the 10 
digits, 0, 1,. . . , 9, generated by an IEEE in-line 
display (Model 10-0K21-1820-L) set in a 3 X 3 ft. 
gray frame, The background, pre- and postexpo- 
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Ficure 1. Mean correct reaction time as a 
function of group (EQ vs. UN), response (POS vs. 
NEG), and set size, (Slopes and intercept constants 
shown on figure represent best least squares fit. 
Lower bars represent error rates as shown on scale 
on the right of the figure.) 


sure fields were black, The S’s index fingers rested 
on plastic keys which, when depressed, activated a 
microswitch which stopped a timer that was started 
by the onset of the digit. A warning buzzer of .5-sec. 
duration sounded 1 sec. before each stimulus’ pre- 
sentation of .5 sec. The interstimulus interval was 
5 sec. After each response, E verbally provided 
error and speed feedback (to the nearest .01 sec.). 
Approximately 2 min, intervened between blocks 
of trials, 

Design and procedure. Each S was fully instructed 
about the nature of the task and the relative fre- 


TABLE 1 


PROBABILITIES OF STIMULUS DIGITS IN THE 
PosrrvE SETS FOR THE EQUAL AND 
UNEQUAL PROBABILITY GROUPS 


Group 

Size of 
positive Te e 

t Equal Unequal 
sel AS e 
s= 2/15, 2/15 3/15, 1/15 
(s = 3) (1/15, 1/15, 1/15; (1/15, 1/15, 1/15) 
$24 1/15, 1/15, 1/15, 1/15 | 2/15, 2/45, 2/45, 2/45 
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quencies of each of the 10 digits. A bar grar is- 
play of these relative frequencies remained S's 
view throughout the experiment. Left and : ‘ht 
keys were assigned to Yes and No response ic- 
cording to S's preference. 


The design closely followed that of the first s n 
in Sternberg's (1967) experiment. The first ck 
for all Ss always consisted of 45 trials with a po: ‘ve 


memory-set size of three digits. Thesecondth: igh 
fourth blocks each consisted of 90 trials with © m- 
ory set sizes of one, two, or four digits in cov er- 


balanced order. Half of the Ss were run in n- 
dition (UN) where, for memory sets of Size 2 4, 
one of the digits in that set was more probab in 
the other digit(s). The remaining Ss, like th in 
Sternberg's experiment, had memory sets re 
the positive digits were equiprobable (EQ) he 
stimulus sets and probabilities are shown in T 1, 
For Set Size 2 in the EQ condition, both digit id 
a probability of 2/15. In the UN conditio ie 
of the digits had a probability of 3/15 and th« "i 


digit had a probability of 1/15. For Set Si; in 
the EQ condition, each of the four digits had - 
bilities of 1/15. In the UN condition, onc e 
four digits had a probability of 2/15 and - 
maining three digits each had a probability 
For Set Sizes 1, 2, and 4 the probability of a 
response was 4/15. For s = 3, all Ss had a 
probable condition where the probability of 
tive response was 3/15. (Therefore, the probal v 
for any one digit when s — 3 was 1/15.) For :Q 
Ss, therefore, 1 of the 10 digits had a probabilit:: of 
4/15, 2 had probabilities of 2/15, and the remain ng 
7 had probabilities of 1/15. In the UN conditicn, 
1 digit had a probability of 4/15, 1 had a probabilit: 
of 3/15, 1 had a probability of 2/15, 4 had proba- 
bilities of 1/15, and 3 had probabilities of 2/45. 
Each .S's positive sets were selected from bne of 
the three groupings used in the Sternberg (1967) 
experiment. The high and low Ps were balanced 
over the different digits. Thus, for example, one- 
third of the Ss had a grouping that when s = 1 
the positive digit was "5"; when s = 2, the positive 
digits were 4 and 9; and when s = 4, the positive 
digits were 0, 1, 3, and 7. In the UN condition, 
each digit in s = 2 and 4 occurred as the more 
probable digit an equal number of times. For 
example, for eight of the Ss in the UN condition, 
when s = 2 the more probable (P = 3/15) digit for 
four of these Ss was “4” and for the remaining four 
Ss the more probable digit was “9,” The assign- 
ment of Ps to digits remained constant throughout 
S's session. Before each block, markers were placed 
on the bars of S's display designating the positive 
digits. 


RrsuLTS 


Figure 1 shows the overall Yes-No RT 
and error functions for both EQ and UN 
conditions. The slopes for the EQ func- 
tions average 40.4 msec/item which is 
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almost identical to the UN functions which 
average 38.4 msec/item, and these, in turn, 
are similar to those reported by Sternberg 


.in his (1967) experiment (mean slope — 


35.6 msec/item). Thus, instructing S 
about the probability variations and ex- 
plicitly manipulating Ps within the posi- 
tive set had little effect on the rate and 
nature of processing. The 30-msec. average 
difference in intercepts between the EQ 
and UN groups most likely represents in- 
dividual differences in the groups: the EQ 
group was 43 msec. faster at the outset of 
the experiment, in the s — 3 condition, 
and 26 msec. faster in the s — 1 condition. 
Differences in RTs among the three digit 
groupings were negligible, F < 1.00. 

The data for probability adjustments 
within the positive set are shown in Table 
2. An analysis of variance of the positive 
stimuli of the s — 2 and 4 condition of the 
UN group revealed that the main effects 
of s (2 vs. 4) and P (high vs. low) were 
significant, F (1, 23) = 19.40, p < .001, 
and F (1, 23) = 11.85, 5 « .01, respec- 
tively. The s X P interaction was highly 
variable from subject to subject and was 
not significant, F (1, 23) — 1.42. Table 2 
shows that the effect of s on Yes RTs is 
too large to be completely attributable to 
P. Three comparisons are relevant here: 
(a) The differences in P-values between 
s = 2 and s = 4 in the EQ condition was 
only half the difference between high and 
low Ps in the s = 2 UN condition (.133- 
.067 = .061 vs. .200-.067 = .133). Yet the 
former yielded an effect on RT of 77 msec. 
which was more than twice the effect on 
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RT produced by the latter P difference 
(28 msec.). (b) A similar case can be made 
for the s = 4 UN condition. Here the 
difference in P values was .089 (.133-.044) 
—slightly greater than the difference in P 
values in the EQ condition. Yet, again, 
the 48-msec. UN difference was less than 
the effect of s in the EQ condition. (c) 
Within the UN conditions, the effect of 
s was clearly greater than the effect of P. 
The RTs to the .133 stimulus in the s = 4 
condition were slower than the .067 stimu- 
lus in the s = 2 condition. Thus when s 
and P were directly pitted against each 
other, s was the more patent variable. 

Is the effect of s independent of the effect 
of P? The data were more equivocal on 
this point but certainly did not provide 
strong evidence for an interaction between 
sand P. Although not statistically reliable, 
the within-subjects effect of P was greater 
in thes = 4 condition than in thes = 2 con- 
dition, even though the difference (but not 
the ratio) in P-values was greater in the 
s =4 condition. However, before this 
suggestion of an interaction between s and 
P is given much weight (which would be 
evidence against a two-stage theory), one 
must first consider how the interaction 
could be affected by rescaling the proba- 
bility variable since the differences in P- 
values for the different set sizes were not 
taken at the same points on the P scale. 
The variation in P for s = 4 occupied a 
lower range of the scale (.044 to .133) than 
did the variation in P for s = 2 (.067 to 
.200). Hence, if one wished to assume a 
transform between objective and subjec- 


TABLE 2 


Mean Corkecr Reaction Times (R^ 


or Group, SET Size (s), AND PROBABILITY (P) 


Ts) FoR YES RESPONSES As A FUNCTION 


Group 
IA EE T ESSE eS Se EN E rper 
Set size pl 2 ^ rad 3r minua igh 
unc Ie RT > RT 
d | Gas HighP | (msec.) Low | (msec.) 
2 133 454 .200 484 .067 | 512 28 
4 .067 531 133 528 .044 576 48 
sefiect (4 minus 2) 77 44 | 64 
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067 133 267 
ITEM PROBABILITY 
FiGURE2, Mean negative correct reaction time as 
à function of probability for the EQ condition. 


tive P that expanded the smaller values of 
the P scale, the interaction between s and 
P could be eliminated. Such an assump- 
tion could be justified on the basis of 
studies (e.g, Hohle & Gholson, 1968; 
Miller & Pachella, 1973) that have demon- 
strated that differences between small P- 
values produce larger effects on RT. than 
equal differences in larger P-values. It is 
possible to construct a between-subjects 
test of the P X s interaction—which avoids 
the scaling problem—by comparing the 
data from the EQ and UN groups. This 


TABLE 3 
Mean Correcr REACTION Times (RTs) rog Yrs 
RESPONSES AS A FUNCTION OF IZE (s) AND 
PROBABILITY (P) WITH CORRECTION FOR GROUP 


Probability 


Gir E ERA SULAT P-effect 
433 .067 (.133 minus .067) 
2| 480(Equal) | $12(Unequal) | — 32 
4 | 528(Unequal) 557(Equal) 29 


Note. A total value of 26 msec., the difference between the 
equal and unequal groupsin thes = | condition, has been added 
to the equal RTs, 
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can be done by equating for the between- 
group differences on the basis of the 
common s = 1 condition which was 26 
msec. faster in the EQ group. Thus if 
26 msec. is added to the EQ values as 
shown in Table 3, then the larger P-effect 
found in the s = 4 group for the within-s 
analysis is eliminated. 

Figures 2 and 3 show the effect on RTs of 
variations in item probability, set size, and 
response. Variations of P within the posi- 
tive set had a considerably greater effect 
than variations of P within the negative 
set. The average slope of the functions 
relating RT to P within the positive set, 
are 14 times as steep as the corresponding 
negative functions: —48 msec. per .1 in- 
crease in P for the positive slope and 
—3.4 msec. per .1 increase in P for the 
negative slope. (By comparison, if the 
effect of s is described in terms of P, then 
the resulting slope, —60 msec. per .1 in- 
crease in P, is steeper than the function 
relating RT to P within a given d 
positive set.) Probability thus had only a 
negligible effect on RTs to items within the 
negative set. In five of the six negative 
set probability functions, the F ratio for 
the effect of P was less than 1.00. In the 
JN group, s = 2 condition, there was a 
significant effect of P: F (3, 69) — 4.91, 
p < .01. However, the function for that 
condition is nonmonotonic and the effect 
is primarily attributable to the point at 
P = .067 which is slower than the point at 
P = .044. 

Repetition effects were examined as a 
function of response type, probability, set 
Size, and group. (Since a number of Ss 
did not have any entries for repetitions 
for some of the low probability stimuli in 
some conditions, the nonrepetition RTs for 
the corresponding cells for these Ss were 
not included in the analyses.) Stimulus 
repetition effects were larger than response 
repetition effects. Trials in which the 
stimulus was repeated from the preceding 
trials resulted in RTs that averaged 25 
msec. less than RTs from trials in which 
the successive stimuli were different but 
the response was the same. When a dif- 
ferent response was required, RTs were 6 


msec. longer than when the response was 
repeated but different stimuli were pre- 
sented. Table 4 shows that larger stimulus 
repetition effects were associated with 
larger set sizes. Response type, P, and 
groups did not yield a consistent effect on 
the magnitude of the stimulus repetition 
effect. The magnitude of the response 
repetition effect did not vary systematically 
with any of the major experimental vari- 
ables. Since there were relatively few repe- 
tition trials the results shown in Figures 1, 
2, and 3 would be only negligibly affected 
if only nonrepetition trials were used. 

'Table 5 shows the correct R'Ts, error 
RTs, and error rates for the major experi- 
mental conditions.  Yellot's (1971) cor- 
rection for fast guessing and the speed- 
accuracy trade-off applied to these data 
had only a negligible effect on the com- 
parisons described above. 


DISCUSSION 


Variations in P produced by variations in s 
led to considerably greateMeffects on RT than 
identical variations in P with s held constant. 
"Thus it is clear that not all the effect of s can 
be attributed to P. The results of this memory 
scanning experiment are thus consistent. with 
the information conservation studies of Hyman 
(1963) and Hohle and Gholson (1968) in show- 
ing that the effect on RTs of variations in set 
size of equiprobable stimuli is greater than the 
effect which is produced by variations of P 
within a given set size. The interaction be- 
tween s and. P was not significant although 
there was a trend, within-s, for larger P effects 

.to be present in the larger positive sets (al- 
though this was not the case in the between-5 
comparison). 

In the present experiment, if it is assumed 
that P effects reflect the functional set (i.e., 
the set which is scanned), then the lack of 
such effects in the negative set would indicate 
that the negative set was rarely functional (or 
scanned). What might have occurred in the 
Kreuger (1970), Theios et al. (1973), and 
Miller and Pachella (1973) experiments, was 
that their Ss, faced with positive and negative 
sets of equal size, varied the set which they 
scanned from trial to trial. What determines 
which set gets scanned? What does the S 
treat as positive? This decision could not 
primarily be based on response probabilities 
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Figure 3, Mean correct reaction times as a 
function of probability and response in the UN 
condition, 


since in the present experiment as well as 
Sternberg's (1967, 1969) experiments the prob- 
ability of a positive response was only 4/15. 
Moreover, variations of response probability 
independent of stimulus probability have only 


TABLE 4 


LFFECTS (IN MSEC.) AS A 
PosrriVE OR NEGATIVE), 


MULUS PROBABILITY 
| Stimulus probability 
Set | M 
| 044 | ,067 | RESI | ,200 | 207 
Positive | | | 
WA AS 54.5 
41 27 
58 
-2 
3| M 
36 | 38 
15 
ese! i 
mean RTs 


rials. Posi- 


* Insufficient data. 
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TABLE 5 
Error REACTION TIME (UNDERLINED, IN MSEC.), AND ERROR 


Mian Correcr Reaction TIME (MSEC.), 
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RATE (IN PARENTHESES) AS A FUNCTION OF Group, SET SIZE, AND RESPONSE AND STIMULUS PROBABILITY 


zs Positive response Negative response 
uoa Group equal Group unequal Group equal Group unequal 
ity 
$-1|s522]|s54|s521]|522]|554 | s1]| s=2 |.s-54 | s=1 s=2 |s=4 
2 438 442 | S16 544 
ZU bs 364 456 | 586 508. - 
089) A (021) | (023) (042) 
200 em ess TSi 424 540 
: 408 548 637 
(113) (.009) (.016) 
133 454 528 | 396 520 | 424 | 460 
; 410 584 | 490 521 | 400 | 471 | 
078; (.052) | (007) (030) | (003) | (.024) 
.067 gp 531 512 397 | 444 | 531 | 427 | 492 | 55i 
498 524 389 | 482 | 622 | 609 | 508 | 587 
(078) (32) (009) | (015) | (078) | (014) | (028) | (.013) 
044 576 426 | 470 
Su 500 | 490 | 
(142) (003) | C017) 


slight effects on RT (Biederman & Zachary, 
1970). It would appear that the number of 
items is the most potent determinant of what 
gets scanned in memory: Left to his own 
devices, S scans the set with the smaller num- 
ber of items^ It seems reasonable to assume, 
however, that when positive and negative sets 
are of equal size, other variables (e.g., proba- 
bility, repetition, value) would determine 
which stimuli are scanned on a given trial. 
Also, if S had insufficient time or memory to 
determine the complement of a positive set, 
he might scan the positive set even though it 
was larger than the negative set. 

The interaction between P and Response 
"Type observed in this experiment is contrary 
to what would have been expected from an 
encoding theory of stimulus probability as 
advanced by Klatzky and Smith (1972) and 
Miller and Pachella (1973). These investiga- 
tors reported that P did not interact with 
memory set size but did interact with stimulus 
degradation. An encoding theory of proba- 
bility effects might be reconciled with the 
above findings if it is assumed that prior to 
the presentation of the probe, S "prepares" 
only for the positive set items, and the effect 
of this preparation is on the encoding stage. 


* Perhaps it is for this reason that it is so difficult 
to follow the instruction, "Don't think about 
alligators.” 


If the amount of preparation was related to 
P then larger P effects in the positive than in 
the negative set might be expected. However, 
such a theory is inconsistent with the complete 
absence of any interaction between degrada- 
tion and response type in the Sternberg (1967) 
experiment. 

If P does not affect encoding, then where 
does it have its effect? One possibility migħt 
be at the response selection stage following 
memory scanning. If positive set matches are 
signaled not simply by a match signal but by 
the digit itself along with a match signal, then 
response selection could be biased in tune with 
the probabilities of the positive set stimuli. 
After all, S knows what the digit was that 
provided the match. Such a theory would be 
consistent with the finding that the effects of. 
P and S-R compatibility interact (Sanders, 
1970). In fact, with naming tasks (which are 
of very high compatibility), there is little or 
no effect of probability (Miller & Pachella, 
1973; Theios, 1974). Of the studies discussed 
above, the only finding that is apparently in- 
compatible with this view is the interaction be- 
tween P and degradation reported by Miller 
and Pachella. In that experiment, degrada- 
tion was produced by the presentation of a 
light field immediately prior to the presenta- 
tion of the digits of a CRT display. The field, 
which reduced the contrast between the digits 
and background, decayed over time. That is, 
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if S waited, the digits became clearer. It is 
possible that such a discriminability manipula- 
tion could have induced a response selection 
strategy which was biased in favor of high-P 
stimuli (assuming that response selection is 
achieved by a combination of digits and match 
signals), But even if another method of pro- 
ducing degradation revealed an interaction 
between P and degradation, P effects could 
still be localized at the response selection stage 
if it was assumed that the P bias was to a 
visual representation of the digit. 
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STIMULUS ENCODING AND DECISION PROCESSES 
IN RECOGNITION MEMORY ' 
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Recognition memory for words and pictures was tested using a fixed, memo- 
rized-set procedure (Experiment I) and a continuous procedure (Experiment 
II). Test items were presented twice each at lags of 4, 12, or 24 intervening 
items, and any item could be tested using the same or different stimulus form 
(word or picture) at each lag. A recognition model that assumes successive 
encoding, decision, and response stages was used as the theoretical framework 
for interpretation of the results. The analysis indicated that stimulus form and 
lag affected encoding processes in similar ways for Experiments I and 11. Differ- 
ences were obtained for the decision stage, however, as stimulus form appar- 
ently affected decision processes in Experiment II but not in Experiment 1. 


In binary-classification tests of recogni- 
tion performance, S must categorize a 
stimulus as being either an o/d, or target 
stimulus (i.e., one that has been presented 
before), or a new stimulus, or distractor 
(Le, one that has not previously been 
presented). Recognition test procedures 
are of two principal types: (a) the target 
set consists of a number of items that have 
been studied prior to the recognition test, 
or (b) the target set consists of all items 
previously presented during a continuous 
series of test items. 

The former procedure has been used to 
study both short-term recognition (Stern- 
berg, 1966) and long-term recognition 
(Juola, Fischler, Wood, & Atkinson, 1971), 
and Atkinson and Juola (in press) have 
developed a model to account for the 
data. The model proposes three processing 
stages (encoding, decision, and response) 
that are executed sequentially during 
recognition, The encoding stage includes 
analysis of the test stimulus and the con- 
struction of an internal representation that 
is unique to the form and modality of input. 


1 This research was supported in part by funds 
from Biomedical Sciences Support Grant RR-07037 
from the National Institutes of Health, University 
of Kansas General Research Grant 3294-5038, and 
Grant MH-24637-01 from the National Institute 
of Mental Health. The authors thank Greg Cermak, 
George Kellas, Roy Lachman, and Carl McFarland 
for comments on an earlier draft of this paper. 

* Requests for reprints should be sent to James F. 
Juola, Department of Psychology, University of 
Kansas, Lawrence, Kansas 66044. i 


This representation is called the percep ial 
code. Each perceptual code is then map ed 
onto the particular long-term mei “y 
location assigned to various percep al 
codes of the same item. This represe a- 


tion in memory is called the concer: al 
code. The generalized activation le if 
the information represented by the oon- 


ceptual code is converted into a familia ty 
value for the test item. Thus, the enco: ng 
stage includes all the processes involve in 
identifying a test stimulus and retriev.ag 
a subjective estimate of its familiarity. 
The decision stage of the model operates 
on the familiarity output by the encoc ag 
stage.  Familiarities are evaluated with 
respect to two decision criteria such (at 
values below a low criterion lead to im- 
mediate negative responses and values 
above a high criterion lead to immediate 
positive responses. If the familiarity falls 
between the two decision criteria, the re- 
sponse stage is delayed until after a more 
extensive search of memory has been 
completed. This search process might be 
similar to the serial and exhaustive scanning 
model proposed by Sternberg (1966, 1969). 
The major difference between the two 
models is that in Sternberg's representation 
a decision to respond is made only after 
an exhaustive search of the target set has 
been completed, whereas in the model 
described above, a decision can be made 
either before or after this search. A de- 
cision made on the basis of familiarity 
alone is rapid, but error prone, and a 


1108 


STIMULUS ENCODING AND 


iecision made after the extended search is 
'ower, but accurate. 
One of the main purposes of the present 
tudy was to extend the recognition model 
to account for data from continuous recog- 
nition experiments. In the continuous 
task, a series of items is presented, and S 
instructed to make negative responses 
all items on their initial presentations 
id to make positive responses to all re- 
ited items (e.g., Shepard & Teghtsoonian, 
61) Studies by Hintzman (1969) and 
kada (1971) have shown that both the 
mber of presentations and the lag be- 
ween successive presentations affect per- 
lormance. In general, repeated tests result 
faster positive responses and fewer 
errors, with mean positive response latency 
creasing as a nonlinear function of lag 
between tests. These results cannot be 
omodated by the Sternberg model, 
;robably because the target items generally 
e not retrievable as a set which could be 
«canned for the presence or absence of the 
test item (Okada, 1971). 
similar effects of repetitions and lags 
have been observed in studies using the 
viemorized target set procedure (Fischler & 
uola, 1971; Juola et al., 1971). Repeti- 
ons have been shown to reduce both the 
mean error rate and mean response latency 
for target items, whereas the error rate 
i s and latencies show either a slight 
increase or decrease for repeated dis- 
tractors. These results have been inter- 
preted in terms of the model by assuming 
that repetitions facilitate the encoding 
‘age and increase mean familiarity values 
for both targets and distractors (Juola, 
1973). For target items, the proportion 
of the familiarity distribution below the 
low criterion is assumed to be equal to the 
miss rate. The proportion between the 
two criteria and that above the high 
criterion are assumed to be equal to the 
proportions of slow and fast positive de- 
cisions, respectively. Thus, target repeti- 
tions should result in fewer errors, fewer 
slow decisions, and more fast decisions (and 
therefore a faster mean execution time for 
correct positive decisions), For distrac- 


in 
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tors, the proportion of the familiarity dis- 
tribution above the high criterion is as- 
sumed to be equal to the false alarm rate, 
The proportions between the two criteria 
and below the low criterion are assumed 
to be equal to the proportions of slow and 
fast negative decisions, respectively. Thus, 
repetitions of distractor items should result 
in more errors and a slower mean execution 
time for correct negative decisions. Repe- 
tition effects on the encoding and decision 
stages should be additive to produce faster 
mean response times for targets. For 
repeated distractors, latencies could be 
either greater than or less than the times 
for responses to distractors on their first 
tests, depending on the relative changes 
in mean execution times for the encoding 
and decision stages. 

The analysis of repetition” effects on 
negative and positive responses has proved 
useful in separating stimulus encoding and 
decision effects on recognition performance. 
Another technique that has been used for 
this purpose involves the use of different 
stimuli to represent a given test item. For 
example, Juola (1973) studied repetition 
effects for items tested twice using the 
same or different visual stimuli (words or 
pictures) on both tests of targets and dis- 
tractors. The results indicated that the 
encoding stage was executed more rapidly 
for all repeated items; however, this effect 
was much larger for those items tested 
with the same stimulus form on both 
presentations. The decision stage was 
apparently independent of the stimulus 
forms used. 

In the present study, a similar manipula- 
tion of stimulus forms was used to help in 
isolating encoding and decision processes 
and their relative contributions to recogni- 
tion performance. In Experiment 1 Ss 
were tested using the fixed target set pro- 
cedure. Targets and distractors were 
presented twice each at three different lag 
conditions, and either the same or different 
stimulus forms were used at all lags. Ex- 
periment II included the same manipula- 
tions as Experiment I in a continuous recog- 
nition task. 


METHOD 
Experiment I 


Subjects. The Ss were 50 right-handed, female 
undergraduates who participated for credit in an 
introductory psychology class, During the experi- 
ment, 3 5s were dismissed for reported difficulty in 
identifying the visual stimuli, and the data for 7 
additional Ss were discarded for failure to meet an 
arbitrary error criterion of 10%. The data are thus 
based of an N of 40, - 

Materials, The stimuli were selected from a pool 
of 96 different items, each of which was represented 
by a word and a picture. The words were common 
one- or two-syllable nouns, three to eight letters in 
length, and the pictures were easily identifiable, out- 
line drawings of the objects named by the nouns. 
"The stimuli were placed on 6 X 9 in. white cards for 
presentation in an Iconix tachistoscope. The words 
were typed in Orator typeface with an IBM Selectric 
typewriter, and they varied in length from .8 cm. to 
2.0 cm. Xerox copies of the original drawings were 
affixed to cards, and they varied between .9 cm. 
and 3.8 cm, in diameter. When placed into the 
tachistoscope, the stimuli appeared about 1.0 cm. 
(ie., about 36 min. of visual angle from S's view- 
point) below a central, dark fixation point, 

Design. The 96 items were randomly divided 
into two sets of 48 items each. The first set was 
used to generate target lists for 20 Ss, and the 
remaining 48 items were used as distractors. For a 
second group of 20 Ss, the target and distractor sets 
were reversed so that for the experiment as a whole, 
all items were used equally often as targets and 
distractors, Both target sets were randomly 
ordered into two different lists to provide the target 
lists used for four groups of 10 Ss each. 

The test session consisted of 192 trials during 
which all items were tested twice. The Ss were 
randomly assigned to three different test sequences, 
ln these sequences, item presentation order was 
randomly determined with the constraint that either 
4, 12, or 24 intervening trials occurred between 
successive presentations of the same target or dis- 
tractor items. Sixteen targets and 16 distractors 
were repeated at each of these three lag intervals, 
Which, after the first 24 trials, were distributed uni- 
formly throughout the test sequenc Half of the 
target and distractor items were tested as words 

(W trials) and half were tested as pictures (P trials) 
on their first presentations. At each of the three lag 
conditions for repeated targets and. distractors, four 
items previously tested as words were tested again 
as words on their second presentations (WW trials), 
four items tested as words were retested as pic- 
tures (WP trials), four items tested as pictures were 
retested as words (PW trials), and four items were 
tested as pictures on both presentations (PP trials). 

Procedure. Each 5 was contacted about 24 hr. 
before the experimental session and was given a 
d list of 48 nouns, The Ss were told to memorize 
a aie x ce order before being tested on the 

g day. The experimental session began with 
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a written, serial recall test for the target list. Mc 
Ss correctly recalled all the target words; howey 
those who made errors were retested until the word 
were recalled in correct serial order. 

The S was then seated in front of the tachist: 
scope, and the exact nature of the test procedur: 
was explained, A series of 20 practice trials on : 
modified Sternberg task was run to familiarize . 
with the procedure. For these trials, S was tol 
to make a positive response if a digit from 0 to 4 w 
presented and to make a negative response ií 
test digit was 5-9. All 10 digits were presented t 
in a random order. 

The procedure for the practice trials was identic 
to that for the experimental test sequence. | 
trial started when E placed a test stimulus « 
into the tachistoscope and gave S a ready sign 
The S then depressed a foot switch that initiate: 
a millisecond timer. The tachistoscope was pr 
grammed to illuminate a preexposure field v 
dark fixation point 10 msec. after S pressed the foot 
switch. This field was maintained for 490 m 
and was followed immediately by a 350-ms 
posure of the test stimulus. The field rem 
dark until the next trial. The S was instructed 
rest her hands on a box upon which two 3.2-cm 
circular buttons were mounted about 9,0 cm. apart. 
When either button was pressed, it activated a 
microswitch that stopped the timer. All Ss were 
instructed to indicate a positive response to target 
items by pressing the right-hand button and to 
indicate a negative response to distractors by 
pressing the left-hand button. 

The practice trials were followed by a brief res 
period during which S was told that targets and 
distractors could be tested either as words or as 
pictures, and she was instructed to respond as 
rapidly as possible on eaéh trial while being careful to 
avoid making errors. During the test session no 
feedback was provided for correct r s, but 
Ss were informed of all errors. The session averaged 
about 45 min. in length during which wer 
allowed to take a maximum of three rest periods at 
irregular intervals. 


Experiment II 


Subjects. The Ss were 22 right-handed female un- 
dergraduates who participated for credit in an in- 
troductory psychology class, During the experiment 
1 5 was dismissed for reported difficulty in identify- 
ing the visual stimuli, and the data for an additional 
5 were discarded for failure to meet an arbitrary 
error criterion of 10%. The data are thus based 
on an N of 20. 

Materials, The stimuli and apparatus were the 
same as those used in Experiment I. 

Design. The test sequences used in Experiment 
II were identical to those used in the previous study. 
The sequences were randomly assigned to . 

Procedure, The test session began with a de- 
scription of the testing procedure. A series of 20 
trials using the digits 0-9 as stimuli was run to 
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TABLE 1 


MEAN LATENCIES AND STANDARD ERRORS OF THE MEANS (IN MSEC.) FOR CORRECT RESPONSES 
AND MEAN ERROR PERCENTAGES FOR ALL CONDITIONS Or EXPERIMENTS I AND IT 


Experiment I 


Measure 7 — -— p Experiment IT 
Targets Distractors | 
First presentations 3 
A T x cn : E EE IR deti 
| w P | w | P | w P 
M [-—/:868. 55] 5: 5,048: 1:1] ees  00L ae 5155978 940 1,003 
SE | 19.2 32.2 24.7 44.5 33.8 42.3 
% errors 13.1 d Lose S dbi 4.8 7A 
— Ly H ade. ) 1 1 1 sate 
Second presentations 
LI P OR a i RTR 
ww WP PW | PP ww WP PW PP ww WP PW PP 
Lag 4 | | 
ar 680 803 702 654 760 902 | 846 730 754 820 858 718 
SE 16.6 | 25.3 | 15.5. | 17.8 | 19.0 26.9 | 29.5 17.2 | 185 | 29.3 | 25.3 | 23.0 
% errors 3.1 0.0 2.5 1,5 48 74 3.5 | 3.3 2:9 3.9 Wi 0.0 
s Aa | 
4 708 835 (ic) 707 836 | 1,050 | 884 813 831 879 871 150 
SE 18.5 | 23.7 | 20.6 | 22.2 | 24.6 | 44.9) 344 | 25.9 | 31.6 | 20.2 | 36.0 | 24.4 
% errors 1.5 1.7 17 3.3 74 7. 2.5 75 3.7 7.3 3.0 1.2 
See 
694 894 709 761 812 | 1,007 | 863 | 858 889 | 921 894 778 
SE 16.9 | 29.0 | 25.1 | 25.9 | 20.5 47.2 | 264 | 24.6 | 34.1 | 358 | 26.9 | 23.5 
o errors 6 4.8 6 3.1 2.5 3.8 2.9 | 2.3 2.7 | 104 EA! 0.0 


Note. Abbreviations: W 


tions, WP = items tested first as words and retested as pictures, PW = 


PP = items tested as pictures on both presentations. 


familiarize S with the task, For these trials, S was 
told to make a negative response to any digit on its 
first presentation and a positive response to any 
repeated digit. All 10 digits were presented twice 
in a random order. All other details of the practice 
trials and the actual test session were identical to 
those of Experiment I. The only difference for the 
present study was that, rather than making positive 
responses to items belonging to a previously memo- 
rized set, Ss were instructed to make positive 
responses to any items previously presented during 
the test session, regardless of whether or not the 
actual stimulus form (W or P) was the same on both 
tests of an item. 


RESULTS 


The mean response latencies,’ standard 
errors of the mean latencies, and mean 


å The latency data were based on 89.7% of the 
total trials run in Experiment I and 92.2% of the 
trials in Experiment IL, The data excluded were 
for Experiment I: 5.8% 5S errors, 3.6% E errors, 
and .9% "ong" responses (eliminated using the 
correction technique described by Wescourt & 
Atkinson, 1973); for Experiment IT: 4.9%, S errors, 
2.8% E errors, and .1% long responses, 


= items tested as words, P = items tested as pictures, WW = items tested as words on both presenta- 


items tested first as pictures and retested as words, and 


proportions of errors for all conditions of 
Experiments I and ll are presented in 
Table 1. The data presented here are from 
Trials 25 to 185 or 186 only (depending on 
the specific test sequence), over which the 
distributions of all types of trials were 
approximately uniform. 

In both experiments mean response times 
were slower to pictures than to words for 
items on their initial presentations. (This 
trend was obtained for 37 of 40 Ss for 
targets and 38 of 40 Ss for distractors in 
Experiment I and for 18 of 20 Ss in Experi- 
ment IJ.) The latency data for repeated 
items were analyzed separately for targets 
and for distractors in Experiment I and 
for Experiment II]. Three 2 X 2 X 3 
repeated measures analyses of variance 
were run, with factors corresponding to the 
stimulus form (W or P) used on the first 
presentation of the test item, the stimulus 
form used on the second test, and the lag 
between presentations, respectively. 
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Figure 1, Mean response latencies for four types 
of items on second test presentations as functions of 
lag between first and second presentations, 


All main effects of stimulus form were 
significant, In Experiment I, responses to 
repeated targets were faster if they had 
been tested as pictures on their first tests, 
d(T; 39). = 34:21, MS, = 9,375, p < .01, 
and as words on their second tests, F (1, 39) 
= 60.69, MS, = 8,223, b <.01. Responses 
to repeated distractors were also faster if 
they had been tested first as pictures, F (1, 
39) — 3148, MS, — 22,659, p < .01, and 
then as words, F (1,°39) = 23.48, MS, 
= 28,796, p < 01. In Experiment Il, 
response latencies were shorter for repeated 
items tested first as pictures, F (1, 19) 
= 9.51, MS, = 8,940, p < 01, and re- 
sponses were faster for items tested as 
pictures on their second Presentations, F (1 
19) = 17.60, MS, = 5042, p < 01. Tn 
general, the response time data indicated 
that latencies were shorter for repeated 
items if the same stimulus forms were used 
on both tests than if they differed. This 
conclusion was supported by the significant 
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interactions between the forms us he. 4 
first and second tests, F (1, 39 9, 
MS, = 9,919, p « .01, for ta in 


Experiment I; F (1, 39) = 
= 33,317, p < .01, for distra 
periment I; and F (1, 19) = 
= 8,872, p < .01, in Experime 

The main effects of lag were 
cant, indicating that response fo 
repeated items increased as a fi 
the number of intervening tes! 
the first and second tests of ; 
F (2, 78) = 19.09, MS, = 10,190, 
for targets in Experiment 1; F (2 
30.30, MS, = 15,015, p < .01. for 
tractors in Experiment I; and / 
= 39.42, MS, = 3,508, p < .0 
tractors in Experiment I]. In | 
I, lag interacted with the stimu 
used on the second tests of targets, / (2, ; 
= 3.82, MS, = 8,977, p < .05, ^nd dis 
tractors, F (2, 78) = 6.39, MS, 0, 
b < .01. In Experiment II, how: 
interacted with the stimulus form 
on the first tests of repeated items, F (2, 
= 5.30, MS. = 4,848, p < .01. None 
the other interactions approach 
cance, The interaction effects « nd 
stimulus forms are apparent in 
where, for Experiment I, the W 
functions showed a large and nea 
increase across lags whereas the i 
PW functions increased only sl: In 
Experiment II, the WW and W! 
exhibited the greatest lag effec 
the PW and PP functions showed 
nearly parallel increase. 
: A comparison of the latency. data 
initial vs. repeated presentations (ave 


across lags) in Experiment I showed ihat 
responses were faster on second tesis ihan 
first tests for at least 35 of 40 S all 


conditions but one. The exception was for 
distractor items tested first as words then 
as pictures, for which 32 of 40 Ss were 
slower on the second tests than on initial 
presentations, Although a similar analvsis 
of the data from Experiment II involves a 
confounding of response type (positive or 
negative) with presentation number, it 
was noted that at least 36 of 40 Ss in all 
conditions were faster on the second tests, 
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The error data for initial presentations 
of test items were analyzed using Wilcoxon 
matched-pairs signed-ranks The 
difference between the mean proportions 
of errors to words and pictures was not 


tests. 


significant for the target data from Experi- 
ment [; however, significantly more errors 
were made for distractor pictures than 
words (p < .0t, test). The 
error rate was also higher for picture stimuli 
than for words on initial tests in Experi- 
ment II (p < .05, two-tailed test). 

The error data for second test presenta- 
tions were analyzed separately for the four 
trial types collapsed across lags and for the 
lag data collapsed acros trial types. A 
Friedman two-way analysis of variance bv 
ranks indicated that there were no signifi- 
cant differences between the four trial types 
for the target and distractor data of Experi- 
ment I. For Experiment II, however, the 
Friedman analysis was significant, X* ($ 
= 9,6, p < .05. Multiple sign tes indi- 
cated that the only significant. difference 
was between WP trials (mean error rate 
= 7.1%) and PP trials (.5%). 

Although the error rate inc reased as a 
function of lag for targets in both Experi- 
ments | and II, neither increase was sig- 
nificant. A Friedman analysis showed that 
lag had a significant effect on the error rate 
for distractors in Experiment J, X* (2) 
= 93, p < .01, and multiple sign tests 
indicated that fewer errors were made at 
Lag 24 (2.9%) than at Lag 4 (4.4%) or at 
Lag 12 (6.6%). 

A comparison of the error data for initial 
vs. second presentations (averaged across 
lags) in Experiment I showed that the error 
rate decreased with repetitions for all target 
conditions (Wilcoxon test, p < .01). For 
distractors, the error rate increased from 
Presentation 1 to Presentation 2 for those 
items tested first as words (p < .01), but 
there was no significant difference in the 
error rate for P trials vs. PW and PP trials. 
For Experiment II there was no significant 
difference in the error rate for W t ials vs. 
WW and WP trials, but the error rate was 
less for PW and PP trials than for P trials 
(p < .01). 


two-tailed 
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DISCUSSION 


The present studies were designed to in- 
vestigate stimulus form and lag effects on 
performance and to test the 
generality of a recognition model. The results 
will be interpreted in terms of the model in 
that differences in error rates for different 
target or distractor conditions will be assumed 
to represent differences in the relative familiar- 
ities of those items. The assumption that 
errors are generated only from the decision 
stage will necessarily have to be modified if 
evidence that can result from 
failures in other processing stages, such as in 
stimulus encoding or response selection, On 
the other hand, differences in response latencies 
that are not accompanied by differences in 
will be assumed to represent 


recognition 


exists errors 


ror rates 
erential facilitation of encoding processes. 
In both experiments, responses were faster 


di 


to word stimuli than to pictures on initial tests. 
This is to be expected since the pictures were 
original drawings that had never before been 
seen by Ss. According to the model, these 
differences should reflect greater expected 
durations for the encoding stage on P trials 
than on W trials. The decision stage should 
not be affected by stimulus form since the 
familiarity value of any test item is determined 
by information derived from the conceptual 
code onto which the perceptual codes of the 
word and picture are mapped. This interpre- 
tation is supported by the ei data from 
initial target presentations in Experiment I 
which showed no significant differences on W 
and P trials. For initial tests of distractors 
in both Experiments I and II, however, 
significantly more errors were made on P trials. 
These results could be taken to indicate that 
distractor pictures result in different familiarity 
values than distractor words. However, it is 
also possible that, as for target items, they are 
about equally familiar and the error rate 
difference is due to a greater probability of 
misidentifying the pictures than the words. 
This effect could have been mitigated for 
targets since Ss had previously studied the 
names of the pictures. 

Responses were faster in both experiments 
for repeated items that were tested twice using 
the same stimulus form (i.e. for WW and PP 
trials) than if the forms were different (i. 
for WP and PW trials). In Experiment 1M 
there were no error rate differences between 
these four types of trials, but stimulus forms 
did affect the error rates for repeated items in 
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Experiment II. The Experiment I results are 
consistent with other data (Juola, 1973) and 
with the interpretation that the decision to 
respond before or after an extensive memory 
search is based only on the familiarity value 
derived from the conceptual code of the test 
item, Although repetitions might facilitate 
execution of the encoding stage, especially if 
the stimulus forms are also repeated, this 
facilitation apparently ends once the appro- 
priate conceptual code is uniquely identified. 
'The decision process in recognition tasks em- 
ploying the fixed target set procedure appears 
to be independent of the stimulus form used 
at the time of test or during earlier presenta- 
tions of the test item. 

The error data for repeated items in Experi- 
ment II indicate that memory for stimulus 
form might be used in the decision process. 
This decision could occur in at least two levels 
of processing : (a) during the construction of the 
perceptual code of the test item or (b) during 
extended memory search if the familiarity 
derived from the conceptual code is of an 
intermediate value. In either case a decision 
could be based on obtaining a match between 
the perceptual codes constructed on two pre- 
sentations of anitem, Thus for the continuous 
paradigm, it is possible that the decision stage 
is not separable from the encoding stage of 
recognition. 

An analysis of the lag data also supports the 
contention that encoding and decision processes 
are separable in Experiment I and confounded 
in Experiment IJ. In Experiment I, response 
latencies for all types of trials increased with 
lag, but this increase was more pronounced 
for items presented as pictures on their second 
tests, Lag effects for repeated items were 
apparently independent of the forms used on 
the initial tests of target and distractor items. 
The lack of any effects of stimulus form of 
initial items on the lag data suggests that the 
stimulus form effects for repeated items were 
due solely to differences in the encoding stage. 
The larger lag effects for WP and PP trials 
than for WW and PW trials indicate that not 
only is the encoding stage for picture stimuli 
affected more by repetitions than that for 
words (e.g., the repetition effect was much 
larger for PP trials than for WW trials), but 
also the facilitative effect of repetitions decays 
more rapidly for items tested as pictures. 

In Experiment II, increasing lags between 
presentations also caused increases in response 


latencies for repeated items. However, the 
] 
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pattern of interactions between stimulus form 
and lag effects differed from Experiment I in 
that the stimulus forms used on the initial tests, 
but not those used on second tests, interacted 
with lag. The lag effects on response latencies 
for repeated items were much greater for 
those tested as words on their initial presenta- 
tions than for those tested as pictures. This 
result is consistent with the hypothesis that 
Ss compare the perceptual code of the test 
item with retained visual information from 
earlier tests of the item. For WW and PP 
trials, a decision can be made during or im- 
mediately after the construction of the test 
item's perceptual code. For WP and PW 
trials, this decision cannot be made until the 
alternative perceptual code is activated (or 
reconstructed) which might first necessitate 
access of the common conceptual code. This 
representation of the decision process is pre- 
dicated on two assumptions: (a) visual in- 
formation for both words and pictures is re- 
tained in long-term memory (Hintzman & 
Summers, 1973) and (b) the visual information 
for words decays faster (or becomes nondis- 
criminative sooner) than that for pictures 
(Snodgrass, Wasser, Finkelstein, & Goldberg, 
1973). 

The model derived from earlier recognition 
studies using the fixed memory set paradigm 
is consistent with the data from Experiment 
I, but it needs to be modified to handle the 
Experiment II results. For the case in which 
the target items are members of a well-defined 
set, the decision to make a positive or negative 
response cannot be based upon matching a 
current perceptual code with the trace of one 
presented earlier, since both targets and dis- 
tractors can be repeatedly tested. For the 
continuous procedure, however, decisions can 
be based upon matches obtained during the 
encoding stage of recognition. In either ex- 
perimental paradigm, a decision can be deter- 
mined by the familiarity (or activation level) 
of the conceptual code that represents the 
test item. If, however, the familiarity is of 
some intermediate value that does not allow 
for relative certainty of discrimination between 
pi and distractors, à more extensive search 
E Sd HE be executed before à decision 
eie t em ch could involve the re- 
pede un ee set and the scanning of 
RUE. ie presence or absence of the 
op he set is well-learned, 

T $ continuous paradigm, 
might involve the retrieval of 


whereas, 
this search 


alternative per- 
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ceptual representations of the test item in 
a search for one that had been constructed 
earlier. 
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RETRIEVING ATTRIBUTE AND NAME INFORMATION 
FROM SEMANTIC MEMORY ' 


ELIZABETH F. LOFTUS? AND WILLIAM COLE 


University of Washington 


In two experiments, Ss retri 
(e.g. ANIMALS). 


times the instance had to possess a cert 
and sometimes its name was restricted ( 
Prior knowledge about the type of 


retrieval of these two types of items. 
instances that simultaneously sat 
v SMALL-M). 


In Experiment I, two s 
ain attribute (e.g., it had to be small), 
e.g. it had to begin with the letter M). 


ved instances of well-learned semantic categories 


types of restrictors were used: Some- 


ictor dilferentially influenced the 


In Experiment II, 5s produced category 
fied both an attribute and a name restric- 
nvariably took longer to respond when the 
presented before the attribute restrictor rather than aft 
;-network model, which embodies the notion that intrin 

ing strategies operate in the retrieval of attribute vs. 


name information, accounts nicely for these results. 


One technique for studying semantic 
memory is to ask 5$ simple questions about 
material they already know and to measure 
how long it takes them to answer these 
questions. The reaction times (RTs) ob- 
tained are then used to make inferences 
about the processes used for retrieval. 

In several experiments that have used 
this technique, S was required to produce 
a member of a category that satisfied some 
restriction, For example, he might be 
asked for the name of a fruit that is yellow, 
or an animal that begins with the letter Z 
(e.g., Freedman & Loftus, 1971; Grober & 
Loftus, in press; Loftus & Suppes, 1972). 
In the Freedman and Loftus study, cate- 
gory-adjective and category-letter pairings 

(hereafter referred to as adjective and 
letter stimuli, respectively) were randomly 
"mixed" together in a long sequence of 
presentations. In the Grober and Loftus 
study, this mixed condition was included, 
but, in addition, Ss received two nonmixed 
conditions: one in which they saw only 
adjective s i 
they saw only letter stimuli. 


It was thus 


1 This research was supported by a grant from 
the National Institute of Mental Health. The 
authorsare grateful to Earl B. Hunt, Colin MacLeod. 
and Steve Poltrock for comments and to Elaine 
Botti for running the 5s in Experiment IT. : 
> 4 Requests for reprints should be sent to Elizabeth 
F. Loftus, Department of Psychology, University 
of Washington, Seattle, Washington 98195. ay 


imuli, and the other in which 


possible to compare the speed of producing 
a fruit-yellow when it occurred in a mixed 
vs. a nonmixed condition. A major finding 
was that RT to an adjective stimulus w 
independent of whether S knew that ! 
would be seeing an adjective ; that is to say 


- RT to an adjective was equal in mixed and 


nonmixed conditions. However, S's re- 
sponses to letter stimuli were dramatically 
affected by his expectations; he responded 
significantly faster in the mixed condition. 
Because mixing had a differential effect on 
adjective and letter stimuli, two retrieval 
processes were thought to occur; that is, 
the process of retrieving a category member 
when the restriction is on a quality or al- 
iribute of that member (e.g., it must be 
yellow) was thought to be different from 
the process of retrieving a member when 
the restriction is on the name of that mem- 
ber (eg. , the name must begin with the 
letter Z). à 

An alternative interpretation was pro- 
posed to us by a grant review panel at the 
National Institute of Mental Health. The 
panel argued that the adjective-letter dif- 
ference might not be the result of intrinsi- 
cally different processing strategies, but 
rather of the fact that the letter and adjec- 
tive cues impose different degrees of con- 
straint or uncertainty. When S is in a 
nonmixed-letter condition, he first sees à 
category for some interval of time, and 
then a letter appears. He must respond as 
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quickly as possible with a member that 
begins with that letter. During the interval 
between the category and letter, there is a 
fixed number of stimulus alternatives that 
S can expect as the forthcoming restricting 
letter. When S is in the nonmixed-adjec- 
tive condition, however, the set of alterna- 
tives he can expect is ill-defined and 
innumerable. In a mixed condition, there 
is-a great deal of uncertainty as to the 
identity of the forthcoming restriction ; any 
letter or adjective is possible. The reason 
that *unmixing" reduces RT for the letter 
stimuli may be that uncertainty is con- 
siderably reduced; the reason it has no 
effect for adjective stimuli may be that 
uncertainty is hardly reduced at all. 

Experiment I was designed to test the 
feasibility of the “uncertainty” interpreta- 
tion, As in earlier studies, Ss saw nouns 
paired with adjectives or letters that were 
presented in mixed and nonmixed condi- 
tions. However, sometimes S was told 
at the beginning of the trial that the forth- 
coming restrictor would be one of two or 
one of four possibilities. Compared to a 
condition in which no information is given 
at the start of the trial, both the two- and 
four-cue conditions considerably reduce the 
uncertainty as to the identity of the forth- 
coming restrictor. 


EXPERIMENT | 
Method 


Subjects. The Ss were 24 undergraduates at the 
University of Washington. ach participated in 
one experimental session lasting approximately 45 
min. 

Materials. Fifty-four noun categories were se- 
lected, and each category was paired once with the 
first letter of a common category member (e.g., 
FRUIT-P) and once with an adjective descriptive of 
some common category member (e.g. FRUIT- 
YELLOW). Thus, 108 unique pairs were created, 
and each pair was hand printed in block letters on 
a5X8in. index card. Each card, then, contained 
a noun category on the left side and a restricting 
letter or adjective on the right. 

The 108 items were presented in nine blocks of 
trials. In three blocks, adjective (M-A) and letter 
(M-L) stimuli were randomly mixed together. Of 
the remaining six blocks, half contained only adjec- 
tive stimuli (O-A), and the other half contained 
only letter stimuli (O-L). Within each type of 
block (mixed, O-A, or O-L), Ss received before each 
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trial either zero, two, or four cues, among which 
was the restrictor being used on that trial, For 
example, in the two-cue O-L condition, just before 
the presentation of rRUrr-P, S might be told that 
on this trial the restrictor would be either A or P, 
The S was then shown the category, FRUIT, and 
after a l-sec. interval, one of the two cues men- 
tioned was shown, he S's task was to produce 
from meme an item that belonged to the noun 
category and began with the restricting letter 
On the four-cue trials, before each stimulus was 
displayed, S was given four items, among which 
was the forthcoming restrictor. On the zero-cue 
trials, the stimulus cards were displayed without 
preliminaries. 

The three cue conditions (zero, two, or four) and 
three types of blocks (mixed, O-A, and O-L) re- 
sulted in nine distinct blocks of trials, A Latin 
square balancing procedure determined the ordering 
of the nine blocks that each S received, 

Each O-A and each O-L block contained nine 
adjective and nine letter stimuli, respectively, while 
the mixed blocks contained a random selection of 
nine letter and nine adjective stimuli, A new ran- 
domization was performed for each S with the re- 
striction that throughout the experiment as a whole, 
a stimulus, such as FRUIT-P, appeared an equal 
number of times in each of the three cue conditions 
and as often in a mixed condition as a nonmixed one, 

Procedure, Each S was told that we were in- 
vestigating typical memory that he 
would see category names followed by either an 
adjective or a letter, and that he was to produce 
an item that belonged to the category and fit the 
The cue conditions were ex- 


I 


processes, 


restriction imposed, 
plained, and each § was informed before each block 
of trials which condition was in effect for the block 
Sixteen warm-up trials preceded the session, illus- 
trating the varieties of conditions in the experiment. 

The S sat in front of a screen in which was a 
window covered by half-silvered glass. The index 
card containing the stimulus was placed in the dark 
enclosure behind the mirror and was revealed by 
illuminating the enclosure, The 5 spoke his re- 
sponse into a microphone placed in front of him, 

A trial consisted of the following: (a) in two- and 
four-cue conditions, the cues were spoken by 4 
(b) a warning tone was sounded and, following a 
brief interval, the noun category was illuminated ; 
(c) after a 1-sec. interval, the restrictor was auto- 
matically illuminated, and simultaneously an el 
tric timer with a dc clutch was started; and (d) S's 
verbal response activated a voice key that stopped 
the clock and terminated the illumination of the 
stimulus. If.S did not respond in 15 sec., the trial 
was terminated by Æ. 


Results 

Only correct responses (97%) are in- 
cluded in the following statistical analyses. 
Responses were grouped into 12 classif 
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Ficure 1. Mean reaction times for adjective and 
letter stimuli in mixed and nonmixed conditions. 
Data are shown separately for the zero-, two-, and 
four-cue conditions. 


tions depending on type of restrictor (ad- 
jective or letter), number of cues (zero, two, 
or four) and presentation sequence (mixed 
or nonmixed), Median latencies were ob- 
tained for each S's responses in each of the 
12 classifications. For each classification, 
group mean latencies were obtained by 
averaging medians for individual Ss, and 
these are plotted in Figure 1. A three-way 
analysis of variance with repeated measures 
was done on the latency data. The sig- 
nificant findings were as follows: a non- 
mixed presentation led to faster responding 
than a mixed presentation (1.86 vs. 2.12 
sec), F (1, 23) = 6.57, p < .05, and this 
variable interacted with type of restrictor, 
F (1,23) = 1347, p < .01. From Figure 
1 we can see that the interaction results 
from the fact that mixing had no effect on 
adjective trials, but had a substantial and 
consistent effect on letter trials. Appar- 
ently, Ss disregard the cues on adjective 
trials, but not on letter trials. 

Although Ss were somewhat faster when 
given no cues than when given two or four 
cues (with zero cues, RT = 1.88 sec. ; with 
two, 1.95 sec.; with four, 2.15 sec.) this 
effect was not statistically significant, F 
(2, 46) = 1.85, p > 10, Furthermore, 
number of cues did not interact with any 
of the other factors (p > .10). 


Discussion 


In Experiment I, Ss were asked to retrieve 
category members that were either (a) re- 
stricted in that they had to possess a certain 
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attribute, or (b) restricted in that their name 
had to possess a certain feature. The RT for 
producing the first type of member was inde- 
pendent of whether S knew in advance that 
the restriction would be on the attribute. 
However, the RT for producing the second 
type of member was reduced when S knew in 
advance that the restriction would be on the 
name. Reducing the uncertainty as to the 
identity of the forthcoming restrictor with 
advance cues did not facilitate RT, as an 
uncertainty interpretation would predict, but 
in fact slightly increased RT. These data 
argue against the proposal that the differentia! 
effect on adjective and letter stimuli of mixing 
trials is due to differential uncertainty, and 
support the notion that intrinsically different 
processing strategies may be operating. 

Although the number of cues had no sta- 
tistically significant effect in Experiment I, 
one cannot help but notice that the RT's in 
the four-cue letter conditions were unusually 
long. One tentative hypothesis (suggested to 
us by E. B. Hunt) is that 5 pays attention to 
the cues, yet interference is produced by an 
“overloading” of short-term memory. When 
adjective stimuli are presented, S acts as if he 
is ignoring the cues. Confirmation of this 
possibility must await the outcome of further 
research. Whatever the explanation, the fact 
that providing four cues prior to a stimulus 
has differential effects on the responses to 
adjective and letter stimuli strengthens o 
view that intrinsically different processin 
strategies are operating. 

Before suggesting what strategies may be 
operating in the retrieval of attribute v* 
name information, we describe a second í 
periment that bears on the uncertainty iss 
In this experiment, Ss were presented with 
stimuli consisting of a noun category plus 
both an adjective and a letter (e.g., ANIMAL- 
SMALL-M) and had to produce a member of 
the category that satisfied the two restrictions. 
That is to say, the response had to be a member 
of the category that began with the given letter 
and to which the adjective was applicable 
(e.g, MOUSE). The Ss saw the category first, 
but the order in which the adjective and letter 
restrictors were presented was varied: on half 
of the trials the adjective came .5 sec. before 
the letter (e.g, ANIMAL-SMALL-M), while on 
the remaining trials the letter came first (e-g., 
ANIMAL-M-SMALL). Reaction time was taken 
from the onset of the last restrictor. 

When the to-be-presented stimulus is ANI- 
MAL-SMALL-M, during the interval between 


^ 
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SMALL and M there is a fixed number of stimulus 
alternatives that S can expect as the forth- 
coming restrictor. However, when the letter 
comes first, and S is anticipating the adjective, 
the set of alternatives he can expect is ill- 
defined and innumerable. In other words, 
there is a great deal of uncertainty as to the 
identity of the forthcoming restrictor—any 
adjective is possible. An uncertainty hypothe- 
sis predicts 5 would respond more quickly in 
the former case than in the latter. 


EXPERIMENT if 
Method 


Subjects, The Ss were 16 students at the New 
School for Social Research. Each participated in 
one experimental session lasting approximately 50 
min, 

Materials. X noun categories were se- 
lected, and e egory was paired with a letter 
and an adjective (e.g., ANIMAL-SMALL-M). Two 
index cards were prepared for each category: One 
contained the adjective restrictor on the left side 
and the letter restrictor on the right, while the 
other card contained the reverse arrangement. 
"Throughout the experiment, then, each category 
was presented as often with the adjective before 
the letter as with the adjective after the letter. 

Procedure. ich S was told that we were in- 
vestigating human memory. He was told he would 
be presented with a category name followed by a 
letter and an adjective, and that he was to provide 
a word that belonged in the overlap defined by the 
triplet. For example, if shown the stimulus ANIMAL— 
SMALL-M, he was to produce the name of a small 
animal that began with the letter M. It was ex- 
plained that sometimes the letter would appear be- 
fore the adjective, and sometimes afterward. 

The S sat in front of a screen in which was a 
window covered by half-silvered glass. An index 
card containing the restrictors was placed in a dark 
enclosure behind the window and was revealed by 
illuminating the enclosure. "The S spoke his response 
into a microphone placed in front of him, 

A trial consisted of the following events: (a) E 
spoke the name of the category; (6) 2 sec. later the 
first restrictor was illuminated; (c) after a ,5-sec 
interval, the second restrictor was illuminated, and 
simultaneously an electric timer with a dc clutch 
was started; and (d) S's verbal response activated 
a voice key that stopped the clock and terminated 
the trial. If S did not respond in 10 sec., the trial 
was terminated by E. Twenty warm-up trials 
preceded the experimental trials. 


Results 


Only correct responses (96%) are in- 
cluded in the following analyses. Re- 
sponses were grouped into two classifica- 
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tions depending on whether the adjective 
came before or after the letter. Median 
latencies were obtained for each S's re- 
sponses in each of the two classifications. 
For each classification, group mean laten- 
cies were obtained by averaging medians 
for individual Ss. These computations 
yielded a value of 1.00 sec. when the 
adjective came first, and a value of 1.45 
sec. when the adjective came second. 
These means are significantly different, / 
(15) = 5.612, p < .01. 

Two median RTs were also computed for 
each category, one for responses to adjec- 
tive-letter stimuli and the other for re- 
sponses to letter-adjective stimuli. For 43 
out of 46 categories, Ss produced a response 
more quickly if the adjective occurred 
before the letter restrictor; for only 3 cate- 
gories, the reverse result occurred. 

When the letter comes first, and S awaits 
the adjective, there is a great deal of un- 
certainty as to what the restrictor will be. 
An uncertainty interpretation correctly 
predicts that Ss will take longer to respond 
in this condition. 


GENERAL Disc 


SION 


The present experiments required Ss to re- 
trieve well-learned category members from 
semantic memory. In the first experiment, 
noun categories were paired with either an 
adjective or a letter, and these stimuli were 
presented in a randomly intermixed sequence 
of trials or in a nonmixed sequence. Reducing 
the uncertainty as to the identity of the forth- 
coming restrictor with advance cues did not 
facilitate RT, as an uncertainty interpretation 
would predict. 

In the second experiment, noun categories 
were paired with both an adjective and a 
letter. An uncertainty interpretation seems 
to correctly predict that Ss will take longer 
to respond when the letter restrictor is given 
first. In spite of the fact that an uncertainty 
interpretation makes a correct prediction in 
Experiment II, it in no way explains the 
results of Experiment I. What is needed is a 
model that can handle both of these results. 


The Dictionary-Network Model 


The model we propose attaches some im- 
portance to the finding of Experiment I that 
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Animal 


Bird 


Chicken 


FIGURE 2. 


the time to respond to an adjective stimulus 
is independent of whether S knew he would 
be seeing an adjective, but the time to respond 
to a letter stimulus is not independent. This 
finding suggests to us that intrinsically dif- 
ferent processing strategies may be operating, 
and this notion is embodied in the following 
model. 

First, we assume that when a word is stored 
in semantic memory, at least three kinds of 
information about that word are included: 
semantic, phonemic, and orthographic infor- 
mation. The assumption is reasonable since 
we know that Ss use these three kinds of 
information when they search for particular 
words (Brown & McNeill, 1966). 

We call the model the dictionary-network 
model, for reasons that will become obvious. 
It includes a "dictionary," as diagrammed in 
Figure 2, which is simply a listing of the 
lexical items of the language. As to the ques- 
tion of what constitutes a lexical item, we 
realize that it could be a word, a morpheme, 
an idiom, or some other unit, but we do not 
attempt to decide on the question of how the 
listing is ordered. The dictionary includes 
both phonemic and orthographic information 
about each of its entries, and, furthermore, 
each entry serves as an address telling where 
semantic information can be found. Thus, 
we have a content-addressable system in which 
each entry provides direct access to the portion 
of semantic memory that represents the mean- 
ing of that entry. 

The semantic network is a highly complex 
arrangement of interconnected concepts or 
categories. Information about the subsets of 
each category as well as its supersets is in- 
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The dictionary-network model of semanti: Lory 


cluded. Within eac! 
clusters of items that have 
mon may exist, So, for es 
and yellow fruits may be 
another in memory. 

To illustrate, the word animal nv 


ategory 


an entry in the dictionary, “ince 

nary itself contains phonem id ort 
information about the w mi 

use this information to a- >% e 
“What letter does the w ! in 


with?" To answer a question abo. 
ing of the word, it is 

part of the semantic 

information. Figure « pr 
network that might I 
someone's semantic m 
can use the information 
work to answer qu 


ie mean 


animal?" or “What is th: c of 
animal?" í 
Using this as a gener vegin 
to describe how 5 mana n answer 
to the kinds of questions i sked in our 
experiments. How does he fir a) a category 
member to which a particule 5 is 
applicable (e.g., edible birds); and (b) a cate- 
«letter 


gory member beginning with a 
(e.g., an animal beginning with Ð) 
Consider case a in which 5 must name an 
edible bird. We now assume that the re- 
trieval process has at least three major steps. 
In keeping with our earlier view (e.g. Freed- 
man & Loftus, 1971), the first step consists of 
entering the appropriate category birds, The 
next step consists of finding an appropriate 
cluster of items and picking a member of it. 
Once this member is found, the third step 
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involves looking up the “name” of the item 
by reentering the dictionary via a pathway 
from that member to its name. Thus, S finds 
that the edible bird he has picked is called a 
chicken. At first glance, the notion that one 
could find a member without knowing the 
name of that member may seem awkward. 
But, on second glance, there is no reason why 
a particular type of category member, or any 
other concept for that matter, must neces- 
sarily have available a single name for it. 
Clearly, there can be concepts without names.* 
For example, one can comprehend the concept 
of a "navigational instrument used in measur- 
ing angular distances, especially the altitude 
of sun, moon, and stars at sea," without the 
name sextant being accessible (Brown & Mc- 
Neill, 1966). The dictionary-network model 
is a reasonable system that allows access of 
conceptual meaning via the lexical entry in 
the dictionary, as well as permitting use of the 
meaning as a direct guide to the lexical entry. 

Consider case b, in which S must name an 
animal beginning with the letter D. Again, S 
begins by entering the animal category, which 
is accomplished by looking up animal in the 
dictionary and going directly to the portion 
of memory that contains information on the 
meaning of animal. The next step is some- 
what different for questions of this type. 
Here we assume that S systematically pro- 
ceeds along pathways leading from animal 
instances to the dictionary representations of 
the names of those instances. This quasi- 
parallel simultaneous search proceeds toward 
the dictionary from all possible members 
(though a less dominant member may start 
off more slowly or slightly later). The dictio- 
nary is used to determine whether a given 
animal instance is one that begins with the 
letter D, and the search terminates when one 
is found. 


Effect of Advance Knowledge 


An explanation in terms of the model of the 
differential effects on adjective and letter trials 


*This point is not particularly original. For 
example: ‘‘Concepts need not have names [Collins 
& Quillian, 1972, p. 316],” and "Concepts ... need 
not correspond in a one-to-one fashion with words. 
In fact, most concepts have no simple natural 
language equivalent [Rumelhart, Lindsay, & 
Norman, 1972, p. 216]." As to the question of 
what form of representation the nonverbal aspect 
of a concept takes, we do not attempt to decide 
that issue here. 
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of advance knowledge about the type of re- 
strictor is straightforward. On adjective trials, 
the first step of the process is to enter the 
category. The next step, which involves 
finding the appropriate cluster to enter, cannot 
begin until S knows what cluster he is to enter. 
The S essentially must wait until the adjective 
is presented before he can begin this next 
step. Thus, knowing that an adjective is 
forthcoming should be of no help; that is, RT 
to O-A stimuli should equal RT to M-A. 

For letter stimuli, a different situation exists, , 
Here, the first step of the process is entering 
thecategory. The next step is a quasi-parallel 
simultaneous search toward the dictionary. 
That is to say, some number of pathways 
leading from category instances to the dictio- 
nary representations of those instances is 
traced. This step can be started during the 
interval between the presentation of the cate- 
gory name and the restricting letter if S knows 
a letter is forthcoming. Thus, RT to O-L 
stimuli should be faster than RT to M-L 
stimuli. 


Effect of Order of Presentation of Attribute and 
Name Restrictors 


Turning now to Experiment II, the dictio- 
nary-network model can easily handle the 
main result, viz., that S is faster to produce 
a category member subject to two restrictions 
when the adjective restrictor is presented first. 
The reasoning is as follows. Whether the 
adjective comes first or second, the first step 
of the retrieval process is to enter the noun 
category. When the adjective does appear 
first, S can then enter an appropriate cluster 
of items, At this point, he systematically 
proceeds along pathways leading from in- 
stances that are within that cluster to the 
dictionary representation of the names of 
those instances. The dictionary is used to 
determine whether any of the instances begin 
with the appropriate letter, and the search 
terminates when one is found. As before, this 
quasi-parallel simultaneous search can be 
started during the interval before the restrict- 
ing letter appears, since S knows a letter is 
coming. 

When the letter appears first, however, the 
picture is somewhat complicated. The S may 
do one of two things. First, after entering 
the category, he may begin to trace pathways 
toward the dictionary, attempting to find a 
member that begins with the given letter. 
Once he finds the member, he must go back 
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to the semantic network to see if the adjective 
applies to it, This strategy requires an extra 
"trip" between the dictionary and the net- 
work. Alternatively, to save the trouble of 
this extra trip, 5 might simply wait until the 
adjective arrives, essentially converting a 
"Jetter-adjective" trial into an "adjective- 
letter" trial. Either way, the prediction is 
a slower RT when the letter appears first. 

We might mention that our prediction for 
Experiment II is not obvious: Other quite 
reasonable models would not make this pre- 
diction. For example, one type of set-theo- 
retic model assumes that once a category is 
entered, its members are sequentially searched 
in a relatively fixed order and that the search 
is terminated when an appropriate instance is 
located (see, e.g., Rosch, 1973). To the extent 
that retrieval involves such a successive con- 
sultation of the memory store, the speed of 
retrieval should not be a function of the order 
of presentation of adjective and letter. If 
each instance is successively examined to see 
if it meets both specifications, and the search 
terminates when such an instance is found, 
there is no reason to expect that presentation 
order will affect RT. 

A slightly different serial model assumes 
that Ss enter a subcategory defined by the 
category name and the first restrictor and 
begin searching in a fixed order for an instance 
which satisfies the second restriction. This 
model would predict that RT is faster when 
the letter is presented first. The reasoning is 
as follows: When the letter is presented first, 
à very small subcategory is usually defined, 
and finding an instance that fits the adjective 
restrictor should be a very easy matter. For 
example, if given Animal-M-small, searching 
the category animals beginning with M should 
result in rapid production of a small animal 
beginning with M, since roughly half of the 
members of that subcategory could conceiva- 
bly (depending on S's criterion) be considered 
small, When the adjective is presented first, 
the subcategory defined by the noun and the 
adjective is searched to find an instance that 
fits the letter restrictor. Since the standard 
of acceptability is much higher now (the in- 
stance must begin with a particular letter), 
the expected length of search for an instance 
meeting this standard is longer. In other 
words, a serial search for an instance meeting 
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the letter restriction would in onsulta- 
tion of many more items, 
A Final Comment 

Can anything be said abou e absolute 
speed of responding to a lett: ı adjective 
stimulus? Is one case more than the 
other? At the present ti have no 
definitive answer to this qu Response 
times depend to a large ex haracter- 


istics of the pool of possibi es; adjec- 


tive and letter stimuli rest: > different 
response pools. If, for the pool 
happens to contain a “dı category 


quency of 
category, 
nse will be 
tus, 1971; 
iven experi- 
iuli will be 
wuli depends 
and letters 
of responses 
reason, the 
on the issue 
tetter restric- 


member, that is, one with a 
being given as an exempla: 
then (other things equal) th: 
relatively fast (Freedman 
Loftus & Suppes, 1972). In 
ment, then, whether adject 
faster or slower than lette: 
largely on the particular adj: 
we choose to include, and t! 
to which they refer. Fc 
dictionary-network model i 
of the difficulty of adjectis 
tors per se. 
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SHORT REPORTS 


PARTIAL CATEGORY CUING: 
THE ACCESSIBILITY OF CATEGORIES 


RICHARD E, PARKER ! Axp LINDA WARREN 
Williams College 


In order to investigate the hypothesis that the retrieval of some retrieval cues 
reduces the probability of retrieving other retrieval cues, experimental Ss were 
given half of the category names from a categorized word list at the time of 
recall, while control Ss were given no category names at the time of recall. 
The results supported the hypothesis and were interpreted as supporting a 
hierarchical model of retrieval from the memory store, 


Rundus (1973) offered a conception of the re- 
rieval process which by its own nature tended to 
nit the accessibility of information in memory. 
introduced a hierarchically organized model of 
memory store with retrieval from retrieval cues 
(^ 78) down to words being accomplished by means 
ol series of random draws. All draws were as- 
sum. 1 to be with replacement. The probability of 
any particular word being accessed from a particular 
RO as equal to the ratio of that word's strength 
of yciation with the RQ to the sum of the strengths 
oí ociation with the RQ of all the words associ- 
! with the RQ. This is the ratio rule. It was 
roposed that both the presentation and recall 
¢ word increased the strength of association 
I cen the word and the appropriate RQ. This 
c ination of hypotheses produces the position 
that the retrieval of one word increases its own 
probability of retrieval, while at the same time re- 
ducing the probability for the other words associ- 
ated with the same RQ. This ultimately leads to 
th« termination of recall. 

Another implication of the model is that the pre- 
sentation of list words as cues for retrieval, by in- 
creasing their own strength of association with an 
RO, should reduce the probability of retrieval for 
other words. This is the well-known negative 
effect of giving list elements as cues for retrieval, 
first reported by Slamecka (1968). In order to 
test this prediction directly, Rundus (1973) varied 
the number of words from a category given to S as 
cues, According to the model, giving proportionally 
more list elements as cues for a category should 
cause proportionally fewer other words from that 
category to be recalled, His results supported this 
prediction, 

Rundus (1973) also proposed that the retrieval 
of RQs obeyed this same set of rules, which is the 
point of interest of this paper. In his model, the 
experimental list served as the center for a series of 
random draws with the RQs associated with the 
list serving as the pool. As before, the probability 
of retrieval was dependent on the ratio rule, and the 


1 Requests for reprints should be sent to Richard E. Parker, 
who is now at the Department of Psychology, University of 
California, San Diego, P. O. Box 109, La Jolla, California 92037, 


presentation and retrieval of an. RQ strengthened 
its association with the list. Thus, according to 
Rundus's model the presentation or retrieval of one 
RQ should increase its probability of retrieval, 
while at the same time reducing the probability of 
retrieval of the other RQs. Therefore, for the recall 
of a categorized word list, if Ss were given only 
some of the category names as RQs, then the re- 
trieval of the other RQs should be impaired. The 
purpose of this experiment was to test this higher 
level hypothesis. 

Method. There were 16 control Ss and 16 experi- 
mental Ss with conditions run in a random order, 
Each S was paid $1 for his services, 

Two sets of 40 words, each set consisting of 20 
categories, 2 words per category, were selected from 
the Battig and Montague (1969) norms such that 
none were proper nouns, The sets of words were 
so composed since an examination of Tulving and 
Pearlstone's (1966) data indicated that the maximal 
effect would be obtained by the use of a large 
number of small categories. These category names 
and members were recorded on magnetic tape in a 
blocked fashion, with the category name first, 
followed by the two category members, For the 
experimental condition, the names of the categories 
were typed on index cards. For the control con- 
dition, a set of 10 numbers was typed on index cards, 

Each S was tested individually. All Ss were 
fully advised concerning the procedure and the 
composition of the lists before the presentation of 
the first list. All Ss were told that their object 
was to recall as many category members as possible 
and that the category names were presented solely 
for their benefit. In addition, experimental Ss 
were told that their optimal strategy would be to 
first recall as many words as they could without 
using the index cards on which the category names 
were typed. After doing this, they could try to use 
the index cards to facilitate further retrieval. 
These instructions were designed to decrease any 
tendency to recall cued categories earlier than non- 
cued categories, always a concern in studies of cued 
recall. 

After the instructions, 5 was presented, by means 
of a tape recorder, one of the lists of 20 category 
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TABLE 1 


CATEGORY RECALL DATA EXPRESSED AS MEAN PER S 
nv Lisr, CONDITION, AND CATEGORY TYPE 


List 1 List 2 
Type of : S 
category REA : 
Experi- Experi- ~ 
mental Control mental Control 
Cued 8.8 5.1 8.4 3.7 
Noncued 2.9 AS 2.6 460 
otal BE feel sa AUT 114 8.3 
names and 40 category members at a 2-sec/item 


rate. If he was in the experimental condition, S 
next read aloud, from index cards at a 2-sec/item 
rate, 10 of the category names. According to the 
model, this should increase the strengths of associa- 
tion to the list of the 10 category name RQs. After 
this, the index cards were placed face down beside 
S, and he was given 2.5 min. for written recall. In 
order to cue each of the categories equally often, 
half of the experimental Ss read the category names 
that had been presented in the odd serial positions, 
while the other half read those presented in the 
even serial positions. In both cases, these names 
were read in the same order in which they had been 
presented originally. 

If S was in the control condition, the procedure 
was the same except that the cards containing the 
10 numbers were substituted for the cards containing 
the 10 category names. 

Thirty seconds after the end of the recall interval 
each S was presented the other list of 20 category 
names and 40 words. This was deemed desirable 
since Rundus's (1973) results indicated a rather 
small effect for the manipulation. All Ss remained 
in the same condition for both the first and second 
lists. 

Results and discussion. In order to determine 
whether or not giving experimental Ss category 
names as RQs introduced a differential guessing 
strategy, the intrusion data, by condition and trial, 
were compiled. All values varied around 9% of the 
total correct recall, indicating no differential guessing 
strategy. The words per category recalled by the 
two groups were also calculated. They varied be- 
tween 1.6 and 1.7 for both lists, with the control 
group tending to recall slightly more words per 
category. However, the difference was clearly not 
significant. For the experimental Ss, there was no 
difference in the number of words per category 
recalled in cued and noncued categories. 

The data of primary concern are the category 
recall data contained in Table 1. In Table 1 and 
hereafter, List 1 refers to the first list studied by S, 
regardless of its content, while List 2 refers to the 
second list studied by S, also regardless of the words 
in the list. For the purpose of data analysis, each 
control S was yoked with an experimental S who 
had studied the two lists in the same order. This 
yoking was used solely to provide a distinction 
between “cued” and "noncued" categories for con- 
trol Ss. A category was scored as recalled if either 
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category member was recalled. The category rcc! 
data for List 1 indicated that experimental 
called slightly more categories than control 
(1, 30) = 6.51, p < .05, effect size ot = .04. 
type of category, whether cued or noncued, had a 
highly significant effect, F (1, 30) = 52.93, p < .001, 
w? = .32; and the Category Type > iti 
interaction was also highly signi i 1,30) 
41.10, p < .001, «* .25. Thus, the data indi icate 
that experimental Ss recalled more cued 
than control < ecalled fewer noncued 
than control Ss, s is as predicted by 
(1973) model. As is apparent from Table 1 
data for List 2 were very similar to the data for 
List 1. Thus, the category recall data suppor 
proposition that the retrieval of some cate; 
reduces the probability of retrieving other categor 
The same effects were present for the word re 
data: Experimental Ss recalled 70% of the cued 
words as opposed to 22% of the noncued words; 
control Ss recalled about 38% of each type of word 
Another possible explanation of these 
be based on the retention intervals of cued a 
cued words. Maybe the effect is due to a 
retention interval for the cued words i 
a shorter retention interval for the ci 


for the noncued words, then it would not 5e possib 
to evaluate the validity of Rundus's (19 del. 
To test this possible explanation, experi: 
response sheets were divided into fifths on 

of total number of correct words recalled, 

data for both List 1 and List 2 a in Tab 

in terms of the percentage oí tota! cued-wor 
noncued-word) recall and in terms of t! 
amount of recall for each fifth. For hS wis 


nd some nonc 


r List 2), the ziedia 


recalled some cued word 
(n = 14 for List 1, » 


recall position occupied by cued and noncued words 
was also calculated. For List f, the mean ol « 
median positions for cued words was 10.9 and tor 
noncued words, 6.5, by randomization p< 


Thus, for experimental Ss the cu 
appeared later in the recall oupu 


nón 


words. This is the exact opposite of what a retei 
tion interval argument would req! \s the me 
TABLE 2 
TAGE (AND MEAN NUMBER) OF Worps RECALLED 
FOR EACH FIFTH OF 
2 
Fifth rd ec ed E 
Cued Noncued Cued Noncued 

1 15.3 3&3 | 200 20.0 
(2.14) (1.59) (2.75) (86) 
2 18.6 24.3 19.8 20.6 
(2.60) (1.13) (2.13) (89) 
3 21.5 15.4 19,7 20.9 
(3.01) (71) | (2.71) (.90). 
4 23. 89-* | 19.7 20.9 
(3.31) CAT) (2.71) (.90) 
5 21.0 17.0 20.7 17.7 
(2.94) (79) | (2.85) WO, 
Total 100 100 100 dm 

(14.0) (4.6) (13.8) (4.3 

URS. 
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amount of recall data in Table 2 indicate, this 
difference was basically due to the difference in 
total amount of recall. Since S$ knew that the 
category names would be available later, he may 
have tended to withhold cued-word recall This 
may have contributed to this result. Any such 
tendency, of course, could have only had the effect 
of reducing the magnitude of the obtained results. 
In any case, the retention interval for cued words 
was not shorter than the retention interval for non- 
cued words. Similarly, for List 2 the mean median 
word position for both cued and noncued words 
was 9.9, Thus, these data indicate that the argu- 
ment of different retention intervals is not tenable. 
"The lengths of the retention intervals did not differ 
in the required direction. 

It should be noted that both the reading of the 
10 category names and the freedom to use them 
later as cues were necessary in order to eliminate 
the above possibility of different retention intervals. 
Having S merely read the 10 category names and 
not to have them available later would create the 
problem of different retention intervals. It would 
be to S's advantage to recall the cued categories 
first before he forgot their names. Furthermore, 
according to the model it was necessary to have S 
read the cues in order to establish different strengths 
among the RQs. Thus, the dual manipulation was 
necessary. 

A second alternative explanation for the obtained 
results that experimental Ss spent more time 
trying to retrieve from the cued categories to the 
neglect of the noncued categories. However, this 
explanation does not seem viable for the current 
experiment both because of the long recall interval 
and because experimental Ss recalled more cued 
words than noncued words in each fifth of their 
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output. This is apparent from the mean amount 
of recall data in Table 2. 

In conclusion, the results clearly indicate that 
when an S is given some of the category names 
from a categorized word list as RQs at the time of 
recall, his recall of these categories is enhanced. 
This result is consistent with the findings of Tulving 
and Pearlstone (1966). Moreover, at the same time, 
S's recall of the other categories is reduced. This is 
the more surprising and revealing result. It is also 
consistent with Rundus's (1973) model of the recall 
process, While the present data support Rundus's 
model, it should be noted that his model is not en- 
tirely adequate. In particular, it is not able to 
account for sequential aspects of recall. Even in 
free-recall experiments the output order does become 
stereotyped (e.g., Bousfield, Puff, & Cowan, 1964). 
This is in addition to other designs in which S is 
specifically asked to order his output. In amy case, 
the present data do support Rundus's conception 
of the recall proce: By so doing, they support a 
hierarchical model of the memory store, with re- 
trieval of one RQ reducing the probability of 
retrieving other RQs. 
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Retroactive inhibition (RI) was measured in the A-B, A-D; A-B, C-D; and 
rest-control paradigms. No RI was evidenced for any group by the conventional 
measure of number of correct responses in a recognition test of retention. How- 
ever, strong indications of interference were found using a reaction time mea- 
sure of response latency to the first-list pairs tested at retention. This extends 
the Postman and Stark results and indicates that the associative matching 
test is not entirely free of interference in the sense of generalized response 


competition. 


The hypothesis that retroactive inhibition (RI) 
in the A-B, A-D paradigm may be due to suppres- 
sion of the repertoire of first-list responses was 
proposed by Postman and his associates (Postman 
& Stark, 1969; Postman, Stark, & Fraser, 1968) on 
the basis of the fact that when learning of both lists 
and the test for retention of the first list are carried 
out by an associative-matching procedure no or very 
little RI occurs. ‘This finding, of course, contrasts 
with that seen in studies in which the responses 
must be recalled. Consequently, Postman et al. 
(1968) suggested that RI might occur on the bas 
that first-list responses were not available at time 
of test, maintaining, however, that the first-l’st 
associations are still intact as they can be manifested 
on the matching test. It should be clear, however, 
that suppression of the first-list response reperto:re 
is simply a mechanism additional to any specific 
item interference that may arise, as it does, for 
example, in the A- Br paradigm (Postman & Stark, 
1969) and that item-specific interference cannot be 
ruled out because suppression of the first-list re- 
sponse set may occur (Postman & Underwood, 1973, 
p. 24). 

Various experiments have been reported in which 
item-specific interference has been obtained, either 
in addition to or in the absence of evidence for loss 
of the first-list response set (Birnbaum, 1970, 1972; 
Delprato, 1972; Weaver, Duncan, & Bird, 1972; 
Weaver, Rose, & Campbell, 1971; Wichawut & 
Martin, 1971). Postman and Underwood (1973) 
have suggested limitations on the permissible con- 
clusions to be reached from some of these studies, 
all of which involved the recall method. It has 
seemed to the authors that a further exploration of 
the associative-matching procedure itself would 
have merit. First, it would be desirable to replicate 
the findings of Postman and Stark (1969). Second, 
it is possible, with a matching test, to add informa- 


1 This work was done during the period of A. F. Sanders’ 
tenure as adjunct professor of psychology at the Pennsylvania 
State University. The help of William G. Hughes is gratefully 
acknowledged. 
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of correct 
neasure 


tion to what is provided by the numbe 
matches by measuring reaction times, ‘I 
has often been proposed as an index of a 
strength, although it may perhaps be giver 

tive interpretations in a matching test as op; 
to a recall test. In the matcbir Its report 


by Postman and Stark performance lose to 
perfect in the A-D and certain other p n8; 
and one could therefore argue that perfor 
the matching test was not sensitive to diffe 
associative strength but that reaciion time i 
be. While it would obviously t:e possibic t: 
an experiment in which matching performance {i 
not be at a virtually perfect leve ce án 
the Postman and Stark experiment was very high. 
Since the conclusions conc: ig the supy T 
the first-list response pool come fron 

sure of perfor 


ment, evidence from another me 
ance seems desirable. 


METHOD 


The experiment involved three independen 
of 10 female Ss each, one for each of tii 
A-B, A-D; A-B, C-D; and A-B, control 
paired associates of each list cons 
letters as stimuli and one-syllable fo: 
tives as responses." 


if sergle 
-letter adjec- 
A study-test procedure was 
followed, with the test part of each trial being ina 
recognition format, as described below. The con- 


trol group performed an arithmetic tà í 
interval after A-B learning of the same duration 
as required for interpolated learning (GL) in the 
other groups. 

Subjects. The Ss were volunteers from cl 
introductory psychology at the Pennsylvani 
University who received credit toward their grades 
for participation in experiments. Assignment of Ss 
to conditions was made randomly prior to arrival 
at the laboratory. 

Materials. There were two sets of stimulus terms 
(Ai, A2) which were combined to generate four basic 


were the same as those used by Postman and 
We are indebted to L. Postman for providing 


* The mate: 
Stark (1969). 
them, 
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lists. In each group Ss were assigned so that four 
Ss learned A;-B; as original learning (OL), while 
two Ss learned each of Ai-Bs, A;-B;, and ABs as 
OL. 

Several rules were employed in the construction 
of the pairs: (a) Stimulus letter and first letter of the 
response were different; (b) there was no phonemic 
overlap between the name of the stimulus letter and 
the response term; and (c) there was no apparent 
associative connection between the stimulus letter 
and the response. In the A-D list the responses to 
the same stimuli did not begin with the same letter 
as in the A-B list. Rule c was also applied in the 
construction of the A-D and C-D lists. 

Procedure. A given study-test trial consisted, 
first, of the presentation of the 10 pairs one after 
another for 3 sec. each and, second, of 10 recognition 
tests in each of which one stimulus term was pre- 
sented simultaneously with all 10 response terms. 
"There was an interval of 2 sec. between the study 
trial and the first recognition test of the following 
test trial. An S’s response on the recognition test 
items advanced the memory drum so that the next 
stimulus term and the 10 responses were exposed. 
The interval between the tenth recognition test of 
a test trial and the next study trial was 2 sec. This 
procedure of alternate study and test trials con- 
tinued until S had achieved 3 successive perfect 
trials following the first trial on which he responded 
correctly to all stimuli. Consequently, the first list 
was overlearned. There were 10 study-test trials 
of the second list (A-D or C-D), the first presenta- 
tion of this list following 2 min, after the completion 
of OL. In the control group, this 2-min. interval 
was followed by 9 min. of work on an arithmetic 
task which required simple addition of digits, with 
the total number of digits added in successive 30-sec, 
periods being displayed to S as knowledge of result: 
At the completion of the interpolated task, there 
was a 3-min, rest interval after which S was given 
3 consecutive test trials, each conducted as in OL, 
for the A-B list. (No study trials or knowledge of 
results were given on these test trials.) 

In the study trials four different orders were used 
for the presentation of the pairs, The orders of 
presentation of the stimuli in the test trials were 
different from any of these, and there were 10 
different orders of the responses in each test trial. 
In these orders each of the 10 response positio: 
was correct once, with a different correct position 
(randomly chosen with restrictions) over the series 
of test trials. After each test trial S was told the 
number correct she had achieved on that trial. 
After the interpolated task, the first retention trial 
was given in the same trial order as the last test 
trial in OL. 

Apparatus and manner of response. Three Stowe 
memory drums were used to make unnecessary any 
changing of tapes between lists. A reaction-time 
apparatus provided 10 buttons for S, and S was to 
press the one which corresponded to that one of the 
10 responses simultaneously displayed on the mem- 
ory drum which she thought to be correct. She 


was told to be sure of the response before making it 
but to respond as quickly as possible once she had 
made her choice. Thus, each S was urged to give 
fast but accurate responses. The numbers from 1 
to 10 were affixed to the memory drum, each corre- 
sponding to one of the 10 responses displayed on 
each one of the 10 recognition tests of a test trial, 
and .5 called out the number of the response as she 
pressed the corresponding button. This enabled E 
to determine whether the response was correct. 
The Æ read the reaction time of the response from 
a digital counter in hundredths of a second. The 
S’s reaction, as indicated, advanced the drum, there 
being a }-sec. delay between the button press and 
the appearance of the next stimulus and a new 
order of the 10 responses. Hunter timers were 
employed to control the drum during the study 
trial. 


RESULTS 


First-list learning. The mean number of trials to 
the first perfect trial was 6.9 (s? = 3.2) for the A-D 
group, 7.3 (s? = 4.7) for the C-D group, and 8.7 
(s* = 9.1) for the control group. Statistical analysis 
of these means yielded no significant differences, 

To compute reaction times (RTs), the data for 
each S were handled in the following way. From 
the 10 RTs for each test trial the median and the 
first and third quartile (Q) values were determined. 
The median was used rather than the mean in order 
to avoid undue influence of extreme scores. Then, 
for the group of Ss, the mean values of the median, 
Qi, and Qs were determined over the values for 
individual s on each trial. It is interesting to note 
that the mean of the median individual RT scores 
dropped significantly from the first trial on which 
all responses were correct to the final overlearning 
trial, F (1,27) = 24.4, p < .001. This was true for 
all groups as shown by the failure of the Groups X 
Trials interaction to reach significance, The groups 
did not differ in the means of their Qı, median, or 
Qs RT values on the final overlearning trial, F 
(2, 27) — .220, .863, .905, respectively. 

Second-list learning. Although 10 trials were 
given on the second list, the first perfect trial was 
actually reached by Group A-D in 2.4 trials and 
by Group C-D in 2.3 trials. By this measure, 
Group A-D shows no negative transfer in compari- 
son with Group C-D. However, the means (in sec.) 
of the median RT on the first perfect second-list trial 
were 4.26 for Group A-D and 3.25 for Group 
C-D. This difference is significant, £ (18) = 1.80, 
p «.05. When difference scores were computed, 
i.e., when the mean of the median RTs for the final 
overlearning trial was subtracted from the mean of 
the median RTs for the first perfect IL trial, Group 
A-D had a significantly higher difference score than 
Group C-D, ¢ (18) = 2.31, p < .05. These reaction 
time differences suggest the occurrence of negative 
transfer in Group A-D relative to Group C-D, al- 
though trials to criterion on the second list did not. 

First-list retention tests. The three groups gave 
virtually identical and almost perfect scores for 
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Figure 1. Means of quartile: (Qi), median, and Qs reaction 


times (RTs) for Groups A-D, C-D, and control on the first 
perfect trial in original learning (OL), the final overlearning 
trial in first-list learning (T), and the three retention test trials. 


correct recognition, thus confirming the results for 
this measure reported by Postman and Stark (1969). 
However, a different picture appears for the reaction 
time measure, as shown in Figure 1 for mean Qi, 
mean median, and mean Q; scores for each of the 
three retention tests, the first perfect trial in OL, 
and the final overlearning trial (T). It may be seen 
that for all three measures the groups are compara- 
ble for T and for the first perfect trial in OL. In 
addition, they are quite similar for Qı over the test 
trials, the analysis of variance producing insignifi- 
cant Fs (2, 27) of 1.268, .641, and .882. However, 
the groups differ for the mean of the medians at 
Retention Tests 1 and 3, F (2, 27) = 10.304, p < 
.01, and: =3.825, p < .05; the similar smaller dif- 
ference in the second test is not significant, F (2, 27) 
= .886. A similar pattern occurs for Qs. Here the 
differences for Tests 1 and 3 are also significant, that 
for Test 2 not. The respective Fs (2, 27) are 13.359 
(p < .01), 888, and 4.512 (p < .01). Figure 1 
shows that the significant differences arise from the 
longer reaction times for Group A-D, as compared 
with the other groups which do not differ from one 
another, 

The foregoing analysis indicates that the fastest 
responses (those at Qi) in the retest of A-D do not 
differ from those found in C-D or in the rest con- 
trol. However, the responses with reaction times 
at the median and at Qs in A-D do differ from those 
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in the other groups, significantly so at the first and 


third retention tests. On the first of € tests, it 
is worth noting that half of the Ss A-D show 
longer median RTs than the slowest 5 in C-D, and 
eight A-D Ss show longer RTs than control 


Ss. There is no comparable dispersion of RTs for 
Group A-D in the final overlearning trial of OL. 
Three other analyses of the data were made. 
The first was based on the notion that in the reten- 
tion tests for A-B in Group A-D S may start by 
recalling the appropriate response from the second 


list, not making at once the switch fre he second- 
to the first-list pool of responses. As he encounters 
the first-list response set over the tests of the first 
retention trial S might be able tc nstate the 

list response pool so that he wouid not make 
and have to reject the initial res; o later 
stimuli in the trial. Such a process ild predict 


a decline in RTs over the first retention tl, An 


anal was therefore made by listi: ch 5 
his median RT on Recognition Tests 1 10 
during the first retention trial and a^ icross 
differences for Tests 1 and 3 are also that 
trial. While an analysis of ve the rest- 
control group's data for test position wa rificant, 
F (9, 162) = 3.48, p < .01, the analy was. not 


significant for the other « Hence, there is no 
evidence for recovery of the List 1 response pool 
over the 10 recognition tests in either Croups A-D 
or C-D. 

The second anal followed. that of Runquist 
(1957) who related the number of correct responses 
to a given stimulus-response (5-R) pair with its 
resistance to RI in a recall task, In the present 
study, RTs on the first retest for pairs which yielded 
incorrect responses during OL were compared with 
RTs for pairs alv correct. The analysis showed 
that the median RT for previously missed pairs was 
greater than the median RT for pairs alwa 
for 22 of 27 < (Three Ss made no errors in acquisi- 
tion.) This is significant by sign test at p « .05. 
The 5 Ss whose median RT for missed responses 


roups. 


s correct 


was less than or equal to the median RT for their 
correct responses were distributed a lows: 2 
controls and 3 AB-CD Ss. It is difio o argue 
that this is strong evidence for strengih of specific 


S-R bonds, however, particularly in the light of a 
correlational analysis by .S of the RTs on the follow- 
ing trials: (a) first OL 100% correct trial, (b) final 
OL trial, (c) first retention test, (d) second retention 
test, and (e) third retention test. While some 
correlations reached a significant level (r = .540, 
t > 1.8) for some Ss, no correlations showed a con- 
sistent trend either for all Ss or within each group. 
Thus, there was no evidence to support the role of 
individual associative strengths as measured by RT 
in interlist transfer or retention. 3 
The third analysis was an attempt to determine 
whether the responses in Group A-D whose reaction 
times were at Q, were distinguished from the rest 
of the responses. The only factors that could be 
examined were the serial position of the item in the 
first retention trial and the position of the correct 
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button for each item on that trial Neither an 
analysis of variance nor a correlational analysis 
yielded significant relations between either of these 
factors and reaction times of first-list retention-test 
responses, A 
DISCUSSION 


There are four major findings from this experi- 
ment, aside from the replication of Postman and 
Stark's- (1969) failure to find RI in A-D on an 
associative-matching test so far as correct responses 
are concerned. (a) Reaction times in second-list 
learning were elongated for Group A-D with respect 
to those for C-D. (b) Reaction times at the first 
retention test for List 1 were elongated in Group 
A-D as compared to the other groups, and this 
elongation persisted over the remaining retention 
tests. (c) Some responses in A-D (those at Qi) 
did not show this elongation. (d) Efforts to relate 
specific associative strength, position in the test 
order, serial position of the item, and position of the 
correct button failed to relate to reaction times on 
the first retention tes! 

If one defines negative transfer and RI as the 
occurrence of any deficiencies of performance in the 
A-D paradigm as compared to C-D and the rest 
control, then both have been found in this study in 
the reaction time measure, despite the absence of 
negative transfer in terms of trials to criterion in 
second-list learning and the absence of RI in terms 
of correct responses on the matching test. The 
interpretation of these findings is somewhat puzzling 
in view of the presence of the correct responses on 
each test in every trial. 

One could interpret operationally defined RI in 
terms of diminished associative strength of first-list 
responses. Such an interpretation finds no direct 
support in our data, however. A second interpreta- 
tion that fits both the negative transfer and the RI 
found is a response competition notion. Thus, nega- 
tive transfer in terms of reaction time would be 
produced in the A-D case if, when S sees a stimulus 
term, he thinks of a first-list response. He will, of 
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course, fail to find this response in the set of alterna- 
tives provided and will have to find in the set that 
response which goes with the stimulus. The same 
process would produce RI if the second-list re- 
sponses, at the end of IL, now are dominant and 
one occurs when a stimulus is presented. Again S 
would not find that response among the set of 
alternatives on the first-list retention test and would 
either have to identify the correct response in the 
set or attempt to remember what it was for that 
stimulus and then confirm the memory by checking 
for the response among the alternatives. 

This kind of interpretation, which at least seems 
plausible, essentially amounts to that of generalized 
response competition. Hence, the negative transfer 
and the RI found in the present data suggest the 
presence of generalized response competition in the 
ssociative-matching situation, not detected by the 
simple measure of number of correct choic 
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CLARIFICATION OF THE ROLES OF ABSOLUTE AND RELATIVE 
FREQUENCY ON LIST DIFFERENTIATION 


MILDRED MASON? anp LEONARD KATZ 


University of Connecticut 


Three different ratios (.20, .50, and 1.00) of List 1 to List 2 trials were investi- 
gated under two levels of absolute frequency (6 or 12 trials apportioned be- 


tween the two lists). 


learning employed three response categories (“List 1,” “List 2," 
Results clearly pointed to the effects of absolute frequency on list diff 


tion. Contrary to Winograd’s finding, 


A list differentiation test immediately following List 2 


“neither list"). 
entia- 


list differentiation improved consistently 


as the relative frequency of List 1 to List 2 trials increased. 


The notion of list differentiation (S's discrimina- 
tion of the previous list membership of test stimulus 
items) has played a large role in interference theori 
of memory (see Keppel, 1968). Until relatively 
recently, however, the phenomenon has been some- 
what lacking in both data and explanation. Wino- 
grad (1968) found that given an equal number of 
trials on both lists, list differentiation (LD) im- 
proved with absolute frequency. An explanation of 
LD has been complicated by the further finding 
(Winograd, 1968; Winograd & Smith, 1966) that 
an unequal number of learning trials on two lists 
produces better LD than do balanced frequencies. 

However, the issue of whether unequal frequencies 
do, in fact, improve LD has not been satisfactorily 
settled. Reported relative frequency effects may be 
questioned on two counts. First, as Winograd 
(1968) pointed out, absolute frequency and relative 
frequency are unavoidably confounded in designs 
where the number of trials on one list are held 
constant while varying the number of trials on the 
second list (e.g, in comparing a three trial-three 
trial group with a three trial ix trial group). 
Second, relative frequency effects may possibly be 
due to differential guessing strategies ava ble to 
Ss in the unbalanced frequency conditions. In 
previous studies LD was tested following List 2 
learning by assuring Ss that every item seen on the 
LD test had either been on List 1 or on List 2, 
with Ss required to make a forced choice into one of 
the two categories. Conceivably, 5s with many 
trials on one list and few trials on the other list 
could have assigned unfamiliar test words to the 
list with the fewer trials. This would artificially 
inflate correct responses on the weaker list and would 
account for the putative superiority of the three 
trial-one trial group to the three trial-three trial 
group. 

Bower (1972) also has raised the possibility of 
differential guessing strategies in an attempt to 
reconcile the Winograd data on unequal frequencies 
with the Anderson and Bower (1972) list marker 
theory of list differentiation. The data on the 
superiority of unbalanced frequencies on LD consti- 
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tute an embarrassment to à strength theory of LD 
as well as to the list marker theory. 

The present study was designed to clarify the role 
LD 
methodological differences exist be 


relative frequency plays in Two major 


he present 


study and previous studies. First, the lependent 
variables of absolute and relative frequency have 
been unconfounded by varying the dist ibution of 
List 1 and List 2 trials within fixe Is of absolute 
frequency. This technique has ti ther ad- 
vantage of keeping the total tempor ¡persion 
equal for all relative frequencies ithin any one 


level of absolute frequency (se ;man & Waters, 
1970). Second, the list different u test used in the 


present study presented S wit ioice into 
three “list” categories, inste< were told 
that each word on the LD test na: r been on the 
first list they learned, on the second n learned, 


or not on either list. 

In all other importan! i 
duration of word expo intertrial interval, and 
interlist interval) the nt npted to 
replicate or remain sit he Wi d (1968) 
studies. 

Method. The de 


position, 


the total 


number of trials for Li ined was 
one of two levels of absolu total of 6 
or a total of 12 trials. vel, three 
relative frequencies of List trials were 
employed; the ratios were .2! d 100, Thus, 


Ss in the 6-trial condition received one of the 
following conditions: 1 trial on List 1 trials on 
List 2 (1-5); 2 trials on List 1, 4 trials on List 
2 (2-4), or 3 trials on each list (3 Counterparts 
in the 12-trial condition were 2-10, 4-8, and 6-6. 
Three lists of 20 nouns each were ma! hed for fre- 
quency on the Thorndike-Lorge (1944) count; 48 of 
the 60 nouns were taken from the lists used by 
Winograd (1968). Each 5 receiv d two of the lists 
for learning, with the third list serving as the set of 
distractor items on the LD test. The three lists 
were counterbalanced across conditions, each list 
serving as first list, second list, and distractor list 
for 4 of the Ss in each condition. Five different 
orders of each list were used. All words were pro- 
jected onto a screen in the front of the room by a 
Kodak Carousel slide projector activated by Hunter 
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timers to advance at 4-sec. intervals. The intertrial 
interval was 15 sec.; the interlist interval was 45 sec, 

The Ss were initially instructed that they were 
participating in an experiment on memory and that 
they were to try to learn each word as it appeared 
on the screen. At the end of List 1 learning, Ss were 
instructed that they would be shown a second list 
of words and that they were to learn those words 
also. After the last trial of List 2, test instructions 
were given and test materials handed out. The LD 
test itself started 35 sec. after the last word of List 
2 was shown, and consisted of the 60 words from all 
three lists arranged in random order on three pages. 
The LD test required Ss to make a forced choice for 
each word into one of the three categories “List 1,” 
“List 2," and "neither list" on the test booklet. 
"The LD test was timed with a stopwatch, with E 
saying "next" at 4-sec, intervals indicating that S 
was to proceed to the next word. Two different 
orders of the LD test were used, with half of the Ss 
in each condition receiving each order. 

The Ss were 72 undergraduates, who participated 
in partial fulfillment of requirements for an intro- 
ductory psychology course. There were 12 Ss 
randomly assigned to each condition. Groups of 
between 1 and 4 Ss served simultaneously. 

Results and discussion. Figure 1 presents the mean 
number of correct list identifications for List 1 and 
List 2 combined. List differentiation appears to 
bea function of both absolute and relative frequency. 
An analysis of variance was performed on the com- 
bined number of correct List 1 and List 2 identifica- 
tions for each S in each level of absolute and relative 
frequency, The effect of absolute frequency was 
significant, F (1, 66) = 15.65, p < .001. The 
effect of relative frequency was in the opposite 
direction of that found by Winograd (1968). List 
differentiation was best when the two lists were 
presented an equal number of times and was worst 
when List 1 was presented only 20% as often as 
List 2. The linear component of the relative fre- 
quency effect was significant, F (1, 66) = 5.22 
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Trials 7 12 


Trials = 6 


Mean No. of correct identifications for both lists 


.20 50 1.00 
Relative frequency of List 1 to List 2 
trials 


List differentiation as a function of absolute fre- 
l trials List 1 plus List 2) and relative frequency of 


FIGURE 1. 


The mean square for the interaction was also non- 
significant and small. 

The mean number of correct identifications in 
each of the three response categories for the six 
conditions is shown in Table 1. The “Selected” data 
columns present the mean number of times a par- 
ticular category was selected, ignoring whether the 
choice was correct or incorrect. The contention that 
a forced-choice LD test into two categories biases 
LD in favor of the less frequent list is supported by 
the “Selected” data in Table 1. It is precisely 
at the most unbalanced distribution of trials (rela- 


b < .05; the quadratic component was negligible. tive frequency 20) that the “neither list" 
TABLE 1 
MEAN NUMBER OF Coma WITHIN EACH RESPONSE CATEGORY AND MEAN NuMBER 
or Tim s 
| Neither 
Relative frequency. | 
and condition =a ~ à 
Correct Selected Correct. Selected Correct Selected 
20 Js. i" 1 3 z 
1-8 12.42 17.83 | 18.00. 16.33 23,00 
2-10 15.41 17.92 18.42 | 18.92 21.83 
M 13.92 17,88 18.21 | 17.63 22.42 
| | 
-50 
2-4 13.50 16.00 18.17 2242 18.75 21,58 
4-8 18.08 | 19.58 19.25 20.75 19.25 19.67 
M 15.79 — | 17.79 18.71 | 21.59 19.00 20.63 
| | | 
1.00 | 
3-3 15.67 17.83 | 17.42 20.08 19.58 22.08 
68 18.83 19.58 | 19.25 20.33 19.92 20.08 
M 17.25 18.71 | 18.34 20.21 19.75 21.08 
Overall M 15.65 18.12 18.42 20.50 18.79 21.37 
Hit rate 86 90 88 
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TABLE 2 
ASSIGNMENT OF ERRORS WITHIN RESPONSE CATEGORIES 


“Neither” List 1 List 2 

Absolute | list errors errors errors 

frequency A 
and T 
condition | List | List | «neither | ES | “Neither” | Lt 

6 trials S | 

1-5 0 77 n | 3 21 
24 Mg 32 46 6 16 
E 5| 0 22 W | 10 | 21 

| 

12 trials | | 
2-10 à|0 35 19 | 2 art 
4-8 9 | 0 5 18 | 1 | 8 
6-6 1| 0 o ft] 2 7 


category is used more than would be predicted by 
chance; x? (2) = 8.89, p < .02. 

‘The manner in which the total number of errors 
within each response category was distributed 
within the other two response categories is shown 
in Table 2. The data clearly show more assign- 
ments of List 1 items to the distractor category 
("neither list") in Conditions 1-5 and 2-10 than in 
the other condition; x? (2) = 42.61, p < .001, for 
the 6-trial conditions, and x? (2) = 25.11, p < .001, 
for the 12-trial conditions. If there had been no 
distractor category and if we assume that List 1 
items assigned to the distractor category would 
have been placed in the List 1 category, it is easy 
to sce from the data of Table 2 that there would bea 
higher frequency of "correct" List 1 responses in the 
most unbalanced frequencies—the result reported 
by Winograd (1968). "Thus we may conclude that 
previous reports of LD being poorer in the balanced 
frequency conditions are indeed due to Ss in these 
conditions not having the optimal guessing strategy 
available to Ss in the unbalanced frequency condi- 
tions (ie, “if list membership of an item is not 
known, assign it to the weaker list"). 

The finding that LD improves as relative frequency 
increases is difficult to account for with either a 
strength theory or a list marker theory, In addi- 
tion, both theories run into other specific problems 
with the data. A strength theory of LD would 
predict that more List 2 responses would be assigned 
to List 1 as the number of trials on both lists 
approached equality. As can be seen from Table 2, 
this did not occur at the lower level of absolute 
frequency, At the higher level of absolute frequency, 
the number of List 2 errors assigned to List 1 was 
actually less for the 6-6 group than for the 2-10 
group, x? (1) = 417, p < .05. Similarly, strength 
theory would predict that more List 1 responses 
would be assigned to List 2 when relative frequency 
= 1.00 than when relative frequency = .20, This 
did occur in the lower absolute frequency condition, 
x? (1) = 5.81, p. < .02, but not in the higher absolute 
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frequency condition, where the trend was rever «d. 
We suggest that subjective organization of the s 
accounts for these results and conclude with V ino- 
grad (1968) and Hintzman and Waters (1970) that 
absolute frequency affects list organization in addi- 
tion to the effect it has on strength. 

A list marker theory would predict a high 
rate for the distractor items than for LI 
as can be seen from Table 1, did not occ 


importantly, a list marker theory would pr at 
distractor items have an equiprobable « f 
incorrectly acquiring a List 1 or a List 2 T. 
With the counterbalanced stimulus lists use i 
study, the possibilities of associative generalization 
are the same. The data from Table 2 clear! W 
that distractor items were incorrectly igned to 


Furthermore, distractor 
gned to List 1 when 

2,x* Q) = 397b 
9.74, p. < 01, 
t teme 


List 1 and not to List 2. 
items are most likely to be : 
List 1 is weak relative to Li 
p < .001, for 6 trials, and x? 
for 12 trials. 
poral dimension, 

Hintzman and Waters (1970) showed p 
performance was based on simultineot 
pendent discriminations on f ney auda 
recency dimension, We would | 
or temporal, dimension to account for iT ing 
as the number of trials on both ti che f 
Within either level of absolu 1ency I 
time is equal among levels ve fre 
Results of the present experi: r t 
that the way this total t iapersion 1 
partitioned into two successive Gin k: 1% 
and “List 2") affects temporal d ability and 
hence LD, Whether th ner in whiet tal 
time is partitioned int é 
retention of temporal rm affects the 
decision process invo! (Hinri a 
matter for future invest 


The confusion is cle n 
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RECALL ACCURACY OF EIDETIKERS! 


CHARLES J, FURST; K 


H FULD, asp MICHAEL PANCOE 


Dartmouth College 


A large group of second-, third-, and fourth-grade 


hoolchildren were shown 


pictures and asked to recall them verbally. ‘The children were also asked to 
report whether or not they could see a projected image of the stimulus picture 
after it had been removed. Recall scores were compared for those children who. 
reported such eidetic images and those children who did not. It was found 
that the eidetikers recalled considerably more material than noneidetikers, and 
this result was discussed in the context of questions raised about the nature of 


eidetic imagery. 


In the recent revival of interest in eidetic imagery 
(ET), the question has been raised as to the unique- 
ness of this memory phenomenon. Neisser (1970) 
suggested that since there have been no demon- 
strable differences in capacity between children who 
report Els and those who do not, Els should be 
regarded as ordinary visual memories which are 
somewhat arbitrarily construed by their beholders 
to be located in front of their ey The most 
thorough available modern evidence fails to demon- 
strate a strong superiority in visual recall for 
eidetikers (Leask, Haber, & Haber, 1969), although 
Haber and Haber (1964) had earlier reported a 
slight difference in favor of these children. 

We recently had the opportunity to check on the 
recall accuracy of eidetikers during the course of 
screening a large number of children for El (for 
subsequent electroencephalographie studies), and 
these results are reported here. Free-recall reports 
of visually presented material were obtained and 
scored for accuracy, using an open-ended but reliable 
scoring system. In view of Neisser's (1970) a 
tion, it was decided that the most relevant test 
would be of the null hypothesis that no differences 
in recall accuracy would exist between children 
reporting Els and those not reporting Els. Accord- 
ingly, no distinction was made in the recall reports 
of the children between those items recalled from 
an El and those recalled from other types of memory. 

Method. A total of 192 children, comprising the 
second, third, and fourth grades of a local public 
school, were tested for eidetic imagery. Of these, 
21 satisfied the criteria for being included in the 
eidetic group for this study. ‘These criteria were 
a reported EI to both of the stimulus pictures, the 
duration of at least one of which was greater than 
15 sec., and visual scanning (as judged by Æ) ac- 
companying the reported eidetic episodes. The 
remaining children constituted the control group, 
with the exception of 7 children who were not in- 
cluded in the study because they reported Els that 
did not meet the above criteria. 


1 This research was supported by Grant HD-07037 to the 
first author from the National Institute of Child Health and 
Human Development. The authors wish to thank the students 
and teachers of the White River Junction (Vermont) Elemen- 
tary School for their cooperation, " 

Requests for reprints should be sent to Charles J, Furst, 
Department of Psychology, Dartmouth College, Hanover, 
New Hampshire 03755. 


"The audiotapes of the screening sessions were 
incomplete or unscorable for 38 Ss, including 3 
from the eidetic group. This left an eidetic group 
of 18 Ss and a control group of 129. 

The children were individually conducted to a 
room near their classroom, where they were told 
that Es wished to show them pictures and ask 
them how they “sce things and remember them," 
They were first given a demonstration of a chromatic 
afterimage to a colored square on a gray back- 
ground. This was to aid them in distinguishing 
between "seeing" an object that was no longer 
present from merely remembering it. 

Each S was then shown the first stimulus picture, 
a detailed colored illustration from a child's story- 
book, mounted on a gray cardboard backing, The 
mounted picture was placed on an easel in front of 
S for 30 sec. and then removed, revealing a blank 
cardboard surface on the easel of the same color as 
the picture backing. The child was asked to look 
at the board and to report whether or not he could 
still see any parts of the picture there, If S replied 
that he could, he was pressed to make sure that 
he distinguished this "seeing" from remembering 
and al: to the location of this image, He was 
asked to describe the image and to report when it 
had disappeared. When the image had disappeared, 
the child was asked to recall everything he could 
remember about the picture, All Ss not reporting 
Els were asked to report all details of the pictures 
which they could remember. The session was 
recorded on audiotape. In addition, Æ recorded the 
duration of any reported Els and his judgment of 
whether or not they were accompanied by visual 
scanning of the easel board, At the conclusion of 
the recall of the first picture, a second stimulus 
picture was presented in a similar fashion. 

The audiotapes were scored by two persons for 
number of items accurately reported, The system 
used gave 1 point for each object in the picture 
correctly reported and 1 point for each “attribute” 
of an object, Attributes were defined to include 
such things as color, article of apparel, or activity, 
but not location in the picture, For example, a 
report correctly stating that the picture contained 
two elephants, wearing blue bow ties, standing in the 
road would score 1 point each for elephant, blue, bow 
lie, and road. An additional point would be added 
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Ficure 1. Frequency distributions of recall scores for controls 
(open circles) and eidetikers (triangles). 


for stating that there were a pair of elephants but 
not for the verb standing, which would have been 
interpreted as indicating location rather than an 
activity. Interscorer reliability was quite high 
with this method (r = .99). No distinction was 
made between items reported from an EI and those 
reported from a subsequent memory after the EI 
had faded. Each S was given a recall score which 
was the sum of his scores for both stimulus pictures. 

Results and discussion. Mean number of item 
correctly reported is shown in Table 1. An analysis 
of variance shows the difference between eidetikers 
and controls to be significant, F (1, 141) — 73.1, 
b < .001, as well as a main effect of school grade, 
F (2,141) = 34.9, p < .001. The interaction was 
not significant (F < 1). The children reporting 
Els were clearly superior to controls in their recall 


TABLE I 


Mian VISUAL RECALL SCORES FOR EiDETIC 
AND CowTROL GROUPS 


Grade 
Group 
Second Third Fourth 
Kidetikers 24,5 24.3 22.8 
SD TS 3.2 6.9 
Controls 119 16.1 16.3 
SD 5.3 6.3 Tet. 
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bya ratio of about 3:2. Distributions of recall sc 

are given in Figure 1. An examination of I 

1 reveals that although the eidetikers were gens. aily 

near the top of the distributions, there were excep- 

tions, and there were many noneidetic children wt 

did as well as the eidetiker: 
These results contradict Neisser's (1970) assc: 

that eidetikers do not have a demonstrably : 

recall of visual scenes than other people. H 

Haber (1964) reported only slight superi: 

visual recall for their eidetikers, although the 

scores were not given. Haber and Haber's 

categories were predefined (ie., Es decided bei 

hand what the admissible items would be), wi 

the system used here was more open-ended 

how a visual scene is coded tends to be a hig 

idiosyncratic affair (Bartlett, 1932), an open-end 

scoring procedure might be a more sensitive test 
The study reported here did not separate out in- 

formation supplied from a supposed E] and informa- 

lled subsequently through a more common- 

As stated above, thi m 


tion reca 
place visual memory. 
because it was desired to test the hypothe 
differences between children who report Ei 


others. We often found it di It to determin 
when a child reported an item from an E! ond when 
it was being supplied from memory. R ‘son 


(1969) s ed that the presumed superi 
of an eidetiker might be due to 
time, since he would be able to 
some time after the original kocture 
moved. Some such explanat 
the most direct. However 
are still viable. For 
children who report 
hence, report more ite 
interestingly, it might 


»olonged 
iew” the 


had been 

is would be 
| hy 
it is po: 
more talk 
during recat 


that children with the 


ible that 


ls are 


most vivid visual imagery of a p iy wA 
nary type tend to localize their images in front of 
their eyes. 

Although the eidetikers in t Ja 


clear superiority in visual rec: 
was not so far off the distributions of scores 
their classmates that one woul: stified in cor 
cluding from these data alone 
aspecialability. Stromeyer and Psotka (1970) haye 
recently reported a remarkable objective demonstra 
tion that an eidetic image can be “photographic 

in the sense of being highly detailed in texture, at 
least for one talented adult S. It is still unresuived 
whether or not the type of eidetic imagery commonly 
reported by children is of this type. Stromeyer and 
his co-workers (C. F. Stromeyer, personal communi- 
cation, 1971) have been unable to demonstrate this, 
but it remains possible that childhood eidetic 


for 


iat they pos ed 


3Some of the children reported brief EIs which disappeared 
before material from it was verbally reported. Other children 
started out reporting material “seen” on the board and then 
afterward said that most of the report had been from memory, 
when the EI had faded. For the 20 reports from which scores 
for eidetically seen and remembered material could be dis- 
tinguished, 45% of the material correctly reported came from 
the EI. There was great variability around this Tage per- 
centage (s — 31), with no relationship between the total recall 
score and the percent of the material recalled from. the EI. 
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imagery is extremely detailed but fragmentary, and 
therefore, different from ordinary visual memory. 
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PROCESSING THE TERMS RIGHT AND LEFT: 
A NOTE ON LEFT-HANDERS! 


GARY M. OLSON £ 


Michigan State University 


KEVIN LAXAR 


Naval Submarine Medical Research Laboratory, Groton, Connecticut 


A simple word-picture verification task using the terms right and left was run 


with left-handed Ss. 


Olson and Laxar (1973) reported a series of in- 
vestigations which supported the claim that the 
mental representation of right is less complex than 
that of left. Their experiments eliminated simple 
scanning biases as the source of their effects and 
suggested rather that asymmetries in the reaction 
times to simple word-picture displays were due to 
characteristics of central information processing. 
These experiments were all run with right-handed 
Ss, and it would be of interest to know what com- 
parable performance is like with left-handers. The 
present paper provides evidence on this question. 

The assumption that right is the less complex term 
is quite plausible for right-handers. Their handed- 
ness coincides with cultural and social conventions 
to make right the natural reference direction in the 
sagittal plane, just as one can argue that aboveness 
and forwardness serve as reference directions in 
other planes (Clark, 1973; Olson & Laxar, 1973). 
However, the picture is much less clear for left- 
handers. Three possibilities seem to have equal 
a priori plausibility: (a) because left-handers live in 
an essentially right-handed world, their internal 


1 This research was carried out under the auspices of the 
Bureau of Medicine and Surgery, Navy Department, Research 
Work Unit MF51.524,004-2002DX5G. The opinions and as- 
sertions contained in this paper are the private ones of the 
authors and should not be construed as official or as reflecting 
the views of the Navy Department or the Naval Submarine 
Medical Research Laboratory. 

? Requests for reprints should be sent to Gary M. Olson, 
Department of Psychology, Michigan State University, East 
Lansing, Michigan 48824. 


Jnlike right-handers, these Ss showed no evidence for 
processing asymmetries between right and left. 
their performance was comparable to right-handei 


In all other respects, however, 


model of space conforms to that of right-handers; 
(b) although social and cultural conventions favor 
right as the reference direction, the left-hander's 
handedness is the more dominant factor, yielding 
left as the simpler term; and (c) since the criteria 
leading to a and b are in conflict, no performance 
asymmetry comparable to that found with right- 
handers should emerge. This last possibility could 
signify either the absence of an underlying con- 
ceptual asymmetry or conflict between alternative 
sets of representations. 

Method. The design and logic of this experiment 
were identical to Experiment | in Olson and Laxar 
(1973). There were four different displays, contain- 
ing either the word RIGHT or LEFT in block letters 
centered in a small square and a black dot located 
either to the left or right of the square. The Ss’ 
task was to decide as quickly as possible whether 
the word correctly named the side on which the dot 
appeared. The displays in which the word correctly 
named the side were the /rue conditions, and the 
other two displays were the false conditions. 

The displays were presented to Ss in blocks of 
28 trials, each display appearing seven times per 
block. The first 4 trials in'a block were warm-ups 
and were not analyzed. Each S was given four 
blocks of trials for a total of 96 experimental trials 
plus 16 warm-ups, all within a single 45-min. session. 
The order of presentation within each block was 
independently randomized for each S and each 
block. 
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FiGURE 1. Mean reaction times for correct responses and error percentages for right- and left-handed Ss. 


The Ss sat in a special experimental booth which 
contained the viewer for a three-field tachistoscope 
and a response panel at desk-top height immediately 
below the viewer. This panel contained two keys 
for true and false responses and a start button 
centered above the keys. The Ss initiated each 
trial by pressing the start button with one of their 
middle fingers while their forefingers were positioned 
on the two response keys. The position of the true 
key was counterbalanced across Ss. One second 
after they pushed the start button one of the four dis- 
plays flashed on and Ss responded true or false as 
quickly as possible. A fixation point was used to 
direct Ss to look at the word first. On each trial Ss 
were informed of their time and correctness. 

The Ss were 16 left-handed civilian and military 
personnel, 14 males and 2 females. Left-handednes 
was judged by asking each S which was her or his 
preferred hand. All Ss wrote with their left hands. 

Results and discussion. Mean latencies were com- 
puted for correct responses only for each Block X 
Display X S combination, yielding 16 means for 
each S. Since the performance of the 2 females 
did not differ from that of the males, sex of S was 
ignored in all of the analyses, The mean latencies 


and error rates for the four displays collapsed over 
blocks are shown in Figure 1. Comparable data 
from Experiment I of Olson and Laxar (1973) are 
also shown. 

A repeated measures analysis of variance of the 256 
means for the effects of stimulus word (RIGHT-LEFT ), 
dot position (right-left) and test block (1—4) r 
vealed the following significant effects: (a) irue di 
plays were verified more quickly than fa/se one 
yielding a reliable interaction between RIGHT-LE 
and the positions right-left, F (1,15) = 2847, p < 
.001; (b) Ss became faster as the experiment pro- 
gressed, F (3, 45) = 13.32, p < 001. The Dot 
Position X Blocks interaction approached signifi- 
cance, F (3, 45) = 2.34, .05 < p < 0. Analysis 
of errors revealed no significant F ratios, although 
the Stimulus Word X Dot Position interaction, Æ 
(1, 15) = 3.87, .05 < p < -10, and the main effect 
of blocks, F (3, 45) = 241, .05 <p <.10, ap- 
proached significance. 

As in the previous experiments with right-handers, 
stimulus-response (S-R) compatibility effects were 
very strong. These can be summarized as follows. 
For each S one of the true displays referred to the 
same side as the true response key, and this repre- 
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TABLE 1 


SUMMARY OF SIGNIFICANT F RATIOS FOR ANALYSIS OF VARIANCE CONTRASTING LEFT- AND RIGHT-HANDERS 


Source of variation 


Primary effect 


Interaction with 


MS for appropriate 
handedness 


error term 


Stimulus word 
Position of dot 
Test blocks | 
Word X Dot (true-false) | 


F (1,24) = 3.39% 


| 
ns 
F (3,72) = 45,83 | 
F (1,24) = 49.09% | 


0718. 
.0272 
40131 
.0204 


*.05« p <.10, 
»* p <.05. 
ED < 001. 


sented a compatible (C) relationship. The other 
true display represented an incompatible (IC) rela- 
tionship. This was also true for the false displays. 
For each S the average difference between the 
correct reaction times for C and IC responses 
(IC — C) was computed for true and false displays 
separately. The mean difference of 171 msec. for 
true displays was significant, / (15) = 5.52, p < .001, 
while the difference of. —16 msec. for false displays 
was not. By comparison, the right-handers of 
Olson and Laxar (1973) had a mean difference of 
116 msec, for true displays and 17 msec. for false 
ones, An analysis of variance revealed that these 
S-R compatibility effects for left-handers did not 
interact with any of the effects revealed by the main 
analysis, while ¢ tests showed that there was no 
difference between right- and left-handers. 

A comparison of the results for right-handers and 
left-handers in Figure 1 reveals that their perform- 
ance was quite different. These data were subjected 
to an unweighted-means analysis of variance with 
handedness as a between-Ss factor and stimulus 
word, dot position, and test block as within-Ss 
factors. There was no main effect for handedness. 
Table 1 summarizes the F ratios for the primary 
within-Ss sources of variation and their interac- 
tions with handedness. The only significant F ratio 
among the other higher order interactions was the 
triple Stimulus Word X Dot Position X "Test Block 
interaction, F (3, 72) = 2.93, p « .05. These anal- 
yses confirm what can be seen in Figure 1. Of the 
three possibilities listed in the introduction, c ap- 
pears to be supported. The absence of any higher 
order interactions either in the main analysis or in 
the secondary analysis for effects of S-R compati- 
bility suggests that within this task there was no 
consistent pattern of trade-offs between right- 
normalized and left-normalized spatial models. 
However, no distinction between the absence of a 
reference direction and conflicting spatial models 
can be drawn from our evidence. Since our defini- 


tion of left-handedness was whether S categorized 
himself as left- or right-handed, perhaps a more 
sensitive index of handedness or of hemispheric 
lateralization of function might offer greater insight 
into the left-hander's conceptualization of space. 

The striking contrast between right-handers and 
left-handers on this task is consistent with the 
findings of other investigators, For instance, others 
have found that the performance of left-handers in 
tasks requiring spatial abilities is neither parallel to 
nor complementary to that of right-handers, but 
rather is intermediate (Luria, McKay, & Ferris, 
1973). Disorders of spatial orientation are most 
acute in the right-left dimension (Corballis & Beale, 
1970; Howard & Templeton, 1966), and there is 
evidence that right-left difficulties are much greater 
for left-handers (Gerhardt, 1959; Hécaen & de 
Ajuriaguerra, 1964). "These facts along with the 
data reported here reinforce our earlier view that. 
"the conceptualization of the right-left dimension 
is much more labile, flexible, and subject to disrup- 
tion than that of the other two spatial dimensions 
[Olson & Laxar, 1973, p. 2899.” 
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SYLLABLE-DEPENDENT PRONUNCIATION LATENCIES IN NUMBER NAM} 
A REPLICATION : 


STUART T. KLAPP? 


California State University, Hayward 


Previous experiments have shown that the latency for the initiation of pro- 
nunciation of words and two-digit numbers depends on the number of syllables 
to be named. However, Henderson, Coltheart, and Woodhouse have recently 


shown that the presence of decades (e.g 


-, 20, 30) was confounded with syllables 


in the number studies, and that this confounding could have accounted for 


some of the previous dz 
of the naming latency effi 


Several recent studies have reported that when 
single words or two-digit numbers are read aloud, the 
latency from stimulus onset until the beginning of 
overt vocalization increases as the number of s 
lables to be pronounced increases (Eriksen, Pollack, 
& Montague, 1970, Experiment III; Klapp, 1971, 
Experiment 1; Klapp, Anderson, & Berrian, 1973, 
Experiment 1). However, Henderson, Coltheart, 
and Woodhouse (1973) failed to obtain the syllable 
dependence in latency for naming numbers. The 
purpose of the present investigation was to reassess 
this phenomenon in light of the considerations 
raised by these investigators. 

In their Experiment I, Henderson et al. (1973) 
used the same general procedure and material as 
were employed in the previous number studies 
(Eriksen et al., 1970; Klapp, 1971). The stimuli 
were two-digit numbers of two-, three-, and four- 
syllable pronunciation. The Ss responded by pro- 
nouncing a number as quickly as possible after 
presentation, and time to initiate vocalization was 
determined as a function of the number of syllabl 
to be pronounced. The results were ambiguous, 
with faster latencies for two syllables than for three 
or four syllables, but with no difference between the 
latencies for three and four syllables, These am- 
biguous results contrast with the monotonic increase 
in latency as a function of syllables reported under 
apparently identical conditions in the previous 
studi However, in the Henderson et al. study 
there was no control condition to assess whether 
there were differences in the sensitivity of the 
equipment to the initial sounds associated with 
responses of different lengths, Klapp (1971) and 
Klapp et al. (1973) used a control in which Ss 
examined the stimulus for a few seconds, and then 
pronounced it when a signal appeared. Latencies 
measured from this signal should be and were inde- 
pendent of the number of syllables. Since Hender- 
son et al. did not provide these control data for 


1 This research was supported, in part, by the Small Grants 
Program of the Research Foundation of California State 
University, Hayward. The author wishes to thank Nancy 
Lankford, who gathered the data, and Alicia Klapp, who 
prepared the stimulus material. 

? Request for reprints should be sent to Stuart Klapp, De- 
partment of Psychology, California State University, Hayward, 
California 94542. 


a. The present experiment is a successful replication 
`t for number stimuli controlling for this variable. 


their equipment, it is difficult to evaluate the extent 
to which the relative latencies assigned to the dif- 
ferent classes of numbers may be an artifact of 
equipment activation. Further instability in the 
data could have resulted from the attempt to ass 
the relatively small effect of syllables from a som 
what limited number of observations—four trials 
per stimulus number for each of 15 Ss, compared to 
eight trials for 12 Ss in Klapp (1971 

Although the results of the Henderson et al. 
(1973) Experiment I may be attributed to methodo- 
logical factors, the results of their Experiment. II 
are quite another matter. They pointed out that 
in the previous studies, including their own Experi- 
ment I, the two-syllable numbers were primarily 
decades (e.g. 30, 40, etc.), while decades did not 
appear among the three- and four-sy!lable numbers, 
They then showed that two-syllabie decades were 
pronounced with shorter latencies than two-syllable 
teens (e.g., 13, 14, etc stimuli have 
matched initial soun above men- 
tioned sound activation control is of no consequence 
in evaluating this particular comparison. Thus, 
confounding of decades could account for the previ- 
ously reported shorter latencies for two-syllable 
numbers compared with three- and four-syllable 
numbers. Furthermore, most if not all of the over- 
all effect of syllables reported previously by Eriksen 
et al. (1970) and by Henderson et al., Experiment t. 
was attributable to a shorter latency for two as 
compared to three and four syllables, with little or 
no effect of three compared to four syllables (al- 
though Klapp, 1971, did report a substantial three 
to four syllable effect), Therefore, Henderson et al, 
concluded that the previous reports of syllable- 
dependent latencies for naming numbers may well 
be attributed to the confounding of decades with 
syllables without the need to assume that response 
latency is related to number of syllables. Although 
these considerations are a problem only with number 
stimuli and do not apply to experiments demon- 
strating syllable-dependent response latencies for 
words (Klapp et al., 1973), it nevertheless appeared 
desirable to attempt a replication of this effect using 
new number stimuli not subject to this confounding. 

Method. The stimuli were two-digit numbers end- 
ing in 7 (yielding four syllables in total pronuncia- 
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tion, e.g., 27), in 6 (three syllables, e.g., 26) or in 4 
(also three syllables, e.g., 24). According to the 
principle that vocalizations of more syllables require 
longer response latencies, the numbers ending in 7 
should have longer latencies than the other num- 
bers. However, which digit is used to terminate 
the number should not matter unless the two- 
syllable digit 7 is used. Therefore, numbers ending 
in 4 or 6 should yield latencies which are identical 
to each other while shorter than for numbers ending 
in 7. The particular digits 4 and 6 were selected to 
be distinctively different in vocalization, while 
equated for syllables. The first digits used in gen- 
erating the two-digit stimulus numbers were 2, 3, 5, 
8, and 9. These particular digits were selected to 
avoid teens and identical digits (e.g., 66). These 
five first digits were used with the three ending 
digits to generate a complete set of 15 stimulus 
numbers which were prepared on 35-mm. slides 
using Instantype numbers and projected to subtend 
a visual angle of 2? high by 3.7? wide. The numbers 
were presented in 15 blocks, each comprised of all 
15 numbers in random order. A brief rest pause 
occurred after each block. The last 12 blocks were 
included in the analysis, while the early blocks were 
regarded as practice. 

The Ss were adult native speakers of English 
fulfilling a course requirement or serving for $2 pay. 
The Ss were alternately assigned to control and 
experimental conditions until 8 Ss had been assigned 
to each condition. Thereafter, all Ss were assigned 
to the experimental condition, for a total of 13 Ss 
in this condition. Data from an additional S in 
the experimental condition were rejected because 
36% of the latencies exceeded 800 msec., including 
20% of the total in excess of 1 sec. and several in 
excess of 2 sec. 

The Ss were seated in a sound-isolation chamber 
into which the numbers Were projected on a rear- 
projection screen through double-glass sound insul 
tion. The E was in an adjacent chamber containing 
apparatus which automatically provided the se- 
quence of events for each trial. The response 
latency was measured by an electronic timer which 
was started when a photocell was activated by the 
onset of the image or signal light on the projection 
screen, This timer was stopped by a voice-activated 
trigger operated by a microphone placed in near 
contact with S’s mouth. Sensitivity was adjusted 
to the maximum possible, while avoiding frequent 
activation by breathing. 

For the control condition, each trial lasted 7.9 sec., 
during which the stimulus number was present for 
6sec. After the number had been visible for 3 sec., 
a visual signal light appeared and the timer was 
started. The Ss were instructed to pronounce the 
number as quickly as possible after the appearance 
of the signal, and the initial vocal sound caused the 
timer to stop. Any trials on which S pronounced 
the wrong number, corrected himself, or generated 
stray sounds were repeated at the end of the follow- 
ing block of 15 trials, and these data were included 
in the analysis. 
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TABLE 1 
PRONUNCIATION LATENCY (IN MSEC.) 


| 


| Stimuli 
3 syllable 4 syllable 
Condition - i e 
| Number ends in 
Ere | go | Number endsin 
] : 7 
| p 
Control 267.9 | 269.9 | 268.9 T 
Experimental | 422:5 | 419.9 | 4212 


For the experimental condition, each trial lasted 
6.9 sec., during which the number was presented 
simultaneously with the starting of the timer and 
remained visible for 3 sec. The Ss were instructed 
to pronounce the number as quickly as possible 
after its appearance, and the initial vocal sound 
caused the timer to stop. Except for the change in 
sequence, the procedure was identical to that of 
the control condition. 

Results. For the control condition, 4% of the 
trials had to be repeated due to number mispro- 
nunciation, while stray noise or equipment error 
necessitated repeating another 2.5% of the trials. 
On an additional .2% of the trials the latency ex- 
ceeded 800 msec. These trials were assumed to 
represent inattention on the part of S, and the 
means were calculated without these data. Means 
were determined for each S for the sets of numbers 
ending in 7, 6, and 4, and the analysis was performed 
on these means. The results appear in Table 1. 
Latencies for the numbers with the three different 
endings did not differ significantly, F (2, 14) < 1, 
and the overall mean latency for three-syllable 
numbers was within .1 msec. of that for four- 
syllable numbers. Thus, the equipment was acti- 
vated by the sounds of the different classes of 
numbers with apparently equal speed. This is 
certainly to be expected, since these sets of numbers 
were balanced with respect to initial sound. 

For the experimental condition, the rates of re- 
peated trials were 1.2% for number mispronuncia- 
tion and 1.7% for stray noise or equipment error. 
For 1.6% of the trials the latency exceeded 800 
msec. and, as in the control condition, the means 
were calculated excluding these data, There was 
no change in the direction of the data trend for 
effect of syllables as a result of discarding these long 
latencies. As is evident from Table 1, the mean 
latency was longer for four-syllable numbers than 
for either of the three-syllable conditions which, as 
expected, yielded approximately equal latencies. 
The three conditions differ significantly from each 
other, F (2, 24) = 5.6, p < .01, with both Newman- 
Keuls contrasts between four-syllable numbers end- 
ing in 7 compared to three-syllable numbers ending 
in 6 or in 4 significant beyond the p < .05 level. 
Four-syllable numbers yielded longer latencies than 
three-syllable numbers for 11 of the 13 Ss (p < .05, 
sign test), with the remaining two Ss showing 
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latencies which differed by less than 2 msec. between 
these conditions. Thus, the syllable effect for 
naming numbers has been replicated in an experi- 
ment which controls for the considerations raised 
by Henderson et al. (1973). The magnitude of the 
effect was 6.6 msec., which approximates the 5-msec. 
three- vs, four-syllable effect reported by Eriksen 
etal, (1970), but which is considerably less than the 
corresponding 14-msec. effect reported by Klapp 
(1971). 

Discussion. The conclusion that syllable depen- 
dent response initiation latencies occur for numbers 
as well as for words suggests that this phenomenon 
is not due to perceptual analysis of the visual 
stimuli into syllabic units (Spoehr & Smith, 1973), 
since syllables are represented at the graphemic 
level only for words and not for the two-digit 
numbers, Although such perceptual analysis may 
occur for words, it cannot account for the syllable 
dependency in response latencies for numbers. 
Klapp et al. (1973) suggested an interpretation 
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which assumes that a vocalization must be pro- 
grammed before it can be emitted, with longer 
programming time for more syllables. This inter- 
pretation correctly predicts that the effect of syl- 
lables on pronunciation latency would occur for 
various modes of presentation, including numbers 
as well as words, and even extending to pictorial 
representations (Klapp et al., Experiment IV 
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Memory for triads of words and meaningful environmental sounds was com- 


pared following a 20-sec. retention interval, during which S: 
Words were recalled better than sounds after music 


following poetry d 


of either poetry or mus 


distraction, while the reverse was true 


shadowed passages 


traction. This in- 


teraction suggests that short-term retention of sounds and words might be 
mediated by different memory processes. 


Memory for common environmental sounds has 
only recently begun to be explored. Miller and Tanis 
(1971) compared recognition memory for long lists 
of printed words, spoken words, and sounds and 
found accuracy scores of 84%, 75%, and 69%, 
respectively, on a two-alternative forced-choice test. 
"Thus, sounds were recognized at about the same 
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level as spoken words, but at a much lower level 
than comparable performance reported by other 
investigators for recognition of meaningful pictures; 
where the accuracy level usually exceeds 90%, 
Lawrence and Banks (1973) improved upon the 
procedure employed by Miller and Tanis by using 
high-fidelity equipment and lengthening the presen- 
tation time for individual sounds. The overall 
recognition accuracy calculated from Ss’ confidence 
ratings was 85%, a figure noticeably higher than 
that reported by Miller and Tanis, but still some- 
what below the level of picture recognition. 
Philipchalk and Rowe (1971) a: 

memory for sounds vs. words would be d 


whether 
erentially 


a 


SHORT 


affected by the requirement to recall the items in 
their presented order, as had been demonstrated for 
pictures vs. words in the visual modality. Their 
results showed comparable performance for sounds 
and words in free recall, but recall of sounds was 
significantly worse in a serial recall task. The fact 
that order information can be better retained for 
verbal material offers one potential distinction be- 
tween the processes that mediate retention of sounds 
and words and raises the possibility of separate 
verbal and nonverbal auditory memory systems. 

The experiment reported here represents a more 
direct test of the independent-systems hypothesis. 
‘Triads of sounds and words were presented for reten- 
tion in the Brown-Peterson paradigm (Brown, 1958; 
Peterson & Peterson, 1959) with either verbal or 
nonverbal auditory distractor activity, The verbal 
distractor task (repeating passages of poetry aloud) 
was intended to disrupt verbal memory processes 
and the nonverbal distractor (humming passages 
music) to disrupt nonverbal memory. If sounds and 
words are encoded, stored, or retrieved via different 
systems, selective interference in recall of the two 
types of triads by the two types of distraction is 
expected, such that sounds should be recalled better 
than words following poetry distraction and words 
should be recalled better following music distraction. 

Method, ‘Twenty-four easily identifiable sounds, 
which ranged 1.4—4.4 sec. in length (X = 3.0), were 
selected from a larger pool of items obtained from 
commercial sound effects recordings (see Philip- 
chalk & Rowe, 1971). Two separate sets of eight 
triads were constructed and recorded on tape, The 
individual sounds within a triad were separated by 
2 sec, or less of silence, giving a mean triad length of 
10.4 and 10. respectively, for the two sets, 
Parallel sets of triads of the dominant labels of the 
sounds were recorded in a male voice with the items 
being read at a rate of one every 3 s Thus the 
word triads were all 9 sec. long. 

Four different versions of the experimental tape 
were drawn up for both sounds and words. One of 
two kinds of distractor material—music (M) or 
poetry (P)—was recorded on the tape immediately 
following each triad. The order of occurrence of the 
distractor material PMMPMPPM for one 
version of each triad set and MPPMPMMP for the 


other. The M distraction consisted of the initial 
20-sec. segments of instrumental melodies taken 
from various types of popular recordings. The P 


distraction included either complete poems or com- 
plete stanzas from poems, read in a female voice 
and timed to last 20 sec. The word “recall,” 
spoken in a female voice, occurred at the end of each 
segment of distractor material, followed after 10 sec. 
by the words "stop writing," and after 3 more sec. 
by the next triad. 

Most of the Ss were paid volunteers from under- 
graduate psychology classes. Twenty Ss each were 
assigned to the sounds and words conditions, with 
distractor type as a within-5s variable. The Ss were 
tested individually or in pairs. If 2 Ss were tested 
together, they were separated by a high cardboard 
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shield that prevented them from seeing each other, 
They were instructed to listen carefully to cach triad, 
to shadow the distractor material, and then to write 
down the names of the sounds or words from the 
triad on a prepared answer sheet when given the 
signal to recall. Ordered recall was not required. 
Practice in shadowing both music and poetry was 
provided prior to presentation of the experimental 
tape. Most Ss experienced very little difficulty in 
repeating the poems aloud and humming along 
with the music segments, so no more than two trials 
on either the M or P practice material were neces- 
sary. The tapes were played through high-quality 
headphones, and Ss’ shadowing was recorded on a 
second tape recorder. Equal numbers of Ss received 
the two different sets of triads in both the sounds and 
words conditions, but a slight error occurred in 
assigning Ss to each distractor sequence for one set 
of triads in the words condition only, This resulted 
in 4 and 6 Ss being run for each sequence instead of 
5 each. Otherwise, the ignment of Ss to list 
conditions was completely balanced. 

After completion of the memory task, Ss heard the 
same set of triads a second time with no distraction 
and were allowed 10 sec. for immediate recall of 
each set. The Ss in the sounds condition were asked 
to use the same names for the sounds as they had 
used for recall in the first part of the experiment 

Results and discussion. To control for the possible 
effects of auditory recognition errors, the recall 
scores were conditionalized on performance in the 
second phase of the experiment. Thus, an item had 
to be reproduced correctly on both tests in order to 
be counted as correct. If S failed to recall an item 
on both tests, performance on the first test was 
scored out of the number of items recalled in that 
on the second time and adjusted to a de- 
itor of 12 for direct comparison with the other 

This occurred for eight Ss who heard the 
is and for one S who heard the words. No S 
:d more than one item on both tests. 

The results of the second test were also used in 
scoring the protocols of the sounds group where Ss’ 
answers did not clearly correspond to the dominant 
label for a particular item, In these cases the sound 
was scored as correct if the same name was used on 
both tests. For both conditions, words which were 
minimally discrepant from a presented item (egu 
minor misspellings and substitution of plural for 
singular nouns) were counted as correct, as were 
homophones of presented items (e.g, symbols for 
cymbals). Incomplete recalls and intrusions from 
previous trials were scored as incorrect. 

The recall scores (Table 1) were analyzed by a 
2 X 2 analysis of variance, which yielded a signifi- 
cant main effect of distractor type, F (1, 38) = 62.7, 
nd a significant Item X Distractor interaction, 
(1, 38) = 29.6, both ps < .001, The interaction 
qualifies the distractor-type main effect, since only 
word recall was significantly affected by M vs. P 
distractor tasks. Of the 20 Ss in the words condi- 
tion, 19 had lower recall following P distraction, 
while the remaining .5 had equal scores for both M 
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TABLE 1 


MEAN NUMBER OF ITEMS CORRECTLY RECALLED AND MEAN 
TIME SPENT IN SHADOWING AS A FUNCTION OF Music 
AND POETRY DISTRACTOR TASKS 


Sounds Words 
Type of e ies v 
distraction 
Recall Shadowing Recall | Shadowing 
Music 9.0 15.1 11.3 16.0 
Poetry 8.1 11.4 6.9 11.0 


and P triads. With the sounds, 10 Ss had lower 
recall following P material, 7 showed the opposite 
effect, and 3 had equal scores. Comparison of 
sound vs. word recall within each distractor condi- 
tion showed that words were recalled better under 
M distraction, ¢ (38) = 3.71, p < .001, while sounds 
were recalled better under P distraction, 4 (38) = 
1.77, p < .05 (one-tailed tests). 

Because of the difficulties inherent in scoring Ss' 
shadowing of music on an accuracy measure, 
shadowing performance for both types of distractor 
material was examined by measuring the total 
amount of time S was actively engaged in shadowing 
during the 20-sec. retention interval. The means 
are included in Table 1. Since the responses pro- 
duced in speaking and humming are qualitatively 
different, meaningful comparisons between dis- 
tractor types are not possible. It should be noted, 
however, that the lower scores produced in shadow- 
ing the poetry probably do not reflect a lower degree 
of accuracy than in the music conditions, as informal 
observation of the Ss suggested that performance 
on the poetry was quite high. The difference be- 
tween the words and sounds conditions for music 
shadowing was reliable, ¢ (38) = 2.27, p< .05 
(two-tailed test). This might be interpreted to 
mean that attempts to rehearse the words during 
the retention interval did not interfere with hum- 
ming as much as attempts at rehearsing the sounds. 
The reverse is suggested by the poetry shadowing 
data, but the difference here was not significant, 
t (38) = 1.05. 

The differential effect of music and poetry dis- 
tractor tasks on recall of sounds and words con- 
stitutes evidence for separate verbal and nonverbal 
auditory short-term memory processes. The results 
do not allow a firm decision regarding the possible 
localization of the selective interference in terms of 
the tripartite division of memory into encoding, 
storage, and retrieval phases. However, the fact 
that the recall differences between sounds and 
words are reflected in the shadowing scores, especi- 
ally for music shadowing, suggests that the dis- 
tractor tasks acted to disrupt attempts to rehearse 
the triads. This conclusion is supported by the 
informal comments of Ss, especially those in the 
words condition, who reported that it was much 
easier to “think about” the words while humming 
the melodies than while repeating the poems. Thus, 
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it seems that the interference is at least partly 
attributable to processes involved in the mainte- 
nance of the items in storage through rehearsa!, but 
possible additional effects on encoding or retrieval 
cannot of course be ruled out. 

The form of the obtained interaction between type 
of triad and type of distraction offers a clue to the 


way in which sounds are represented in “mory. 
Assume that sounds may be represer ither 
directly in an auditory imagery store or rectly 
via their verbal labels in a verbal short-tc: = store, 
or both. If only the names of the sounds held 
in memory, then music vs. poetry distractor tasks 


should have affected recall of sounds and words in 
the same way. This clearly did not happen. If 
the sounds were stored only as nonverbal images, 
we might expect that the nonverbal distractor task 
would be more disruptive than the verbal task. 
This also is not reflected in the data. The most 
plausible interpretation, then, is that the sounds 
were represented via both modes, so that retrieval 
of the image could be used to offset the effects of 
verbal interference and retrieval of the word label 
could help counteract the effects of nonverbal 
interference. 


The independence of verbal and nor erbal audi- 


tory memory stores has been previo lemon- 
strated by Deutsch (1970) who found that inter- 
polation of additional tones betwee parison 
tones in a same-different jude task impaired 
performance, whereas interpolated numburs did not. 


hown that the 
x in immediate 
restricted to 
a non- 
> find- 


ion 


Furthermore, Crowder (1971 
interfering effect of a redundant s 
recall of auditory letter sequences 15 
verbal suffixes, the effect disappearing wh 
verbal suffix (a buzzer) is used. Both of thes 
ings have been interpreted as reflecting the op 


of sensory or precategorical mem: The 
present data show that the d tween 
verbal and nonverbal auditory ; viable 
when processing proceeds to the exte; 'depth"" 
characterized by more meaningful sou 1 real 
word stimuli presented under less string :poral 


conditions. 
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SHOCK-ASSOCIATED WORDS IN A NONATTENDED MESSAGE; 


A TEST FOR MOMENTARY AWAR 


[A 


R. S. CORTEEN! asp D, DUNN 


University of British Columbia 


An experiment was carried out to test for momentary awareness of critical 


words in a nonattended message during shadowing. 


On only 1 out of 42 oc- 


casions when there was a significant galvanic skin response to critical non- 
attended words did an S signal awareness. This finding was taken as support 
for the hypothesis that there can be some semantic processing without 


awarene: 


In a recent paper Corteen and Wood (1972) 
seemed to demonstrate that words that had been 
previously shock associated, and also words that 
had a class relationship with the shock- ssociated 
words, would give rise to a significant number of 
phasic skin conductance changes when they were 
embedded in a nonattended message. The con- 
current attended message task involved shadowing 
à prose passage. The importance of the paper rests 
entirely on the claim that Ss were not aware of the 
significant words in the nonattended channel. The 
idea of semantic processing without awareness 
suggests a wide range of interesting possibilities; 
but the study would be quite trivial if it could be 
demonstrated that Ss were aware even momentarily 
of the significant words. 

Corteen and Wood (1972, p. 311) employed a 
detailed and probing postexperimental interview 
procedure which certainly appears to indicate that, 
at the time of the interview, immediately following 
the experimental sequence, Ss were not aware of 
having heard any of the critical words. This 
procedure, however, does not come to terms with a 
major criticism, which implies that Ss might have 
been aware of the critical words at the time they 
were presented but that the interference produced 
by the shadowing task was so great as to produce 
complete forgetting, not only of the specific words 
but of the experience of having been aware of 
anything at all. This is an important and relevant 
criticism and the present experiment was designed 
in an effort to provide an answer to it. 

The basic technique used in this experiment 
was a slightly modified form of the method used 
by Treisman and Riley (1969) where they instructed 
Ss to stop shadowing at once and tap if they heard 
particular target letters in either shadowed or 
nonshadowed messages. The results obtained by 
Treisman and Riley indicate that the technique is 
workable and that detection in nonshadowed 
messages can be made quite successfully. In the 
present study the only concern was with responses 
to targets in the nonattended message, but beyond 
this and the substitution of a buzzer for ruler 
tapping, the method used was essentially the same. 


! Requests for reprints should be sent to R. S. Corteen, 
Department of Psychology, University of British Columbia, 
Vancouver, British Columbia, Canada. 


Method. "The method used was, up to a point, 
identical to that used by Corteen and Wood (1972). 
Three city names were presented aurally along with 
other words and the city names were paired with a 
mild but unpleasant electric shock. Phasic skin 
conductance changes were measured concurrently 
and were used to indicate the successful formation 
of shock association. The shock. sociated city 
names, together with three city names that had not 
been included in the shock-association procedure 
and three nouns that had been in the original shock- 
association list, were now presented to one ear of 
the S, embedded in a list of randomly selected 
unrelated nouns. Prior to this, Ss were given 
practice in shadowing a prose passage. They were 
now presented with a prose passage in the ear 
opposite to that being stimulated by the city names 
and other nouns and asked to shadow the passage. 
Skin conductance changes were recorded throughout. 

All that has been described above is outlined in 
detail by Corteen and Wood (1972). The major 
change was that suggested by the Treisman and 
Riley (1969) paper. Prior to the dichotic listening 
sequence Ss were shown a Morse key which acti- 
vated a simple buzzer. They were asked to place 
the fingers of their free (right) hand on the key and 
they were told that if they heard any city names, 
in either ear, during the shadowing procedure, they 
were to stop shadowing immediately and press the 
buzzer. The importance of accurate shadowing 
was stressed but the need simultaneously to stop 
shadowing and press the buzzer upon hearing any 
city name was heavily emphasized, 

A total of 19 experimental and 17 control Ss were 
run. All Ss were undergraduate students at the 
University of British Columbia. The control Ss 
were exposed to exactly the same procedure as the 
experimental Ss except that there was no prior 
shock association. The purpose of the control Ss 
was to compare the incidence of buzzer pressing 
to city names when there had been no shock pairing, 
As it happened the control was unnecessary. 

Results, The study confirmed the original 
findings of Corteen and Wood (1972) almost exactly. 
Indeed it would have been dismaying if it had not, 
as it was an almost exact replication. The findings 
concerning awareness appear to be quite unequi- 
vocal. The 19 experimental Ss were each exposed 
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to six "critical" words, three shock-associated city 
names and three other city names. This gave, in 
all, 114 opportunities to press the buzzer. The 
buzzer was pressed only once. Naturally this 
produced a phasic skin conductance response but 
it was impossible to determine whether this response 
was to the word or to the process of buzzer pressing. 
There were 41 significant phasic conductance 
responses to the remaining 113 critical words and 
on none of these occasions was the buzzer pressed. 
Of these 41 responses, 24 (42.1%) were to shock- 
associated city names and 14 (29.8%) were to 
non-shock-associated city names. There was a 
7.3% responding rate for shock-list control nouns 
and a 5.7% responding rate for non-shock-list 
contro| nouns. The autonomic responding rate 
to city names of the control Ss was higher than the 
rate to control nouns (12.3% vs. 6,9%) but the 
difference was not significant. The size of a signifi- 
cant phasic response is defined by Corteen and 
Wood as a change in resistance of at least 1 kQ 
occurring within 3 sec. There were two buzzer 
presses by control Ss. 

Discussion. The hypothesis being tested by this 
experiment was that the results obtained in the 
Corteen and Wood (1972) study could be explained 
if Ss were momentarily aware of stimulus words in 
the nonattended channel, but that this awareness 
was of such a transitory nature that no vestige of 
it remained at the end of the experiment. The 
hypothesis was suggested, but only barely supported, 
by the findings of Norman (1969) and Glucksberg 
and Cowen (1970) that memory for nonattended 
items, when they are probed for, persists for only a 


few seconds. Neither of these two cited studies 
demonstrated or sought to demonstrate that the 
items are in conscious awareness prior to being 
probed for, but they do suggest the idea of momen- 
tary awareness with subsequent complete loss, 

The findings reported here seem to invalidate 
the above hypothesis, They do not, of course, 
deny the findings of Norman (1969) or Glucksberg 
and Cowen (1970) because, of course, the items they 
report memory for may only come into awareness 
when there is specific interruption and probe. But 
the results do certainly support the position taken 
by Corteen and Wood (1972) that there is no 
awareness of certain words in a nonattended channel 
even though those words are capable of giving 
rise to a significant phasic conductance response. 

It is certainly still possible to argue about the 
precise meaning of the word "aware": but insofar 
as it might be defined as the ability of Ss to indicate 
the presence of a stimulus at the time of its occur- 
rence, then this study strongly su s that there 
can be autonomic responding to words that are not 
in the awareness of Ss at the time of the responding. 
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RELATION BETWEEN SIMILARITY GROUPING 
AND PERIPHERAL DISCRIMINABILITY t 
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Four experiments studied the extrafoveal discriminability of a tilted T, an L, 
an X, and a sideways T from an upright T. In a patterned field in which an S 
did not know where to direct his attention, the discriminability of the figures 


corresponded to their effectivene 
of upright Ts. In an empty vis 


in producing similarity grouping in a pattern 
field in which the brightnes 


s between a figure 


and a background was able to direct an S where to attend, the discriminability 


of the figures 
grouping. 


id not correspond to their effectiveness 
The results are interpreted as supporting the hypothe: 
larity grouping involves segregating a pattern into groups on the b: 


n producing similarity 
s that simi- 
s of extra- 


foveal difference signals that are responded to in parallel, or, if sequentially, 
very quickly and independently of focal attention, 


Beck (1972) proposed that similarity grouping 
involves segregating a pattern into groups on the 
basis of extrafoveal stimulus differences that are 
readily discriminated when attention is distributed 
and not concentrated or focused. He reported 
that the similarity grouping produced by rotating 
some of the figures in a display corresponded to their 
extrafoveal discriminability in a patterned visual 
field in which an S did not know where to attend, 
but not in an empty visual field, This difference 
in the discriminability of figural differences in a 
patterned and in an empty visual field was inter- 
preted as resulting from a narrowing or focusing of 
attention. In an empty visual field, the brightness 
difference between a figure and a background was 
able to direct an individual where to attend. The 
present experiments further compare the extrafoveal 
discriminability of a tilted T, an L, an X, and a 
sideways T from an upright T in a patterned and 
in an empty visual field. Beck (1966, 1967) re- 
ported that similarity grouping occurs strongly for a 
tilted T but not for an L, and X, or a sideways T in 
a pattern of upright Ts. 

Experiment I. The patterned visual field con- 
sisted of four figures in the corners of an imagi y 
square 7.62 cm. on a side. Each target consisted of 
either four upright Ts in each of the four corners or 
three upright Ts in three corners and an L, a T 
rotated 45° clockwise (tilted T), or a T rotated 90° 
counterclockwise (sideways T) in the fourth corner. 
The lines composing the figures were always at 
right angles to each other. The figures were made 
out of black dry-transfer lines .04 cm. wide (approxi- 
mately .04°), and .32 cm. long (approximately .3°) 
affixed to white paper. The targets were presented 
by means of a Gerbrands three-channel tachisto- 
scope in which the preexposure and exposure fields 
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were masked so that a target appeared in a circular. 
field 18.89 cm. in diameter. The luminances of the 
white backgrounds of the preexposure and exposure 
fields were 17.13 cd/m*. Following the presentation 
of a target, the preexposure field immediately re- 
appeared. A black dot placed in the center of the 
preexposure field served as a fixation point, A dot 
was also placed in the center of the square array on 
each target and carefully aligned to coincide with 
the fixation point in the preexposure field, An S 
was instructed to fixate the dot in the center of the 
square array and told that this would enable him to 
perform optimally. The distances between the 
center dot on a target and the figures in the corners 
were 5.33 em. (5° 7’ of arc). 

The targets with an L, tilted T, and sideways T 
were presented in three separate sets, Each set 
consisted of four targets with an upright T in each 
of the four corners and four targets with an upright 
T in three corners, and a disparate figure in the 
fourth corner. An S was told before the presenta- 
tion of each set which. disparate figure would be 
presented in that set. He was instructed to report 
"same" for targets having all upright Ts and to 
identify the location of the disparate figure for 
targets having a disparate figure. An error was 
counted if an § reported a disparate figure when 
none was present, failed to report a disparate figure, 
or incorrectly reported the location of a disparate 
figure, Three of the 8 stimuli ina set were randomly 
added to a block of trials but were not analyzed 
(making 11 stimuli in all); The 11 stimuli in a 
set were presented twice to an S in a random order 
for a total of 22 presentations. The three sets were 
presented in a different order to each S. Thus, 
each S received 66 presentations in the experiment, 
Each S also received 66 presentations in a practice 
session before the experiment began. ‘The practice 
period was used to determine an exposure duration 
that would give an overall percentage correct of 
between 70% and 75%. In Experiment I, the ex- 
posure durations ranged between 23 and 42 msec., 
with a mean of 30.3 msec, 
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TABLE 1 


MEAN ERRORS AND STANDARD DEVIATIONS 
IN THE EXPERIMENTS 


Experiment M SD 

I: Patterned field (NV = 15) 
Tilted T vs. T 3.1 16 
L vs. T 4.9 L5 
Sideways T vs. T 6.4 1.6 


Il: Empty field (V = 10) 
Tilted T vs. T 4 
Lys. T 3. 
Sideways T vs. T 5. 


IIL: Patterned field (NV = 15) 
Tilted T vs, T 4.1 
Xvs T 6.2 
Sideways T vs. T ES 

field (N = 15) 

vs. T. 2: 

X vs. T y 34 

Sideways T vs. T Less 


Experiment II. In Experiment ll, a single figure 
appeared on one corner of an imaginary 7.62-cm. 
square in an otherwise empty visual field. Three 
separate sets of stimuli were prepared. Each set 
consisted of four targets containing a single upright 
T in each of the four corners of the imaginary 
square and four targets containing either an L, a 
tilted T, or a sideways T in each of the four corne 
An S was told before the presentation of each set 
which two figures (upright T and tilted T, for 
example) would be presented. An error was 
counted if an .S incorrectly identified either a figure 
or the location of a figure. The procedure and ex- 
perimental arrangement were the same as in Experi- 
ment I, In Experiment II, the exposure durations 
ranged between 11 and 35 ms with a mean 
of 17,4 msec. 

Experiments III and IV. In Experiment III, the 
targets were similar to those in Experiment I. 
Each target consisted of either four upright Ts or 
three upright Ts and an X, a tilted T, or a sideways 
T. In Experiment IV, the targets were similar to 
those in Experiment II. Each target consisted of a 
single upright T, tilted T, X, or sideways T on one 
corner of an imaginary square in an otherwise empty 
field. The stimulus sets, procedure, and experi- 
mental arrangement in Experiment III were the 
same as in Experiment I and those in Experiment IV 
the same as in Experiment Il, The only differences 
were that the figures in Experiments IIT and IV. 
were drawn in India ink and the vertical and hori- 
zontal lines of the exposure and preexposure fields 
were not masked. The exposure durations in Ex- 
periment I1 ranged between 10 and 20 msec., with 
a mean of 18 msec., and in Experiment IV ranged 

- between 10 and 20 msec., with a mean of 15 msec. 

Subjecs. Four separate groups of Ss who had 
either normal vision or corrected-to-normal vision 
when wearing glasses served in the experiments. 
Fifteen Ss served in Experiments I, III, and IV and 
10 Ss served in Experiment II. 

Results. Table 1 presents the mean errors and 
standard deviations for the experiments. The error 
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rate overall in Experiment I was Statistical 
tests of the greater mean error f © than for a 
tilted T, ż (14) = 3.83, p < .01; sways T 
than for a tilted T, ¢ (14) = 6.49, p < and fora 
sideways T than for an L, £ (14) = 2.96, ^ < .05, 


were significant. The error rate overall in .-xperi- 
ment III was 37%. Statistical tests of the greater 
mean error for an X than fora tilted T, | (14) = 2.54, 
P < .05, and for a sideways T than for tilted T, 
t (12) = 421, p < .01, were significant. The 
greater mean error for a sideways T than for an X 
failed to be significant, ¢ (14) = 1.60 1, Thus; 
the results show that, in a patter: isual field, 
Ss made statistically significantly ^re errors in 
discriminating an X, L, and sid from an 
upright T than in discriminating a '; I from an 
upright T. 


The error rate overall in Experi: vas 27%, 
St ical tests of the greater mean error \ side- 
ways T than for a tilted T, i (9) 1.¢ .2; for 
a tilted T than for an L, £ (9) = 2.22, p 05; and 
for a sideways T than for an L, £ (9) = 1.90, p > .05, 
were not significant.’ The error rate overall in 


Experiment IV was 24%. A sta al test of the 
greater mean error for an X than for a tilted T, 
t (14) = .26, p > .5, was not significant. The 
greater mean error for a sideways T than for a 
tilted T, ¢ (14) = 6.04, p € .01, and for a sideways 
T than for an X, ¢ (14) = 5.71, p 01, were 
significant. When a brightness difference between 
a figure and a background directed an S where to 
attend, the discriminability of an X and of an L 
from an upright T was similar to that of a tilted T. 
In an empty visual field, the discriminability of the 
figures, as hypothesized, failed to correspond to 
their effectiveness in producing similarity grouping. 
Discussion. The results are consistent with the 
hypothesis that the effectiveness of stimulus differ- 
ences in producing similarity grouping correspond 
to their peripheral discriminability in a patterned 
visual field in which an S does not know where to 
attend. Beck (1966, 1967) found that Ls and side- 
in a pattern of upright Ts grouped mor 
poorly than did tilted Ts. He a it, 


? The acceptance of the null hypoth 
problem of a Type II error, In Experi 
increased this possibilit It is, therefor 
out that the relative discriminability of a 
in Experiment II was the opposite of t! 
Though just missin; nificance, the mean error for an L was 
less than that for a tilted T in yeriment IL. In contrast, in 
Experiment I, the mean error for an L was significantly greater 
than for a tilted T, Beck and Ambler (1972, 1973) aleo found 
that the peripheral discriminability of an L fror pright 
T was greater than or equal to that of a tilted T when attention 
was focused, but that the discriminability of an L became 
worse than that of a tilted T when attention was distributed 
over a patterned field. What is suggested is that focused 
attention increases the sensitivity of the visual system to the 
variable of line arrangement in the periph Information 
about line arrangement would facilitate discriminating an L 
from an upright T but would, if anything, ause of their 
similarity in line arrangement interfere with discriminating a 
tilted T from an upright T. 

* [n Experiment IV, a tilted T was more di 
an upright T than a sidewa; 


poses the 
an N of 10 
ant to point 
nd of a tilted T 
n Experiment I 


1 iminable from 
y y T, but in Experiment II, it was 
not. A possible explanatio that the vertical and horizontal 
lines of the exposure field visible in Experiment IV served as 
cues for the tilt of a figure's lines. It is possible that masking 
the vertical and horizontal lines in Experiment II reduced the 
discriminability between a tilted T and an upright T. 


; 
$ 
| 
| 
i 
Í 
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though an X had the same slopes of lines as a tilted 
T, Xs grouped less readily than tilted Ts and similar 
to sideways Ts in a pattern of upright Ts. 

An interpretation of the relation between simi- 
larity grouping and peripheral discriminability under 
uncertainty in a patterned field can be given by 
assuming that both are based upon stimulus differ- 
ences computed from the figural properties that are 
responded to “quickly” and independently of focal 
attention (Beck & Ambler, 1972). What is proposed 
is that when attention is distributed over a patterned 
field, a tilted T is discriminated from an upright T 
on the basis of a parallel or, if sequential, almost 
simultaneous readout of differences in their line 
slopes from a visual information store (Beck & 
Ambler, 1973). The presence of confusable vertical 
and horizontal lines is assumed to interfere with a 
“quick” readout of differences in line arrangement. 
Thus, an L and a sideways T group poorly and are 
not discriminable with distributed attention in a 
pattern of upright Ts. Selective attention, however, 
can increase an S's sensitivity to peripherally pre- 
sented differences in line arrangement (Beck & 
Ambler, 1972, 1973). When a single figure is 
presented, the brightness difference between a 
figure and background directs an S where to attend 
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and thereby improves the discriminability of an L, 
sideways T, and X. 

The present results suggest that the Xs failed 
to group as well as the tilted Ts in a pattern of 
upright Ts because the central symmetry of an X 
causes it to readily lose discriminability outside of 
the fovea when it subtends a small visual angle. 
Under these conditions, the grouping of Xs in a 
pattern of upright Ts is weak. However, when the 
lines of an X are made larger, Xs grouped as well 
as tilted Ts (Beck, 1973). 


REF 


RENCES 


Beck, J. Perceptual grouping produced by changes in orien- 
tation and shape. Science, 1966, 154, 538-540, 
Beck, J. Perceptual grouping produced by line figures, 
Perception & Psychophysics, 1967, 2, 491-495. 
Beck, J, Similarity grouping and peripheral discriminability 
aunties uncertainty, American Journal of Psychology, 1972, 
9. 


Beck, J, Similarity grouping and peripheral discriminability 
under uncertainty II. (Tech, Rep. 233) College Park: 
University of Maryland Computer Science Center, 1973, 
Beck, J., & AwnrtEs, B. Discriminability of differences in 
slope and in line arrangement as a function of mask 


distributed attention on peripheral acuity. 
Psychophysics, 1973, 14, 225-230. 


Perception. & 


(Received September 17, 1973) 


INHIBITION IN FREE RECALL ! 


ISABEL M. BIRNBAUM * 
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Two succ 
ing. Half of the categori 


sive lists of categorized words were presented for free-recall learn- 
appeared on both lists (repeated categories) and 


half appeared on only one of the lists (nonrepeated categories). A standard 


control group learned one list. 
different patterns of fi 


On the final test of free recall of both lis 
st retention loss were seen for repeated and non- 


S, 


repeated categories. There was a loss of words within categories in the former 
case, and a loss of whole categories in the latter. 


Recent studies indicate that retroactive inhibition 
(RI) in free-recall learning is an increasing function 
of similarity between the category composition of 
List 1 and List 2. Shuell (1968) found that RI 
was greater when different words from the same 
category appeared on both lists than when com- 


1 The author wishes to thank Karen Krueger and Joellen 
Hartley for their valuable assistance at every stage of the 
research. A 

2 Requests for reprints should be sent to Isabel M. Birnbaum, 
School of Social Sciences, University of California, Irvine, 
California 92664. 


pletely different categories were used on the first 
and second lists. With a mixed-list design where 
half of the List 1 categories were repeated (different 
words from the List 1 category appeared on List 2) 
and half were nonrepeated (no words from the List 1 
category appeared on List 2), there was greater 
retention loss for words in the repeated categories 
(Winograd, 1968). 

The present experiment was designed to investi- 
gate recovery of List 1 repeated and nonrepeated 
categories over a 10-min. retention interval following 


1148 


TABLE 1 


Mean NuwWER or Worps RmcALLED AND MEAN NUMBER 
or WORDS RECALLED PER CATEGORY FOR REPEATED (R) 
AND NONREPEATED (N) CATEGORIES 


Controls Es | Es 


Data |— EE | 
R N R N R 
Words 14.05 | 15.22 | 11.11 | 1,50 | 12.44 | 11.66 
Words per 
category ex 3.85 | 2,94 | 339 3:25 | 3.49 
Note, Ei = experimental group with a predominance of 


study trials and Es = experimental group with a predominance 
of SS trials in interpolated learning. 
R vs. N was a dummy variable in the control group. 


two types of interpolated learning (IL); however, 
chance differences in original learning (OL) in the 
immediate recall groups precluded the planned com- 
parisons. The remaining groups provided a modi- 
fied replication and extension of the original phe- 
nomenon, and results in these groups are the basis 
for the present report. 

Method. The Ss were 54 students at the Uni- 
versity of California, Irvine, whose participation in 
the experiment contributed to the fulfillment of a 
course requirement, The Ss were run individually 
and were assigned to groups in randomized blocks 
in order of appearance at the laboratory. 

Lists for OL and IL contained 48 words, 6 words 
from each of 8 categories. A repeated category 
appeared on the OL and IL lists and the specific 
words from the category were different on each list. 
A nonrepeated category appeared only on the OL 
list or the IL list. Three different 8-category lists 
were constructed from a pool of 12 categories with 
12 words in each category, using the design de- 
scribed by Winograd (1968). Words were selected 
from the category norms of Battig and Montague 
(1969) and two subsets of 6 words from each cate- 
gory were balanced for mean frequency of occurrence 
in response to the category name, Each category 
was a repeated category one-third of the time, and 
any pair of lists contained 4 repeated and 4 non- 
repeated categories. If the three lists are designated 
A, B, and C, then one-sixth of the Ss in each group 
learned each of these OL-IL sequences: AB, AC, 
BA, BC, CA, and CB. hus, each category and 
each subset of 6 items within the category used 
equally often as a repeated and a nonrepeated 
category in OL and in IL. 

Each list was tape recorded in four dilferent 
random orders with the restriction that no words 
Írom the same category appear in adjacent posi- 
tions. The words were recorded at a 1-sec. rate and 
‘three of the orders were used equally often as 
starting orders in OL and IL. AIl Ss received four 
alternating study and test trials on List 1. The 
duration of each trial was 48 sec. and there was a 
12-sec. intertrial interval. Three groups were dif- 
lerentiated on the basis of the interpolated task. 
The control (C) group worked on a paper-and- 
pencil numerical reasoning task throughout the 
retention interval. Two experimental (E; and Es) 
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groups received 12 trials of IL and then worked on 
the numerical reasoning task for 10 min. Group 
E, received a predominance of study trials during 
IL: each four-trial cycle consisted of three successive 
study trials followed by a single test trial. Group 
E; received a predominance of test trials during IL: 
each four-trial cycle consisted of a single study trial 
followed’ by three successive test trials. In all 
groups, the final recall test began 26.5 min. after 
the last test trial on List 1. The Ss in Groups E; 
and Es were asked to recall the words from Lists 1 
and 2 in any order; recall was oral and was termi- 
nated after 4 min, had elapsed. The Ss in Group C 
were given 2 min. for oral recall of the words from 
List 1 in any order. 

Results and discussion, The mean number of 
words recalled on the last test trial of OL was 24.8, 
2 nd 24.5 in Groups C, Ey, and E», respectively, 
F (2,51) < 1.00, There was no significant differ- 
ence in recall of words from to-be-repeated and 


to-be-nonrepeated categories, F (2, 51) = 1.22, 
LE 
Shuell (1968) used a between- Ss design and found 
consistently poorer performance in IL when words 
from the same category appeared on Lists 1 and 2 
than when new categories appeared on List 2, In 
the present experiment, performance on repeated 


in the IL mixed list 
Three-way analyses of 
Cycles X Type 


and nonrepeated categori 
showed the same pattern 
variance of recall in IL. (Groups 
of Category) either on the first test trial of each 
cycle or the fourth trial of each cycle (a test trial 
for both groups) showed that fewer words (ps < .05) 
and a smaller number of words per category (ps < 
-01) were recalled from repeated categories than 
from nonrepeated categories during IL. There was 
improvement over cycles in IL and Group E; 
showed consistently higher recall than Group E. 
(ps < .01). 

First-list recall of words from repeated and non- 
repeated categories in each group is shown in Tabic 
1. The designation of words as repeated and non- 
repeated in Group C was a dummy variable, Anal- 
yses of variance of absolute recall scores on the fina! 
test or loss scores (recall on Trial 4 of OL minus 
recall on the final test) yielded essentially the same 
results. There was a significant difference between 
groups in loss scores based on number of words 
recalled and number of words recalled per category A 
F (2,51) = 7.12, p < .01, and F (2, 51) = 5.09, 
b < .05, respectively, confirming the na 
of RI in free-recall learning of categorized lists. 
There was no significant difference between E; and 
Ez on either dependent variable (ps > .10) su 
gesting that first-list retention loss is not modified 
by the pattern of study and test trials in IL when 
the number of trials is held constant, There was 
greater RI for words recalled per category in re- 
peated than nonrepeated categories, F (2, 51) 
7.96, p < .01, for the Groups X Type of Category 
interaction, and there was no Ge RI for 
nonrepeated categories alone, F (2, 51) = 1.17, p 
2.25. The results are consistent with those of 
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Tulving and Psotka (1971) who found no signifi- 
cant reduction in first-list within-category recall 
when successive free-recall lists contained different 
categories, 

Winograd (1968) found that on the final test of 
recall, List 1 nonrepeated categories were less likely 
to be retrieved than repeated categories but more 
items within a recalled nonrepeated category were 
accessible. The interplay of these factors yielded 
an approximately equivalent total number of items 
recalled from repeated and nonrepeated categories, 
and the present results in the experimental groups 
confirm Winograd's findings, Category recall, de- 
fined as recall of at least one item from the category, 
was significantly higher for repeated than non- 
repeated categories, F (1, 34) 13.28, p < .01. 
It appears that the probability of recall of at least 
one word from a category was increased by the 
greater frequency and/or recency with which the 
category had been encountered. However, when 
different words from the same category were in- 
cluded in OL and IL there was a reduction in first- 
list within-category recall; given that a category 
was recalled, a significantly greater number of words 
from the category were recalled in nonrepeated than 
repeated categories, F (1, 34) = 8.68, p < .01. As 
in Winograd's study, there was no significant differ- 
ence in the total number of words recalled from 
nonrepeated and repeated categories (F < 1.00). 
In contrast, Shuell (1968) found lower absolute 
recall as well as fewer words recalled per gory 
when the same categories appeared on the first and 
second lists. The increase in category recall did not 
compensate for the reduction in within- 
recall. On the basis of the present data 
data reported by Shuell and Winograd, it is c 
that the occurrence of words from the same con- 
ceptual categories in two successive lists enhances 
recall of the categories on the final free-recall test 
and reduces the recall of first-list words within each 
category. 
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Given the large number of procedural differences 
between Shuell's (1968).and the present study, the 
similarity of results is striking. In addition to a 
smaller number and different pattern of trials in IL, 
Shuell used an unmixed-list design : Either all or none 
of the categories on List 1 were carried over to List 2, 
Thus, the overall contextual similarity between OL. 
and IL (in terms of category composition) was 
higher when categories were repeated (same group) 
than when categories were not repeated (different 
group). In the present study, the overall contextual 
similarity of the interpolated list was somewhat 
similar to the original list for repeated or non- 
repeated categories. Yet in both studies, significant 
within-category retention loss was found only for 
repeated categories. Nonrepeated categories appear 
to be highly resistant to interference, showing no 
reduction in within-category recall whether the 
categorical composition of the second list is com- 
pletely changed (as in Shuell's study) or only 
partially changed (as in the present study). It will 
be of interest to determine whether a nonrepeated 
category remains impervious to interference when 
all but one category is c d over from OL to IL. 
In view of previous results on RI and contextual 
similarity, it seems reasonable to. ime that 
within-category retention loss in nonrepeated cate- 
gories will be a function of the similarity of the 
remainder of the interpolated list to the original 
list, 
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Unit imaginable . . . . Made to order for the Experimenter 
Who likes to innovate... . 

Like you. 

It's for black and white, and color vision experiments; 

It has easily removable back plates, 

So you can attach accessories like back illuminators 

Or insert 3-dimensional stimuli; 

And adjustable field masks, framing masks and 

A removable top, 

So you can adjust or modify what you want... When you want, 


It's the right tach because it's the most reliable tach. 
Made to order for the Researcher with rigorous demands, 
Like you, 

It has cold cathode lamps for accurate stimulus control; 
Evenly matched color and illumination in all 3 fields, 

So you don't cue the subject; 

And special anti-reflection mirrors, 

So you don't get multiple images. 


WE HAVE SUPPLIED OVER ONE-HALF OF ALL 
THE 3-CHANNEL TACHISTOSCOPES 

IN USE TODAY. 

WE THINK WE'RE ON THE RIGHT TACH. 

LIKE YOU? 
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